NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1039761240|ref|XP_017174574|]
View 

bromodomain adjacent to zinc finger domain protein 2B isoform X1 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Bromo_BAZ2A_B_like cd05503
Bromodomain, BAZ2A/BAZ2B_like subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and 2B ...
2094-2190 2.00e-64

Bromodomain, BAZ2A/BAZ2B_like subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and 2B (BAZ2B) were identified as a novel human bromodomain gene by cDNA library screening. BAZ2A is also known as Tip5 (Transcription termination factor I-interacting protein 5) and hWALp3. The proteins may play roles in transcriptional regulation. Human Tip5 is part of a complex termed NoRC (nucleolar remodeling complex), which induces nucleosome sliding and may play a role in the regulation of the rDNA locus. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


:

Pssm-ID: 99935  Cd Length: 97  Bit Score: 213.39  E-value: 2.00e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2094 DLALCSMILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDS 2173
Cdd:cd05503      1 DLALCETILDEMEAHEDAWPFLEPVNTKLVPGYRKIIKKPMDFSTIREKLESGQYKTLEEFAEDVRLVFDNCETFNEDDS 80
                           90
                   ....*....|....*..
gi 1039761240 2174 DIGRAGHSMRKYFEKKW 2190
Cdd:cd05503     81 EVGRAGHNMRKFFEKRW 97
PHD_BAZ2B cd15630
PHD finger found in bromodomain adjacent to zinc finger domain protein 2B (BAZ2B); BAZ2B, also ...
1961-2009 4.24e-32

PHD finger found in bromodomain adjacent to zinc finger domain protein 2B (BAZ2B); BAZ2B, also termed WALp4, is a bromodomain-containing protein whose biological role is still elusive. It shows high sequence similarly with BAZ2A, which is the large subunit of the SNF2h-containing chromatin-remodeling complex NoRC that induces nucleosome sliding in an ATP-and histone H4 tail-dependent fashion. BAZ2B contains a TAM (TIP5/ARBP/MBD) domain, an Apolipophorin-III like domain, a DDT domain, four AT-hooks, BAZ 1 and BAZ 2 motifs, a WAKZ (WSTF/Acf1/KIAA0314/ZK783.4) motif, a plant homeodomain (PHD) finger, and a bromodomain.


:

Pssm-ID: 277100  Cd Length: 49  Bit Score: 119.69  E-value: 4.24e-32
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1039761240 1961 VYCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPACIS 2009
Cdd:cd15630      1 VYCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPEGDWFCPACIA 49
MBD super family cl00110
MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of ...
758-827 1.89e-25

MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of proteins that share the methyl-CpG-binding domain (MBD). The MBD consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1. The MBDs present in putative chromatin remodelling subunit, BAZ2A, and putative histone methyltransferase, CLLD8, represent two phylogenetically distinct groups within the MBD protein family.


The actual alignment was detected with superfamily member cd01397:

Pssm-ID: 469618 [Multi-domain]  Cd Length: 73  Bit Score: 101.33  E-value: 1.89e-25
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039761240  758 ELRIPLDYGWQRETRVRNFGGRLPGEVAYYAPCGKKLRQCPDMVKGMQW---CLLKEEDVIPRIRAMDGRRGR 827
Cdd:cd01397      1 ELRVPLELGWRRETRIRGLGGRIQGEVAYYAPCGKKLRQYPEVIKYLSKngiSLLSRENFSFSARAPVGDFYE 73
TPH super family cl38442
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
835-1052 2.79e-15

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


The actual alignment was detected with superfamily member pfam13868:

Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 79.58  E-value: 2.79e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  835 RAREESRMKRRKgrppnvgsaefldntdaKLLRKLQAQ-EIARQAAQIKLLRKLQKQEQ----ARVAKEAKKQQAIMAAE 909
Cdd:pfam13868  109 RIQEEDQAEAEE-----------------KLEKQRQLReEIDEFNEEQAEWKELEKEEEreedERILEYLKEKAEREEER 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  910 E-------KRKQKEQMKIIKQQEKIKRIQQIRmeKELRAQQILEakkkkkeeaanakllEAEKRTKEKELRRQQAVLLKH 982
Cdd:pfam13868  172 EaereeieEEKEREIARLRAQQEKAQDEKAER--DELRAKLYQE---------------EQERKERQKEREEAEKKARQR 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  983 QELERHRLDMVWERERRRQH------VMLMKAMEARKKAEEKERLKQEKRDEKRL------------NKERKLEQRRLEL 1044
Cdd:pfam13868  235 QELQQAREEQIELKERRLAEeaereeEEFERMLRKQAEDEEIEQEEAEKRRMKRLehrrelekqieeREEQRAAEREEEL 314

                   ....*...
gi 1039761240 1045 EMAKELKK 1052
Cdd:pfam13868  315 EEGERLRE 322
DDT smart00571
domain in different transcription and chromosome remodeling factors;
1079-1142 3.11e-14

domain in different transcription and chromosome remodeling factors;


:

Pssm-ID: 214726  Cd Length: 63  Bit Score: 69.20  E-value: 3.11e-14
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039761240  1079 TTFSDCLMVVQFLRNFGKVLGFDVNIDVpnLSVLQEGLLNIgDSMGEVQDLLVRLLSAAVCDPG 1142
Cdd:smart00571    2 EAFGDLLMVYEFLRSFGKVLGLSPFRAT--LEDFIAALKCR-DQNGLLTEVHVVLLRAILKDEG 62
WSD super family cl21412
Williams-Beuren syndrome DDT (WSD), D-TOX E motif; This family represents the combined ...
1363-1395 5.98e-06

Williams-Beuren syndrome DDT (WSD), D-TOX E motif; This family represents the combined alpha-helical module found in diverse eukaryotic chromatin proteins. Based on the Ioc3 structure, the N-terminus of this module is inferred to interact with nucleosomal linker DNA and the SLIDE domain of ISWI proteins. The resulting complex forms a protein ruler that measures out the spacing between two adjacent nucleosomes. The acidic residue from the GxD signature at the N-terminus is a major determinant of the interaction between the ISWI and WHIM motifs. The N-terminal portion also contacts the inter-nucleosomal linker DNA. The module shows a great domain architectural diversity and is often combined with other modified histone peptide recognizing and DNA binding domains, some of which discriminate methylated DNA. The WSD module constitutes the inter-nucleosomal linker DNA binding site in the major groove of DNA, and was first identified as WSD, the D-TOX E motif of plant homeodomains homologous with the mutant transcription factor causing Williams-Beuren syndrome in association with the DDT-domain.


The actual alignment was detected with superfamily member pfam15613:

Pssm-ID: 464775 [Multi-domain]  Cd Length: 69  Bit Score: 45.60  E-value: 5.98e-06
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1039761240 1363 LRSMMFGQDRYRRRYWIL-PQCGGIFVEGMESGE 1395
Cdd:pfam15613    1 IRSLPLGRDRRYNRYWWFdPGTGRLFVESPSDGE 34
WSD super family cl21412
Williams-Beuren syndrome DDT (WSD), D-TOX E motif; This family represents the combined ...
1706-1745 3.04e-05

Williams-Beuren syndrome DDT (WSD), D-TOX E motif; This family represents the combined alpha-helical module found in diverse eukaryotic chromatin proteins. Based on the Ioc3 structure, the N-terminus of this module is inferred to interact with nucleosomal linker DNA and the SLIDE domain of ISWI proteins. The resulting complex forms a protein ruler that measures out the spacing between two adjacent nucleosomes. The acidic residue from the GxD signature at the N-terminus is a major determinant of the interaction between the ISWI and WHIM motifs. The N-terminal portion also contacts the inter-nucleosomal linker DNA. The module shows a great domain architectural diversity and is often combined with other modified histone peptide recognizing and DNA binding domains, some of which discriminate methylated DNA. The WSD module constitutes the inter-nucleosomal linker DNA binding site in the major groove of DNA, and was first identified as WSD, the D-TOX E motif of plant homeodomains homologous with the mutant transcription factor causing Williams-Beuren syndrome in association with the DDT-domain.


The actual alignment was detected with superfamily member pfam15613:

Pssm-ID: 464775 [Multi-domain]  Cd Length: 69  Bit Score: 43.68  E-value: 3.04e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1039761240 1706 EEMQFGWWRIID-PEDLKTLLKVLHLRGIREKALQKQIQKH 1745
Cdd:pfam15613   28 ESPSDGEWGVYSsKEQLDALIASLNPRGVRESALKEALEKI 68
WHIM1 pfam15612
WSTF, HB1, Itc1p, MBD9 motif 1; A conserved alpha helical motif that along with the WHIM2 and ...
1183-1223 2.64e-03

WSTF, HB1, Itc1p, MBD9 motif 1; A conserved alpha helical motif that along with the WHIM2 and WHIM3 motifs, and the DDT domain comprise an alpha helical module found in diverse eukaryotic chromatin proteins.Based on the Ioc3 structure, this module is inferred to interact with nucleosomal linker DNA and the SLIDE domain of ISWI proteins. The resulting complex forms a protein ruler that measures out the spacing between two adjacent nucleosomes. The conserved basic residue in WHIM1 is involved in packing with the DDT motif. The module shows a great domain architectural diversity and is often combined with other modified histone peptide recognising and DNA binding domains, some of which discriminate methylated DNA.


:

Pssm-ID: 464774 [Multi-domain]  Cd Length: 46  Bit Score: 37.48  E-value: 2.64e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1039761240 1183 ELTESLKTKAFQAHTPAQKASILAFLVNELACSKSVVSEID 1223
Cdd:pfam15612    6 GLLETLKKGGYYELSPEEKLKILKALCDLLLSSSAIRDEIE 46
 
Name Accession Description Interval E-value
Bromo_BAZ2A_B_like cd05503
Bromodomain, BAZ2A/BAZ2B_like subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and 2B ...
2094-2190 2.00e-64

Bromodomain, BAZ2A/BAZ2B_like subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and 2B (BAZ2B) were identified as a novel human bromodomain gene by cDNA library screening. BAZ2A is also known as Tip5 (Transcription termination factor I-interacting protein 5) and hWALp3. The proteins may play roles in transcriptional regulation. Human Tip5 is part of a complex termed NoRC (nucleolar remodeling complex), which induces nucleosome sliding and may play a role in the regulation of the rDNA locus. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99935  Cd Length: 97  Bit Score: 213.39  E-value: 2.00e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2094 DLALCSMILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDS 2173
Cdd:cd05503      1 DLALCETILDEMEAHEDAWPFLEPVNTKLVPGYRKIIKKPMDFSTIREKLESGQYKTLEEFAEDVRLVFDNCETFNEDDS 80
                           90
                   ....*....|....*..
gi 1039761240 2174 DIGRAGHSMRKYFEKKW 2190
Cdd:cd05503     81 EVGRAGHNMRKFFEKRW 97
BROMO smart00297
bromo domain;
2088-2192 3.73e-35

bromo domain;


Pssm-ID: 197636 [Multi-domain]  Cd Length: 107  Bit Score: 130.48  E-value: 3.73e-35
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  2088 KKDESRDLALCSMILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCET 2167
Cdd:smart00297    2 PKLQKKLQELLKAVLDKLDSHPLSWPFLKPVSRKEAPDYYDIIKKPMDLKTIKKKLENGKYSSVEEFVADFNLMFSNART 81
                            90       100
                    ....*....|....*....|....*
gi 1039761240  2168 FNEDDSDIGRAGHSMRKYFEKKWTD 2192
Cdd:smart00297   82 YNGPDSEVYKDAKKLEKFFEKKLRE 106
PHD_BAZ2B cd15630
PHD finger found in bromodomain adjacent to zinc finger domain protein 2B (BAZ2B); BAZ2B, also ...
1961-2009 4.24e-32

PHD finger found in bromodomain adjacent to zinc finger domain protein 2B (BAZ2B); BAZ2B, also termed WALp4, is a bromodomain-containing protein whose biological role is still elusive. It shows high sequence similarly with BAZ2A, which is the large subunit of the SNF2h-containing chromatin-remodeling complex NoRC that induces nucleosome sliding in an ATP-and histone H4 tail-dependent fashion. BAZ2B contains a TAM (TIP5/ARBP/MBD) domain, an Apolipophorin-III like domain, a DDT domain, four AT-hooks, BAZ 1 and BAZ 2 motifs, a WAKZ (WSTF/Acf1/KIAA0314/ZK783.4) motif, a plant homeodomain (PHD) finger, and a bromodomain.


Pssm-ID: 277100  Cd Length: 49  Bit Score: 119.69  E-value: 4.24e-32
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1039761240 1961 VYCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPACIS 2009
Cdd:cd15630      1 VYCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPEGDWFCPACIA 49
HAT_MBD cd01397
Methyl-CpG binding domains (MBD) present in putative chromatin remodelling factor such as ...
758-827 1.89e-25

Methyl-CpG binding domains (MBD) present in putative chromatin remodelling factor such as BAZ2A; BAZ2A contains a MBD, DDT, PHD-type zinc finger and Bromo domain suggesting that BAZ2A might be associated with histone acetyltransferase (HAT) activity. The Drosophila melanogaster toutatis protein, a putative subunit of the chromatin-remodeling complex, and other such proteins in this group share a similar domain architecture with BAZ2A, as does the Caenorhabditis elegans flectin homolog.


Pssm-ID: 238691 [Multi-domain]  Cd Length: 73  Bit Score: 101.33  E-value: 1.89e-25
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039761240  758 ELRIPLDYGWQRETRVRNFGGRLPGEVAYYAPCGKKLRQCPDMVKGMQW---CLLKEEDVIPRIRAMDGRRGR 827
Cdd:cd01397      1 ELRVPLELGWRRETRIRGLGGRIQGEVAYYAPCGKKLRQYPEVIKYLSKngiSLLSRENFSFSARAPVGDFYE 73
Bromodomain pfam00439
Bromodomain; Bromodomains are 110 amino acid long domains, that are found in many chromatin ...
2098-2180 4.93e-22

Bromodomain; Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 425683 [Multi-domain]  Cd Length: 84  Bit Score: 91.99  E-value: 4.93e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2098 CSMILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGR 2177
Cdd:pfam00439    1 CLEILDKLMEHPIAAPFLEPVDPDEYPDYYSVIKKPMDLSTIKKKLENGEYKSLAEFLADVKLIFSNARTYNGPGSVIYK 80

                   ...
gi 1039761240 2178 AGH 2180
Cdd:pfam00439   81 AAE 83
PHD pfam00628
PHD-finger; PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar ...
1962-2010 8.52e-18

PHD-finger; PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.


Pssm-ID: 425785 [Multi-domain]  Cd Length: 51  Bit Score: 78.69  E-value: 8.52e-18
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 1962 YCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITT--IPDGDWFCPACISK 2010
Cdd:pfam00628    1 YCAVCGKSDDGGELVQCDGCDDWFHLACLGPPLDPaeIPSGEWLCPECKPK 51
COG5076 COG5076
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin ...
2014-2189 1.00e-17

Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription];


Pssm-ID: 227408 [Multi-domain]  Cd Length: 371  Bit Score: 87.55  E-value: 1.00e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2014 QSIKIKKIHVKGKKTNDSKKTKKGNVAGDTEDEDSASTSSSLKRGSKElkKRKMEETTSLNLSKAESTTSIKKPKKDESR 2093
Cdd:COG5076     62 TSIVDDREPGSMANVNDDLENVGGITYSPFEKNRPESLRFDEIVFLAI--ESVTPESGLGSLLMAHLKTSVKKRKTPKIE 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2094 DLAL---CSMILTEMET------HEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDN 2164
Cdd:COG5076    140 DELLyadNKAIAKFKKQlflrdgRFLSSIFLGLPSKREYPDYYEIIKSPMDLLTIQKKLKNGRYKSFEEFVSDLNLMFDN 219
                          170       180
                   ....*....|....*....|....*
gi 1039761240 2165 CETFNEDDSDIGRAGHSMRKYFEKK 2189
Cdd:COG5076    220 CKLYNGPDSSVYVDAKELEKYFLKL 244
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
835-1052 2.79e-15

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 79.58  E-value: 2.79e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  835 RAREESRMKRRKgrppnvgsaefldntdaKLLRKLQAQ-EIARQAAQIKLLRKLQKQEQ----ARVAKEAKKQQAIMAAE 909
Cdd:pfam13868  109 RIQEEDQAEAEE-----------------KLEKQRQLReEIDEFNEEQAEWKELEKEEEreedERILEYLKEKAEREEER 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  910 E-------KRKQKEQMKIIKQQEKIKRIQQIRmeKELRAQQILEakkkkkeeaanakllEAEKRTKEKELRRQQAVLLKH 982
Cdd:pfam13868  172 EaereeieEEKEREIARLRAQQEKAQDEKAER--DELRAKLYQE---------------EQERKERQKEREEAEKKARQR 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  983 QELERHRLDMVWERERRRQH------VMLMKAMEARKKAEEKERLKQEKRDEKRL------------NKERKLEQRRLEL 1044
Cdd:pfam13868  235 QELQQAREEQIELKERRLAEeaereeEEFERMLRKQAEDEEIEQEEAEKRRMKRLehrrelekqieeREEQRAAEREEEL 314

                   ....*...
gi 1039761240 1045 EMAKELKK 1052
Cdd:pfam13868  315 EEGERLRE 322
PTZ00121 PTZ00121
MAEBL; Provisional
832-1056 9.33e-15

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 80.96  E-value: 9.33e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  832 DRPRAREESRMKRRKGRppnvgSAEfldntDAKLLRKLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEK 911
Cdd:PTZ00121  1500 DEAKKAAEAKKKADEAK-----KAE-----EAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEA 1569
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  912 RKQKEQMKI-IKQQEKIKRIQQIRMEKELRAQQILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKHQELERHRL 990
Cdd:PTZ00121  1570 KKAEEDKNMaLRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039761240  991 DMVWERERRRQhvmLMKAMEARKKAEEK---ERLKQEKRDEKRlnKERKLEQRRLELEMAKELKKPKED 1056
Cdd:PTZ00121  1650 EELKKAEEENK---IKAAEEAKKAEEDKkkaEEAKKAEEDEKK--AAEALKKEAEEAKKAEELKKKEAE 1713
PHD smart00249
PHD zinc finger; The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in ...
1962-2007 1.20e-14

PHD zinc finger; The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RING finger and the FYVE finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BROMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RING fingers misidentified as PHD fingers.


Pssm-ID: 214584 [Multi-domain]  Cd Length: 47  Bit Score: 69.93  E-value: 1.20e-14
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 1039761240  1962 YCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITT-IPDGDWFCPAC 2007
Cdd:smart00249    1 YCSVCGKPDDGGELLQCDGCDRWYHQTCLGPPLLEeEPDGKWYCPKC 47
DDT smart00571
domain in different transcription and chromosome remodeling factors;
1079-1142 3.11e-14

domain in different transcription and chromosome remodeling factors;


Pssm-ID: 214726  Cd Length: 63  Bit Score: 69.20  E-value: 3.11e-14
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039761240  1079 TTFSDCLMVVQFLRNFGKVLGFDVNIDVpnLSVLQEGLLNIgDSMGEVQDLLVRLLSAAVCDPG 1142
Cdd:smart00571    2 EAFGDLLMVYEFLRSFGKVLGLSPFRAT--LEDFIAALKCR-DQNGLLTEVHVVLLRAILKDEG 62
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
868-1049 6.88e-11

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 68.04  E-value: 6.88e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  868 KLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIK----RIQQIRMEKELRAQQ 943
Cdd:COG1196    252 EAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRReleeRLEELEEELAELEEE 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  944 ILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKhQELERHRLDMVWERERRRQHVMLMKAMEARKKAEEKERLKQ 1023
Cdd:COG1196    332 LEELEEELEELEEELEEAEEELEEAEAELAEAEEALLE-AEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEE 410
                          170       180
                   ....*....|....*....|....*...
gi 1039761240 1024 --EKRDEKRLNKERKLEQRRLELEMAKE 1049
Cdd:COG1196    411 alLERLERLEEELEELEEALAELEEEEE 438
MBD pfam01429
Methyl-CpG binding domain; The Methyl-CpG binding domain (MBD) binds to DNA that contains one ...
753-802 7.96e-11

Methyl-CpG binding domain; The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has negligible non-specific affinity for DNA. In vitro foot-printing with MeCP2 showed the MBD can protect a 12 nucleotide region surrounding a methyl CpG pair. MBDs are found in several Methyl-CpG binding proteins and also DNA demethylase.


Pssm-ID: 396147 [Multi-domain]  Cd Length: 76  Bit Score: 59.68  E-value: 7.96e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039761240  753 VADDQELRIPLDYGWQRETRVRNFGG-RLPGEVAYYAPCGKKLRQCPDMVK 802
Cdd:pfam01429    1 IERKREDRLPLPPGWRREERQRKSGSkAGKVDVFYYSPTGKKLRSKSEVAR 51
MBD smart00391
Methyl-CpG binding domain; Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, ...
758-802 4.04e-09

Methyl-CpG binding domain; Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain


Pssm-ID: 128673  Cd Length: 77  Bit Score: 55.07  E-value: 4.04e-09
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 1039761240   758 ELRIPLDYGWQRETRVRNFG-GRLPGEVAYYAPCGKKLRQCPDMVK 802
Cdd:smart00391    3 PLRLPLPCGWRRETKQRKSGrSAGKFDVYYISPCGKKLRSKSELAR 48
DDT pfam02791
DDT domain; The DDT domain is named after (DNA binding homeobox and Different Transcription ...
1080-1140 1.25e-08

DDT domain; The DDT domain is named after (DNA binding homeobox and Different Transcription factors) and is approximately 60 residues in length. Along with the WHIM motifs, it comprises an entirely alpha helical module found in diverse eukaryotic chromatin proteins. Based on the structure of Ioc3, this module is inferred to interact with nucleosomal linker DNA and the SLIDE domain of ISWI proteins. The resulting complex forms a protein ruler that measures out the spacing between two adjacent nucleosomes. In particular, the DDT domain, in combination with the WHIM1 and WHIM2 motifs form the SLIDE domain binding pocket.


Pssm-ID: 460696  Cd Length: 58  Bit Score: 52.89  E-value: 1.25e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 1080 TFSDCLMVVQFLRNFGKVLGfdvnIDVPNLSVLQEGLLNIGDSMGEVQDLLVRLLSAAVCD 1140
Cdd:pfam02791    2 AFGDLLMVWEFLNSFGEVLG----LSPFTLDDFEEALLCTEEPSELLDEIHCALLKALVRD 58
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
870-1056 1.76e-08

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 58.70  E-value: 1.76e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  870 QAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAimaaEEKRKQKEQMKIIKQQEKIKRIQQIRMEKELRAQQILEAKK 949
Cdd:TIGR02794   51 QANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRA----AEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKA 126
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  950 KKkeeaanakllEAE-KRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEARKKAEEKERLKQEKRDE 1028
Cdd:TIGR02794  127 KQ----------AAEaKAKAEAEAERKAKEEAAKQAEEEAKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAEEAKA 196
                          170       180
                   ....*....|....*....|....*....
gi 1039761240 1029 KRLNKERKLEQRRLEL-EMAKELKKPKED 1056
Cdd:TIGR02794  197 KAEAAKAKAAAEAAAKaEAEAAAAAAAEA 225
WSD pfam15613
Williams-Beuren syndrome DDT (WSD), D-TOX E motif; This family represents the combined ...
1363-1395 5.98e-06

Williams-Beuren syndrome DDT (WSD), D-TOX E motif; This family represents the combined alpha-helical module found in diverse eukaryotic chromatin proteins. Based on the Ioc3 structure, the N-terminus of this module is inferred to interact with nucleosomal linker DNA and the SLIDE domain of ISWI proteins. The resulting complex forms a protein ruler that measures out the spacing between two adjacent nucleosomes. The acidic residue from the GxD signature at the N-terminus is a major determinant of the interaction between the ISWI and WHIM motifs. The N-terminal portion also contacts the inter-nucleosomal linker DNA. The module shows a great domain architectural diversity and is often combined with other modified histone peptide recognizing and DNA binding domains, some of which discriminate methylated DNA. The WSD module constitutes the inter-nucleosomal linker DNA binding site in the major groove of DNA, and was first identified as WSD, the D-TOX E motif of plant homeodomains homologous with the mutant transcription factor causing Williams-Beuren syndrome in association with the DDT-domain.


Pssm-ID: 464775 [Multi-domain]  Cd Length: 69  Bit Score: 45.60  E-value: 5.98e-06
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1039761240 1363 LRSMMFGQDRYRRRYWIL-PQCGGIFVEGMESGE 1395
Cdd:pfam15613    1 IRSLPLGRDRRYNRYWWFdPGTGRLFVESPSDGE 34
WSD pfam15613
Williams-Beuren syndrome DDT (WSD), D-TOX E motif; This family represents the combined ...
1706-1745 3.04e-05

Williams-Beuren syndrome DDT (WSD), D-TOX E motif; This family represents the combined alpha-helical module found in diverse eukaryotic chromatin proteins. Based on the Ioc3 structure, the N-terminus of this module is inferred to interact with nucleosomal linker DNA and the SLIDE domain of ISWI proteins. The resulting complex forms a protein ruler that measures out the spacing between two adjacent nucleosomes. The acidic residue from the GxD signature at the N-terminus is a major determinant of the interaction between the ISWI and WHIM motifs. The N-terminal portion also contacts the inter-nucleosomal linker DNA. The module shows a great domain architectural diversity and is often combined with other modified histone peptide recognizing and DNA binding domains, some of which discriminate methylated DNA. The WSD module constitutes the inter-nucleosomal linker DNA binding site in the major groove of DNA, and was first identified as WSD, the D-TOX E motif of plant homeodomains homologous with the mutant transcription factor causing Williams-Beuren syndrome in association with the DDT-domain.


Pssm-ID: 464775 [Multi-domain]  Cd Length: 69  Bit Score: 43.68  E-value: 3.04e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1039761240 1706 EEMQFGWWRIID-PEDLKTLLKVLHLRGIREKALQKQIQKH 1745
Cdd:pfam15613   28 ESPSDGEWGVYSsKEQLDALIASLNPRGVRESALKEALEKI 68
COG5141 COG5141
PHD zinc finger-containing protein [General function prediction only];
1962-2008 5.53e-04

PHD zinc finger-containing protein [General function prediction only];


Pssm-ID: 227470 [Multi-domain]  Cd Length: 669  Bit Score: 44.97  E-value: 5.53e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039761240 1962 YCQICRK-----GDNEELLLLCDGCDKGCHTYCHrpKITTIPDGDWFCPACI 2008
Cdd:COG5141    192 FDDICTKctsthNENSNAIVFCDGCEICVHQSCY--GIQFLPEGFWLCRKCI 241
OmpH smart00935
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
870-932 1.24e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 214922 [Multi-domain]  Cd Length: 140  Bit Score: 41.03  E-value: 1.24e-03
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039761240   870 QAQEIARQAAQIK-LLRKLQKQ-----EQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQ 932
Cdd:smart00935   30 RQAELEKLEKELQkLKEKLQKDaatlsEAAREKKEKELQKKVQEFQRKQQKLQQDLQKRQQEELQKILD 98
WHIM1 pfam15612
WSTF, HB1, Itc1p, MBD9 motif 1; A conserved alpha helical motif that along with the WHIM2 and ...
1183-1223 2.64e-03

WSTF, HB1, Itc1p, MBD9 motif 1; A conserved alpha helical motif that along with the WHIM2 and WHIM3 motifs, and the DDT domain comprise an alpha helical module found in diverse eukaryotic chromatin proteins.Based on the Ioc3 structure, this module is inferred to interact with nucleosomal linker DNA and the SLIDE domain of ISWI proteins. The resulting complex forms a protein ruler that measures out the spacing between two adjacent nucleosomes. The conserved basic residue in WHIM1 is involved in packing with the DDT motif. The module shows a great domain architectural diversity and is often combined with other modified histone peptide recognising and DNA binding domains, some of which discriminate methylated DNA.


Pssm-ID: 464774 [Multi-domain]  Cd Length: 46  Bit Score: 37.48  E-value: 2.64e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1039761240 1183 ELTESLKTKAFQAHTPAQKASILAFLVNELACSKSVVSEID 1223
Cdd:pfam15612    6 GLLETLKKGGYYELSPEEKLKILKALCDLLLSSSAIRDEIE 46
GBP_C cd16269
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal ...
960-1051 2.78e-03

Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal domain. Guanylate-binding proteins (GBPs) are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence, and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. This C-terminal domain has been shown to mediate inhibition of endothelial cell proliferation by inflammatory cytokines.


Pssm-ID: 293879 [Multi-domain]  Cd Length: 291  Bit Score: 42.18  E-value: 2.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  960 LLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQH----VMLMKAM--EARKKAEEKERLKQEK-RDEKRLN 1032
Cdd:cd16269    193 LTEKEKEIEAERAKAEAAEQERKLLEEQQRELEQKLEDQERSYeehlRQLKEKMeeERENLLKEQERALESKlKEQEALL 272
                           90
                   ....*....|....*....
gi 1039761240 1033 KERKLEQRRLELEMAKELK 1051
Cdd:cd16269    273 EEGFKEQAELLQEEIRSLK 291
growth_prot_Scy NF041483
polarized growth protein Scy;
863-1056 5.97e-03

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 42.12  E-value: 5.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  863 AKLLRKlQAQE-IARQAAQIKLLRKlQKQEQArvakEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQIRMEKELR- 940
Cdd:NF041483   515 ATTLRR-QAEEtLERTRAEAERLRA-EAEEQA----EEVRAAAERAARELREETERAIAARQAEAAEELTRLHTEAEERl 588
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  941 --AQQILEAKKKKKEEAANAKLLEAEK-RTKEKELRR---QQAvllkHQELERHRLDMVWERERRR---QHVMLMKAMEA 1011
Cdd:NF041483   589 taAEEALADARAEAERIRREAAEETERlRTEAAERIRtlqAQA----EQEAERLRTEAAADASAARaegENVAVRLRSEA 664
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1012 rkkAEEKERLKQEKRDE-KRLNKERKLEQRRLELEMAKELKKPKED 1056
Cdd:NF041483   665 ---AAEAERLKSEAQESaDRVRAEAAAAAERVGTEAAEALAAAQEE 707
 
Name Accession Description Interval E-value
Bromo_BAZ2A_B_like cd05503
Bromodomain, BAZ2A/BAZ2B_like subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and 2B ...
2094-2190 2.00e-64

Bromodomain, BAZ2A/BAZ2B_like subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and 2B (BAZ2B) were identified as a novel human bromodomain gene by cDNA library screening. BAZ2A is also known as Tip5 (Transcription termination factor I-interacting protein 5) and hWALp3. The proteins may play roles in transcriptional regulation. Human Tip5 is part of a complex termed NoRC (nucleolar remodeling complex), which induces nucleosome sliding and may play a role in the regulation of the rDNA locus. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99935  Cd Length: 97  Bit Score: 213.39  E-value: 2.00e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2094 DLALCSMILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDS 2173
Cdd:cd05503      1 DLALCETILDEMEAHEDAWPFLEPVNTKLVPGYRKIIKKPMDFSTIREKLESGQYKTLEEFAEDVRLVFDNCETFNEDDS 80
                           90
                   ....*....|....*..
gi 1039761240 2174 DIGRAGHSMRKYFEKKW 2190
Cdd:cd05503     81 EVGRAGHNMRKFFEKRW 97
BROMO smart00297
bromo domain;
2088-2192 3.73e-35

bromo domain;


Pssm-ID: 197636 [Multi-domain]  Cd Length: 107  Bit Score: 130.48  E-value: 3.73e-35
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  2088 KKDESRDLALCSMILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCET 2167
Cdd:smart00297    2 PKLQKKLQELLKAVLDKLDSHPLSWPFLKPVSRKEAPDYYDIIKKPMDLKTIKKKLENGKYSSVEEFVADFNLMFSNART 81
                            90       100
                    ....*....|....*....|....*
gi 1039761240  2168 FNEDDSDIGRAGHSMRKYFEKKWTD 2192
Cdd:smart00297   82 YNGPDSEVYKDAKKLEKFFEKKLRE 106
Bromodomain cd04369
Bromodomain. Bromodomains are found in many chromatin-associated proteins and in nuclear ...
2094-2190 1.58e-33

Bromodomain. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine.


Pssm-ID: 99922 [Multi-domain]  Cd Length: 99  Bit Score: 125.18  E-value: 1.58e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2094 DLALCSMILTEMETH--EDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNED 2171
Cdd:cd04369      1 LKKKLRSLLDALKKLkrDLSEPFLEPVDPKEAPDYYEVIKNPMDLSTIKKKLKNGEYKSLEEFEADVRLIFSNAKTYNGP 80
                           90
                   ....*....|....*....
gi 1039761240 2172 DSDIGRAGHSMRKYFEKKW 2190
Cdd:cd04369     81 GSPIYKDAKKLEKLFEKLL 99
PHD_BAZ2B cd15630
PHD finger found in bromodomain adjacent to zinc finger domain protein 2B (BAZ2B); BAZ2B, also ...
1961-2009 4.24e-32

PHD finger found in bromodomain adjacent to zinc finger domain protein 2B (BAZ2B); BAZ2B, also termed WALp4, is a bromodomain-containing protein whose biological role is still elusive. It shows high sequence similarly with BAZ2A, which is the large subunit of the SNF2h-containing chromatin-remodeling complex NoRC that induces nucleosome sliding in an ATP-and histone H4 tail-dependent fashion. BAZ2B contains a TAM (TIP5/ARBP/MBD) domain, an Apolipophorin-III like domain, a DDT domain, four AT-hooks, BAZ 1 and BAZ 2 motifs, a WAKZ (WSTF/Acf1/KIAA0314/ZK783.4) motif, a plant homeodomain (PHD) finger, and a bromodomain.


Pssm-ID: 277100  Cd Length: 49  Bit Score: 119.69  E-value: 4.24e-32
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1039761240 1961 VYCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPACIS 2009
Cdd:cd15630      1 VYCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPEGDWFCPACIA 49
Bromo_gcn5_like cd05509
Bromodomain; Gcn5_like subfamily. Gcn5p is a histone acetyltransferase (HAT) which mediates ...
2096-2189 4.27e-31

Bromodomain; Gcn5_like subfamily. Gcn5p is a histone acetyltransferase (HAT) which mediates acetylation of histones at lysine residues; such acetylation is generally correlated with the activation of transcription. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99941 [Multi-domain]  Cd Length: 101  Bit Score: 118.43  E-value: 4.27e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2096 ALCSMILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDI 2175
Cdd:cd05509      4 TQLKKVLDSLKNHKSAWPFLEPVDKEEAPDYYDVIKKPMDLSTMEEKLENGYYVTLEEFVADLKLIFDNCRLYNGPDTEY 83
                           90
                   ....*....|....
gi 1039761240 2176 GRAGHSMRKYFEKK 2189
Cdd:cd05509     84 YKCANKLEKFFWKK 97
PHD_BAZ2A_like cd15545
PHD finger found in bromodomain adjacent to zinc finger domain protein 2A (BAZ2A) and 2B ...
1962-2007 7.45e-31

PHD finger found in bromodomain adjacent to zinc finger domain protein 2A (BAZ2A) and 2B (BAZ2B); BAZ2A, also termed transcription termination factor I-interacting protein 5 (TTF-I-interacting protein 5, or Tip5), or WALp3, is an epigenetic regulator. It has been implicated in epigenetic rRNA gene silencing, as the large subunit of the SNF2h-containing chromatin-remodeling complex NoRC that induces nucleosome sliding in an ATP- and histone H4 tail-dependent fashion. BAZ2A has also been shown to be broadly overexpressed in prostate cancer, to regulate numerous protein-coding genes and to cooperate with EZH2 (enhancer of zeste homolog 2) to maintain epigenetic silencing at genes repressed in prostate cancer metastasis. Its overexpression is tightly associated with a prostate cancer subtype displaying CpG island methylator phenotype (CIMP) in tumors and with prostate cancer recurrence in patients. BAZ2B, also termed WALp4, is a bromodomain-containing protein whose biological role is still elusive. It shows high sequence similarly with BAZ2A. Both BAZ2A and BAZ2B contain a TAM (TIP5/ARBP/MBD) domain, a DDT domain, four AT-hooks, BAZ 1 and BAZ 2 motifs, a WAKZ (WSTF/Acf1/KIAA0314/ZK783.4) motif, a plant homeodomain (PHD) finger, and a bromodomain. BAZ2B also harbors an extra Apolipophorin-III like domain in its N-terminal region.


Pssm-ID: 277020 [Multi-domain]  Cd Length: 46  Bit Score: 115.87  E-value: 7.45e-31
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15545      1 SCQICRSGDNEDQLLLCDGCDRGYHTYCFKPKMTNVPEGDWFCPEC 46
Bromo_Acf1_like cd05504
Bromodomain; Acf1_like or BAZ1A_like subfamily. Bromo adjacent to zinc finger 1A (BAZ1A) was ...
2095-2189 2.76e-30

Bromodomain; Acf1_like or BAZ1A_like subfamily. Bromo adjacent to zinc finger 1A (BAZ1A) was identified as a novel human bromodomain gene by cDNA library screening. The Drosophila homologue, Acf1, is part of the CHRAC (chromatin accessibility complex) and regulates ISWI-induced nucleosome remodeling. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99936  Cd Length: 115  Bit Score: 116.73  E-value: 2.76e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2095 LALCSMILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSD 2174
Cdd:cd05504     14 LSALEQLLVEIVKHKDSWPFLRPVSKIEVPDYYDIIKKPMDLGTIKEKLNMGEYKLAEEFLSDIQLVFSNCFLYNPEHTS 93
                           90
                   ....*....|....*
gi 1039761240 2175 IGRAGHSMRKYFEKK 2189
Cdd:cd05504     94 VYKAGTRLQRFFIKR 108
Bromo_tif1_like cd05502
Bromodomain; tif1_like subfamily. Tif1 (transcription intermediary factor 1) is a member of ...
2093-2189 4.28e-28

Bromodomain; tif1_like subfamily. Tif1 (transcription intermediary factor 1) is a member of the tripartite motif (TRIM) protein family, which is characterized by a particular domain architecture. It functions by recruiting coactivators and/or corepressors to modulate transcription. Vertebrate Tif1-gamma, also labeled E3 ubiquitin-protein ligase TRIM33, plays a role in the control of hematopoiesis. Its homologue in Xenopus laevis, Ectodermin, has been shown to function in germ-layer specification and control of cell growth during embryogenesis. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99934 [Multi-domain]  Cd Length: 109  Bit Score: 110.07  E-value: 4.28e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2093 RDLALCSMILTEMETHEDSWPFLLPVNlKLVPGYKKVIKKPMDFSTIREKL---NNGQYPNFETFALDVRLVFDNCETFN 2169
Cdd:cd05502      4 IDQRKCERLLLELYCHELSLPFHEPVS-PSVPNYYKIIKTPMDLSLIRKKLqpkSPQHYSSPEEFVADVRLMFKNCYKFN 82
                           90       100
                   ....*....|....*....|
gi 1039761240 2170 EDDSDIGRAGHSMRKYFEKK 2189
Cdd:cd05502     83 EEDSEVAQAGKELELFFEEQ 102
PHD_BAZ2A cd15629
PHD finger found in bromodomain adjacent to zinc finger domain protein 2A (BAZ2A); BAZ2A, also ...
1963-2008 1.30e-27

PHD finger found in bromodomain adjacent to zinc finger domain protein 2A (BAZ2A); BAZ2A, also termed transcription termination factor I-interacting protein 5 (TTF-I-interacting protein 5, or Tip5), or WALp3, is an epigenetic regulator. It has been implicated in epigenetic rRNA gene silencing, as the large subunit of the SNF2h-containing chromatin-remodeling complex NoRC that induces nucleosome sliding in an ATP- and histone H4 tail-dependent fashion. BAZ2A has also been shown to be broadly overexpressed in prostate cancer, to regulate numerous protein-coding genes and to cooperate with EZH2 (enhancer of zeste homolog 2) to maintain epigenetic silencing at genes repressed in prostate cancer metastasis. Its overexpression is tightly associated with a prostate cancer subtype displaying CpG island methylator phenotype (CIMP) in tumors and with prostate cancer recurrence in patients. It contains a TAM (TIP5/ARBP/MBD) domain, a DDT domain, four AT-hooks, BAZ 1 and BAZ 2 motifs, a WAKZ (WSTF/Acf1/KIAA0314/ZK783.4) motif, a plant homeodomain (PHD) finger, and a bromodomain.


Pssm-ID: 277099  Cd Length: 47  Bit Score: 106.86  E-value: 1.30e-27
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPACI 2008
Cdd:cd15629      2 CLVCRKGDNDEYLLLCDGCDRGCHMYCHRPKMLQVPEGDWFCPNCV 47
Bromo_Brdt_II_like cd05498
Bromodomain, Brdt_like subfamily, repeat II. Human Brdt is a testis-specific member of the BET ...
2095-2190 1.87e-27

Bromodomain, Brdt_like subfamily, repeat II. Human Brdt is a testis-specific member of the BET subfamily of bromodomain proteins; the first bromodomain in Brdt has been shown to be essential for male germ cell differentiation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99930  Cd Length: 102  Bit Score: 108.13  E-value: 1.87e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2095 LALCSMILTEM--ETHED-SWPFLLPVN---LKLvPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETF 2168
Cdd:cd05498      2 LKFCSGILKELfsKKHKAyAWPFYKPVDpeaLGL-HDYHDIIKHPMDLSTIKKKLDNREYADAQEFAADVRLMFSNCYKY 80
                           90       100
                   ....*....|....*....|..
gi 1039761240 2169 NEDDSDIGRAGHSMRKYFEKKW 2190
Cdd:cd05498     81 NPPDHPVHAMARKLQDVFEDRW 102
Bromo_BDF1_2_II cd05499
Bromodomain. BDF1/BDF2 like subfamily, restricted to fungi, repeat II. BDF1 and BDF2 are yeast ...
2094-2190 4.97e-26

Bromodomain. BDF1/BDF2 like subfamily, restricted to fungi, repeat II. BDF1 and BDF2 are yeast transcription factors involved in the expression of a wide range of genes, including snRNAs; they are required for sporulation and DNA repair and protect histone H4 from deacetylation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99931  Cd Length: 102  Bit Score: 103.90  E-value: 4.97e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2094 DLALCSMILTEM---ETHEDSWPFLLPVN--LKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETF 2168
Cdd:cd05499      1 ELKFCEEVLKELmkpKHSAYNWPFLDPVDpvALNIPNYFSIIKKPMDLGTISKKLQNGQYQSAKEFERDVRLIFKNCYTF 80
                           90       100
                   ....*....|....*....|..
gi 1039761240 2169 NEDDSDIGRAGHSMRKYFEKKW 2190
Cdd:cd05499     81 NPEGTDVYMMGHQLEEVFNDKW 102
HAT_MBD cd01397
Methyl-CpG binding domains (MBD) present in putative chromatin remodelling factor such as ...
758-827 1.89e-25

Methyl-CpG binding domains (MBD) present in putative chromatin remodelling factor such as BAZ2A; BAZ2A contains a MBD, DDT, PHD-type zinc finger and Bromo domain suggesting that BAZ2A might be associated with histone acetyltransferase (HAT) activity. The Drosophila melanogaster toutatis protein, a putative subunit of the chromatin-remodeling complex, and other such proteins in this group share a similar domain architecture with BAZ2A, as does the Caenorhabditis elegans flectin homolog.


Pssm-ID: 238691 [Multi-domain]  Cd Length: 73  Bit Score: 101.33  E-value: 1.89e-25
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039761240  758 ELRIPLDYGWQRETRVRNFGGRLPGEVAYYAPCGKKLRQCPDMVKGMQW---CLLKEEDVIPRIRAMDGRRGR 827
Cdd:cd01397      1 ELRVPLELGWRRETRIRGLGGRIQGEVAYYAPCGKKLRQYPEVIKYLSKngiSLLSRENFSFSARAPVGDFYE 73
Bromo_plant1 cd05506
Bromodomain, uncharacterized subfamily specific to plants. Might function as a global ...
2098-2190 9.96e-24

Bromodomain, uncharacterized subfamily specific to plants. Might function as a global transcription factor. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99938  Cd Length: 99  Bit Score: 97.40  E-value: 9.96e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2098 CSMILTEMETHEDSWPFLLPVNLKL--VPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDI 2175
Cdd:cd05506      5 CGTLLRKLMKHKWGWVFNAPVDVVAlgLPDYFDIIKKPMDLGTVKKKLEKGEYSSPEEFAADVRLTFANAMRYNPPGNDV 84
                           90
                   ....*....|....*
gi 1039761240 2176 GRAGHSMRKYFEKKW 2190
Cdd:cd05506     85 HTMAKELLKIFETRW 99
Bromo_TFIID cd05511
Bromodomain, TFIID-like subfamily. Human TAFII250 (or TAF250) is the largest subunit of TFIID, ...
2097-2189 2.52e-22

Bromodomain, TFIID-like subfamily. Human TAFII250 (or TAF250) is the largest subunit of TFIID, a large multi-domain complex, which initiates the assembly of the transcription machinery. TAFII250 contains two bromodomains that specifically bind to acetylated histone H4. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99943 [Multi-domain]  Cd Length: 112  Bit Score: 93.87  E-value: 2.52e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2097 LCSM---ILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDS 2173
Cdd:cd05511      1 LSFIldeIVNELKNLPDSWPFHTPVNKKKVPDYYKIIKRPMDLQTIRKKISKHKYQSREEFLEDIELIVDNSVLYNGPDS 80
                           90
                   ....*....|....*.
gi 1039761240 2174 DIGRAGHSMRKYFEKK 2189
Cdd:cd05511     81 VYTKKAKEMLELAEEL 96
Bromo_BDF1_2_I cd05500
Bromodomain. BDF1/BDF2 like subfamily, restricted to fungi, repeat I. BDF1 and BDF2 are yeast ...
2110-2189 4.93e-22

Bromodomain. BDF1/BDF2 like subfamily, restricted to fungi, repeat I. BDF1 and BDF2 are yeast transcription factors involved in the expression of a wide range of genes, including snRNAs; they are required for sporulation and DNA repair and protect histone H4 from deacetylation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99932  Cd Length: 103  Bit Score: 92.76  E-value: 4.93e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2110 DSWPFLLPVN-LKL-VPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGRAGHSMRKYFE 2187
Cdd:cd05500     21 DARPFLVPVDpVKLnIPHYPTIIKKPMDLGTIERKLKSNVYTSVEEFTADFNLMVDNCLTFNGPEHPVSQMGKRLQAAFE 100

                   ..
gi 1039761240 2188 KK 2189
Cdd:cd05500    101 KH 102
Bromodomain pfam00439
Bromodomain; Bromodomains are 110 amino acid long domains, that are found in many chromatin ...
2098-2180 4.93e-22

Bromodomain; Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 425683 [Multi-domain]  Cd Length: 84  Bit Score: 91.99  E-value: 4.93e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2098 CSMILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGR 2177
Cdd:pfam00439    1 CLEILDKLMEHPIAAPFLEPVDPDEYPDYYSVIKKPMDLSTIKKKLENGEYKSLAEFLADVKLIFSNARTYNGPGSVIYK 80

                   ...
gi 1039761240 2178 AGH 2180
Cdd:pfam00439   81 AAE 83
PHD_BAZ1A cd15627
PHD finger found in bromodomain adjacent to zinc finger domain protein 1A (BAZ1A); BAZ1A, also ...
1963-2007 3.14e-21

PHD finger found in bromodomain adjacent to zinc finger domain protein 1A (BAZ1A); BAZ1A, also termed ATP-dependent chromatin-remodeling protein, or ATP-utilizing chromatin assembly and remodeling factor 1 (ACF1), or CHRAC subunit ACF1, or Williams syndrome transcription factor-related chromatin-remodeling factor 180 (WCRF180), or WALp1, is a subunit of the conserved imitation switch (ISWI)-family ATP-dependent chromatin assembly and remodeling factor (ACF)/chromatin accessibility complex (CHRAC) chromatin remodeling complex, which is required for DNA replication through heterochromatin. It alters the remodeling properties of the ATPase motor protein sucrose nonfermenting-2 homolog (SNF2H). Moreover, BAZ1A and its complexes play important roles in DNA double-strand break (DSB) repair. It is essential for averting improper gene expression during spermatogenesis. It also regulates transcriptional repression of vitamin D3 receptor-regulated genes. BAZ1A contains a WAC motif, a DDT domain, BAZ 1 and BAZ 2 motifs, a WAKZ (WSTF/Acf1/KIAA0314/ZK783.4) motif, a plant homeodomain (PHD) finger, and a bromodomain.


Pssm-ID: 277097 [Multi-domain]  Cd Length: 46  Bit Score: 88.60  E-value: 3.14e-21
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15627      2 CRICRRKGDAEKMLLCDGCDRGHHMYCLRPPLKKVPEGDWFCPDC 46
PHD_RSF1 cd15543
PHD finger found in Remodeling and spacing factor 1 (Rsf-1); Rsf-1, also termed HBV ...
1962-2007 1.09e-20

PHD finger found in Remodeling and spacing factor 1 (Rsf-1); Rsf-1, also termed HBV pX-associated protein 8, or Hepatitis B virus X-associated protein alpha (HBxAPalpha), or p325 subunit of RSF chromatin-remodeling complex, is a novel nuclear protein with histone chaperon function. It is a subunit of an ISWI chromatin remodeling complex, remodeling and spacing factor (RSF), and plays a role in mediating ATPase-dependent chromatin remodeling and conferring tumor aggressiveness in common carcinomas. As an ataxia-telangiectasia mutated (ATM)-dependent chromatin remodeler, Rsf-1 facilitates DNA damage checkpoints and homologous recombination repair. It regulates the mitotic spindle checkpoint and chromosome instability through the association with serine/threonine kinase BubR1 (BubR1) and Hepatitis B virus (HBV) X protein (HBx) in the chromatin fraction during mitosis. It also interacts with cyclin E1 and promotes tumor development. Rsf-1 contains a plant homeodomain (PHD) finger.


Pssm-ID: 277018 [Multi-domain]  Cd Length: 46  Bit Score: 86.94  E-value: 1.09e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15543      1 PCRKCGLSDHPEWILLCDRCDAGYHTACLRPPLMIIPDGNWFCPPC 46
PHD_BAZ1A_like cd15544
PHD finger found in bromodomain adjacent to zinc finger domain protein BAZ1A and BAZ1B; BAZ1A, ...
1963-2007 1.45e-19

PHD finger found in bromodomain adjacent to zinc finger domain protein BAZ1A and BAZ1B; BAZ1A, also termed ATP-dependent chromatin-remodeling protein, or ATP-utilizing chromatin assembly and remodeling factor 1 (ACF1), or CHRAC subunit ACF1, or Williams syndrome transcription factor-related chromatin-remodeling factor 180 (WCRF180), or WALp1, is a subunit of the conserved imitation switch (ISWI)-family ATP-dependent chromatin assembly and remodeling factor (ACF)/chromatin accessibility complex (CHRAC) chromatin remodeling complex, which is required for DNA replication through heterochromatin. It alters the remodeling properties of the ATPase motor protein sucrose nonfermenting-2 homolog (SNF2H). Moreover, BAZ1A and its complexes play important roles in DNA double-strand break (DSB) repair. It is essential for averting improper gene expression during spermatogenesis. It also regulates transcriptional repression of vitamin D3 receptor-regulated genes. BAZ1B, also termed Tyrosine-protein kinase BAZ1B, or Williams syndrome transcription factor (WSTF), or Williams-Beuren syndrome chromosomal region 10 protein, Williams-Beuren syndrome chromosomal region 9 protein, or WALp2, is a multifunctional protein implicated in several nuclear processes, including replication, transcription, and the DNA damage response. BAZ1B/WSTF, together with the imitation switch (ISWI) ATPase, forms a WSTF-ISWI chromatin remodeling complex (WICH), which transiently associates with the human inactive X chromosome (Xi) during late S-phase prior to BRCA1 and gamma-H2AX. Moreover, BAZ1B/WSTF, SNF2h, and nuclear myosin 1 (NM1) forms the chromatin remodeling complex B-WICH that is involved in regulating rDNA transcription. Both BAZ1A and BAZ1B contain a WAC motif, a DDT domain, BAZ 1 and BAZ 2 motifs, a WAKZ (WSTF/Acf1/KIAA0314/ZK783.4) motif, a plant homeodomain (PHD) finger, and a bromodomain.


Pssm-ID: 277019  Cd Length: 46  Bit Score: 83.61  E-value: 1.45e-19
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15544      2 CKVCRKKGDPDNMILCDGCDKAFHLYCLRPALREVPSGDWFCPAC 46
PHD1_Lid2p_like cd15519
PHD finger 1 found in Schizosaccharomyces pombe Lid2 complex component Lid2p and similar ...
1963-2007 5.21e-19

PHD finger 1 found in Schizosaccharomyces pombe Lid2 complex component Lid2p and similar proteins; Lid2p is a trimethyl H3K4 (H3K4me3) demethylase responsible for H3K4 hypomethylation in heterochromatin. It interacts with the histone lysine-9 methyltransferase, Clr4, through the Dos1/Clr8-Rik1 complex, and mediates H3K9 methylation and small RNA production. It also acts cooperatively with the histone modification enzymes Set1 and Lsd1 and plays an essential role in cross-talk between H3K4 and H3K9 methylation in euchromatin. Lid2p contains a JmjC domain, three PHD fingers and a JmjN domain. This model corresponds to the first PHD finger.


Pssm-ID: 276994 [Multi-domain]  Cd Length: 46  Bit Score: 82.13  E-value: 5.21e-19
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15519      2 CEVCGLDDNEGEVLLCDGCDAEYHTSCLDPPLGEIPPGTWFCPSC 46
PHD1_KDM5A_like cd15515
PHD finger 1 found in Lysine-specific demethylase KDM5A, KDM5B, KDM5C, KDM5D and similar ...
1963-2007 8.18e-19

PHD finger 1 found in Lysine-specific demethylase KDM5A, KDM5B, KDM5C, KDM5D and similar proteins; The JARID subfamily within the JmjC proteins includes Lysine-specific demethylase KDM5A, KDM5B, KDM5C, KDM5D and a Drosophila homolog, protein little imaginal discs (Lid). KDM5A was originally identified as a retinoblastoma protein (Rb)-binding partner and its inactivation may be important for Rb to promote differentiation. It is involved in transcription through interacting with TBP, p107, nuclear receptors, Myc, Sin3/HDAC, Mad1, RBP-J, CLOCK and BMAL1. KDM5B has a restricted expression pattern in the testis, ovary, and transiently in the mammary gland of the pregnant female and has been shown to be upregulated in breast cancer, prostate cancer, and lung cancer, suggesting a potential role in tumorigenesis. Both KDM5A and KDM5B function as trimethylated histone H3 lysine 4 (H3K4me3) demethylases. KDM5C is a H3K4 trimethyl-histone demethylase that catalyzes demethylation of H3K4me3 and H3K4me2 to H3K4me1. It plays a role in neuronal survival and dendrite development. KDM5C defects are associated with X-linked mental retardation (XLMR). KDM5D is a male-specific antigen that shows a demethylase activity specific for di- and tri-methylated histone H3K4 (H3K4me2 and H3K4me3), and has a male-specific function as a histone H3K4 demethylase by recruiting a meiosis-regulatory protein, MSH5, to condensed DNA. KDM5D directly interacts with a polycomb-like protein Ring6a/MBLR, and plays a role in regulation of transcriptional initiation through H3K4 demethylation. This family also includes Drosophila melanogaster protein little imaginal discs (Lid) that functions as a JmjC-dependent H3K4me3 demethylase, which is required for dMyc-induced cell growth. It positively regulates Hox gene expression in S2 cells. Members in this family contain the catalytic JmjC domain, JmjN, the BRIGHT domain, which is an AT-rich interacting domain (ARID), and a Cys5HisCys2 zinc finger, as well as two or three plant homeodomain (PHD) fingers. This model corresponds to the first PHD finger.


Pssm-ID: 276990  Cd Length: 46  Bit Score: 81.67  E-value: 8.18e-19
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15515      2 CQVCGRGDDEDKLLLCDGCDDSYHTFCLIPPLPDIPPGDWRCPKC 46
PHD pfam00628
PHD-finger; PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar ...
1962-2010 8.52e-18

PHD-finger; PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.


Pssm-ID: 425785 [Multi-domain]  Cd Length: 51  Bit Score: 78.69  E-value: 8.52e-18
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 1962 YCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITT--IPDGDWFCPACISK 2010
Cdd:pfam00628    1 YCAVCGKSDDGGELVQCDGCDDWFHLACLGPPLDPaeIPSGEWLCPECKPK 51
COG5076 COG5076
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin ...
2014-2189 1.00e-17

Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription];


Pssm-ID: 227408 [Multi-domain]  Cd Length: 371  Bit Score: 87.55  E-value: 1.00e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2014 QSIKIKKIHVKGKKTNDSKKTKKGNVAGDTEDEDSASTSSSLKRGSKElkKRKMEETTSLNLSKAESTTSIKKPKKDESR 2093
Cdd:COG5076     62 TSIVDDREPGSMANVNDDLENVGGITYSPFEKNRPESLRFDEIVFLAI--ESVTPESGLGSLLMAHLKTSVKKRKTPKIE 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2094 DLAL---CSMILTEMET------HEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDN 2164
Cdd:COG5076    140 DELLyadNKAIAKFKKQlflrdgRFLSSIFLGLPSKREYPDYYEIIKSPMDLLTIQKKLKNGRYKSFEEFVSDLNLMFDN 219
                          170       180
                   ....*....|....*....|....*
gi 1039761240 2165 CETFNEDDSDIGRAGHSMRKYFEKK 2189
Cdd:COG5076    220 CKLYNGPDSSVYVDAKELEKYFLKL 244
Bromo_SPT7_like cd05510
Bromodomain; SPT7_like subfamily. SPT7 is a yeast protein that functions as a component of the ...
2098-2184 1.51e-17

Bromodomain; SPT7_like subfamily. SPT7 is a yeast protein that functions as a component of the transcription regulatory histone acetylation (HAT) complexes SAGA, SALSA, and SLIK. SAGA is involved in the RNA polymerase II-dependent transcriptional regulation of about 10% of all yeast genes. The SPT7 bromodomain has been shown to weakly interact with acetylated histone H3, but not H4. The human representative of this subfamily is cat eye syndrome critical region protein 2 (CECR2). Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99942 [Multi-domain]  Cd Length: 112  Bit Score: 80.18  E-value: 1.51e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2098 CSMILTEMETH-EDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIG 2176
Cdd:cd05510     12 LDKVLNELKTYtEHSTPFLTKVSKREAPDYYDIIKKPMDLGTMLKKLKNLQYKSKAEFVDDLNLIWKNCLLYNSDPSHPL 91

                   ....*....
gi 1039761240 2177 RA-GHSMRK 2184
Cdd:cd05510     92 RRhANFMKK 100
Bromo_brd1_like cd05512
Bromodomain; brd1_like subfamily. BRD1 is a mammalian gene which encodes for a nuclear protein ...
2114-2183 2.40e-16

Bromodomain; brd1_like subfamily. BRD1 is a mammalian gene which encodes for a nuclear protein assumed to be a transcriptional regulator. BRD1 has been implicated with brain development and susceptibility to schizophrenia and bipolar affective disorder. Bromodomains are 110 amino acid long domains that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99944  Cd Length: 98  Bit Score: 76.28  E-value: 2.40e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2114 FLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGRAGHSMR 2183
Cdd:cd05512     22 FSEPVDLSEVPDYLDHIKQPMDFSTMRKKLESQRYRTLEDFEADFNLIINNCLAYNAKDTIFYRAAVRLR 91
Bromo_WSTF_like cd05505
Bromodomain; Williams syndrome transcription factor-like subfamily (WSTF-like). The ...
2094-2175 2.68e-16

Bromodomain; Williams syndrome transcription factor-like subfamily (WSTF-like). The Williams-Beuren syndrome deletion transcript 9 is a putative transcriptional regulator. WSTF was found to play a role in vitamin D-mediated transcription as part of two chromatin remodeling complexes, WINAC and WICH. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99937  Cd Length: 97  Bit Score: 76.04  E-value: 2.68e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2094 DLALCSMILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDS 2173
Cdd:cd05505      1 ELQKCEEILSKILKYRFSWPFREPVTADEAEDYKKVITNPMDLQTMQTKCSCGSYSSVQEFLDDMKLVFSNAEKYYENGS 80

                   ..
gi 1039761240 2174 DI 2175
Cdd:cd05505     81 YV 82
PHD1_Lid_like cd15605
PHD finger 1 found in Drosophila melanogaster protein little imaginal discs (Lid) and similar ...
1963-2007 5.29e-16

PHD finger 1 found in Drosophila melanogaster protein little imaginal discs (Lid) and similar proteins; Drosophila melanogaster Lid, also termed Retinoblastoma-binding protein 2 homolog, is identified genetically as a trithorax group (trxG) protein that is a Drosophila homolog of the human protein JARID1A/kdm5A, a member of the JARID subfamily within the JmjC proteins. Lid functions as a JmjC-dependent trimethyl histone H3K4 (H3K4me3) demethylase, which is required for dMyc-induced cell growth. It positively regulates Hox gene expression in S2 cells. Lid contains the catalytic JmjC domain, JmjN, the BRIGHT domain, which is an AT-rich interacting domain (ARID), and a Cys5HisCys2 zinc finger, as well as three plant homeodomain (PHD) fingers. This model corresponds to the first PHD finger of Lid.


Pssm-ID: 277078  Cd Length: 46  Bit Score: 73.64  E-value: 5.29e-16
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15605      2 CHTCGRGDGEESMLLCDGCDDSYHTFCLLPPLSEVPKGDWRCPKC 46
PHD_PHRF1 cd15536
PHD finger found in PHD and RING finger domain-containing protein 1 (PHRF1); PHRF1, also ...
1962-2007 6.66e-16

PHD finger found in PHD and RING finger domain-containing protein 1 (PHRF1); PHRF1, also termed KIAA1542, or CTD-binding SR-like protein rA9, is a ubiquitin ligase that induces the ubiquitination of TGIF (TG-interacting factor) at lysine 130. It acts as a tumor suppressor that promotes the transforming growth factor (TGF)-beta cytostatic program through selective release of TGIF-driven promyelocytic leukemia protein (PML) inactivation. PHRF1 contains a plant homeodomain (PHD) finger and a RING finger.


Pssm-ID: 277011  Cd Length: 46  Bit Score: 73.22  E-value: 6.66e-16
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15536      1 YCEVCGRSDREDRLLLCDGCDAGYHMECLTPPLDEVPIEEWFCPEC 46
PHD_BAZ1B cd15628
PHD finger found in bromodomain adjacent to zinc finger domain protein 1B (BAZ1B); BAZ1B, also ...
1963-2007 7.09e-16

PHD finger found in bromodomain adjacent to zinc finger domain protein 1B (BAZ1B); BAZ1B, also termed Tyrosine-protein kinase BAZ1B, or Williams syndrome transcription factor (WSTF), or Williams-Beuren syndrome chromosomal region 10 protein, Williams-Beuren syndrome chromosomal region 9 protein, or WALp2, is a multifunctional protein implicated in several nuclear processes, including replication, transcription, and the DNA damage response. BAZ1B/WSTF, together with the imitation switch (ISWI) ATPase, forms a WSTF-ISWI chromatin remodeling complex (WICH), which transiently associates with the human inactive X chromosome (Xi) during late S-phase prior to BRCA1 and gamma-H2AX. Moreover, BAZ1B/WSTF, SNF2h, and nuclear myosin 1 (NM1) forms the chromatin remodeling complex B-WICH that is involved in regulating rDNA transcription. BAZ1B contains a WAC motif, a DDT domain, BAZ 1 and BAZ 2 motifs, a WAKZ (WSTF/Acf1/KIAA0314/ZK783.4) motif, a plant homeodomain (PHD) finger, and a bromodomain.


Pssm-ID: 277098  Cd Length: 46  Bit Score: 73.24  E-value: 7.09e-16
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15628      2 CKVCRKKGEDDKLILCDECNQAFHLFCLRPALYEVPDGEWMCPAC 46
Bromo_Brdt_I_like cd05497
Bromodomain, Brdt_like subfamily, repeat I. Human Brdt is a testis-specific member of the BET ...
2108-2189 1.91e-15

Bromodomain, Brdt_like subfamily, repeat I. Human Brdt is a testis-specific member of the BET subfamily of bromodomain proteins; the first bromodomain in Brdt has been shown to be essential for male germ cell differentiation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99929  Cd Length: 107  Bit Score: 74.00  E-value: 1.91e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2108 HEDSWPFLLPVN-LKL-VPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGRAGHSMRKY 2185
Cdd:cd05497     20 HKFAWPFQQPVDaVKLnLPDYHKIIKTPMDLGTIKKRLENNYYWSASECIQDFNTMFTNCYIYNKPGDDVVLMAQTLEKL 99

                   ....
gi 1039761240 2186 FEKK 2189
Cdd:cd05497    100 FLQK 103
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
835-1052 2.79e-15

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 79.58  E-value: 2.79e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  835 RAREESRMKRRKgrppnvgsaefldntdaKLLRKLQAQ-EIARQAAQIKLLRKLQKQEQ----ARVAKEAKKQQAIMAAE 909
Cdd:pfam13868  109 RIQEEDQAEAEE-----------------KLEKQRQLReEIDEFNEEQAEWKELEKEEEreedERILEYLKEKAEREEER 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  910 E-------KRKQKEQMKIIKQQEKIKRIQQIRmeKELRAQQILEakkkkkeeaanakllEAEKRTKEKELRRQQAVLLKH 982
Cdd:pfam13868  172 EaereeieEEKEREIARLRAQQEKAQDEKAER--DELRAKLYQE---------------EQERKERQKEREEAEKKARQR 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  983 QELERHRLDMVWERERRRQH------VMLMKAMEARKKAEEKERLKQEKRDEKRL------------NKERKLEQRRLEL 1044
Cdd:pfam13868  235 QELQQAREEQIELKERRLAEeaereeEEFERMLRKQAEDEEIEQEEAEKRRMKRLehrrelekqieeREEQRAAEREEEL 314

                   ....*...
gi 1039761240 1045 EMAKELKK 1052
Cdd:pfam13868  315 EEGERLRE 322
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
862-1063 4.37e-15

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 79.19  E-value: 4.37e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  862 DAKLLRKLQAQEIARQA--AQIKLLRKLQKQEQARVAKEAKKQQAIMAAEE--------------KRKQKEQMKIIKQQE 925
Cdd:pfam13868   61 EEKEEERKEERKRYRQEleEQIEEREQKRQEEYEEKLQEREQMDEIVERIQeedqaeaeeklekqRQLREEIDEFNEEQA 140
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  926 KIKRIQQIRMEKELR--AQQILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKHQ-ELERHRLDMVWErERRRQH 1002
Cdd:pfam13868  141 EWKELEKEEEREEDEriLEYLKEKAEREEEREAEREEIEEEKEREIARLRAQQEKAQDEKaERDELRAKLYQE-EQERKE 219
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039761240 1003 VMLMKaMEARKKAEEKERLKQEKRDEKRLNKERKLEQRRLELEMAKE-LKKPKEDMCLADQK 1063
Cdd:pfam13868  220 RQKER-EEAEKKARQRQELQQAREEQIELKERRLAEEAEREEEEFERmLRKQAEDEEIEQEE 280
PHD1_KDM5A cd15602
PHD finger 1 found in Lysine-specific demethylase 5A (KDM5A); KDM5A (also termed Histone ...
1963-2010 4.93e-15

PHD finger 1 found in Lysine-specific demethylase 5A (KDM5A); KDM5A (also termed Histone demethylase JARID1A, Jumonji/ARID domain-containing protein 1A, or Retinoblastoma-binding protein 2 (RBBP-2 or RBP2)) was originally identified as a retinoblastoma protein (Rb)-binding partner and its inactivation may be important for Rb to promote differentiation. It is involved in transcription through interacting with TBP, p107, nuclear receptors, Myc, Sin3/HDAC, Mad1, RBP-J, CLOCK and BMAL1. KDM5A functions as a trimethylated histone H3 lysine 4 (H3K4me3) demethylase that belongs to the JARID subfamily within the JmjC proteins. It also displays DNA-binding activities that can recognize the specific DNA sequence CCGCCC. KDM5A contains the catalytic JmjC domain, JmjN, the BRIGHT domain, which is an AT-rich interacting domain (ARID), and a Cys5HisCys2 zinc finger, as well as three plant homeodomain (PHD) fingers. This model corresponds to the first PHD finger.


Pssm-ID: 277075  Cd Length: 49  Bit Score: 71.13  E-value: 4.93e-15
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPACISK 2010
Cdd:cd15602      2 CLFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKGDWRCPKCVAE 49
Bromo_cbp_like cd05495
Bromodomain, cbp_like subfamily. Cbp (CREB binding protein or CREBBP) is an acetyltransferase ...
2109-2187 8.72e-15

Bromodomain, cbp_like subfamily. Cbp (CREB binding protein or CREBBP) is an acetyltransferase acting on histone, which gives a specific tag for transcriptional activation and also acetylates non-histone proteins. CREBBP binds specifically to phosphorylated CREB protein and augments the activity of phosphorylated CREB to activate transcription of cAMP-responsive genes. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99927  Cd Length: 108  Bit Score: 72.09  E-value: 8.72e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2109 EDSWPFLLPVNLKL--VPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGRAGHSMRKYF 2186
Cdd:cd05495     20 PESLPFRQPVDPKLlgIPDYFDIVKNPMDLSTIRRKLDTGQYQDPWQYVDDVWLMFDNAWLYNRKTSRVYKYCTKLAEVF 99

                   .
gi 1039761240 2187 E 2187
Cdd:cd05495    100 E 100
PTZ00121 PTZ00121
MAEBL; Provisional
832-1056 9.33e-15

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 80.96  E-value: 9.33e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  832 DRPRAREESRMKRRKGRppnvgSAEfldntDAKLLRKLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEK 911
Cdd:PTZ00121  1500 DEAKKAAEAKKKADEAK-----KAE-----EAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEA 1569
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  912 RKQKEQMKI-IKQQEKIKRIQQIRMEKELRAQQILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKHQELERHRL 990
Cdd:PTZ00121  1570 KKAEEDKNMaLRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039761240  991 DMVWERERRRQhvmLMKAMEARKKAEEK---ERLKQEKRDEKRlnKERKLEQRRLELEMAKELKKPKED 1056
Cdd:PTZ00121  1650 EELKKAEEENK---IKAAEEAKKAEEDKkkaEEAKKAEEDEKK--AAEALKKEAEEAKKAEELKKKEAE 1713
PHD smart00249
PHD zinc finger; The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in ...
1962-2007 1.20e-14

PHD zinc finger; The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RING finger and the FYVE finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BROMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RING fingers misidentified as PHD fingers.


Pssm-ID: 214584 [Multi-domain]  Cd Length: 47  Bit Score: 69.93  E-value: 1.20e-14
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 1039761240  1962 YCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITT-IPDGDWFCPAC 2007
Cdd:smart00249    1 YCSVCGKPDDGGELLQCDGCDRWYHQTCLGPPLLEeEPDGKWYCPKC 47
Bromo_AAA cd05528
Bromodomain; sub-family co-occurring with AAA domains. Bromodomains are 110 amino acid long ...
2093-2177 1.97e-14

Bromodomain; sub-family co-occurring with AAA domains. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine. The structure(2DKW) in this alignment is an uncharacterized protein predicted from analysis of cDNA clones from human fetal liver


Pssm-ID: 99957  Cd Length: 112  Bit Score: 71.23  E-value: 1.97e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2093 RDLALC-SMILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNED 2171
Cdd:cd05528      2 RELRLFlRDVLKRLASDKRFNAFTKPVDEEEVPDYYEIIKQPMDLQTILQKLDTHQYLTAKDFLKDIDLIVTNALEYNPD 81

                   ....*.
gi 1039761240 2172 DSDIGR 2177
Cdd:cd05528     82 RDPADK 87
PHD1_KDM5C_5D cd15604
PHD finger 1 found in Lysine-specific demethylase 5C (KDM5C) and 5D (KDM5D); The family ...
1963-2007 1.99e-14

PHD finger 1 found in Lysine-specific demethylase 5C (KDM5C) and 5D (KDM5D); The family includes KDM5C and KDM5D, both of which belong to the JARID subfamily within the JmjC proteins. KDM5C (also termed Histone demethylase JARID1C, Jumonji/ARID domain-containing protein 1C, SmcX, or Xe169) is a H3K4 trimethyl-histone demethylase that catalyzes demethylation of H3K4me3 and H3K4me2 to H3K4me1. It plays a role in neuronal survival and dendrite development. KDM5C defects are associated with X-linked mental retardation (XLMR). KDM5D (also termed Histocompatibility Y antigen (H-Y), Histone demethylase JARID1D, Jumonji/ARID domain-containing protein 1D, or SmcY) is a male-specific antigen that shows a demethylase activity specific for di- and tri-methylated histone H3K4 (H3K4me3 andH3K4me2), and has a male-specific function as a histone H3K4 demethylase by recruiting a meiosis-regulatory protein, MSH5, to condensed DNA. KDM5D directly interacts with a polycomb-like protein Ring6a/MBLR, and plays a role in regulation of transcriptional initiation through H3K4 demethylation. Both KDM5C and KDM5D contain the catalytic JmjC domain, JmjN, the BRIGHT domain, which is an AT-rich interacting domain (ARID), and a Cys5HisCys2 zinc finger, as well as two plant homeodomain (PHD) fingers. This model corresponds to the first PHD finger.


Pssm-ID: 277077  Cd Length: 46  Bit Score: 69.10  E-value: 1.99e-14
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15604      2 CRMCSRGDEDDKLLLCDGCDDNYHTFCLLPPLPEPPKGIWRCPKC 46
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
877-1052 2.18e-14

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 76.88  E-value: 2.18e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  877 QAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQIRMEKELRaQQIleakkkkkEEAA 956
Cdd:pfam13868   16 LAAKCNKERDAQIAEKKRIKAEEKEEERRLDEMMEEERERALEEEEEKEEERKEERKRYRQELE-EQI--------EERE 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  957 NAKLLEAEKRTKEKELRRQQAVLLKHQELERHRldmvwERERRRQHVM--LMKAMEARKKAEEKERLKQEKRDEKRLNKE 1034
Cdd:pfam13868   87 QKRQEEYEEKLQEREQMDEIVERIQEEDQAEAE-----EKLEKQRQLReeIDEFNEEQAEWKELEKEEEREEDERILEYL 161
                          170
                   ....*....|....*...
gi 1039761240 1035 RKLEQRRLELEMAKELKK 1052
Cdd:pfam13868  162 KEKAEREEEREAEREEIE 179
Bromo_WDR9_II cd05496
Bromodomain; WDR9 repeat II_like subfamily. WDR9 is a human gene located in the Down Syndrome ...
2098-2169 2.89e-14

Bromodomain; WDR9 repeat II_like subfamily. WDR9 is a human gene located in the Down Syndrome critical region-2 of chromosome 21. It encodes for a nuclear protein containing WD40 repeats and two bromodomains, which may function as a transcriptional regulator involved in chromatin remodeling and play a role in embryonic development. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99928  Cd Length: 119  Bit Score: 70.95  E-value: 2.89e-14
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039761240 2098 CSMILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFN 2169
Cdd:cd05496     10 CKELVNLMWDCEDSEPFRQPVDLLKYPDYRDIIDTPMDLGTVKETLFGGNYDDPMEFAKDVRLIFSNSKSYT 81
DDT smart00571
domain in different transcription and chromosome remodeling factors;
1079-1142 3.11e-14

domain in different transcription and chromosome remodeling factors;


Pssm-ID: 214726  Cd Length: 63  Bit Score: 69.20  E-value: 3.11e-14
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039761240  1079 TTFSDCLMVVQFLRNFGKVLGFDVNIDVpnLSVLQEGLLNIgDSMGEVQDLLVRLLSAAVCDPG 1142
Cdd:smart00571    2 EAFGDLLMVYEFLRSFGKVLGLSPFRAT--LEDFIAALKCR-DQNGLLTEVHVVLLRAILKDEG 62
PHD2_KAT6A_6B cd15527
PHD finger 2 found in monocytic leukemia zinc-finger protein (MOZ) and its factor (MORF); MOZ, ...
1963-2007 3.73e-14

PHD finger 2 found in monocytic leukemia zinc-finger protein (MOZ) and its factor (MORF); MOZ, also termed histone acetyltransferase KAT6A, YBF2/SAS3, SAS2 and TIP60 protein 3 (MYST-3), or runt-related transcription factor-binding protein 2, or zinc finger protein 220, is a MYST-type histone acetyltransferase (HAT) that functions as a coactivator for acute myeloid leukemia 1 protein (AML1)- and p53-dependent transcription. It possesses intrinsic HAT activity to acetylate both itself and lysine (K) residues on histone H2B, histone H3 (K14) and histone H4 (K5, K8, K12 and K16) in vitro and H3K9 in vivo. MOZ-related factor (MORF), also termed MOZ2, or histone acetyltransferase KAT6B, or MOZ, YBF2/SAS3, SAS2 and TIP60 protein 4 (MYST4), is a ubiquitously expressed transcriptional regulator with intrinsic HAT activity. It can interact with the Runt-domain transcription factor Runx2 and form a tetrameric complex with BRPFs, ING5, and EAF6. Both MOZ and MORF are catalytic subunits of HAT complexes that are required for normal developmental programs, such as hematopoiesis, neurogenesis, and skeletogenesis, and are also implicated in human leukemias. MOZ is also the catalytic subunit of a tetrameric inhibitor of growth 5 (ING5) complex, which specifically acetylates nucleosomal histone H3K14. Moreover, MOZ and MORF are involved in regulating transcriptional activation mediated by Runx2 (or Cbfa1), a Runt-domain transcription factor known to play important roles in T cell lymphomagenesis and bone development, and its homologs. MOZ contains a linker histone 1 and histone 5 domains and two plant homeodomain (PHD) fingers. In contrast, MORF contains an N-terminal region containing two PHD fingers, a putative HAT domain, an acidic region, and a C-terminal Ser/Met-rich domain. The family corresponds to the first PHD finger.


Pssm-ID: 277002  Cd Length: 46  Bit Score: 68.17  E-value: 3.73e-14
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15527      2 CSVCQDSGNADNLLFCDACDKGFHMECHDPPLTRMPKGKWVCQIC 46
MBD cd00122
MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of ...
758-802 4.49e-14

MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of proteins that share the methyl-CpG-binding domain (MBD). The MBD consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1. The MBDs present in putative chromatin remodelling subunit, BAZ2A, and putative histone methyltransferase, CLLD8, represent two phylogenetically distinct groups within the MBD protein family.


Pssm-ID: 238069  Cd Length: 62  Bit Score: 68.50  E-value: 4.49e-14
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240  758 ELRIPLDYGWQRETRVRNFGGRLPGEVAYYAPCGKKLRQCPDMVK 802
Cdd:cd00122      1 PLRDPLPPGWKRELVIRKSGSAGKGDVYYYSPCGKKLRSKPEVAR 45
PHD1_KDM5B cd15603
PHD finger 1 found in lysine-specific demethylase 5B (KDM5B); KDM5B (also termed Cancer/testis ...
1963-2007 7.11e-14

PHD finger 1 found in lysine-specific demethylase 5B (KDM5B); KDM5B (also termed Cancer/testis antigen 31 (CT31), Histone demethylase JARID1B, Jumonji/ARID domain-containing protein 1B (JARID1B), PLU-1, or retinoblastoma-binding protein 2 homolog 1 (RBP2-H1 or RBBP2H1A)) is a member of the JARID subfamily within the JmjC proteins. It has a restricted expression pattern in the testis, ovary, and transiently in the mammary gland of pregnant females and has been shown to be upregulated in breast cancer, prostate cancer, and lung cancer, suggesting a potential role in tumorigenesis. KDM5B acts as a histone demethylase that catalyzes the removal of trimethylation of lysine 4 on histone H3 (H3K4me3), induced by polychlorinated biphenyls (PCBs). It also mediates demethylation of H3K4me2 and H3K4me1. Moreover, KDM5B functions as a negative regulator of hematopoietic stem cell (HSC) self-renewal and progenitor cell activity. KDM5B has also been shown to interact with the DNA binding transcription factors BF-1 and PAX9, as well as TIEG1/KLF10 (transforming growth factor-beta inducible early gene-1/Kruppel-like transcription factor 10), and possibly function as a transcriptional corepressor. KDM5B contains the catalytic JmjC domain, JmjN, the BRIGHT domain, which is an AT-rich interacting domain (ARID), and a Cys5HisCys2 zinc finger, as well as three plant homeodomain (PHD) fingers. This model corresponds to the first PHD finger.


Pssm-ID: 277076  Cd Length: 46  Bit Score: 67.67  E-value: 7.11e-14
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15603      2 CLVCGSGNDEDRLLLCDGCDDSYHTFCLIPPLHDVPKGDWRCPKC 46
Bromo_brd7_like cd05513
Bromodomain, brd7_like subgroup. The BRD7 gene encodes a nuclear protein that has been shown ...
2114-2173 1.47e-13

Bromodomain, brd7_like subgroup. The BRD7 gene encodes a nuclear protein that has been shown to inhibit cell growth and the progression of the cell cycle by regulating cell-cycle genes at the transcriptional level. BRD7 has been identified as a gene involved in nasopharyngeal carcinoma. The protein interacts with acetylated histone H3 via its bromodomain. Bromodomains are 110 amino acid long domains that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99945  Cd Length: 98  Bit Score: 68.21  E-value: 1.47e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2114 FLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDS 2173
Cdd:cd05513     22 FAFPVTDFIAPGYSSIIKHPMDFSTMKEKIKNNDYQSIEEFKDDFKLMCENAMKYNKPDT 81
Bromo_polybromo_V cd05515
Bromodomain, polybromo repeat V. Polybromo is a nuclear protein of unknown function, which ...
2103-2175 2.41e-13

Bromodomain, polybromo repeat V. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.


Pssm-ID: 99946  Cd Length: 105  Bit Score: 68.10  E-value: 2.41e-13
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039761240 2103 TEMETHEDSWPFL-LPvNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDI 2175
Cdd:cd05515     16 TDGRGRRLSLIFMrLP-SKSEYPDYYDVIKKPIDMEKIRSKIEGNQYQSLDDMVSDFVLMFDNACKYNEPDSQI 88
PHD2_KMT2C_like cd15510
PHD finger 2 found in Histone-lysine N-methyltransferase 2C (KMT2C) and 2D (KMT2D); KMT2C, ...
1963-2007 3.35e-13

PHD finger 2 found in Histone-lysine N-methyltransferase 2C (KMT2C) and 2D (KMT2D); KMT2C, also termed myeloid/lymphoid or mixed-lineage leukemia protein 3 (MLL3) or homologous to ALR protein, is a histone H3 lysine 4 (H3K4) lysine methyltransferase that functions as a circadian factor contributing to genome-scale circadian transcription. It is a component of a large complex that acts as a coactivator of multiple transcription factors, including the bile acid (BA)-activated nuclear receptor, farnesoid X receptor (FXR), a critical player in BA homeostasis. The MLL3 complex is essential for p53 transactivation of small heterodimer partner (SHP). KMT2C is also a part of activating signal cointegrator-2 (ASC-2)-containing complex (ASCOM) that contains the transcriptional coactivator nuclear receptor coactivator 6 (NCOA6), KMT2C and its paralog MLL4. The ASCOM complex is critical for nuclear receptor (NR) activation of bile acid transporter genes and is down regulated in cholestasis. KMT2D, also termed ALL1-related protein (ALR), is encoded by the gene that was named MLL4, a fourth human homolog of Drosophila trithorax, located on chromosome 12. It enzymatically generates trimethylated histone H3 Lysine 4 (H3K4me3). It plays an essential role in differentiating the human pluripotent embryonal carcinoma cell line NTERA-2 clone D1 (NT2/D1) stem cells by activating differentiation-specific genes, such as HOXA1-3 and NESTIN. KMT2D is also a part of ASCOM. Both KMT2C and KMT2D contain the catalytic domain SET, five plant homeodomain (PHD) fingers, two extended PHD (ePHD) fingers, Cys2HisCys5HisCys2His, a RING finger, an HMG (high-mobilitygroup)-binding motif, and two FY-rich regions. This model corresponds to the second PHD finger.


Pssm-ID: 276985  Cd Length: 46  Bit Score: 65.53  E-value: 3.35e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15510      2 CQACRQPGDDTKMLVCETCDKGYHTSCLRPVMSSIPKYGWKCKNC 46
PHD2_CHD_II cd15532
PHD finger 2 found in class II Chromodomain-Helicase-DNA binding (CHD) proteins; Class II CHD ...
1962-2007 3.69e-13

PHD finger 2 found in class II Chromodomain-Helicase-DNA binding (CHD) proteins; Class II CHD proteins includes chromodomain-helicase-DNA-binding protein CHD3, CHD4, and CHD5, which are nuclear and ubiquitously expressed chromatin remodelling ATPases generally associated with histone deacetylases (HDACs). They are involved in DNA Double Strand Break (DSB) signaling, DSB repair and/or p53-dependent pathways such as apoptosis and senescence, as well as in the maintenance of genomic stability, and/or cancer prevention. They function as subunits of the Nucleosome Remodelling and Deacetylase (NuRD) complex, which is generally associated with gene repression, heterochromatin formation, and overall chromatin compaction. In contrast to the class I CHD enzymes (CHD1 and CHD2), class II CHD proteins lack identifiable DNA-binding domains, but possess a C-terminal coiled-coil region. Moreover, in addition to the tandem chromodomains and a helicase domain, they all harbor tandem plant homeodomain (PHD) zinc fingers involved in the recognition of methylated histone tails. This model corresponds to the second PHD finger.


Pssm-ID: 277007 [Multi-domain]  Cd Length: 43  Bit Score: 65.38  E-value: 3.69e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNeelLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15532      1 FCRVCKDGGE---LLCCDGCPSSYHLHCLNPPLAEIPDGDWFCPRC 43
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
835-1048 7.65e-13

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 74.00  E-value: 7.65e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  835 RAREESRMKRRKGRPPNVGSAEFLDNTDAKLLRKLQAQEIARQAAQIKLLRKLQ---KQEQARVAKEAKKQQAIMAAEEK 911
Cdd:pfam17380  376 RMRELERLQMERQQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQeeaRQREVRRLEEERAREMERVRLEE 455
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  912 RKQKEQMKIIKQQEKIKRIQQIRMEKELRAQQileakkkkkeeaanakllEAEkrtkekELRRQqavlLKHQELERHRLD 991
Cdd:pfam17380  456 QERQQQVERLRQQEEERKRKKLELEKEKRDRK------------------RAE------EQRRK----ILEKELEERKQA 507
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1039761240  992 MVwERERRRQhvMLMKAMEARKKAEEKERLKQEKRDEKRLNKERKlEQRRLELEMAK 1048
Cdd:pfam17380  508 MI-EEERKRK--LLEKEMEERQKAIYEEERRREAEEERRKQQEME-ERRRIQEQMRK 560
PHD2_KMT2D cd15595
PHD finger 2 found in Histone-lysine N-methyltransferase 2D (KMT2D); KMT2D, also termed ...
1963-2007 2.08e-12

PHD finger 2 found in Histone-lysine N-methyltransferase 2D (KMT2D); KMT2D, also termed ALL1-related protein (ALR), is encoded by the gene that was named myeloid/lymphoid or mixed-lineage leukemia 4 (MLL4), a fourth human homolog of Drosophila trithorax, located on chromosome 12. KMT2D enzymatically generates trimethylated histone H3 Lys 4 (H3K4me3). It plays an essential role in differentiating the human pluripotent embryonal carcinoma cell line NTERA-2 clone D1 (NT2/D1) stem cells by activating differentiation-specific genes, such asHOXA1-3 and NESTIN. It is also a part of activating signal cointegrator-2 (ASC-2)-containing complex (ASCOM) that contains the transcriptional coactivator nuclear receptor coactivator 6 (NCOA6), KMT2C and KMT2D. The ASCOM complex is critical for nuclear receptor (NR) activation of bile acid transporter genes and is down regulated in cholestasis. KMT2D contains the catalytic domain SET, five plant homeodomain (PHD) fingers, two extended PHD (ePHD) fingers, Cys2HisCys5HisCys2His, a RING finger, an HMG (high-mobility group)-binding motif, and two FY-rich regions. This model corresponds to the second PHD finger.


Pssm-ID: 277070  Cd Length: 46  Bit Score: 63.48  E-value: 2.08e-12
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15595      2 CQTCRKPGEDSKMLVCEACDKGYHTFCLKPAMESLPTDSWKCKAC 46
PHD_SF cd15489
PHD finger superfamily; The PHD finger superfamily includes a canonical plant homeodomain (PHD) ...
1962-2007 2.61e-12

PHD finger superfamily; The PHD finger superfamily includes a canonical plant homeodomain (PHD) finger typically characterized as Cys4HisCys3, and a non-canonical extended PHD finger, characterized as Cys2HisCys5HisCys2His. Variations include the RAG2 PHD finger characterized by Cys3His2Cys2His and the PHD finger 5 found in nuclear receptor-binding SET domain-containing proteins characterized by Cys4HisCys2His. The PHD finger is also termed LAP (leukemia-associated protein) motif or TTC (trithorax consensus) domain. Single or multiple copies of PHD fingers have been found in a variety of eukaryotic proteins involved in the control of gene transcription and chromatin dynamics. PHD fingers can recognize the unmodified and modified histone H3 tail, and some have been found to interact with non-histone proteins. They also function as epigenome readers controlling gene expression through molecular recruitment of multi-protein complexes of chromatin regulators and transcription factors. The PHD finger domain SF is structurally similar to the RING and FYVE_like superfamilies.


Pssm-ID: 276966 [Multi-domain]  Cd Length: 48  Bit Score: 63.10  E-value: 2.61e-12
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1039761240 1962 YCQICRK-GDNEELLLLCDGCDKGCHTYCHRPKITT-IPDGDWFCPAC 2007
Cdd:cd15489      1 SCIVCGKgGDLGGELLQCDGCGKWFHADCLGPPLSSfVPNGKWICPVC 48
PTZ00121 PTZ00121
MAEBL; Provisional
830-1062 3.19e-12

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 72.48  E-value: 3.19e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  830 NPDRPRAREESRMKRRKGRPPNVGSAEFLDNTDAKLLRKLQAQEIARQAAQIKLLRKLQKQ-EQARVAKEAKKQqaimaA 908
Cdd:PTZ00121  1250 NNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKaEEAKKADEAKKK-----A 1324
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  909 EEKRKQKEQMKiiKQQEKIKRIQQIRMEKELRAQQILEAKKKKKEEAANAKLLE---AEKRTKEKELRRQQAVLLKHQEL 985
Cdd:PTZ00121  1325 EEAKKKADAAK--KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAkkkADAAKKKAEEKKKADEAKKKAEE 1402
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039761240  986 ERHRLDMVWERERRRQhvmlmKAMEARKKAEEKERLKQEKRDEKRLNKERKLEQRRLELEMAKELKKPKEDMCLADQ 1062
Cdd:PTZ00121  1403 DKKKADELKKAAAAKK-----KADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADE 1474
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
867-1063 4.50e-12

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 71.69  E-value: 4.50e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  867 RKLQAQEIARQAAqikllrkLQKQ-----EQARVAKEAKKQQAIMAAEEKRKQKEQMK---IIKQQEKIKRIQQIRMEKE 938
Cdd:pfam17380  316 RKLEEAEKARQAE-------MDRQaaiyaEQERMAMERERELERIRQEERKRELERIRqeeIAMEISRMRELERLQMERQ 388
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  939 LRAQQIleakKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQhvMLMKAMEARKKAEEK 1018
Cdd:pfam17380  389 QKNERV----RQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERARE--MERVRLEEQERQQQV 462
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1039761240 1019 ERLKQEKRDEKR--LNKERKL-------EQRRLELEmaKELKKPKEDMCLADQK 1063
Cdd:pfam17380  463 ERLRQQEEERKRkkLELEKEKrdrkraeEQRRKILE--KELEERKQAMIEEERK 514
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
862-1052 5.22e-12

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 69.56  E-value: 5.22e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  862 DAKLLRKLQAQEIARQAAQIKLLRKLQKQEQARVAKE---AKKQQAIMAAEEKRKQKEQM-KIIKQQEKIK--RIQQIRM 935
Cdd:pfam13868  168 EEEREAEREEIEEEKEREIARLRAQQEKAQDEKAERDelrAKLYQEEQERKERQKEREEAeKKARQRQELQqaREEQIEL 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  936 EKELRAQQIleakkkKKEEAANAKLLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRqhvmlmkAMEARKKA 1015
Cdd:pfam13868  248 KERRLAEEA------EREEEEFERMLRKQAEDEEIEQEEAEKRRMKRLEHRRELEKQIEEREEQR-------AAEREEEL 314
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1039761240 1016 EEKERLKQEKRDekrlnkerklEQRRLELEMAKELKK 1052
Cdd:pfam13868  315 EEGERLREEEAE----------RRERIEEERQKKLKE 341
PHD2_KMT2C cd15594
PHD finger 2 found in Histone-lysine N-methyltransferase 2C (KMT2C); KMT2C, also termed ...
1963-2007 6.34e-12

PHD finger 2 found in Histone-lysine N-methyltransferase 2C (KMT2C); KMT2C, also termed myeloid/lymphoid or mixed-lineage leukemia protein 3 (MLL3) or homologous to ALR protein, is a histone H3 lysine 4 (H3K4) lysine methyltransferase that functions as a circadian factor contributing to genome-scale circadian transcription. It is a component of a large complex that acts as a coactivator of multiple transcription factors, including the bile acid (BA)-activated nuclear receptor, farnesoid X receptor (FXR), a critical player in BA homeostasis. The MLL3 complex is essential for p53 transactivation of small heterodimer partner (SHP). KMT2C is also a part of activating signal cointegrator-2 (ASC-2)-containing complex (ASCOM) that contains the transcriptional coactivator nuclear receptor coactivator 6 (NCOA6), KMT2C and its paralog MLL4. The ASCOM complex is critical for nuclear receptor (NR) activation of bile acid transporter genes and is down regulated in cholestasis. KMT2C contains several plant homeodomain (PHD) fingers, two extended PHD (ePHD) fingers, Cys2HisCys5HisCys2His, an ATPase alpha beta signature, a high mobility group (HMG)-1 box, a SET (Suppressor of variegation, Enhancer of zeste, Trithorax) domain and two FY (phenylalanine tyrosine)-rich domains. This model corresponds to the second PHD finger.


Pssm-ID: 277069  Cd Length: 46  Bit Score: 61.88  E-value: 6.34e-12
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15594      2 CQTCRQPGDDNKMLVCDTCDKGYHTFCLQPVMTTIPKNGWKCKNC 46
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
864-1057 6.54e-12

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 69.18  E-value: 6.54e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  864 KLLRKLQAQEIARQAAQIKLLRKLQKQeqarvaKEAKKQQAIMAAEEKRKQKEQMK--IIKQQEKIKRIQQIRMEKELRA 941
Cdd:pfam13868   55 RALEEEEEKEEERKEERKRYRQELEEQ------IEEREQKRQEEYEEKLQEREQMDeiVERIQEEDQAEAEEKLEKQRQL 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  942 QQILEAKkkkkeeaanaklLEAEKRTKEKELRRQQavllkhqELERHRLDMVWERERRRQHVMLMKAMEARKKAEEKERL 1021
Cdd:pfam13868  129 REEIDEF------------NEEQAEWKELEKEEER-------EEDERILEYLKEKAEREEEREAEREEIEEEKEREIARL 189
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1039761240 1022 --KQEKRDEKRLNKERKLEQRRLELEMAKELKKPKEDM 1057
Cdd:pfam13868  190 raQQEKAQDEKAERDELRAKLYQEEQERKERQKEREEA 227
PTZ00121 PTZ00121
MAEBL; Provisional
832-1055 6.86e-12

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 71.33  E-value: 6.86e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  832 DRPRAREESRMKRRKGRPPNVGSAEFLDNTDAKLLRKLQAQEIARQAAQIKLLR--KLQKQEQARVAKEAK-KQQAIMAA 908
Cdd:PTZ00121  1549 DELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEeeKKMKAEEAKKAEEAKiKAEELKKA 1628
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  909 EEKRKQKEQMKIiKQQEKIKRIQQIRMEKElrAQQILEAKKKKKEEAANAkllEAEKRTKEKELRRQQAVLLKHQELERH 988
Cdd:PTZ00121  1629 EEEKKKVEQLKK-KEAEEKKKAEELKKAEE--ENKIKAAEEAKKAEEDKK---KAEEAKKAEEDEKKAAEALKKEAEEAK 1702
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039761240  989 RLDMVWERE----------RRRQHVMLMKAMEARKKAEEKERLKQE-KRDEKRLNKERKLeqRRLELEMAKELKKPKE 1055
Cdd:PTZ00121  1703 KAEELKKKEaeekkkaeelKKAEEENKIKAEEAKKEAEEDKKKAEEaKKDEEEKKKIAHL--KKEEEKKAEEIRKEKE 1778
Bromo_SP100C_like cd05501
Bromodomain, SP100C_like subfamily. The SP100C protein is a splice variant of SP100, a major ...
2126-2194 8.05e-12

Bromodomain, SP100C_like subfamily. The SP100C protein is a splice variant of SP100, a major component of PML-SP100 nuclear bodies (NBs), which are poorly understood. It is covalently modified by SUMO-1 and may play a role in processes at the chromatin level. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99933  Cd Length: 102  Bit Score: 63.60  E-value: 8.05e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039761240 2126 YKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDsDIGRAGHSMRKYFEKKWTDTF 2194
Cdd:cd05501     33 YCQGIKEPMWLNKVKERLNERVYHTVEGFVRDMRLIFHNHKLFYKDD-DFGQVGITLEKKFEKNFKEVF 100
PTZ00121 PTZ00121
MAEBL; Provisional
868-1062 8.33e-12

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 71.33  E-value: 8.33e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  868 KLQAQEIARQAAQIKLLRKLQKQ-EQARVAKEAKKQqaimaAEEKRKQKEQMKiiKQQEKIKRIQQIRMEKELRAQQILE 946
Cdd:PTZ00121  1417 KKKADEAKKKAEEKKKADEAKKKaEEAKKADEAKKK-----AEEAKKAEEAKK--KAEEAKKADEAKKKAEEAKKADEAK 1489
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  947 AKKKKKEEAANAKLLEAEKRTKEKELRR----QQAVLLKHQElERHRLDMVWERERRRQHVMLMKAMEARKkAEEKERLK 1022
Cdd:PTZ00121  1490 KKAEEAKKKADEAKKAAEAKKKADEAKKaeeaKKADEAKKAE-EAKKADEAKKAEEKKKADELKKAEELKK-AEEKKKAE 1567
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1039761240 1023 QEKRDEKRLNKE-------RKLEQRRLELEM----------AKELKKPKEDMCLADQ 1062
Cdd:PTZ00121  1568 EAKKAEEDKNMAlrkaeeaKKAEEARIEEVMklyeeekkmkAEEAKKAEEAKIKAEE 1624
PHD_UHRF1_2 cd15525
PHD finger found in ubiquitin-like PHD and RING finger domain-containing protein UHRF1 and ...
1963-2007 9.38e-12

PHD finger found in ubiquitin-like PHD and RING finger domain-containing protein UHRF1 and UHRF2; UHRF1 is a unique chromatin effector protein that integrates the recognition of both histone PTMs and DNA methylation. It is essential for cell proliferation and plays a critical role in the development and progression of many human carcinomas, such as laryngeal squamous cell carcinoma (LSCC), gastric cancer (GC), esophageal squamous cell carcinoma (ESCC), colorectal cancer, prostate cancer, and breast cancer. UHRF1 acts as a transcriptional repressor through its binding to histone H3 when it is unmodified at Arg2. Its overexpression in human lung fibroblasts results in downregulation of expression of the tumour suppressor pRB. It also plays a role in transcriptional repression of the cell cycle regulator p21. Moreover, UHRF1-dependent repression of transcription factors can facilitate the G1-S transition. It interacts with Tat-interacting protein of 60 kDa (TIP60) and induces degradation-independent ubiquitination of TIP60. It is also an N-methylpurine DNA glycosylase (MPG)-interacting protein that binds MPG in a p53 status-independent manner in the DNA base excision repair (BER) pathway. In addition, UHRF1 functions as an epigenetic regulator that is important for multiple aspects of epigenetic regulation, including maintenance of DNA methylation patterns and recognition of various histone modifications. UHRF2 was originally identified as a ubiquitin ligase acting as a small ubiquitin-like modifier (SUMO) E3 ligase that enhances zinc finger protein 131 (ZNF131) SUMOylation but does not enhance ZNF131 ubiquitination. It also ubiquitinates PCNP, a PEST-containing nuclear protein. Moreover, UHRF2 functions as a nuclear protein involved in cell-cycle regulation and has been implicated in tumorigenesis. It interacts with cyclins, CDKs, p53, pRB, PCNA, HDAC1, DNMTs, G9a, methylated histone H3 lysine 9, and methylated DNA. It interacts with the cyclin E-CDK2 complex, ubiquitinates cyclins D1 and E1, induces G1 arrest, and is involved in the G1/S transition regulation. Furthermore, UHRF2 is a direct transcriptional target of the transcription factor E2F-1 in the induction of apoptosis. It recruits HDAC1 and binds to methyl-CpG. UHRF2 also participates in the maturation of Hepatitis B virus (HBV) by interacting with the HBV core protein and promoting its degradation. Both UHRF1 and UHRF2 contain an N-terminal ubiquitin-like domain (UBL), a tandem Tudor domain (TTD), a plant homeodomain (PHD) finger, a SET- and RING-associated (SRA) domain, and a C-terminal RING finger.


Pssm-ID: 277000  Cd Length: 47  Bit Score: 61.62  E-value: 9.38e-12
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGD-WFCPAC 2007
Cdd:cd15525      2 CHVCGGKQDPEKQLLCDECDMAYHLYCLDPPLTSLPDDDeWYCPDC 47
PTZ00121 PTZ00121
MAEBL; Provisional
819-1055 1.00e-11

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 70.94  E-value: 1.00e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  819 RAMDGRRGRPP-NPDRPRAREESRMKRRKGRPPNVGSAEfldntDAKLLRKLQAQEIARQAAQIKLLRKLQKQEQARVAK 897
Cdd:PTZ00121  1147 KAEDAKRVEIArKAEDARKAEEARKAEDAKKAEAARKAE-----EVRKAEELRKAEDARKAEAARKAEEERKAEEARKAE 1221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  898 EAKKQQAIMAAEEKRKQKEQMkiiKQQEKIKRIQQIRMEKELRAQQILEAKKKKKEEAANAK--LLEAEKRTKEKELRRQ 975
Cdd:PTZ00121  1222 DAKKAEAVKKAEEAKKDAEEA---KKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKAdeLKKAEEKKKADEAKKA 1298
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  976 QAVlLKHQELERHrldmvwERERRRQHVMLMKAMEARKKAEEKERlKQEKRDEKRLNKERKLEQRRLELEMAKELKKPKE 1055
Cdd:PTZ00121  1299 EEK-KKADEAKKK------AEEAKKADEAKKKAEEAKKKADAAKK-KAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAE 1370
PTZ00121 PTZ00121
MAEBL; Provisional
835-1062 1.84e-11

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 70.17  E-value: 1.84e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  835 RAREESRMKRRKGRPPNVGSAEflDNTDAKLLRKLQAQEIARQAAQIKLLRKLQ---KQEQARVAKEAKKQQAIMAAEEK 911
Cdd:PTZ00121  1110 KAEEARKAEEAKKKAEDARKAE--EARKAEDARKAEEARKAEDAKRVEIARKAEdarKAEEARKAEDAKKAEAARKAEEV 1187
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  912 RKQKEqmkiIKQQEKIKRIQQIRMEKELRAQQILEAKKKKKEEAANAKLLEAEKrtKEKELRRQQAVllKHQELERHRLD 991
Cdd:PTZ00121  1188 RKAEE----LRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKK--DAEEAKKAEEE--RNNEEIRKFEE 1259
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039761240  992 MVWERERRRQHVMlmKAMEARKKAEEKERLKQEKRDEKRLNKERKleqrrleleMAKELKKPKEDMCLADQ 1062
Cdd:PTZ00121  1260 ARMAHFARRQAAI--KAEEARKADELKKAEEKKKADEAKKAEEKK---------KADEAKKKAEEAKKADE 1319
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
865-1057 1.86e-11

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 68.02  E-value: 1.86e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  865 LLRKLQAQEIAR-QAAQI---KLLRKLQKQEQARVAKEAKKQ-----QAIMAAEEKRKQK--EQMKIIKQQ--EKIKRIQ 931
Cdd:pfam13868   11 LNSKLLAAKCNKeRDAQIaekKRIKAEEKEEERRLDEMMEEEreralEEEEEKEEERKEErkRYRQELEEQieEREQKRQ 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  932 QIRMEKELRAQQILeakkkkkeeAANAKLLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVwERERRRQHVMLMKAMEA 1011
Cdd:pfam13868   91 EEYEEKLQEREQMD---------EIVERIQEEDQAEAEEKLEKQRQLREEIDEFNEEQAEWK-ELEKEEEREEDERILEY 160
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1012 RKKAEEKERLKQEKRDEKRLNKERKLEQRRLELEMAKELKKPKEDM 1057
Cdd:pfam13868  161 LKEKAEREEEREAEREEIEEEKEREIARLRAQQEKAQDEKAERDEL 206
Bromo_WDR9_I_like cd05529
Bromodomain; WDR9 repeat I_like subfamily. WDR9 is a human gene located in the Down Syndrome ...
2078-2177 2.36e-11

Bromodomain; WDR9 repeat I_like subfamily. WDR9 is a human gene located in the Down Syndrome critical region-2 of chromosome 21. It encodes for a nuclear protein containing WD40 repeats and two bromodomains, which may function as a transcriptional regulator involved in chromatin remodeling and play a role in embryonic development. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99958  Cd Length: 128  Bit Score: 63.12  E-value: 2.36e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2078 AESTTSIKKP-----KKDESRDLALCSMILTE----METHEDSW--PFLLPVNLKL-VPGYKKVIKKPMDFSTIREKLNN 2145
Cdd:cd05529      1 LYNPLSSEWElfdpgWEQPHIRDEERERLISGldklLLSLQLEIaeYFEYPVDLRAwYPDYWNRVPVPMDLETIRSRLEN 80
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1039761240 2146 GQYPNFETFALDVRLVFDNCETFNEDDSDIGR 2177
Cdd:cd05529     81 RYYRSLEALRHDVRLILSNAETFNEPNSEIAK 112
PHD1_MTF2_PHF19_like cd15499
PHD finger 1 found in polycomb repressive complex 2 (PRC2)-associated polycomb-like (PCL) ...
1963-2008 2.40e-11

PHD finger 1 found in polycomb repressive complex 2 (PRC2)-associated polycomb-like (PCL) family proteins MTF2, PHF19, and similar proteins; The family includes two PCL family proteins, metal-response element-binding transcription factor 2 (MTF2/PCL2) and PHF19/PCL3, which are homologs of PHD finger protein1 (PHF1). PCL family proteins are accessory components of the polycomb repressive complex 2 (PRC2) core complex and all contain an N-terminal Tudor domain followed by two PHD fingers, and a C-terminal MTF2 domain. They specifically recognize tri-methylated H3K36 (H3K36me3) through their N-terminal Tudor domains. The interaction between their Tudor domains and H3K36me3 is critical for both the targeting and spreading of PRC2 into active chromatin regions and for the maintenance of optimal repression of poised developmental genes where PCL proteins, H3K36me3, and H3K27me3 coexist. Moreover, unlike other PHD finger-containing proteins, the first PHD fingers of PCL proteins do not display histone H3K4 binding affinity and they do not affect the Tudor domain binding to histones. This model corresponds to the first PHD finger.


Pssm-ID: 276974  Cd Length: 53  Bit Score: 60.59  E-value: 2.40e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 1963 CQICRKGDNEEL--LLLCDGCDKGCHTYCHRPKITTIP---DGDWFCPACI 2008
Cdd:cd15499      2 CSICGGAEARDGneILICDKCDKGYHQLCHSPKVRTSPlegDEKWFCSRCV 52
PTZ00121 PTZ00121
MAEBL; Provisional
833-1056 2.44e-11

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 69.78  E-value: 2.44e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  833 RPRAREESRMKRRKGRPPNVGSAEfldNTDAKLLRKLQAQEIARQAAQIKLLRKLQK--QEQARVAKEAKK-QQAIMAAE 909
Cdd:PTZ00121  1437 KKKAEEAKKADEAKKKAEEAKKAE---EAKKKAEEAKKADEAKKKAEEAKKADEAKKkaEEAKKKADEAKKaAEAKKKAD 1513
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  910 EKRKQKEQMKI--IKQQEKIKRIQQIRMEKELRAQQILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKHQELER 987
Cdd:PTZ00121  1514 EAKKAEEAKKAdeAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEAR 1593
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  988 HRLDM-VWERERRRQHVMLMKAMEARKKAEEKERLKQEKRDEKRLNKERKLEQRRlelemAKELKKPKED 1056
Cdd:PTZ00121  1594 IEEVMkLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK-----AEELKKAEEE 1658
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
866-1046 2.73e-11

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 69.00  E-value: 2.73e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  866 LRKLQAQEIARQAAQIKLLRKLQ-----KQEQARVAKEAKKQQAIMAAEEKRKQKEQM----KIIKQQEKIKRIQQIRME 936
Cdd:pfam17380  362 LERIRQEEIAMEISRMRELERLQmerqqKNERVRQELEAARKVKILEEERQRKIQQQKvemeQIRAEQEEARQREVRRLE 441
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  937 KElRAQQIleakkkkkeeaanaklleaeKRTKEKELRRQQavllkhqELERHRLDmvwERERRRQHVMLMKamEARKKAE 1016
Cdd:pfam17380  442 EE-RAREM--------------------ERVRLEEQERQQ-------QVERLRQQ---EEERKRKKLELEK--EKRDRKR 488
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1039761240 1017 EKER----LKQEKRDEKRLNKERKLEQRRLELEM 1046
Cdd:pfam17380  489 AEEQrrkiLEKELEERKQAMIEEERKRKLLEKEM 522
Bromo_brd8_like cd05507
Bromodomain, brd8_like subgroup. In mammals, brd8 (bromodomain containing 8) interacts with ...
2101-2183 2.75e-11

Bromodomain, brd8_like subgroup. In mammals, brd8 (bromodomain containing 8) interacts with the thyroid hormone receptor in a ligand-dependent fashion and enhances thyroid hormone-dependent activation from thyroid response elements. Brd8 is thought to be a nuclear receptor coactivator. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99939  Cd Length: 104  Bit Score: 62.00  E-value: 2.75e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2101 ILTEMETHEDSWPFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGRAGH 2180
Cdd:cd05507     11 VYRTLASHRYASVFLKPVTEDIAPGYHSVVYRPMDLSTIKKNIENGTIRSTAEFQRDVLLMFQNAIMYNSSDHDVYLMAV 90

                   ...
gi 1039761240 2181 SMR 2183
Cdd:cd05507     91 EMQ 93
PHD2_d4 cd15530
PHD finger 2 found in d4 gene family proteins; The family includes proteins coded by three ...
1962-2007 5.27e-11

PHD finger 2 found in d4 gene family proteins; The family includes proteins coded by three members of the d4 gene family, DPF1 (neuro-d4), DPF2 (ubi-d4/Requiem), and DPF3 (cer-d4), which function as transcription factors and are involved in transcriptional regulation of genes by changing the condensed/decondensed state of chromatin in the nucleus. DPF2 is ubiquitously expressed and it acts as a transcription factor that may participate in developmentally programmed cell death. DPF1 and DPF3 are expressed predominantly in neural tissues, and they may be involved in the transcription regulation of neuro-specific gene clusters. The d4 family proteins show distinct domain organization with domain 2/3 in the N-terminal region, a Cys2His2 (C2H2) zinc finger or Kruppel-type zinc finger in the central part and two adjacent plant homeodomain (PHD) fingers (d4-domain) in the C-terminal part of the molecule. This model corresponds to the second PHD finger.


Pssm-ID: 277005  Cd Length: 46  Bit Score: 59.32  E-value: 5.27e-11
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15530      1 SCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSCHLC 46
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
868-1049 6.88e-11

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 68.04  E-value: 6.88e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  868 KLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIK----RIQQIRMEKELRAQQ 943
Cdd:COG1196    252 EAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRReleeRLEELEEELAELEEE 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  944 ILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKhQELERHRLDMVWERERRRQHVMLMKAMEARKKAEEKERLKQ 1023
Cdd:COG1196    332 LEELEEELEELEEELEEAEEELEEAEAELAEAEEALLE-AEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEE 410
                          170       180
                   ....*....|....*....|....*...
gi 1039761240 1024 --EKRDEKRLNKERKLEQRRLELEMAKE 1049
Cdd:COG1196    411 alLERLERLEEELEELEEALAELEEEEE 438
PHD1_AIRE cd15539
PHD finger 1 found in autoimmune regulator (AIRE); AIRE, also termed autoimmune ...
1962-2007 7.27e-11

PHD finger 1 found in autoimmune regulator (AIRE); AIRE, also termed autoimmune polyendocrinopathy candidiasis ectodermal dystrophy (APECED) protein, functions as a regulator of gene transcription in the thymus. It is essential for prevention of autoimmunity. AIRE plays a critical role in the induction of central tolerance. It promotes self-tolerance through tissue-specific antigen (TSA) expression. It also acts as an active regulator of chondrocyte differentiation. AIRE contains a homogeneously-staining region (HSR) or caspase-recruitment domain (CARD), a nuclear localization signal (NLS), a SAND (for Sp100, AIRE, nuclear phosphoprotein 41/75 or NucP41/75, and deformed epidermal auto regulatory factor 1 or Deaf1) domain, two plant homeodomain (PHD) fingers, and four LXXLL (where L stands for leucine) motifs. This model corresponds to the first PHD finger that recognizes the unmethylated tail of histone H3 and targets AIRE-dependent genes.


Pssm-ID: 277014 [Multi-domain]  Cd Length: 43  Bit Score: 59.00  E-value: 7.27e-11
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNeelLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15539      1 ECAVCGDGGE---LLCCDGCPRAFHLACLVPPLTLIPSGTWRCSSC 43
MBD pfam01429
Methyl-CpG binding domain; The Methyl-CpG binding domain (MBD) binds to DNA that contains one ...
753-802 7.96e-11

Methyl-CpG binding domain; The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has negligible non-specific affinity for DNA. In vitro foot-printing with MeCP2 showed the MBD can protect a 12 nucleotide region surrounding a methyl CpG pair. MBDs are found in several Methyl-CpG binding proteins and also DNA demethylase.


Pssm-ID: 396147 [Multi-domain]  Cd Length: 76  Bit Score: 59.68  E-value: 7.96e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039761240  753 VADDQELRIPLDYGWQRETRVRNFGG-RLPGEVAYYAPCGKKLRQCPDMVK 802
Cdd:pfam01429    1 IERKREDRLPLPPGWRREERQRKSGSkAGKVDVFYYSPTGKKLRSKSEVAR 51
PTZ00121 PTZ00121
MAEBL; Provisional
819-1056 8.27e-11

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 67.86  E-value: 8.27e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  819 RAMDGRRGRPP-NPDRPRAREESRMKRRKGRPPNVGSAEflDNTDAKLLRKLQAQ---EIARQAAQIKLLRKLQKQEQAR 894
Cdd:PTZ00121  1123 KAEDARKAEEArKAEDARKAEEARKAEDAKRVEIARKAE--DARKAEEARKAEDAkkaEAARKAEEVRKAEELRKAEDAR 1200
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  895 VAKEAKKQQAIMAAEEKRKQKEQMKI---------IKQQEKIKRIQQIRMEKELRAQQileaKKKKKEEAANAKLLEAEK 965
Cdd:PTZ00121  1201 KAEAARKAEEERKAEEARKAEDAKKAeavkkaeeaKKDAEEAKKAEEERNNEEIRKFE----EARMAHFARRQAAIKAEE 1276
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  966 RTKEKELRRQQAVLlKHQEL----ERHRLDMVWER--ERRRQHVMLMKAMEARKKAEEKERLKQEKR------------- 1026
Cdd:PTZ00121  1277 ARKADELKKAEEKK-KADEAkkaeEKKKADEAKKKaeEAKKADEAKKKAEEAKKKADAAKKKAEEAKkaaeaakaeaeaa 1355
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1039761240 1027 -------------DEKRLNKERK----LEQRRLELEMAKELKKPKED 1056
Cdd:PTZ00121  1356 adeaeaaeekaeaAEKKKEEAKKkadaAKKKAEEKKKADEAKKKAEE 1402
Bromo_polybromo_IV cd05518
Bromodomain, polybromo repeat IV. Polybromo is a nuclear protein of unknown function, which ...
2121-2189 2.37e-10

Bromodomain, polybromo repeat IV. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.


Pssm-ID: 99949 [Multi-domain]  Cd Length: 103  Bit Score: 59.38  E-value: 2.37e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039761240 2121 KLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGRAGHSMRKYFEKK 2189
Cdd:cd05518     34 KDYPDYYKIILEPIDLKTIEHNIRNDKYATEEELMDDFKLMFRNARHYNEEGSQVYEDANILEKVLKEK 102
Bromo_SNF2L2 cd05516
Bromodomain, SNF2L2-like subfamily, specific to animals. SNF2L2 (SNF2-alpha) or SWI ...
2113-2175 2.54e-10

Bromodomain, SNF2L2-like subfamily, specific to animals. SNF2L2 (SNF2-alpha) or SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A member 2 is a global transcriptional activator, which cooperates with nuclear hormone receptors to boost transcriptional activation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99947  Cd Length: 107  Bit Score: 59.36  E-value: 2.54e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039761240 2113 PFL-LPVNlKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDI 2175
Cdd:cd05516     27 VFIqLPSR-KELPEYYELIRKPVDFKKIKERIRNHKYRSLEDLEKDVMLLCQNAQTFNLEGSLI 89
PHD_UHRF1 cd15616
PHD finger found in ubiquitin-like PHD and RING finger domain-containing protein 1 (UHRF1); ...
1963-2007 4.31e-10

PHD finger found in ubiquitin-like PHD and RING finger domain-containing protein 1 (UHRF1); UHRF1 (also termed inverted CCAAT box-binding protein of 90 kDa, nuclear protein 95, nuclear zinc finger protein Np95 (Np95), RING finger protein 106, transcription factor ICBP90, or E3 ubiquitin-protein ligase UHRF1) is a unique chromatin effector protein that integrates the recognition of both histone PTMs and DNA methylation. It is essential for cell proliferation and plays a critical role in the development and progression of many human carcinomas, such as laryngeal squamous cell carcinoma (LSCC), gastric cancer (GC), esophageal squamous cell carcinoma (ESCC), colorectal cancer, prostate cancer, and breast cancer. UHRF1 acts as a transcriptional repressor through its binding to histone H3 when it is unmodified at Arg2. Its overexpression in human lung fibroblasts results in downregulation of expression of the tumour suppressor pRB. It also plays a role in transcriptional repression of the cell cycle regulator p21. Moreover, UHRF1-dependent repression of transcription factors can facilitate the G1-S transition. It interacts with Tat-interacting protein of 60 kDa (TIP60) and induces degradation-independent ubiquitination of TIP60. It is also an N-methylpurine DNA glycosylase (MPG)-interacting protein that binds MPG in a p53 status-independent manner in the DNA base excision repair (BER) pathway. In addition, UHRF1 functions as an epigenetic regulator that is important for multiple aspects of epigenetic regulation, including maintenance of DNA methylation patterns and recognition of various histone modifications. UHRF1 contains an N-terminal ubiquitin-like domain (UBL), a tandem Tudor domain (TTD), a plant homeodomain (PHD) finger, a SET and RING finger associated (SRA) domain, and a C-terminal RING-finger domain. It specifically binds to hemimethylated DNA, double-stranded CpG dinucleotides, and recruits the maintenance methyltransferase DNMT1 to its hemimethylated DNA substrate through its SRA domain. UHRF1-dependent H3K23 ubiquitylation has an essential role in maintaining DNA methylation and replication. The tandem Tudor domain directs UHRF1 binding to the heterochromatin mark histone H3K9me3 and the PHD finger targets UHRF1 to unmodified histone H3 in euchromatic regions. The RING-finger domain exhibit both autocatalytic E3 ubiquitin (Ub) ligase activity and activity against histone H3 and DNMT1.


Pssm-ID: 277088  Cd Length: 47  Bit Score: 56.90  E-value: 4.31e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIP-DGDWFCPAC 2007
Cdd:cd15616      2 CHVCGGKQDPDKQLMCDECDMAFHIYCLNPPLSSIPdDEDWYCPEC 47
PHD2_PHF10 cd15529
PHD finger 2 found in PHD finger protein 10 (PHF10) and similar proteins; PHF10, also termed ...
1963-2007 4.38e-10

PHD finger 2 found in PHD finger protein 10 (PHF10) and similar proteins; PHF10, also termed BRG1-associated factor 45a (BAF45a), or XAP135, is a ubiquitously expressed transcriptional regulator that is required for maintaining the undifferentiated status of neuroblasts. It contains a SAY (supporter of activation of yellow) domain and two adjacent plant homeodomain (PHD) fingers. This model corresponds to the second PHD finger.


Pssm-ID: 277004  Cd Length: 44  Bit Score: 56.55  E-value: 4.38e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYChrPKITTIPDGDWFCPAC 2007
Cdd:cd15529      2 CTKCGDPHDEDKMMFCDQCDRGYHTFC--VGLRSIPDGRWICPLC 44
PHD5_KMT2C_like cd15513
PHD finger 5 found in Histone-lysine N-methyltransferase 2C (KMT2C) and PHD finger 4 found in ...
1963-2008 4.61e-10

PHD finger 5 found in Histone-lysine N-methyltransferase 2C (KMT2C) and PHD finger 4 found in KMT2D; KMT2C, also termed myeloid/lymphoid or mixed-lineage leukemia protein 3 (MLL3), or homologous to ALR protein, is a histone H3 lysine 4 (H3K4) lysine methyltransferase that functions as a circadian factor contributing to genome-scale circadian transcription. It is a component of a large complex that acts as a coactivator of multiple transcription factors, including the bile acid (BA)-activated nuclear receptor, farnesoid X receptor (FXR), a critical player in BA homeostasis. The MLL3 complex is essential for p53 transactivation of small heterodimer partner (SHP). KMT2C is also a part of activating signal cointegrator-2 (ASC-2)-containing complex (ASCOM) that contains the transcriptional coactivator nuclear receptor coactivator 6 (NCOA6), KMT2C and its paralog MLL4. The ASCOM complex is critical for nuclear receptor (NR) activation of bile acid transporter genes and is down regulated in cholestasis. KMT2D, also termed ALL1-related protein (ALR), is encoded by the gene that was named MLL4, a fourth human homolog of Drosophila trithorax, located on chromosome 12. It enzymatically generates trimethylated histone H3 Lysine 4 (H3K4me3). It plays an essential role in differentiating the human pluripotent embryonal carcinoma cell line NTERA-2 clone D1 (NT2/D1) stem cells by activating differentiation-specific genes, such as HOXA1-3 and NESTIN. KMT2D is also a part of ASCOM. Both KMT2C and KMT2D contain the catalytic domain SET, several plant homeodomain (PHD) fingers, extended PHD (ePHD) fingers, Cys2HisCys5HisCys2His, a RING finger, an HMG (high-mobility group)-binding motif, and two FY-rich regions. This model corresponds to the fifth PHD finger of KMT2C and the fourth PHD finger of KMT2D.


Pssm-ID: 276988  Cd Length: 47  Bit Score: 56.72  E-value: 4.61e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPACI 2008
Cdd:cd15513      2 CEGCGKASDESRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCV 47
Bromo_ASH1 cd05525
Bromodomain; ASH1_like sub-family. ASH1 (absent, small, or homeotic 1) is a member of the ...
2101-2187 5.22e-10

Bromodomain; ASH1_like sub-family. ASH1 (absent, small, or homeotic 1) is a member of the trithorax-group in Drosophila melanogaster, an epigenetic transcriptional regulator of HOX genes. Drosophila ASH1 has been shown to methylate specific lysines in histones H3 and H4. Mammalian ASH1 has been shown to methylate histone H3. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99955 [Multi-domain]  Cd Length: 106  Bit Score: 58.56  E-value: 5.22e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2101 ILTEMETHEDSW------PFL-LPVNLKlVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDS 2173
Cdd:cd05525     10 ICDAIITYKDSNgqslaiPFInLPSKKK-NPDYYERITDPVDLSTIEKQILTGYYKTPEAFDSDMLKVFRNAEKYYGRKS 88
                           90
                   ....*....|....
gi 1039761240 2174 DIGRAGHSMRKYFE 2187
Cdd:cd05525     89 PIGRDVCRLRKAYY 102
PTZ00121 PTZ00121
MAEBL; Provisional
835-1062 1.47e-09

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 64.01  E-value: 1.47e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  835 RAREESRMKRRKGRPpnvgSAEFLDNTDAKLLRKLQAQEIARQAAQIKLLRKlQKQEQARVAKEAKKQQAIMAAEEKRKQ 914
Cdd:PTZ00121  1230 KKAEEAKKDAEEAKK----AEEERNNEEIRKFEEARMAHFARRQAAIKAEEA-RKADELKKAEEKKKADEAKKAEEKKKA 1304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  915 KEQMKIIKQQEKIKRIQQIRMEKELRAQQiLEAKKKKKEEAANAKLLEAEKRTKEKEL--RRQQAVLLKHQELERHRLDM 992
Cdd:PTZ00121  1305 DEAKKKAEEAKKADEAKKKAEEAKKKADA-AKKKAEEAKKAAEAAKAEAEAAADEAEAaeEKAEAAEKKKEEAKKKADAA 1383
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039761240  993 VWERERRRqhvmlmKAMEARKKAEE-KERLKQEKRDEKRLNKERKLEQRRLELEMAKELKKPKEDMCLADQ 1062
Cdd:PTZ00121  1384 KKKAEEKK------KADEAKKKAEEdKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADE 1448
PHD_BRPF_JADE_like cd15492
PHD finger found in BRPF proteins, Jade proteins, protein AF-10, AF-17, and similar proteins; ...
1963-2007 1.50e-09

PHD finger found in BRPF proteins, Jade proteins, protein AF-10, AF-17, and similar proteins; The family includes BRPF proteins, Jade proteins, protein AF-10 and AF-17. BRPF proteins are scaffold proteins that form monocytic leukemic zinc-finger protein (MOZ)/MOZ-related factor (MORF) H3 histone acetyltransferase (HAT) complexes with other regulatory subunits, such as inhibitor of growth 5 (ING5) and Esa1-associated factor 6 ortholog (EAF6). BRPF proteins have multiple domains, including a canonical Cys4HisCys3 plant homeodomain (PHD) zinc finger followed by a non-canonical extended PHD (ePHD) finger, Cys2HisCys5HisCys2His, a bromodomain and a proline-tryptophan-tryptophan-proline (PWWP) domain. PHD and ePHD fingers both bind to lysine 4 of histone H3 (K4H3), bromodomains interact with acetylated lysines on N-terminal tails of histones and other proteins, and PWWP domains show histone-binding and chromatin association properties. Jade proteins are required for ING4 and ING5 to associate with histone acetyltransferase (HAT) HBO1 and EAF6, to form a HBO1 complex that has a histone H4-specific acetyltransferase activity, a reduced activity toward histone H3, and is responsible for the bulk of histone H4 acetylation in vivo. AF-10, also termed ALL1 (acute lymphoblastic leukemia)-fused gene from chromosome 10 protein, is a transcription factor that has been implicated in the development of leukemia following chromosomal rearrangements between the AF10 gene and one of at least two other genes, MLL and CALM. AF-17, also termed ALL1-fused gene from chromosome 17 protein, is a putative transcription factor that may play a role in multiple signaling pathways. All Jade proteins, AF-10, and AF-17 contain a canonical PHD finger followed by a non-canonical ePHD finger. This model corresponds to the canonical PHD finger.


Pssm-ID: 276967 [Multi-domain]  Cd Length: 46  Bit Score: 55.32  E-value: 1.50e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1039761240 1963 CQICRKGDNEE--LLLLCDGCDKGCHTYCHrpKITTIPDGDWFCPAC 2007
Cdd:cd15492      2 CDVCLDGESEDdnEIVFCDGCNVAVHQSCY--GIPLIPEGDWFCRKC 46
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
858-1052 2.01e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 63.03  E-value: 2.01e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  858 LDNTDAKLLRkLQA--QEIARQaaqiklLRKLQKQeqARVAKEAkkqqaimaaeekRKQKEQMKIIKQQEKIKRIQQIRM 935
Cdd:COG1196    181 LEATEENLER-LEDilGELERQ------LEPLERQ--AEKAERY------------RELKEELKELEAELLLLKLRELEA 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  936 EKELRAQQILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEARKKA 1015
Cdd:COG1196    240 ELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLE 319
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1039761240 1016 EEKERLKQEKRDEKRLNKERK-LEQRRLELEMAKELKK 1052
Cdd:COG1196    320 ELEEELAELEEELEELEEELEeLEEELEEAEEELEEAE 357
PHD_Phf1p_Phf2p_like cd15502
PHD finger found in Schizosaccharomyces pombe SWM histone demethylase complex subunits Phf1 ...
1963-2007 2.26e-09

PHD finger found in Schizosaccharomyces pombe SWM histone demethylase complex subunits Phf1 (Phf1p) and Phf2 (Phf2p); Phf1p and Phf2p are components of the SWM histone demethylase complex that specifically demethylates histone H3 at lysine 9 (H3K9me2), a specific tag for epigenetic transcriptional activation. They function as corepressors and play roles in regulating heterochromatin propagation and euchromatic transcription. Both Phf1p and Phf2p contain a plant homeodomain (PHD) finger.


Pssm-ID: 276977  Cd Length: 52  Bit Score: 55.13  E-value: 2.26e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 1963 CQICRKGDNEE--LLLLCDGCDKGCHTYCHRPKITT----IPDGDWFCPAC 2007
Cdd:cd15502      2 CIVCQRGHSPKsnRIVFCDGCNTPYHQLCHDPSIDDevveDPDAEWFCKKC 52
Bromo_polybromo_III cd05520
Bromodomain, polybromo repeat III. Polybromo is a nuclear protein of unknown function, which ...
2113-2189 2.28e-09

Bromodomain, polybromo repeat III. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.


Pssm-ID: 99951  Cd Length: 103  Bit Score: 56.58  E-value: 2.28e-09
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039761240 2113 PFL-LPVNlKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGRAGHSMRKYFEKK 2189
Cdd:cd05520     26 PFLkLPSK-RKYPDYYQEIKNPISLQQIRTKLKNGEYETLEELEADLNLMFENAKRYNVPNSRIYKDAEKLQKLMQAK 102
PTZ00121 PTZ00121
MAEBL; Provisional
836-1056 2.78e-09

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 62.85  E-value: 2.78e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  836 AREESRMKRRKGRPPNVGSAEFLDNTDAKLLRKLQAQEI--ARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRK 913
Cdd:PTZ00121  1092 ATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAeeARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARK 1171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  914 QKEQMKIikqqEKIKRIQQIRMEKELRAQQILEAKKKKKEEAANAKLLEAEKRTKEK---ELRRQQAVLLKHQEL----- 985
Cdd:PTZ00121  1172 AEDAKKA----EAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKkaeAVKKAEEAKKDAEEAkkaee 1247
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  986 ERHRLDMVWERERRRQHV----MLMKAMEARK-----KAEEKERLKQEKRDEKRlNKERKLEQRRLELEMAKELKKPKED 1056
Cdd:PTZ00121  1248 ERNNEEIRKFEEARMAHFarrqAAIKAEEARKadelkKAEEKKKADEAKKAEEK-KKADEAKKKAEEAKKADEAKKKAEE 1326
MBD smart00391
Methyl-CpG binding domain; Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, ...
758-802 4.04e-09

Methyl-CpG binding domain; Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain


Pssm-ID: 128673  Cd Length: 77  Bit Score: 55.07  E-value: 4.04e-09
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 1039761240   758 ELRIPLDYGWQRETRVRNFG-GRLPGEVAYYAPCGKKLRQCPDMVK 802
Cdd:smart00391    3 PLRLPLPCGWRRETKQRKSGrSAGKFDVYYISPCGKKLRSKSELAR 48
DUF4670 pfam15709
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ...
872-1052 4.26e-09

Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.


Pssm-ID: 464815 [Multi-domain]  Cd Length: 522  Bit Score: 61.51  E-value: 4.26e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  872 QEIARQAAQIKLLRKLQKQEQAR-VAKEAKKQQAIMAAEEKRKQKEQMKIIkQQEKIKRIQQirMEKELRaqqileakkk 950
Cdd:pfam15709  313 EERSEEDPSKALLEKREQEKASRdRLRAERAEMRRLEVERKRREQEEQRRL-QQEQLERAEK--MREELE---------- 379
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  951 kkeeaanaklLEAEKRTKEKELRRQQavllkhQELERHRldmvWERERRRQHVMLMKAME-ARKKAEEKERLKQE---KR 1026
Cdd:pfam15709  380 ----------LEQQRRFEEIRLRKQR------LEEERQR----QEEEERKQRLQLQAAQErARQQQEEFRRKLQElqrKK 439
                          170       180
                   ....*....|....*....|....*...
gi 1039761240 1027 DEKRLNKERKLEQRRLELEM--AKELKK 1052
Cdd:pfam15709  440 QQEEAERAEAEKQRQKELEMqlAEEQKR 467
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
866-1052 5.78e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 61.49  E-value: 5.78e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  866 LRKLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQM--KIIKQQEKIKRIQQIRMEKELRAQQ 943
Cdd:COG1196    276 LEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELeeELEELEEELEELEEELEEAEEELEE 355
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  944 ILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEARKKAEEKERLKQ 1023
Cdd:COG1196    356 AEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEE 435
                          170       180
                   ....*....|....*....|....*....
gi 1039761240 1024 EKRDEKRLNKERKLEQRRLELEMAKELKK 1052
Cdd:COG1196    436 EEEEEEEALEEAAEEEAELEEEEEALLEL 464
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
869-1052 5.78e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 61.49  E-value: 5.78e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  869 LQAQEIARQAAQIKLLRKLQKQEQARvakEAKKQQAIMAAEEKRKQKEQmKIIKQQEKIKRIQQIRMEKELRAQQILEAK 948
Cdd:COG1196    292 ELLAELARLEQDIARLEERRRELEER---LEELEEELAELEEELEELEE-ELEELEEELEEAEEELEEAEAELAEAEEAL 367
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  949 KKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEARKKAEEKERLKQEKRDE 1028
Cdd:COG1196    368 LEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEA 447
                          170       180
                   ....*....|....*....|....
gi 1039761240 1029 KRLNKERKLEQRRLELEMAKELKK 1052
Cdd:COG1196    448 AEEEAELEEEEEALLELLAELLEE 471
PHD_UHRF2 cd15617
PHD finger found in ubiquitin-like PHD and RING finger domain-containing protein 2 (UHRF2); ...
1963-2007 7.50e-09

PHD finger found in ubiquitin-like PHD and RING finger domain-containing protein 2 (UHRF2); UHRF2 (also termed Np95/ICBP90-like RING finger protein (NIRF), Np95-like RING finger protein, nuclear protein 97, nuclear zinc finger protein Np97, RING finger protein 107, or E3 ubiquitin-protein ligase UHRF2) was originally identified as a ubiquitin ligase acting as a small ubiquitin-like modifier (SUMO) E3 ligase that enhances zinc finger protein 131 (ZNF131) SUMOylation but does not enhance ZNF131 ubiquitination. It also ubiquitinates PCNP, a PEST-containing nuclear protein. Moreover, UHRF2 functions as a nuclear protein involved in cell-cycle regulation and has been implicated in tumorigenesis. It interacts with cyclins, CDKs,p53, pRB, PCNA, HDAC1, DNMTs, G9a, methylated histone H3 lysine 9, and methylated DNA. It interacts with the cyclin E-CDK2 complex, ubiquitinates cyclins D1 and E1, induces G1 arrest, and is involved in the G1/S transition regulation. Furthermore, UHRF2 is a direct transcriptional target of the transcription factor E2F-1 in the induction of apoptosis. It recruits HDAC1 and binds to methyl-CpG. UHRF2 also participates in the maturation of Hepatitis B virus (HBV) by interacting with the HBV core protein and promoting its degradation. UHRF2 contains an N-terminal ubiquitin-like domain (UBL), a tandem Tudor domain (TTD), a plant homeodomain (PHD) finger, a SET- and RING-associated (SRA) domain, and a C-terminal RING finger.


Pssm-ID: 277089  Cd Length: 47  Bit Score: 53.42  E-value: 7.50e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIP-DGDWFCPAC 2007
Cdd:cd15617      2 CYVCGGKQDAHMQLLCDECNMAYHIYCLNPPLDKIPeDEDWYCPSC 47
PHD_TIF1_like cd15541
PHD finger found in the transcriptional intermediary factor 1 (TIF1) family and similar ...
1962-2007 8.49e-09

PHD finger found in the transcriptional intermediary factor 1 (TIF1) family and similar proteins; The TIF1 family of transcriptional cofactors includes TIF1alpha (TRIM24), TIF1beta (TRIM28), TIF1gamma (TRIM33), and TIF1delta (TRIM66), which are characterized by an N-terminal RING-finger B-box coiled-coil (RBCC/TRIM) motif and plant homeodomain (PHD) finger followed by a bromodomain in the C-terminal region. TIF1 proteins couple chromatin modifications to transcriptional regulation, signaling, and tumor suppression. They exert a deacetylase-dependent silencing effect when tethered to a promoter region. TIF1alpha, TIF1beta, and TIF1delta can homodimerize and contain a PXVXL motif necessary and sufficient for heterochromatin protein 1(HP1) binding. TIF1alpha and TIF1beta bind nuclear receptors and Kruppel-associated boxes (KRAB) specifically and respectively. In contrast, TIF1delta appears to lack nuclear receptor- and KRAB-binding activity. Moreover, TIF1delta is specifically involved in heterochromatin-mediated gene silencing during postmeiotic phases of spermatogenesis. TIF1gamma is structurally closely related to TIF1alpha and TIF1beta, but has very little functional features in common with them. It does not interact with the KRAB silencing domain of KOX1 or the heterochromatinic proteins HP1alpha, beta, and gamma. It cannot bind to nuclear receptors (NRs). This family also includes Sp100/Sp140 family proteins, the nuclear body SP100 and SP140. Sp110 is a leukocyte-specific component of the nuclear body. It may function as a nuclear hormone receptor transcriptional coactivator that may play a role in inducing differentiation of myeloid cells. It is also involved in resisting intracellular pathogens and functions as an important drug target for preventing intracellular pathogen diseases, such as tuberculosis, hepatic veno-occlusive disease, and intracellular cancers. SP140 is an interferon inducible nuclear leukocyte-specific protein involved in primary biliary cirrhosis and a risk factor in chronic lymphocytic leukemia. It is also implicated in innate immune response to human immunodeficiency virus type 1 (HIV-1) by binding to the virus viral infectivity factor (Vif) protein. Both Sp110 and Sp140 contain a SAND domain, a plant homeodomain (PHD) finger, and a bromodomain (BRD).


Pssm-ID: 277016 [Multi-domain]  Cd Length: 43  Bit Score: 53.12  E-value: 8.49e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNeelLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15541      1 WCAVCQNGGE---LLCCDKCPRVFHLDCHIPPIPEFPSGEWSCSLC 43
DDT pfam02791
DDT domain; The DDT domain is named after (DNA binding homeobox and Different Transcription ...
1080-1140 1.25e-08

DDT domain; The DDT domain is named after (DNA binding homeobox and Different Transcription factors) and is approximately 60 residues in length. Along with the WHIM motifs, it comprises an entirely alpha helical module found in diverse eukaryotic chromatin proteins. Based on the structure of Ioc3, this module is inferred to interact with nucleosomal linker DNA and the SLIDE domain of ISWI proteins. The resulting complex forms a protein ruler that measures out the spacing between two adjacent nucleosomes. In particular, the DDT domain, in combination with the WHIM1 and WHIM2 motifs form the SLIDE domain binding pocket.


Pssm-ID: 460696  Cd Length: 58  Bit Score: 52.89  E-value: 1.25e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 1080 TFSDCLMVVQFLRNFGKVLGfdvnIDVPNLSVLQEGLLNIGDSMGEVQDLLVRLLSAAVCD 1140
Cdd:pfam02791    2 AFGDLLMVWEFLNSFGEVLG----LSPFTLDDFEEALLCTEEPSELLDEIHCALLKALVRD 58
COG5076 COG5076
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin ...
2052-2189 1.40e-08

Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription];


Pssm-ID: 227408 [Multi-domain]  Cd Length: 371  Bit Score: 59.43  E-value: 1.40e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 2052 SSSLKRGSKELKK------RKMEETTSLNLSKAEsttsikkPKKDESRDlalcSMILTEMETHEDSWPFLLPVNLKLVPG 2125
Cdd:COG5076    227 DSSVYVDAKELEKyflkliEEIPEEMLELSIKPG-------REEREERE----SVLITNSQAHVGAWPFLRPVSDEEVPD 295
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039761240 2126 YKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGRAGHSMRKYFEKK 2189
Cdd:COG5076    296 YYKDIRDPMDLSTKELKLRNNYYRPEETFVRDAKLFFDNCVMYNGEVTDYYKNANVLEDFVIKK 359
PTZ00121 PTZ00121
MAEBL; Provisional
819-1056 1.52e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 60.54  E-value: 1.52e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  819 RAMDGRRGRPPNPDRPRAREESRMKRRKGRPPNVGSAEFLDNTDAKLLRKLQAQEI-ARQAAQIKLLRKLQKQEQARVAK 897
Cdd:PTZ00121  1565 KAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIkAEELKKAEEEKKKVEQLKKKEAE 1644
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  898 EAKKqqaimaAEEKRKQKEQMKIIKQQEKIKRIQQIRMEKELRAQQ-----ILEAKKKKKEEAANAKLL---EAEKRTKE 969
Cdd:PTZ00121  1645 EKKK------AEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEedekkAAEALKKEAEEAKKAEELkkkEAEEKKKA 1718
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  970 KELRRQQAV-LLKHQELERHRldmvwERERRrqhvmlmKAMEARKKAEEKERLKQEKRDEkrlnkERKLEQRRLELEMAK 1048
Cdd:PTZ00121  1719 EELKKAEEEnKIKAEEAKKEA-----EEDKK-------KAEEAKKDEEEKKKIAHLKKEE-----EKKAEEIRKEKEAVI 1781

                   ....*...
gi 1039761240 1049 ELKKPKED 1056
Cdd:PTZ00121  1782 EEELDEED 1789
PTZ00121 PTZ00121
MAEBL; Provisional
833-1055 1.65e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 60.54  E-value: 1.65e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  833 RPRAREESRMKRRKGRPPNVGSAEFLDNTDAKLLRKLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKR 912
Cdd:PTZ00121  1073 KPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAK 1152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  913 KqkeqMKIIKQQEKIKRIQQIRMEKELRAQQileakkkkkEEAANAKLLEAEKRTKEKELRRQQAVlLKHQELERHRLDM 992
Cdd:PTZ00121  1153 R----VEIARKAEDARKAEEARKAEDAKKAE---------AARKAEEVRKAEELRKAEDARKAEAA-RKAEEERKAEEAR 1218
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039761240  993 VWERERRRQHVmlMKAMEARKKAEEKERLKQEKRDEK-------------RLNKERKLEQRRL--ELEMAKELKKPKE 1055
Cdd:PTZ00121  1219 KAEDAKKAEAV--KKAEEAKKDAEEAKKAEEERNNEEirkfeearmahfaRRQAAIKAEEARKadELKKAEEKKKADE 1294
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
870-1056 1.76e-08

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 58.70  E-value: 1.76e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  870 QAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAimaaEEKRKQKEQMKIIKQQEKIKRIQQIRMEKELRAQQILEAKK 949
Cdd:TIGR02794   51 QANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRA----AEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKA 126
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  950 KKkeeaanakllEAE-KRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEARKKAEEKERLKQEKRDE 1028
Cdd:TIGR02794  127 KQ----------AAEaKAKAEAEAERKAKEEAAKQAEEEAKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAEEAKA 196
                          170       180
                   ....*....|....*....|....*....
gi 1039761240 1029 KRLNKERKLEQRRLEL-EMAKELKKPKED 1056
Cdd:TIGR02794  197 KAEAAKAKAAAEAAAKaEAEAAAAAAAEA 225
PTZ00121 PTZ00121
MAEBL; Provisional
868-1063 2.47e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 59.77  E-value: 2.47e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  868 KLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQMKiikQQEKIKRIQQIRMEKELRAQQILEA 947
Cdd:PTZ00121  1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK---ADAAKKKAEEKKKADEAKKKAEEDK 1404
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  948 KKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEARKKAEEKERLKQEKRD 1027
Cdd:PTZ00121  1405 KKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1039761240 1028 EKRLNKERKLEQRRL-ELEMAKELKKPKEDMCLADQK 1063
Cdd:PTZ00121  1485 ADEAKKKAEEAKKKAdEAKKAAEAKKKADEAKKAEEA 1521
PHD1_CHD_II cd15531
PHD finger 1 found in class II Chromodomain-Helicase-DNA binding (CHD) proteins; Class II CHD ...
1962-2007 3.94e-08

PHD finger 1 found in class II Chromodomain-Helicase-DNA binding (CHD) proteins; Class II CHD proteins includes chromodomain-helicase-DNA-binding protein CHD3, CHD4, and CHD5, which are nuclear and ubiquitously expressed chromatin remodelling ATPases generally associated with histone deacetylases (HDACs). They are involved in DNA Double Strand Break (DSB) signaling, DSB repair and/or p53-dependent pathways such as apoptosis and senescence, as well as in the maintenance of genomic stability, and/or cancer prevention. They function as subunits of the Nucleosome Remodelling and Deacetylase (NuRD) complex, which is generally associated with gene repression, heterochromatin formation, and overall chromatin compaction. In contrast to the class I CHD enzymes (CHD1 and CHD2), class II CHD proteins lack identifiable DNA-binding domains, but possess a C-terminal coiled-coil region. Moreover, in addition to the tandem chromodomains and a helicase domain, they all harbor tandem plant homeodomain (PHD) zinc fingers involved in the recognition of methylated histone tails. This model corresponds to the first PHD finger.


Pssm-ID: 277006 [Multi-domain]  Cd Length: 43  Bit Score: 51.06  E-value: 3.94e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNeelLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15531      1 YCEVCQQGGE---IILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 43
PHD_PRHA_like cd15504
PHD finger found in Arabidopsis thaliana pathogenesis-related homeodomain protein (PRHA) and ...
1962-2007 5.34e-08

PHD finger found in Arabidopsis thaliana pathogenesis-related homeodomain protein (PRHA) and similar proteins; PRHA is a homeodomain protein encoded by a single-copy Arabidopsis thaliana homeobox gene, prha. It shows the capacity to bind to TAATTG core sequence elements but requires additional adjacent bases for high-affinity binding. PRHA contains a plant homeodomain (PHD) finger, a homeodomain, peptide repeats and a putative leucine zipper dimerization domain.


Pssm-ID: 276979  Cd Length: 53  Bit Score: 51.28  E-value: 5.34e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039761240 1962 YCQICRKG--DNEELLLLCDG-CDKGCHTYCHRPKITT--IPDGD--WFCPAC 2007
Cdd:cd15504      1 FCAKCQSGeaSPDNDILLCDGgCNRAYHQKCLEPPLLTedIPPEDegWLCPLC 53
PTZ00121 PTZ00121
MAEBL; Provisional
867-1055 6.66e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 58.23  E-value: 6.66e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  867 RKLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQ--------QAIMAAEEKRKQKEQMKiiKQQEKIKRIQQIRMEKE 938
Cdd:PTZ00121  1364 EKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKaeedkkkaDELKKAAAAKKKADEAK--KKAEEKKKADEAKKKAE 1441
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  939 LRAQQILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEARKKAEEK 1018
Cdd:PTZ00121  1442 EAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEA 1521
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1039761240 1019 ERLKQEKRDE--KRLNKERKLEQRRL--ELEMAKELKKPKE 1055
Cdd:PTZ00121  1522 KKADEAKKAEeaKKADEAKKAEEKKKadELKKAEELKKAEE 1562
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
867-1049 7.37e-08

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 58.02  E-value: 7.37e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  867 RKLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQirmekELRAQQile 946
Cdd:COG1196    216 RELKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELEL-----ELEEAQ--- 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  947 akkkkkeeaANAKLLEAEKRTKEKELRRQQAvllKHQELERHRLDMVWERERRRQHVmlmKAMEARKKAEEKERLKQEKR 1026
Cdd:COG1196    288 ---------AEEYELLAELARLEQDIARLEE---RRRELEERLEELEEELAELEEEL---EELEEELEELEEELEEAEEE 352
                          170       180
                   ....*....|....*....|...
gi 1039761240 1027 DEKRLNKERKLEQRRLELEMAKE 1049
Cdd:COG1196    353 LEEAEAELAEAEEALLEAEAELA 375
PHD_PHF21A cd15523
PHD finger found in PHD finger protein 21A (PHF21A); PHF21A (also termed BHC80a or BRAF35-HDAC ...
1962-2007 1.55e-07

PHD finger found in PHD finger protein 21A (PHF21A); PHF21A (also termed BHC80a or BRAF35-HDAC complex protein BHC80) along with HDAC1/2, CtBP1, CoREST, and BRAF35, is associated with LSD1, a lysine (K)-specific histone demethylase. It inhibits LSD1-mediated histone demethylation in vitro. PHF21A is predominantly present in the central nervous system and spermatogenic cells and is one of the six components of BRAF-HDAC complex (BHC) involved in REST-dependent transcriptional repression of neuron-specific genes in non-neuronal cells. It acts as a scaffold protein in BHC in neuronal as well as non-neuronal cells and also plays a role in spermatogenesis. PHF21A contains a C-terminal plant homeodomain (PHD) finger that is responsible for the binding directly to each of five other components of BHC, and of organizing BHC mediating transcriptional repression.


Pssm-ID: 276998 [Multi-domain]  Cd Length: 43  Bit Score: 49.32  E-value: 1.55e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNeelLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15523      1 FCSVCRKSGE---LLMCDTCSLVYHLDCLDPPLKTIPKGMWICPKC 43
PHD1_MTF2 cd15578
PHD finger 1 found in metal-response element-binding transcription factor 2 (MTF2); MTF2, also ...
1963-2009 1.55e-07

PHD finger 1 found in metal-response element-binding transcription factor 2 (MTF2); MTF2, also termed metal regulatory transcription factor 2, or metal-response element DNA-binding protein M96, or polycomb-like protein 2 (PCL2), complexes with the polycomb repressive complex-2 (PRC2) in embryonic stem cells and regulates the transcriptional networks during embryonic stem cell self-renewal and differentiation. It recruits the PRC2 complex to the inactive X chromosome and target loci in embryonic stem cells. Moreover, MTF2 is required for PRC2-mediated Hox cluster repression. It activates the Cdkn2a gene and promotes cellular senescence, thus suppressing the catalytic activity of PRC2 locally. MTF2 consists of an N-terminal Tudor domain followed by two PHD fingers, and a C-terminal MTF2 domain. This model corresponds to the first PHD finger.


Pssm-ID: 277053  Cd Length: 53  Bit Score: 49.70  E-value: 1.55e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039761240 1963 CQICRKGDNEE--LLLLCDGCDKGCHTYCHRPKITTI---PDGDWFCPACIS 2009
Cdd:cd15578      2 CTVCQDGSSESpnEIVLCDKCGQGYHQLCHNPKIDSSvldPDVPWLCRQCVF 53
YqiK COG2268
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
851-1055 2.08e-07

Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];


Pssm-ID: 441869 [Multi-domain]  Cd Length: 439  Bit Score: 55.65  E-value: 2.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  851 NVGSAEFLDNTDAKLLRKLQAQEIAR--QAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIK 928
Cdd:COG2268    202 RIAEAEAERETEIAIAQANREAEEAEleQEREIETARIAEAEAELAKKKAEERREAETARAEAEAAYEIAEANAEREVQR 281
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  929 RIQQIRMEKELRAQQIleakkkkkeeaanaklleaekrTKEKELRRQQAVLLKHQELERHRldmVWERERRRQHVMLMKA 1008
Cdd:COG2268    282 QLEIAEREREIELQEK----------------------EAEREEAELEADVRKPAEAEKQA---AEAEAEAEAEAIRAKG 336
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 1009 M-EA---RKKAEEKERLKQEKRDEKRLNkerKLEqrrlelEMAKELKKPKE 1055
Cdd:COG2268    337 LaEAegkRALAEAWNKLGDAAILLMLIE---KLP------EIAEAAAKPLE 378
TolA COG3064
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
870-1062 2.22e-07

Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442298 [Multi-domain]  Cd Length: 485  Bit Score: 55.82  E-value: 2.22e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  870 QAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQmkiiKQQEKIKRIQQIRMEKELRAQQIleakk 949
Cdd:COG3064      1 AQEALEEKAAEAAAQERLEQAEAEKRAAAEAEQKAKEEAEEERLAELE----AKRQAEEEAREAKAEAEQRAAEL----- 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  950 kkkEEAANAKLLEAEKRTK--EKELRRQQAVLLKHQELERhrldmvwERERRRQHVMLMKAMEARKKAEEKERLK-QEKR 1026
Cdd:COG3064     72 ---AAEAAKKLAEAEKAAAeaEKKAAAEKAKAAKEAEAAA-------AAEKAAAAAEKEKAEEAKRKAEEEAKRKaEEER 141
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1039761240 1027 DEKRLNKERKLEQRRLELEMAKELKKPKEDMCLADQ 1062
Cdd:COG3064    142 KAAEAEAAAKAEAEAARAAAAAAAAAAAAAARAAAG 177
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
872-1055 3.04e-07

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 55.90  E-value: 3.04e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  872 QEIARQAAQIKLLRKLQKQEQARVAkEAKKQQAIMAAEEK--RKQKEQMKIIKQQEKIKRIQQIRMEK---ELRAQQILE 946
Cdd:pfam17380  303 QEKEEKAREVERRRKLEEAEKARQA-EMDRQAAIYAEQERmaMERERELERIRQEERKRELERIRQEEiamEISRMRELE 381
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  947 AkkkkkeeaanaklLEAEKRTKEKELRRQ--QAVLLKHQELERHR--LDMVWERERRRQHVMLMKAMEARKKAEEKER-- 1020
Cdd:pfam17380  382 R-------------LQMERQQKNERVRQEleAARKVKILEEERQRkiQQQKVEMEQIRAEQEEARQREVRRLEEERARem 448
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 1039761240 1021 -----LKQEKRDEKRLNKERKLEQRRLELEMAKELKKPKE 1055
Cdd:pfam17380  449 ervrlEEQERQQQVERLRQQEEERKRKKLELEKEKRDRKR 488
CCDC66 pfam15236
Coiled-coil domain-containing protein 66; This protein family, named Coiled-coil ...
972-1052 3.56e-07

Coiled-coil domain-containing protein 66; This protein family, named Coiled-coil domain-containing protein 66 (CCDC) refers to a protein domain found in eukaryotes, and is approximately 160 amino acids in length. CCDC66 protein is detected mainly in the inner segments of photoreceptors in many vertebrates including mice and humans. It has been found in dogs, that a mutation in the CCDC66 gene causes generalized progressive retinal atrophy (gPRA). This shows that the protein encoded for by this gene is vital for healthy vision and guards against photoreceptor cell degeneration. The structure of CCDC66 proteins includes a heptad repeat pattern which contains at least one coiled-coil domain. There are at least two or more alpha-helices which form a cable-like structure.


Pssm-ID: 434558 [Multi-domain]  Cd Length: 154  Bit Score: 51.72  E-value: 3.56e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  972 LRRQQAVLLKHQeLErhrldmvwERERRRQ-----HVMLMKAMEAR--KKAEEKERLKQEK-RDEKRLNKERKLEQRRLE 1043
Cdd:pfam15236   32 LRGQNALLDPAQ-LE--------ERERKRQkalehQNAIKKQLEEKerQKKLEEERRRQEEqEEEERLRREREEEQKQFE 102

                   ....*....
gi 1039761240 1044 LEMAKELKK 1052
Cdd:pfam15236  103 EERRKQKEK 111
Bromo_polybromo_I cd05524
Bromodomain, polybromo repeat I. Polybromo is a nuclear protein of unknown function, which ...
2124-2174 4.08e-07

Bromodomain, polybromo repeat I. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.


Pssm-ID: 99954 [Multi-domain]  Cd Length: 113  Bit Score: 50.41  E-value: 4.08e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 2124 PGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSD 2174
Cdd:cd05524     39 PEYYEVVSNPIDLLKIQQKLKTEEYDDVDDLTADFELLINNAKAYYKPDSP 89
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
884-1055 5.63e-07

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 54.15  E-value: 5.63e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  884 LRKL-QKQEQARVAKEAKKQQaimaaEEKRKQKEQMKiikQQEKikRIQQIrMEKELraqqileakkkkkeeaanaklLE 962
Cdd:pfam13868    8 LRELnSKLLAAKCNKERDAQI-----AEKKRIKAEEK---EEER--RLDEM-MEEER---------------------ER 55
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  963 AEKRTKEKELRRQQAVLLKHQEL----ERHRLDMVWERERRRQHVMLMKAM------EARKKAEEKERLKQEKRDE---- 1028
Cdd:pfam13868   56 ALEEEEEKEEERKEERKRYRQELeeqiEEREQKRQEEYEEKLQEREQMDEIveriqeEDQAEAEEKLEKQRQLREEidef 135
                          170       180       190
                   ....*....|....*....|....*....|
gi 1039761240 1029 ---KRLNKERKLEQRRLELEMAKELKKPKE 1055
Cdd:pfam13868  136 neeQAEWKELEKEEEREEDERILEYLKEKA 165
PTZ00121 PTZ00121
MAEBL; Provisional
867-1055 5.68e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 55.15  E-value: 5.68e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  867 RKLQAQEIARQAAqikllrKLQKQEQARVAKEAKKQqaimaAEEKRKQKEQMKiiKQQEKIKRIQQIRMEKELR-----A 941
Cdd:PTZ00121  1357 DEAEAAEEKAEAA------EKKKEEAKKKADAAKKK-----AEEKKKADEAKK--KAEEDKKKADELKKAAAAKkkadeA 1423
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  942 QQILEAKKKKKEEAAnakllEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEARKKAEEK--- 1018
Cdd:PTZ00121  1424 KKKAEEKKKADEAKK-----KAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAkkk 1498
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 1039761240 1019 -ERLKQEKRDEKRLNKERKLEQRRL--ELEMAKELKKPKE 1055
Cdd:PTZ00121  1499 aDEAKKAAEAKKKADEAKKAEEAKKadEAKKAEEAKKADE 1538
Bromo_ZMYND11 cd05492
Bromodomain; ZMYND11_like sub-family. ZMYND11 or BS69 is a ubiquitously expressed nuclear ...
2124-2184 6.11e-07

Bromodomain; ZMYND11_like sub-family. ZMYND11 or BS69 is a ubiquitously expressed nuclear protein that has been shown to associate with chromatin. It interacts with chromatin remodeling factors and might play a role in chromatin remodeling and gene expression. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99924  Cd Length: 109  Bit Score: 50.07  E-value: 6.11e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 2124 PGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGRAGHSMRK 2184
Cdd:cd05492     37 PKRRRLIHTHLDVADIQEKINSEKYTSLEEFKADALLLLHNTAIFHGADSEQYDAARWLYR 97
PHD1_PHF12 cd15533
PHD finger 1 found in PHD finger protein 12 (PHF12); PHF12, also termed PHD factor 1 (Pf1), is ...
1962-2007 1.32e-06

PHD finger 1 found in PHD finger protein 12 (PHF12); PHF12, also termed PHD factor 1 (Pf1), is a plant homeodomain (PHD) zinc finger-containing protein that bridges the transducin-like enhancer of split (TLE) corepressor to the mSin3A-histone deacetylase (HDAC)-complex, and further represses transcription at targeted genes. PHF12 also interacts with MRG15 (mortality factor-related genes on chromosome 15), a member of the mortality factor (MORF) family of proteins implicated in regulating cellular senescence. PHF12 contains two plant-homeodomain (PHD) zinc fingers followed by a polybasic region. The PHD fingers function downstream of phosphoinositide signaling triggered by the interaction between polybasic regions and phosphoinositides. This model corresponds to the first PHD finger.


Pssm-ID: 277008 [Multi-domain]  Cd Length: 45  Bit Score: 46.97  E-value: 1.32e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1039761240 1962 YCQICRKGDNeelLLLCDGCDKGCHTYCHRPKI--TTIPDGDWFCPAC 2007
Cdd:cd15533      1 YCDSCGEGGD---LLCCDRCPASFHLQCCNPPLdeEDLPPGEWLCHRC 45
PHD_TAF3 cd15522
PHD finger found in transcription initiation factor TFIID subunit 3 (TAF3); TAF3 (also termed ...
1962-2007 1.63e-06

PHD finger found in transcription initiation factor TFIID subunit 3 (TAF3); TAF3 (also termed 140 kDa TATA box-binding protein-associated factor, TBP-associated factor 3, transcription initiation factor TFIID 140 kDa subunit (TAF140), or TAFII-140, is an integral component of TFIID) is a general initiation factor (GTF) that plays a key role in preinitiation complex (PIC) assembly through core promoter recognition. The interaction of H3K4me3 with TAF3 directs global TFIID recruitment to active genes, which regulates gene-selective functions of p53 in response to genotoxic stress. TAF3 is highly enriched in embryonic stem cells and is required for endoderm lineage differentiation and prevents premature specification of neuroectoderm and mesoderm. Moreover, TAF3, along with TRF3, forms a complex that is essential for myogenic differentiation. TAF3 contains a plant homeodomain (PHD) finger. This family also includes Drosophila melanogaster BIP2 (Bric-a-brac interacting protein 2) protein, which functions as an interacting partner of D. melanogaster p53 (Dmp53).


Pssm-ID: 276997 [Multi-domain]  Cd Length: 46  Bit Score: 46.51  E-value: 1.63e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15522      1 ICPICKKPDDGSPMIGCDECDDWYHWECVGITDEPPEEDDWFCPKC 46
Bromo_Rsc1_2_II cd05522
Bromodomain, repeat II in Rsc1/2_like subfamily, specific to fungi. Rsc1 and Rsc2 are ...
2124-2186 2.11e-06

Bromodomain, repeat II in Rsc1/2_like subfamily, specific to fungi. Rsc1 and Rsc2 are components of the RSC complex (remodeling the structure of chromatin), are essential for transcriptional control, and have a specific domain architecture including two bromodomains. The RSC complex has also been linked to homologous recombination and nonhomologous end-joining repair of DNA double strand breaks. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99953 [Multi-domain]  Cd Length: 104  Bit Score: 48.01  E-value: 2.11e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039761240 2124 PGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGRAGHSMRKYF 2186
Cdd:cd05522     38 PEYYQEISNPISLDDIKKKVKRRKYKSFDQFLNDLNLMFENAKLYNENDSQEYKDAVLLEKEA 100
PHD_TIF1alpha cd15622
PHD finger found in transcription intermediary factor 1-alpha (TIF1-alpha); TIF1-alpha, also ...
1962-2007 2.35e-06

PHD finger found in transcription intermediary factor 1-alpha (TIF1-alpha); TIF1-alpha, also termed tripartite motif-containing protein 24 (TRIM24), or E3 ubiquitin-protein ligase TRIM24, or RING finger protein 82, belongs to the TRIM/RBCC protein family. It interacts specifically and in a ligand-dependent manner with the ligand binding domain (LBD) of several nuclear receptors (NRs), including retinoid X (RXR), retinoic acid (RAR), vitamin D3 (VDR), estrogen (ER), and progesterone (PR) receptors. It also associates with heterochromatin-associated factors HP1alpha, MOD1 (HP1beta) and MOD2 (HP1gamma), as well as vertebrate Kruppel-type (C2H2) zinc finger proteins that contain transcriptional silencing domain KRAB. TIF1-alpha is a ligand-dependent co-repressor of retinoic acid receptor (RAR) that interacts with multiple nuclear receptors in vitro via an LXXLL motif, and further acts as a gatekeeper of liver carcinogenesis. It also functions as an E3-ubiquitin ligase targeting p53 and is broadly associated with chromatin silencing. Moreover, it is a chromatin regulator that recognizes specific, combinatorial histone modifications through its C-terminal plant homeodomain (PHD)-Bromodomain (Bromo) region. In addition, it interacts with chromatin and estrogen receptor to activate estrogen-dependent genes associated with cellular proliferation and tumor development. TIF1-alpha contains an N-terminal RBCC (RING finger, B-box zinc-fingers, coiled-coil), a plant homeodomain (PHD) finger, followed by a bromodomain in the C-terminal region.


Pssm-ID: 277092  Cd Length: 43  Bit Score: 46.21  E-value: 2.35e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNeelLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15622      1 WCAVCQNGGE---LLCCEKCPKVFHLSCHVPTLMNFPSGEWICTFC 43
PTZ00121 PTZ00121
MAEBL; Provisional
819-1026 2.35e-06

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 53.22  E-value: 2.35e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  819 RAMDGRRGRPPNPDRPRAREESRMKRRKGRPPNVGSAEFLDNTDAKLLRK------LQAQEIARQAAQIKllrklQKQEQ 892
Cdd:PTZ00121  1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKaeeenkIKAAEEAKKAEEDK-----KKAEE 1679
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  893 ARVAKEAKK---QQAIMAAEEKRKQKEQMKiiKQQEKIKRIQQIRMEKELRAQQILEAKKKkkeeaanakllEAEKRTKE 969
Cdd:PTZ00121  1680 AKKAEEDEKkaaEALKKEAEEAKKAEELKK--KEAEEKKKAEELKKAEEENKIKAEEAKKE-----------AEEDKKKA 1746
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1039761240  970 KELRRQQAVLLKHQELERHRLDMVWERERRRQHVMlmkaMEARKKAEEKERLKQEKR 1026
Cdd:PTZ00121  1747 EEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVI----EEELDEEDEKRRMEVDKK 1799
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
863-1063 3.00e-06

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 52.67  E-value: 3.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  863 AKLLRKLQAQEIARQAAQIKLLRKLQKQEQ--ARVAKEAKKQQAIMAaEEKRKQKEQMKIIKQQEKIKRIQQIRMEKELR 940
Cdd:pfam02463  236 EERIDLLQELLRDEQEEIESSKQEIEKEEEklAQVLKENKEEEKEKK-LQEEELKLLAKEEEELKSELLKLERRKVDDEE 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  941 AQQILEAKKKKKEEAANAKLLEAEKRTKEKEL--RRQQAVLLKHQELERHRLDmvwERERRRQHVMLMKAMEARKKAEEK 1018
Cdd:pfam02463  315 KLKESEKEKKKAEKELKKEKEEIEELEKELKEleIKREAEEEEEEELEKLQEK---LEQLEEELLAKKKLESERLSSAAK 391
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1019 ER-LKQEKRDEKRLNKERKLEQRRLELEMAKELKKPKEDMCLADQK 1063
Cdd:pfam02463  392 LKeEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEE 437
DUF4659 pfam15558
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins ...
867-1046 3.37e-06

Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins in this family are typically between 427 and 674 amino acids in length. There are two completely conserved residues (D and I) that may be functionally important.


Pssm-ID: 464768 [Multi-domain]  Cd Length: 374  Bit Score: 51.58  E-value: 3.37e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  867 RKLQAQEIAR-QAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQIRMEKELRAQQiL 945
Cdd:pfam15558    7 RKIAALMLARhKEEQRMRELQQQAALAWEELRRRDQKRQETLERERRLLLQQSQEQWQAEKEQRKARLGREERRRADR-R 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  946 EAKKKKKEEAANAKLLEAEKRTKEK-ELRRQQAVLLKHQELERHRldmVWERERR----RQHVMLMKAME-ARKKAEEKE 1019
Cdd:pfam15558   86 EKQVIEKESRWREQAEDQENQRQEKlERARQEAEQRKQCQEQRLK---EKEEELQalreQNSLQLQERLEeACHKRQLKE 162
                          170       180
                   ....*....|....*....|....*..
gi 1039761240 1020 RLKQEKRDEKRLNKERKLEQRRLELEM 1046
Cdd:pfam15558  163 REEQKKVQENNLSELLNHQARKVLVDC 189
Bromo_SNF2 cd05519
Bromodomain, SNF2-like subfamily, specific to fungi. SNF2 is a yeast protein involved in ...
2114-2189 4.10e-06

Bromodomain, SNF2-like subfamily, specific to fungi. SNF2 is a yeast protein involved in transcriptional activation, it is the catalytic component of the SWI/SNF ATP-dependent chromatin remodeling complex. The protein is essential for the regulation of gene expression (both positive and negative) of a large number of genes. The SWI/SNF complex changes chromatin structure by altering DNA-histone contacts within the nucleosome, which results in a re-positioning of the nucleosome and facilitates or represses the binding of gene-specific transcription factors. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99950  Cd Length: 103  Bit Score: 47.33  E-value: 4.10e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039761240 2114 FLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGRAGHSMRKYFEKK 2189
Cdd:cd05519     27 FLEKPSKKLYPDYYVIIKRPIALDQIKRRIEGRAYKSLEEFLEDFHLMFANARTYNQEGSIVYEDAVEMEKAFKKK 102
PHD1_Rco1 cd15535
PHD finger 1 found in Saccharomyces cerevisiae transcriptional regulatory protein Rco1 and ...
1962-2007 4.99e-06

PHD finger 1 found in Saccharomyces cerevisiae transcriptional regulatory protein Rco1 and similar proteins; Rco1 is a component of the Rpd3S histone deacetylase complex that plays an important role at actively transcribed genes. Rco1 contains two plant homeodomain (PHD) fingers, which are required for the methylation of histone H3 lysine 36 (H3K36) nucleosome recognition by Rpd3S. This model corresponds to the first PHD finger.


Pssm-ID: 277010 [Multi-domain]  Cd Length: 45  Bit Score: 45.10  E-value: 4.99e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1039761240 1962 YCQIC-RKGdneeLLLLCDGCDKGCHTYCHRPKIT--TIPDGDWFCPAC 2007
Cdd:cd15535      1 FCSACgGYG----SFLCCDGCPRSFHFSCLDPPLEedNLPDDEWFCNEC 45
PHD_JADE cd15573
PHD finger found in proteins Jade-1, Jade-2, Jade-3, and similar proteins; This family ...
1963-2007 5.10e-06

PHD finger found in proteins Jade-1, Jade-2, Jade-3, and similar proteins; This family includes proteins Jade-1 (PHF17), Jade-2 (PHF15), and Jade-3 (PHF16), each of which is required for ING4 and ING5 to associate with histone acetyltransferase (HAT) HBO1 and EAF6 to form a HBO1 complex that has a histone H4-specific acetyltransferase activity, a reduced activity toward histone H3, and is responsible for the bulk of histone H4 acetylation in vivo. This family also contains Drosophila melanogaster PHD finger protein rhinoceros (RNO). It is a novel plant homeodomain (PHD)-containing nuclear protein that may function as a transcription factor that antagonizes Ras signaling by regulating transcription of key EGFR/Ras pathway regulators in the Drosophila eye. All Jade proteins contain a canonical Cys4HisCys3 PHD finger followed by a non-canonical extended PHD (ePHD) finger, Cys2HisCys5HisCys2His, both of which are zinc-binding motifs. This model corresponds to the canonical Cys4HisCys3 PHD finger.


Pssm-ID: 277048 [Multi-domain]  Cd Length: 46  Bit Score: 45.09  E-value: 5.10e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1039761240 1963 CQICRKGDNEE--LLLLCDGCDKGCHTYCHrpKITTIPDGDWFCPAC 2007
Cdd:cd15573      2 CDVCRSPDSEEgnEMVFCDKCNICVHQACY--GIQKIPEGSWLCRTC 46
PHD4_NSD cd15567
PHD finger 4 found in nuclear receptor-binding SET domain-containing (NSD) proteins; The ...
1962-2007 5.48e-06

PHD finger 4 found in nuclear receptor-binding SET domain-containing (NSD) proteins; The nuclear receptor binding SET domain (NSD) protein is a family of three HMTases, NSD1, NSD2/MMSET/WHSC1, and NSD3/WHSC1L1, that are critical in maintaining chromatin integrity. Reducing NSD activity through specific lysine-HMTase inhibitors appears promising to help suppress cancer growth. NSD proteins have specific mono- and dimethylase activities for H3K36, and they play non-redundant roles during development. NSD1 plays a role in several pathologies, including but not limited to Sotos and Weaver syndromes, acute myeloid leukemia, breast cancer, neuroblastoma, and glioblastoma formation. NSD2 is involved in cancer cell proliferation, survival, and tumor growth, by mediating constitutive NF-kappaB signaling via the cytokine autocrine loop. NSD3 is amplified in human breast cancer cell lines. Moreover, translocation resulting in NUP98 fusion to NSD3 leads to development of acute myeloid leukemia. NSD proteins contain a catalytic suppressor of variegation, enhancer of zeste and trithorax (SET) domain, two proline-tryptophan-tryptophan-proline (PWWP) domains, five plant homeodomain (PHD) fingers, and an NSD-specific Cys-His rich domain (Cys5HisCysHis). This model corresponds to the fourth PHD finger.


Pssm-ID: 277042 [Multi-domain]  Cd Length: 41  Bit Score: 44.93  E-value: 5.48e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNeelLLLCDGCDKGCHTYCHrpKITTIPDGDWFCPAC 2007
Cdd:cd15567      1 WCFICSEGGS---LICCESCPASFHPECL--GLEPPPEGKFYCEDC 41
YqiK COG2268
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
876-1049 5.68e-06

Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];


Pssm-ID: 441869 [Multi-domain]  Cd Length: 439  Bit Score: 51.41  E-value: 5.68e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  876 RQAAQIKllrKLQKQEQARVAKEAKKQQAimaaeEKRKQKEQMKIiKQQEKIKRIQQIRMEKELRAQQileakkkkkeea 955
Cdd:COG2268    192 RKIAEII---RDARIAEAEAERETEIAIA-----QANREAEEAEL-EQEREIETARIAEAEAELAKKK------------ 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  956 anaklLEAEKRTKEKELRRQQAVLLKHQELERhrldmvwERERRRQHVMLMKAMEARKKAEEKERLKQEKRDEKRL--NK 1033
Cdd:COG2268    251 -----AEERREAETARAEAEAAYEIAEANAER-------EVQRQLEIAEREREIELQEKEAEREEAELEADVRKPAeaEK 318
                          170
                   ....*....|....*.
gi 1039761240 1034 ERKLEQRRLELEMAKE 1049
Cdd:COG2268    319 QAAEAEAEAEAEAIRA 334
WSD pfam15613
Williams-Beuren syndrome DDT (WSD), D-TOX E motif; This family represents the combined ...
1363-1395 5.98e-06

Williams-Beuren syndrome DDT (WSD), D-TOX E motif; This family represents the combined alpha-helical module found in diverse eukaryotic chromatin proteins. Based on the Ioc3 structure, the N-terminus of this module is inferred to interact with nucleosomal linker DNA and the SLIDE domain of ISWI proteins. The resulting complex forms a protein ruler that measures out the spacing between two adjacent nucleosomes. The acidic residue from the GxD signature at the N-terminus is a major determinant of the interaction between the ISWI and WHIM motifs. The N-terminal portion also contacts the inter-nucleosomal linker DNA. The module shows a great domain architectural diversity and is often combined with other modified histone peptide recognizing and DNA binding domains, some of which discriminate methylated DNA. The WSD module constitutes the inter-nucleosomal linker DNA binding site in the major groove of DNA, and was first identified as WSD, the D-TOX E motif of plant homeodomains homologous with the mutant transcription factor causing Williams-Beuren syndrome in association with the DDT-domain.


Pssm-ID: 464775 [Multi-domain]  Cd Length: 69  Bit Score: 45.60  E-value: 5.98e-06
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1039761240 1363 LRSMMFGQDRYRRRYWIL-PQCGGIFVEGMESGE 1395
Cdd:pfam15613    1 IRSLPLGRDRRYNRYWWFdPGTGRLFVESPSDGE 34
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
827-1043 6.32e-06

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 50.61  E-value: 6.32e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  827 RPPNPDRPRAREESRMKRRKGRPPNvgsaeflDNTDAKLLRKLQAQEIARQAAQikllRKLQKQEQARVAKEAKKQQAI- 905
Cdd:TIGR02794   63 AKKEQERQKKLEQQAEEAEKQRAAE-------QARQKELEQRAAAEKAAKQAEQ----AAKQAEEKQKQAEEAKAKQAAe 131
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  906 --MAAEEKRKQKEQMKIIKQQE---KIKRIQQIRMEKELRAQQileakkkkkeeaanaklleAEKRTKEKELRRQQAvll 980
Cdd:TIGR02794  132 akAKAEAEAERKAKEEAAKQAEeeaKAKAAAEAKKKAEEAKKK-------------------AEAEAKAKAEAEAKA--- 189
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039761240  981 khqelerhrldmvwERERRRQhvmlmKAMEARKKA--EEKERLKQEKRDEKRLNKERKLEQRRLE 1043
Cdd:TIGR02794  190 --------------KAEEAKA-----KAEAAKAKAaaEAAAKAEAEAAAAAAAEAERKADEAELG 235
DUF4659 pfam15558
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins ...
837-1051 6.42e-06

Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins in this family are typically between 427 and 674 amino acids in length. There are two completely conserved residues (D and I) that may be functionally important.


Pssm-ID: 464768 [Multi-domain]  Cd Length: 374  Bit Score: 50.81  E-value: 6.42e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  837 REESRMKRRKGRPpnvgsaefldNTDAKLLRKLQAQEIARQAAQIKLLrkLQKQEQARvakEAKKQQAIMAAEEKRKQKE 916
Cdd:pfam15558   62 QWQAEKEQRKARL----------GREERRRADRREKQVIEKESRWREQ--AEDQENQR---QEKLERARQEAEQRKQCQE 126
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  917 QMKIIK--------------QQEKIKRIQQIRMEKELRAQQILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKH 982
Cdd:pfam15558  127 QRLKEKeeelqalreqnslqLQERLEEACHKRQLKEREEQKKVQENNLSELLNHQARKVLVDCQAKAEELLRRLSLEQSL 206
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039761240  983 QELERHRLDMVWERERR-RQHVMLMKAM--EARKKAEEKERLKQEKRDEKRLNKERKLEQRRLELEMAKELK 1051
Cdd:pfam15558  207 QRSQENYEQLVEERHRElREKAQKEEEQfqRAKWRAEEKEEERQEHKEALAELADRKIQQARQVAHKTVQDK 278
CCDC34 pfam13904
Coiled-coil domain-containing protein 3; This family is found in eukaryotes; it has several ...
874-1066 6.45e-06

Coiled-coil domain-containing protein 3; This family is found in eukaryotes; it has several conserved tryptophan residues. The function is not known.


Pssm-ID: 464032 [Multi-domain]  Cd Length: 221  Bit Score: 49.32  E-value: 6.45e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  874 IARQAAQIKLLRKLQKQ-EQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQ-IRMEKELRAQQILEAKKKK 951
Cdd:pfam13904   40 YARKLEGLKLERQPLEAyENWLAAKQRQRQKELQAQKEEREKEEQEAELRKRLAKEKYQEwLQRKARQQTKKREESHKQK 119
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  952 KEEAANAKLLEAEKRTKEKELRRqqavllKHQElerhrldmvWERERRRQhvmlmkamearkkaEEKERLKQEkrdEKRL 1031
Cdd:pfam13904  120 AAESASKSLAKPERKVSQEEAKE------VLQE---------WERKKLEQ--------------QQRKREEEQ---REQL 167
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1039761240 1032 NKERKLEQRRLELEMA--KELKKpkedmclADQKPLP 1066
Cdd:pfam13904  168 KKEEEEQERKQLAEKAwqKWMKN-------VKNKPKP 197
PHD_TIF1gamma cd15624
PHD finger found in transcriptional intermediary factor 1 gamma (TIF1gamma); TIF1gamma, also ...
1962-2007 7.59e-06

PHD finger found in transcriptional intermediary factor 1 gamma (TIF1gamma); TIF1gamma, also termed tripartite motif-containing 33 (trim33), or ectodermin, or RFG7, or PTC7, is an E3-ubiquitin ligase that functions as a regulator of transforming growth factor beta (TGFbeta) signaling; it inhibits the Smad4-mediated TGFbeta response by interaction with Smad2/3 or ubiquitylation of Smad4. Moreover, TIF1gamma is an important regulator of transcription during hematopoiesis, as well as a key factor of tumorigenesis. Like other TIF1 family members, TIF1gamma also contains an intrinsic transcriptional silencing function. It can control erythroid cell fate by regulating transcription elongation. It can bind to the anaphase-promoting complex/cyclosome (APC/C) and promotes mitosis. TIF1gamma contains an N-terminal RBCC (RING finger, B-box zinc-fingers, coiled-coil), a plant homeodomain (PHD) finger, followed by a bromodomain in the C-terminal region.


Pssm-ID: 277094  Cd Length: 46  Bit Score: 44.65  E-value: 7.59e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNeelLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15624      1 WCAVCQNGGD---LLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 43
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
859-1036 8.85e-06

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 49.15  E-value: 8.85e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  859 DNTDAKLLRKLQA--QEIARQAAQIK-LLRKLQKQEQARVAKEA---KKQQAIMAAEEKRKQKEQmKIIKQQEKIKRIQ- 931
Cdd:COG1579      2 MPEDLRALLDLQEldSELDRLEHRLKeLPAELAELEDELAALEArleAAKTELEDLEKEIKRLEL-EIEEVEARIKKYEe 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  932 ---QIRMEKELRA--QQIleakkkkkeeaanaKLLEAEKRTKEKELRRqqaVLLKHQELERHRLDMVWERERRRQHVmlm 1006
Cdd:COG1579     81 qlgNVRNNKEYEAlqKEI--------------ESLKRRISDLEDEILE---LMERIEELEEELAELEAELAELEAEL--- 140
                          170       180       190
                   ....*....|....*....|....*....|
gi 1039761240 1007 kameARKKAEEKERLKQEKRDEKRLNKERK 1036
Cdd:COG1579    141 ----EEKKAELDEELAELEAELEELEAERE 166
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
871-1063 9.58e-06

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 50.19  E-value: 9.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  871 AQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKR-KQKEQMKIIKQQEKIKRIQQIRMEKELRAQqileakk 949
Cdd:PRK09510    61 VEQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERlKQLEKERLAAQEQKKQAEEAAKQAALKQKQ------- 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  950 kkkEEAANAKLLEAEKRTKEKELRRQQAVLLKHQElerhrldmvwERERRRQHVMLMKA-MEARKKAEEKERLKQEKRDE 1028
Cdd:PRK09510   134 ---AEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAA----------EAKKKAEAEAAKKAaAEAKKKAEAEAAAKAAAEAK 200
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1039761240 1029 KRLNKERKLEQRRLELEMAKELKKPKEDMCLADQK 1063
Cdd:PRK09510   201 KKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAK 235
Caldesmon pfam02029
Caldesmon;
828-1064 1.08e-05

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 50.64  E-value: 1.08e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  828 PPNPDRPRAREESRMKRRKGRPPNVGSAEFLDNTDAKLlrKLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMA 907
Cdd:pfam02029   91 PTIADEKESVAERKENNEEEENSSWEKEEKRDSRLGRY--KEEETEIREKEYQENKWSTEVRQAEEEGEEEEDKSEEAEE 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  908 AEEKRKQKEQMKIIKQQEKIKR-------IQQIRMEKELRAQQ---------------ILEAKKKKKEEAANAKLLEAEK 965
Cdd:pfam02029  169 VPTENFAKEEVKDEKIKKEKKVkyeskvfLDQKRGHPEVKSQNgeeevtklkvttkrrQGGLSQSQEREEEAEVFLEAEQ 248
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  966 RTKEKELRRQQavlLKHQELE--RHRldmvwERERRRQHVMLMKAMEARKKA-EEKERLKQEKRDEKRLNKE---RKL-- 1037
Cdd:pfam02029  249 KLEELRRRRQE---KESEEFEklRQK-----QQEAELELEELKKKREERRKLlEEEEQRRKQEEAERKLREEeekRRMke 320
                          250       260
                   ....*....|....*....|....*....
gi 1039761240 1038 --EQRRleLEMAKELKKPKEDMCLADQKP 1064
Cdd:pfam02029  321 eiERRR--AEAAEKRQKLPEDSSSEGKKP 347
PHD1_BPTF cd15559
PHD finger 1 found in bromodomain and PHD finger-containing transcription factor (BPTF); BPTF, ...
1962-2007 1.09e-05

PHD finger 1 found in bromodomain and PHD finger-containing transcription factor (BPTF); BPTF, also termed nucleosome-remodeling factor subunit BPTF, or fetal Alz-50 clone 1 protein (FAC1), or fetal Alzheimer antigen, functions as a transcriptional regulator that exhibits altered expression and subcellular localization during neuronal development and neurodegenerative diseases such as Alzheimer's disease. It interacts with the human orthologue of the Kelch-like Ech-associated protein (Keap1). Its function and subcellular localization can be regulated by Keap1. Moreover, BPTF is a novel DNA-binding protein that recognizes the DNA sequence CACAACAC and represses transcription through this site in a phosphorylation-dependent manner. Furthermore, BPTF interacts with the Myc-associated zinc finger protein (ZF87/MAZ) and alters its transcriptional activity, which has been implicated in gene regulation in neurodegeneration. Some family members contain two or three plant homeodomain (PHD) fingers, which may be involved in complex formation with histone H3 trimethylated at K4 (H3K4me3). This family corresponds to the first PHD finger.


Pssm-ID: 277034 [Multi-domain]  Cd Length: 43  Bit Score: 44.33  E-value: 1.09e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1039761240 1962 YCQICRK-GDneelLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15559      1 HCRVCHKlGD----LLCCETCSAVYHLECVDPPLEEVPEEDWQCEVC 43
PTZ00121 PTZ00121
MAEBL; Provisional
890-1070 1.60e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 50.52  E-value: 1.60e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  890 QEQARVAKEAKKQQAiMAAEEKRKQKEQMKiikQQEKIKRIQQIRMEKELRaqqileakkkkkeeaanakllEAEKRTKE 969
Cdd:PTZ00121  1094 EEAFGKAEEAKKTET-GKAEEARKAEEAKK---KAEDARKAEEARKAEDAR---------------------KAEEARKA 1148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  970 KELRRqqaVLLKHQELERHRLDMVWERERRRQHVMLMKAMEARK-----KAEEKERLKQEKR--DEKRLNKERKLEQRRL 1042
Cdd:PTZ00121  1149 EDAKR---VEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKaeelrKAEDARKAEAARKaeEERKAEEARKAEDAKK 1225
                          170       180       190
                   ....*....|....*....|....*....|
gi 1039761240 1043 --ELEMAKELKKPKEDMCLADQKPLPEWPR 1070
Cdd:PTZ00121  1226 aeAVKKAEEAKKDAEEAKKAEEERNNEEIR 1255
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
862-1055 1.60e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 50.15  E-value: 1.60e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  862 DAKLLRKLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQmkiiKQQEKIKRIQQIRMEKELRA 941
Cdd:COG4717     54 EADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEA----ELEELREELEKLEKLLQLLP 129
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  942 QQILEAKKKKKEEAANAKLLEAEKRTKE-KELRRQQAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEARKKAEEKER 1020
Cdd:COG4717    130 LYQELEALEAELAELPERLEELEERLEElRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLA 209
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1039761240 1021 LKQEKRDEKRLNKERkLEQRRLELEMAKELKKPKE 1055
Cdd:COG4717    210 ELEEELEEAQEELEE-LEEELEQLENELEAAALEE 243
PHD_JADE3 cd15681
PHD finger found in protein Jade-3 and similar proteins; Jade-3, also termed PHD finger ...
1963-2008 1.67e-05

PHD finger found in protein Jade-3 and similar proteins; Jade-3, also termed PHD finger protein 16 (PHF16), is a plant homeodomain (PHD) zinc finger protein that is closely related to Jade-1, which functions as an essential regulator of multiple cell signaling pathways. Like Jade-1, Jade-3 is required for ING4 and ING5 to associate with histone acetyl transferase (HAT) HBO1 and Eaf6 to form a HBO1 complex that has a histone H4-specific acetyltransferase activity, a reduced activity toward histone H3, and is responsible for the bulk of histone H4 acetylation in vivo. Jade-3 contains a canonical Cys4HisCys3 PHD domain followed by a non-canonical extended PHD (ePHD) domain, Cys2HisCys5HisCys2His, both of which are zinc-binding motifs. This model corresponds to the canonical Cys4HisCys3 PHD finger.


Pssm-ID: 277151 [Multi-domain]  Cd Length: 50  Bit Score: 43.81  E-value: 1.67e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1039761240 1963 CQICRKGDNEE--LLLLCDGCDKGCHTYCHrpKITTIPDGDWFCPACI 2008
Cdd:cd15681      2 CDVCRSPDSEEgnDMVFCDKCNICVHQACY--GILKVPEGSWLCRTCV 47
PHD_SPP1 cd16039
PHD finger found in Set1 complex component SPP1; Set1C component SPP1, also called COMPASS ...
1962-2007 1.78e-05

PHD finger found in Set1 complex component SPP1; Set1C component SPP1, also called COMPASS component Spp1, or Complex proteins associated with set1 protein Spp1, or Suppressor of PRP protein 1, is a component of the COMPASS complex that links histone methylation to initiation of meiotic recombination. It induces double-strand break (DSB) formation by tethering to recombinationally cold regions. SPP1 interacts with H3K4me3 and Mer2, a protein required for DSB formation, to promote recruitment of potential meiotic DSB sites to the chromosomal axis. SPP1 contains a PHD finger, a zinc binding motif.


Pssm-ID: 277186  Cd Length: 46  Bit Score: 43.62  E-value: 1.78e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039761240 1962 YCqICRKGDNEELLLLCDGCDKGCHTYChrpkiTTIPDGD------WFCPAC 2007
Cdd:cd16039      1 YC-ICQKPDDGRWMIACDGCDEWYHFTC-----VNIPEADvelvdsFFCPPC 46
PHD_TIF1beta cd15623
PHD finger found in transcription intermediary factor 1-beta (TIF1-beta); TIF1-beta, also ...
1963-2007 1.89e-05

PHD finger found in transcription intermediary factor 1-beta (TIF1-beta); TIF1-beta, also termed Kruppel-associated Box (KRAB)-associated protein 1 (KAP-1), or KRAB-interacting protein 1 (KRIP-1), or nuclear co-repressor KAP-1, or RING finger protein 96, or tripartite motif-containing protein 28 (TRIM28), or E3 SUMO-protein ligase TRIM28, acts as a nuclear co-repressor that plays a role in transcription and in DNA damage response. Upon DNA damage, the phosphorylation of KAP-1 on serine 824 by the ataxia telangiectasia-mutated (ATM) kinase enhances cell survival and facilitates chromatin relaxation and heterochromatic DNA repair. It also regulates CHD3 nucleosome remodeling during DNA double-strand break (DSB) response. Meanwhile, KAP-1 can be dephosphorylated by protein phosphatase PP4C in the DNA damage response. In addition, KAP-1 is a co-activator of the orphan nuclear receptor NGFI-B (or Nur77) and is involved in NGFI-B-dependent transcription. It is also a coiled-coil binding partner, substrate and activator of the c-Fes protein tyrosine kinase. TIF1-beta contains an N-terminal RBCC (RING finger, B-box zinc-fingers, coiled-coil), which can interact with KRAB zinc finger proteins (KRAB-ZFPs), MDM2, MM1, C/EBPbeta, and mediates homo- and heterodimerization, a plant homeodomain (PHD) finger followed by a bromodomain in the C-terminal region, which interact with SETDB1, Mi-2alpha and other proteins to form complexes with histone deacetylase or methyltransferase activity.


Pssm-ID: 277093  Cd Length: 43  Bit Score: 43.64  E-value: 1.89e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1963 CQICRKgdnEELLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15623      2 CRVCQK---AGALVMCDQCEFCFHLDCHLPALQEVPGEDWKCLLC 43
PHD_TIF1delta cd15625
PHD finger found in transcriptional intermediary factor 1 delta (TIF1delta); TIF1delta, also ...
1962-2007 1.89e-05

PHD finger found in transcriptional intermediary factor 1 delta (TIF1delta); TIF1delta, also termed tripartite motif-containing protein 66 (TRIM66), is a novel heterochromatin protein 1 (HP1)-interacting member of the transcriptional intermediary factor1 (TIF1) family expressed by elongating spermatids. Like other TIF1 proteins, TIF1delta displays a potent trichostatin A (TSA)-sensitive repression function; TSA is a specific inhibitor of histone deacetylases. Moreover, TIF1delta plays an important role in heterochromatin-mediated gene silencing during postmeiotic phases of spermatogenesis. It functions as a negative regulator of postmeiotic genes acting through HP1 isotype gamma (HP1gamma) complex formation and centromere association. TIF1delta contains an N-terminal RBCC (RING finger, B-box zinc-fingers, coiled-coil), a plant homeodomain (PHD) finger, followed by a bromodomain in the C-terminal region.


Pssm-ID: 277095 [Multi-domain]  Cd Length: 49  Bit Score: 43.79  E-value: 1.89e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNeelLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15625      4 FCAVCLNGGE---LLCCDRCPKVFHLSCHVPALLSFPVGEWVCTLC 46
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
870-1048 1.97e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 49.38  E-value: 1.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  870 QAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAImaaEEKRKQKEQmKIIKQQEKIKRIQQ--IRMEKELRAqqilea 947
Cdd:COG4942     18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKAL---LKQLAALER-RIAALARRIRALEQelAALEAELAE------ 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  948 kkkkkeeaanaklLEAEKRTKEKELRRQQAVLLKH----QELERHRLDMV------WERERRRQHVM---------LMKA 1008
Cdd:COG4942     88 -------------LEKEIAELRAELEAQKEELAELlralYRLGRQPPLALllspedFLDAVRRLQYLkylaparreQAEE 154
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1039761240 1009 MEARKK--AEEKERLKQEKRDEKRLNKERKLEQRRLELEMAK 1048
Cdd:COG4942    155 LRADLAelAALRAELEAERAELEALLAELEEERAALEALKAE 196
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
863-1052 2.06e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 49.38  E-value: 2.06e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  863 AKLLRKLQAQ--EIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRI--QQIRMEKE 938
Cdd:COG4942     19 ADAAAEAEAEleQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELekEIAELRAE 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  939 LRAQQ-----ILEAKKKKKEEAANAKLLEAEkrTKEKELRRQQAVllkhQELERHRLDMVWERERRRQHVMLMKAmEARK 1013
Cdd:COG4942     99 LEAQKeelaeLLRALYRLGRQPPLALLLSPE--DFLDAVRRLQYL----KYLAPARREQAEELRADLAELAALRA-ELEA 171
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1039761240 1014 KAEEKERLKQEKRDEKRLNKERKLEQRRLELEMAKELKK 1052
Cdd:COG4942    172 ERAELEALLAELEEERAALEALKAERQKLLARLEKELAE 210
Bromodomain_1 cd05494
Bromodomain; uncharacterized subfamily. Bromodomains are found in many chromatin-associated ...
2094-2141 2.24e-05

Bromodomain; uncharacterized subfamily. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine.


Pssm-ID: 99926 [Multi-domain]  Cd Length: 114  Bit Score: 45.51  E-value: 2.24e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039761240 2094 DLALCSMILTEMETH---EDSWPFLLPVN--LKLVPGYKKVIKKPMDFSTIRE 2141
Cdd:cd05494      1 DYEALERVLRELKRHrrnEDAWPFLEPVNppRRGAPDYRDVIKRPMSFGTKVN 53
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
863-1078 2.75e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 48.99  E-value: 2.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  863 AKLLRKLQaQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEK----IKRIQQIRMEKE 938
Cdd:COG4942     68 ARRIRALE-QELAALEAELAELEKEIAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDfldaVRRLQYLKYLAP 146
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  939 LRAQQILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAvllKHQELERhrldmvwERERRRQHVMLMKAMEARKKAEEK 1018
Cdd:COG4942    147 ARREQAEELRADLAELAALRAELEAERAELEALLAELEE---ERAALEA-------LKAERQKLLARLEKELAELAAELA 216
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240 1019 ERLKQEKRDEKRLNKERKLEQRRLELEMAKELKKPKEDMcladqkplpEWPrIPGLVLSG 1078
Cdd:COG4942    217 ELQQEAEELEALIARLEAEAAAAAERTPAAGFAALKGKL---------PWP-VSGRVVRR 266
WSD pfam15613
Williams-Beuren syndrome DDT (WSD), D-TOX E motif; This family represents the combined ...
1706-1745 3.04e-05

Williams-Beuren syndrome DDT (WSD), D-TOX E motif; This family represents the combined alpha-helical module found in diverse eukaryotic chromatin proteins. Based on the Ioc3 structure, the N-terminus of this module is inferred to interact with nucleosomal linker DNA and the SLIDE domain of ISWI proteins. The resulting complex forms a protein ruler that measures out the spacing between two adjacent nucleosomes. The acidic residue from the GxD signature at the N-terminus is a major determinant of the interaction between the ISWI and WHIM motifs. The N-terminal portion also contacts the inter-nucleosomal linker DNA. The module shows a great domain architectural diversity and is often combined with other modified histone peptide recognizing and DNA binding domains, some of which discriminate methylated DNA. The WSD module constitutes the inter-nucleosomal linker DNA binding site in the major groove of DNA, and was first identified as WSD, the D-TOX E motif of plant homeodomains homologous with the mutant transcription factor causing Williams-Beuren syndrome in association with the DDT-domain.


Pssm-ID: 464775 [Multi-domain]  Cd Length: 69  Bit Score: 43.68  E-value: 3.04e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1039761240 1706 EEMQFGWWRIID-PEDLKTLLKVLHLRGIREKALQKQIQKH 1745
Cdd:pfam15613   28 ESPSDGEWGVYSsKEQLDALIASLNPRGVRESALKEALEKI 68
Bromo_RACK7 cd05508
Bromodomain, RACK7_like subfamily. RACK7 (also called human protein kinase C-binding protein) ...
2113-2173 3.45e-05

Bromodomain, RACK7_like subfamily. RACK7 (also called human protein kinase C-binding protein) was identified as a potential tumor suppressor genes, it shares domain architecture with BS69/ZMYND11; both have been implicated in the regulation of cellular proliferation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99940  Cd Length: 99  Bit Score: 44.68  E-value: 3.45e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 2113 PFLLPVNLKLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDS 2173
Cdd:cd05508     22 PFLKPVDLEQFPDYAQYVFKPMDLSTLEKNVRKKAYGSTDAFLADAKWILHNAIIYNGGDH 82
Bromo_polybromo_II cd05517
Bromodomain, polybromo repeat II. Polybromo is a nuclear protein of unknown function, which ...
2121-2189 3.59e-05

Bromodomain, polybromo repeat II. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.


Pssm-ID: 99948  Cd Length: 103  Bit Score: 44.74  E-value: 3.59e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039761240 2121 KLVPGYKKVIKKPMDFSTIREKLNNGQYPNFETFALDVRLVFDNCETFNEDDSDIGRAGHSMRKYFEKK 2189
Cdd:cd05517     34 VLYPDYYAVIKEPIDLKTIAQRIQSGYYKSIEDMEKDLDLMVKNAKTFNEPGSQVYKDANAIKKIFTAK 102
PHD_PRKCBP1 cd15538
PHD finger found in protein kinase C-binding protein 1 (PRKCBP1); PRKCBP1, also termed ...
1962-2007 3.73e-05

PHD finger found in protein kinase C-binding protein 1 (PRKCBP1); PRKCBP1, also termed cutaneous T-cell lymphoma-associated antigen se14-3 (CTCL-associated antigen se14-3), or Rack7, or zinc finger MYND domain-containing protein 8 (ZMYND8), is a novel receptor for activated C-kinase (RACK)-like protein that may play an important role in the activation and regulation of PKC-beta I, and the PKC signaling cascade. It also has been identified as a formin homology-2-domain containing protein 1 (FHOD1)-binding protein that may be involved in FHOD1-regulated actin polymerization and transcription. Moreover, PRKCBP1 may function as a REST co-repressor 2 (RCOR2) interacting factor; the RCOR2/ZMYND8 complex which might be involved in the regulation of neural differentiation. PRKCBP1 contains a plant homeodomain (PHD) finger, a bromodomain, and a proline-tryptophan-tryptophan-proline (PWWP) domain.


Pssm-ID: 277013  Cd Length: 41  Bit Score: 42.70  E-value: 3.73e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNeelLLLCDGCDKGCHTYCHRPKITtiPDGDWFCPAC 2007
Cdd:cd15538      1 FCWRCHKEGQ---VLCCSLCPRVYHKKCLKLTSE--PDEDWVCPEC 41
DUF4659 pfam15558
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins ...
901-1056 3.79e-05

Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins in this family are typically between 427 and 674 amino acids in length. There are two completely conserved residues (D and I) that may be functionally important.


Pssm-ID: 464768 [Multi-domain]  Cd Length: 374  Bit Score: 48.49  E-value: 3.79e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  901 KQQAIMAAEEKRKQKEQ-MKIIKQQ-----EKIKRIQQIRMEKELRAQQILEAKKKKKEeaanakllEAEKRTKEKELRR 974
Cdd:pfam15558    5 RDRKIAALMLARHKEEQrMRELQQQaalawEELRRRDQKRQETLERERRLLLQQSQEQW--------QAEKEQRKARLGR 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  975 QQAVLLKHQELERHRLDMVW-----ERERRRQHVMLMKAMEARKKAEEKERLKQEKRDEKRLNKERKLEQRRLELEMA-- 1047
Cdd:pfam15558   77 EERRRADRREKQVIEKESRWreqaeDQENQRQEKLERARQEAEQRKQCQEQRLKEKEEELQALREQNSLQLQERLEEAch 156

                   ....*....
gi 1039761240 1048 KELKKPKED 1056
Cdd:pfam15558  157 KRQLKEREE 165
MAP7 pfam05672
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is ...
963-1055 4.15e-05

MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is presumably controlled by tissue-specific microtubule-associated proteins (MAPs). The 115-kDa epithelial MAP (E-MAP-115/MAP7) has been identified as a microtubule-stabilising protein predominantly expressed in cell lines of epithelial origin. The binding of this microtubule associated protein is nucleotide independent.


Pssm-ID: 461709 [Multi-domain]  Cd Length: 153  Bit Score: 45.80  E-value: 4.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  963 AEKR-----TKEKELRRQQAVllkhQELERHRLDmvwERERRRQHVMLMKAMEARKKAEEKerlKQEKRDEKRLNKERKL 1037
Cdd:pfam05672   17 AEKRrqareQREREEQERLEK----EEEERLRKE---ELRRRAEEERARREEEARRLEEER---RREEEERQRKAEEEAE 86
                           90
                   ....*....|....*...
gi 1039761240 1038 EQRRLELEMAKELKKPKE 1055
Cdd:pfam05672   87 EREQREQEEQERLQKQKE 104
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
856-1043 4.76e-05

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 48.81  E-value: 4.76e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  856 EFLDNTDAKLLRKLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQ----------QE 925
Cdd:pfam02463  814 AELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEqklkdeleskEE 893
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  926 KIKRIQQIRMEKELRAQQILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLK----HQELERHRLDMVwerERRRQ 1001
Cdd:pfam02463  894 KEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEkeenNKEEEEERNKRL---LLAKE 970
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 1002 HVMLMKAM---------EARKKAEEKERLKQEKRDEKRLNKERKLEQRRLE 1043
Cdd:pfam02463  971 ELGKVNLMaieefeekeERYNKDELEKERLEEEKKKLIRAIIEETCQRLKE 1021
PHD_PHF21B cd15524
PHD finger found in PHD finger protein 21B (PHF21B); PHF21B is a plant homeodomain (PHD) ...
1962-2007 6.43e-05

PHD finger found in PHD finger protein 21B (PHF21B); PHF21B is a plant homeodomain (PHD) finger-containing protein whose biological function remains unclear. It shows high sequence similarity with PHF21A, which is associated with LSD1, a lysine (K)-specific histone demethylase and inhibits LSD1-mediated histone demethylation in vitro. PHD fingers can recognize the unmodified and modified histone H3 tail, and some have been found to interact with non-histone proteins.


Pssm-ID: 276999 [Multi-domain]  Cd Length: 43  Bit Score: 42.19  E-value: 6.43e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGDNeelLLLCDGCDKGCHTYCHRPKITTIPDGDWFCPAC 2007
Cdd:cd15524      1 HCAACKRGGN---LQPCGTCPRAYHLDCLDPPLKTAPKGVWVCPKC 43
DUF4659 pfam15558
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins ...
869-1049 6.90e-05

Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins in this family are typically between 427 and 674 amino acids in length. There are two completely conserved residues (D and I) that may be functionally important.


Pssm-ID: 464768 [Multi-domain]  Cd Length: 374  Bit Score: 47.72  E-value: 6.90e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  869 LQAQEIARQAAQIKLLR--KLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQ--------MKIIKQQEKIKRIQQIRmEKE 938
Cdd:pfam15558  145 LQLQERLEEACHKRQLKerEEQKKVQENNLSELLNHQARKVLVDCQAKAEEllrrlsleQSLQRSQENYEQLVEER-HRE 223
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  939 LRA------QQILEAKkkkkeeaanaklLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEAR 1012
Cdd:pfam15558  224 LREkaqkeeEQFQRAK------------WRAEEKEEERQEHKEALAELADRKIQQARQVAHKTVQDKAQRARELNLEREK 291
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1039761240 1013 ------KKAEEKERLKQEKRDEKRLNKERKLEQRRLELEMAKE 1049
Cdd:pfam15558  292 nhhilkLKVEKEEKCHREGIKEAIKKKEQRSEQISREKEATLE 334
PHD_BRPF cd15572
PHD finger found in bromodomain and PHD finger-containing (BRPF) proteins; The family of BRPF ...
1963-2012 9.50e-05

PHD finger found in bromodomain and PHD finger-containing (BRPF) proteins; The family of BRPF proteins includes BRPF1, BRD1/BRPF2, and BRPF3. They are scaffold proteins that form monocytic leukemic zinc-finger protein (MOZ)/MOZ-related factor (MORF) H3 histone acetyltransferase (HAT) complexes with other regulatory subunits, such as inhibitor of growth 5 (ING5) and Esa1-associated factor 6 ortholog (EAF6). BRPF proteins have multiple domains, including a canonical Cys4HisCys3 plant homeodomain (PHD) zinc finger followed by a non-canonical extended PHD (ePHD) finger, Cys2HisCys5HisCys2His, a bromodomain and a proline-tryptophan-tryptophan-proline (PWWP) domain. PHD and ePHD fingers both bind to lysine 4 of histone H3 (K4H3), bromodomains interact with acetylated lysines on N-terminal tails of histones and other proteins, and PWWP domains show histone-binding and chromatin association properties. This model corresponds to the canonical Cys4HisCys3 PHD finger.


Pssm-ID: 277047 [Multi-domain]  Cd Length: 54  Bit Score: 41.83  E-value: 9.50e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039761240 1963 CQICRKGD--NEELLLLCDGCDKGCHTYCHrpKITTIPDGDWFCPACISKAS 2012
Cdd:cd15572      4 CCICLDGEcqNSNVILFCDMCNLAVHQECY--GVPYIPEGQWLCRRCLQSPS 53
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
855-1043 1.12e-04

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 47.11  E-value: 1.12e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  855 AEFLDNTDAKLLRKLQAQEIARQAAQIKllrKLQKQEQARVAKEAKKQqaimAAEEKRKQKEQMKIiKQQEKIKRIQQIR 934
Cdd:PRK09510    89 AEELQQKQAAEQERLKQLEKERLAAQEQ---KKQAEEAAKQAALKQKQ----AEEAAAKAAAAAKA-KAEAEAKRAAAAA 160
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  935 MEKELRAQQILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKHQELerhrldmvwERERRRQHVMLMKAMEARKK 1014
Cdd:PRK09510   161 KKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKA---------AAEAKKKAAAEAKAAAAKAA 231
                          170       180
                   ....*....|....*....|....*....
gi 1039761240 1015 AEEKERLKQEKRDEKRLNKERKLEQRRLE 1043
Cdd:PRK09510   232 AEAKAAAEKAAAAKAAEKAAAAKAAAEVD 260
PHD_BRPF2 cd15677
PHD finger found in bromodomain and PHD finger-containing protein 2 (BRPF2) and similar ...
1963-2012 1.18e-04

PHD finger found in bromodomain and PHD finger-containing protein 2 (BRPF2) and similar proteins; BRPF2, also termed bromodomain-containing protein 1 (BRD1), or BR140-like protein, is encoded by BRL (BR140 Like gene). It is responsible for the bulk of the acetylation of H3K14 and forms a novel monocytic leukemic zinc-finger protein (MOZ)/MOZ-related factor (MORF) H3 histone acetyltransferase (HAT) complex with HBO1 and ING4. The complex is required for full transcriptional activation of the erythroid-specific regulator genes essential for terminal differentiation and survival of erythroblasts in fetal liver. BRPF2 shows widespread expression and localizes to the nucleus within spermatocytes. It contains a cysteine rich region harboring a canonical Cys4HisCys3 plant homeodomain (PHD) finger followed by a non-canonical extended PHD (ePHD) finger, Cys2HisCys5HisCys2His, a bromodomain, and a proline-tryptophan-tryptophan-proline (PWWP) domain. This model corresponds to the canonical Cys4HisCys3 PHD finger.


Pssm-ID: 277147 [Multi-domain]  Cd Length: 54  Bit Score: 41.54  E-value: 1.18e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039761240 1963 CQICRKGD--NEELLLLCDGCDKGCHTYCHrpKITTIPDGDWFCPACISKAS 2012
Cdd:cd15677      4 CCICMDGEcqNSNVILFCDMCNLAVHQECY--GVPYIPEGQWLCRHCLQSRS 53
ERM_helical pfam20492
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ...
910-1057 1.39e-04

Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.


Pssm-ID: 466641 [Multi-domain]  Cd Length: 120  Bit Score: 43.37  E-value: 1.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  910 EKRKQKEQMKIIKQQEKIKriqqiRMEKELRaqqileakkkkkeeaanakllEAEKRTKEKELRRQQAvllkhqELERHR 989
Cdd:pfam20492    5 EREKQELEERLKQYEEETK-----KAQEELE---------------------ESEETAEELEEERRQA------EEEAER 52
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039761240  990 LdmvwERERRRQHVMLMK-AMEARKKAEEKERLKQEKRDEKRLNKERKLEQRRLELEMAK---ELKKPKEDM 1057
Cdd:pfam20492   53 L----EQKRQEAEEEKERlEESAEMEAEEKEQLEAELAEAQEEIARLEEEVERKEEEARRlqeELEEAREEE 120
PHD_TCF19_like cd15517
PHD finger found in Transcription factor 19 (TCF-19), Lysine-specific demethylase KDM5A and ...
1963-2007 1.49e-04

PHD finger found in Transcription factor 19 (TCF-19), Lysine-specific demethylase KDM5A and KDM5B, and other similar proteins; TCF-19 was identified as a putative trans-activating factor with expression beginning at the late G1-S boundary in dividing cells. It functions as a novel islet factor necessary for proliferation and survival in the INS-1 beta cell line. It plays an important role in susceptibility to both Type 1 Diabetes Mellitus (T1DM) and Type 2 Diabetes Mellitus (T2DM); it has been suggested that it may positively impact beta cell mass under conditions of beta cell stress and increased insulin demand. KDM5A was originally identified as a retinoblastoma protein (Rb)-binding partner and its inactivation may be important for Rb to promote differentiation. It is involved in transcription through interaction with TBP, p107, nuclear receptors, Myc, Sin3/HDAC, Mad1, RBP-J, CLOCK, and BMAL1. KDM5B has a restricted expression pattern in the testis, ovary, and transiently in the mammary gland of the pregnant female and has been shown to be upregulated in breast cancer, prostate cancer, and lung cancer, suggesting a potential role in tumorigenesis. Both KDM5A and KDM5B function as trimethylated histone H3 lysine 4 (H3K4me3) demethylases. This family also includes Caenorhabditis elegans Lysine-specific demethylase 7 homolog (ceKDM7A). ceKDM7A (also termed JmjC domain-containing protein 1.2, PHD finger protein 8 homolog, or PHF8 homolog) is a plant homeodomain (PHD)- and JmjC domain-containing protein that functions as a histone demethylase specific for H3K9me2 and H3K27me2. The binding of the PHD finger to H3K4me3 guides H3K9me2- and H3K27me2-specific demethylation by its catalytic JmjC domain in a trans-histone regulation mechanism. In addition, this family includes plant protein OBERON 1 and OBERON 2, Alfin1-like (AL) proteins, histone acetyltransferases (HATs) HAC, and AT-rich interactive domain-containing protein 4 (ARID4).


Pssm-ID: 276992 [Multi-domain]  Cd Length: 49  Bit Score: 41.38  E-value: 1.49e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1039761240 1963 CQICRK--GDNEELLLLCDGCDKGCHTYCHRPKITTIPDGD-WFCPAC 2007
Cdd:cd15517      2 CGICNLetAAVDELWVQCDGCDKWFHQFCLGLSNERYADEDkFKCPNC 49
PRK12704 PRK12704
phosphodiesterase; Provisional
863-1045 1.59e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 46.69  E-value: 1.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  863 AKLLRKLQAQEIARQAaqikllrklqKQEQARVAKEAKKQqaimaAEEKRKQKeqmkIIKQQEKIKRiqqirmekeLRAQ 942
Cdd:PRK12704    21 GYFVRKKIAEAKIKEA----------EEEAKRILEEAKKE-----AEAIKKEA----LLEAKEEIHK---------LRNE 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  943 qileakkkkkeeaanaklLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRrqhvmlmkaMEARKKAEEKerlK 1022
Cdd:PRK12704    73 ------------------FEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEE---------LEKKEKELEQ---K 122
                          170       180
                   ....*....|....*....|...
gi 1039761240 1023 QEKRDEKRLNKERKLEQRRLELE 1045
Cdd:PRK12704   123 QQELEKKEEELEELIEEQLQELE 145
PHD_BRPF1 cd15676
PHD finger found in bromodomain and PHD finger-containing protein 1 (BRPF1) and similar ...
1963-2012 1.89e-04

PHD finger found in bromodomain and PHD finger-containing protein 1 (BRPF1) and similar proteins; BRPF1, also termed peregrin or protein Br140, is a multi-domain protein that binds histones, mediates monocytic leukemic zinc-finger protein (MOZ)-dependent histone acetylation, and is required for Hox gene expression and segmental identity. It is a close partner of the MOZ histone acetyltransferase (HAT) complex and a novel Trithorax group (TrxG) member with a central role during development. BRPF1 is primarily a nuclear protein that has a broad tissue distribution and is abundant in testes and spermatogonia. It contains a canonical Cys4HisCys3 plant homeodomain (PHD) zinc finger followed by a non-canonical extended PHD (ePHD) finger, Cys2HisCys5HisCys2His, a bromodomain and a proline-tryptophan-tryptophan-proline (PWWP) domain. PHD and ePHD fingers both bind to lysine 4 of histone H3 (K4H3), bromodomains interact with acetylated lysines on N-terminal tails of histones and other proteins, and PWWP domains show histone-binding and chromatin association properties. BRPF1 may be involved in chromatin remodeling. This model corresponds to the canonical Cys4HisCys3 PHD finger.


Pssm-ID: 277146 [Multi-domain]  Cd Length: 62  Bit Score: 41.58  E-value: 1.89e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039761240 1963 CQICRKGD--NEELLLLCDGCDKGCHTYCHrpKITTIPDGDWFCPACISKAS 2012
Cdd:cd15676     10 CCICNDGEcqNSNVILFCDMCNLAVHQECY--GVPYIPEGQWLCRRCLQSPS 59
BAH_plant_2 cd04718
BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH ...
1984-2016 1.89e-04

BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.


Pssm-ID: 240069  Cd Length: 148  Bit Score: 43.73  E-value: 1.89e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1039761240 1984 GCHTYCHRPKITTIPDGDWFCPACISKASGQSI 2016
Cdd:cd04718      1 GFHLCCLRPPLKEVPEGDWICPFCEVEKSGQSA 33
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
863-1057 2.03e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 46.59  E-value: 2.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  863 AKLLRKLQ--AQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAI-MAAEEKRKQKEQMK----IIKQQEKIKRIQQIRM 935
Cdd:TIGR02168  326 EELESKLDelAEELAELEEKLEELKEELESLEAELEELEAELEELeSRLEELEEQLETLRskvaQLELQIASLNNEIERL 405
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  936 EKELRAQQileakkkKKEEAANAKLLEAEKRTKEKELRRQQAVLlkhQELERHRLDMVWERERRRQHvmLMKAMEARKKA 1015
Cdd:TIGR02168  406 EARLERLE-------DRRERLQQEIEELLKKLEEAELKELQAEL---EELEEELEELQEELERLEEA--LEELREELEEA 473
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1039761240 1016 EEKerLKQEKRDEKRLNKERKLEQRRLEL-----EMAKELKKPKEDM 1057
Cdd:TIGR02168  474 EQA--LDAAERELAQLQARLDSLERLQENlegfsEGVKALLKNQSGL 518
CAF-1_p150 pfam11600
Chromatin assembly factor 1 complex p150 subunit, N-terminal; CAF-1_p150 is a polypeptide ...
902-1054 3.28e-04

Chromatin assembly factor 1 complex p150 subunit, N-terminal; CAF-1_p150 is a polypeptide subunit of CAF-1, which functions in depositing newly synthesized and acetylated histones H3/H4 into chromatin during DNA replication and repair. CAF-1_p150 includes the HP1 interaction site, the PEST, KER and ED interacting sites. CAF-1_p150 interacts directly with newly synthesized and acetylated histones through the acidic KER and ED domains. The PEST domain is associated with proteins that undergo rapid proteolysis.


Pssm-ID: 402959 [Multi-domain]  Cd Length: 164  Bit Score: 43.52  E-value: 3.28e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  902 QQAIMAAEEKRKQKEQmKIIKQQEKIKRIQQIRMEKELRAQQILEAKKKKKEEAANAKllEAEKRTKEKELRRQQAvllK 981
Cdd:pfam11600    4 QKSVQSQEEKEKQRLE-KDKERLRRQLKLEAEKEEKERLKEEAKAEKERAKEEARRKK--EEEKELKEKERREKKE---K 77
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039761240  982 HQELERHRLDMVWERERRRQHVMLMKAMEARKKAEEKeRLKQEkrdEKRLNKERKleqrrlelEMAKELKKPK 1054
Cdd:pfam11600   78 DEKEKAEKLRLKEEKRKEKQEALEAKLEEKRKKEEEK-RLKEE---EKRIKAEKA--------EITRFLQKPK 138
Activator_LAG-3 pfam11498
Transcriptional activator LAG-3; The C.elegans Notch pathway, involved in the control of ...
872-1004 3.94e-04

Transcriptional activator LAG-3; The C.elegans Notch pathway, involved in the control of growth, differentiation and patterning in animal development, relies on either of the receptors GLP-1 or LIN-12. Both these receptors promote signalling by the recruitment of LAG-3 to target promoters, where it then acts as a transcriptional activator. LAG-3 works as a ternary complex together with the DNA binding protein, LAG-1. Its N-terminal region adopts an elongated kinked helix that is required for complex assembly.


Pssm-ID: 151935 [Multi-domain]  Cd Length: 476  Bit Score: 45.34  E-value: 3.94e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  872 QEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKE---QMKIIKQQEKIKRIQQirmekelrAQQILEAK 948
Cdd:pfam11498  320 QHIAQLAQQQNKMRLLQQQEMEMQRIEQQRQQQIMHQHQQQQQQEhqqQQMLLQQQQQMHQLQQ--------HHQMNGGG 391
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039761240  949 KKKKEEAANAKLLEAEKRTKEKELRRQQavllkHQELERHRLDMVWERERRRQHVM 1004
Cdd:pfam11498  392 QFATQAHQHAAYLQQMQHMRLQEQIQHQ-----QQQAQHHQQAQQQHQQPAQHGQM 442
Bromo_Rsc1_2_I cd05521
Bromodomain, repeat I in Rsc1/2_like subfamily, specific to fungi. Rsc1 and Rsc2 are ...
2124-2189 4.43e-04

Bromodomain, repeat I in Rsc1/2_like subfamily, specific to fungi. Rsc1 and Rsc2 are components of the RSC complex (remodeling the structure of chromatin), are essential for transcriptional control, and have a specific domain architecture including two bromodomains. The RSC complex has also been linked to homologous recombination and nonhomologous end-joining repair of DNA double strand breaks. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.


Pssm-ID: 99952  Cd Length: 106  Bit Score: 41.54  E-value: 4.43e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039761240 2124 PGYKKVIKKPMDFSTIREKLNNgqYPNFETFALDVRLVFDNCETFNEDDSDIGRAGHSMRKYFEKK 2189
Cdd:cd05521     38 PDYYKIIKNPLSLNTVKKRLPH--YTNAQEFVNDLAQIPWNARLYNTKGSVIYKYALILEKYINDV 101
PDCD7 pfam16021
Programmed cell death protein 7;
835-1024 4.72e-04

Programmed cell death protein 7;


Pssm-ID: 464979 [Multi-domain]  Cd Length: 305  Bit Score: 44.72  E-value: 4.72e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  835 RAREESRMKRRKgrppnvgsaefldnTDAKLLRKLQAQEIARQAAQIK--LLRKLQKQEQARVAKEAKKQQAIMAAEEKR 912
Cdd:pfam16021   74 RQKKRLRRKRRK--------------EERKEEKKEEQERRAEREAKIDkwRRKQIQEVEEKKRERELKLAADAVLSEVRK 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  913 KQKEQMKIIKQQEKIKRIQQIRMEKELRAQQILEAKKKKKEEAANAKLLEA-EKRTK-----EKELRrqqaVLLKHQELE 986
Cdd:pfam16021  140 KQADAKRMLDILRSLEKLRKLRKEAARRKGIKPESECDEAFESHLEKLRSVwKKRTEeysaeEKALK----VMLEGEQEE 215
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1039761240  987 rhrldmvwERERRRqhvmlmkamEARKKAEEKERLKQE 1024
Cdd:pfam16021  216 --------ERKRRR---------EKRQKKEREEFLQKK 236
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
858-1056 5.36e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 45.43  E-value: 5.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  858 LDNTDAKLLRKLQAQEIARQAAQIKLLRKLQKQEQARVAK---EAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQIR 934
Cdd:TIGR02168  754 KELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQlkeELKALREALDELRAELTLLNEEAANLRERLESLERRI 833
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  935 MEKELRAQQILEAKKKKKEEAANAKL----LEAEKRTKEKELRRQQAVLLKHQE-LERHRLDMvwERERRRQHVMLMKAM 1009
Cdd:TIGR02168  834 AATERRLEDLEEQIEELSEDIESLAAeieeLEELIEELESELEALLNERASLEEaLALLRSEL--EELSEELRELESKRS 911
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1039761240 1010 EARKKAEE-KERLKQEKRDEKRL------NKERKLEQRRLELEMAKELKKPKED 1056
Cdd:TIGR02168  912 ELRRELEElREKLAQLELRLEGLevridnLQERLSEEYSLTLEEAEALENKIED 965
COG5141 COG5141
PHD zinc finger-containing protein [General function prediction only];
1962-2008 5.53e-04

PHD zinc finger-containing protein [General function prediction only];


Pssm-ID: 227470 [Multi-domain]  Cd Length: 669  Bit Score: 44.97  E-value: 5.53e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039761240 1962 YCQICRK-----GDNEELLLLCDGCDKGCHTYCHrpKITTIPDGDWFCPACI 2008
Cdd:COG5141    192 FDDICTKctsthNENSNAIVFCDGCEICVHQSCY--GIQFLPEGFWLCRKCI 241
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
863-1050 5.94e-04

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 44.51  E-value: 5.94e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  863 AKLLRKLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQIRMEKELRAQ 942
Cdd:COG4372     48 EQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDL 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  943 QILEAKKKKKEEAANAKLLEAEKRTkeKELRRQQAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEARKKAEEKERLK 1022
Cdd:COG4372    128 EQQRKQLEAQIAELQSEIAEREEEL--KELEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEA 205
                          170       180
                   ....*....|....*....|....*...
gi 1039761240 1023 QEKRDEKRLNKERKLEQRRLELEMAKEL 1050
Cdd:COG4372    206 EKLIESLPRELAEELLEAKDSLEAKLGL 233
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
865-1078 5.95e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 45.14  E-value: 5.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  865 LLRKLQ--AQEIARQAAQIKL--LRKLQKQEQARvaKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQI--RMEKE 938
Cdd:COG4717     47 LLERLEkeADELFKPQGRKPElnLKELKELEEEL--KEAEEKEEEYAELQEELEELEEELEELEAELEELREEleKLEKL 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  939 LRAQQILEAKKKkkeeaanaklLEAEKRTKE---KELRRQQAVL------LKHQELERHRLDMvwERERRRQHVMLMKAM 1009
Cdd:COG4717    125 LQLLPLYQELEA----------LEAELAELPerlEELEERLEELreleeeLEELEAELAELQE--ELEELLEQLSLATEE 192
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039761240 1010 EARKKAEEKERLKQEKRdekRLNKERKLEQRRLElEMAKELKKPKEDMCLADQKPLPEWPRIPGLVLSG 1078
Cdd:COG4717    193 ELQDLAEELEELQQRLA---ELEEELEEAQEELE-ELEEELEQLENELEAAALEERLKEARLLLLIAAA 257
Mitofilin pfam09731
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ...
877-1039 6.55e-04

Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.


Pssm-ID: 430783 [Multi-domain]  Cd Length: 618  Bit Score: 44.75  E-value: 6.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  877 QAAQIKLLRkLQKQEQARVAKEAKKQQA---------IMAAEEKRKQKEQMKIIKQQEKIKRIQQiRMEKELRAQQilea 947
Cdd:pfam09731  297 DQLSKKLAE-LKKREEKHIERALEKQKEeldklaeelSARLEEVRAADEAQLRLEFEREREEIRE-SYEEKLRTEL---- 370
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  948 kkkkkeeaanaklleaEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEARKKAEEKERLKQEKRD 1027
Cdd:pfam09731  371 ----------------ERQAEAHEEHLKDVLVEQEIELQREFLQDIKEKVEEERAGRLLKLNELLANLKGLEKATSSHSE 434
                          170
                   ....*....|..
gi 1039761240 1028 EKRLNkeRKLEQ 1039
Cdd:pfam09731  435 VEDEN--RKAQQ 444
DUF4670 pfam15709
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ...
863-1020 6.75e-04

Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.


Pssm-ID: 464815 [Multi-domain]  Cd Length: 522  Bit Score: 44.56  E-value: 6.75e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  863 AKLLRKLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQ--QAIMAAEEKRKQKE-------QMKIIKQQEKIKRIQ-Q 932
Cdd:pfam15709  371 AEKMREELELEQQRRFEEIRLRKQRLEEERQRQEEEERKQrlQLQAAQERARQQQEefrrklqELQRKKQQEEAERAEaE 450
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  933 IRMEKELRAQqileakkkkkeeaanakLLEAEKRTKE-KELRRQQAVLLKHQELERHRLDMVwERERRRQHVMLMKAMEA 1011
Cdd:pfam15709  451 KQRQKELEMQ-----------------LAEEQKRLMEmAEEERLEYQRQKQEAEEKARLEAE-ERRQKEEEAARLALEEA 512

                   ....*....
gi 1039761240 1012 RKKAEEKER 1020
Cdd:pfam15709  513 MKQAQEQAR 521
PHD_ATX3_4_5_like cd15495
PHD finger found in Arabidopsis thaliana histone-lysine N-methyltransferase arabidopsis ...
1963-2007 6.86e-04

PHD finger found in Arabidopsis thaliana histone-lysine N-methyltransferase arabidopsis trithorax-like protein ATX3, ATX4, ATX5, and similar proteins; The family includes A. thaliana ATX3 (also termed protein SET domain group 14, or trithorax-homolog protein 3), ATX4 (also termed protein SET domain group 16, or trithorax-homolog protein 4) and ATX5 (also termed protein SET domain group 29, or trithorax-homolog protein 5), which belong to the histone-lysine methyltransferase family. They show distinct phylogenetic origins from the ATX1 and ATX2 family. They are multi-domain containing proteins that consist of an N-terminal PWWP domain, a canonical Cys4HisCys3 plant homeodomain (PHD) finger, a non-canonical extended PHD (ePHD) finger, Cys2HisCys5HisCys2His, and a C-terminal SET domain; this model corresponds to the Cys4HisCys3 canonical PHD finger.


Pssm-ID: 276970 [Multi-domain]  Cd Length: 47  Bit Score: 39.28  E-value: 6.86e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1039761240 1963 CQICRKGDN--EELLLLCDGCDKGCHTYCHrpKITTI-PDGDWFCPAC 2007
Cdd:cd15495      2 CAVCNEGEDddNNPLITCNRCQISVHQKCY--GIREVdPDGSWVCRAC 47
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
864-1065 7.25e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 44.96  E-value: 7.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  864 KLLRKLQAQEiarQAAQIKLLRKLQKQEQA----RVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEK------IKRIQQI 933
Cdd:TIGR00618  229 KHLREALQQT---QQSHAYLTQKREAQEEQlkkqQLLKQLRARIEELRAQEAVLEETQERINRARKAaplaahIKAVTQI 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  934 RMEKELRAQQileakkkkkeeaanaklLEAEKRTKEKELRRQQAVLLKHQELERHRLdmvWERERRRQHVMLmkAMEARK 1013
Cdd:TIGR00618  306 EQQAQRIHTE-----------------LQSKMRSRAKLLMKRAAHVKQQSSIEEQRR---LLQTLHSQEIHI--RDAHEV 363
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039761240 1014 KAEEKERLKQEKRDEKRLNKERKLEQRRLELEMAKELKKPKEDMCLADQKPL 1065
Cdd:TIGR00618  364 ATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTR 415
DDRGK pfam09756
DDRGK domain; This is a family of proteins of approximately 300 residues, found in plants and ...
965-1052 7.32e-04

DDRGK domain; This is a family of proteins of approximately 300 residues, found in plants and vertebrates. They contain a highly conserved DDRGK motif.


Pssm-ID: 370664 [Multi-domain]  Cd Length: 188  Bit Score: 42.72  E-value: 7.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  965 KRTKEKELRRQQavllkhQELERHrldmvwERERRRQhvMLMKAMEARKKAEEKERLKQEKRDEKRlnKERKLEQRRLEL 1044
Cdd:pfam09756    9 AKLELKEAKRQQ------REAEEE------EREEREK--LEEKREEEYKEREEREEEAEKEKEEEE--RKQEEEQERKEQ 72

                   ....*...
gi 1039761240 1045 EMAKELKK 1052
Cdd:pfam09756   73 EEYEKLKS 80
PHD2_PHF14 cd15562
PHD finger 2 found in PHD finger protein 14 (PHF14) and similar proteins; PHF14 is a novel ...
1963-2007 1.01e-03

PHD finger 2 found in PHD finger protein 14 (PHF14) and similar proteins; PHF14 is a novel nuclear transcription factor that controls the proliferation of mesenchymal cells by directly repressing platelet-derived growth factor receptor-alpha (PDGFRalpha) expression. It also acts as an epigenetic regulator and plays an important role in the development of multiple organs in mammals. PHF14 contains three canonical plant homeodomain (PHD) fingers and a non-canonical extended PHD (ePHD) finger, Cys2HisCys5HisCys2His. It can interact with histones through its PHD fingers. The model corresponds to the second PHD finger.


Pssm-ID: 277037  Cd Length: 50  Bit Score: 38.93  E-value: 1.01e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1039761240 1963 CQICRKGDNEELLLLCDGCDKGCHTYCHRPKITTIPDGD----WFCPAC 2007
Cdd:cd15562      2 CGICKKSNDQHLLALCDTCKLYYHLGCLDPPLTRMPKKTknsgWQCSEC 50
OmpH smart00935
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
870-932 1.24e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 214922 [Multi-domain]  Cd Length: 140  Bit Score: 41.03  E-value: 1.24e-03
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039761240   870 QAQEIARQAAQIK-LLRKLQKQ-----EQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQ 932
Cdd:smart00935   30 RQAELEKLEKELQkLKEKLQKDaatlsEAAREKKEKELQKKVQEFQRKQQKLQQDLQKRQQEELQKILD 98
ARGLU pfam15346
Arginine and glutamate-rich 1; ARGLU, arginine and glutamate-rich 1 protein family, is ...
961-1063 1.25e-03

Arginine and glutamate-rich 1; ARGLU, arginine and glutamate-rich 1 protein family, is required for the oestrogen-dependent expression of ESR1 target genes. It functions in cooperation with MED1. The family of proteins is found in eukaryotes.


Pssm-ID: 405931 [Multi-domain]  Cd Length: 151  Bit Score: 41.58  E-value: 1.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  961 LEAEKRTKEKELRRqqavllkhqELERHRLDMVWERERRRQhvmlmkamEARKKAEEKERLKQEKRDE-----KRLNKER 1035
Cdd:pfam15346   43 VEEARKIMEKQVLE---------ELEREREAELEEERRKEE--------EERKKREELERILEENNRKieeaqRKEAEER 105
                           90       100       110
                   ....*....|....*....|....*....|
gi 1039761240 1036 --KLEQRRLELEMAKELKKPKEDMCLADQK 1063
Cdd:pfam15346  106 laMLEEQRRMKEERQRREKEEEEREKREQQ 135
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
866-1055 1.26e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 43.90  E-value: 1.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  866 LRKLQAQEIARQAAQIKLLRKLQKQEQARVA------KEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQIRMEKEL 939
Cdd:PRK03918   194 LIKEKEKELEEVLREINEISSELPELREELEklekevKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKK 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  940 RAQQileakkkkkeeaanakLLEAEKRTKE-----KELRRQQAVLLKHQELERhRLDMVWERERRRQHVMLMKAMEARKK 1014
Cdd:PRK03918   274 EIEE----------------LEEKVKELKElkekaEEYIKLSEFYEEYLDELR-EIEKRLSRLEEEINGIEERIKELEEK 336
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1039761240 1015 AEEKERLKQEKRD-EKRLNkerKLEQRRLELEMAKELKKPKE 1055
Cdd:PRK03918   337 EERLEELKKKLKElEKRLE---ELEERHELYEEAKAKKEELE 375
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
898-1049 1.27e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 43.99  E-value: 1.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  898 EAKK---QQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQIRMeKELRAQQILEAKKKKKEEAANAKLLEAEKRTKEKELRR 974
Cdd:COG4717     33 EAGKstlLAFIRAMLLERLEKEADELFKPQGRKPELNLKEL-KELEEELKEAEEKEEEYAELQEELEELEEELEELEAEL 111
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039761240  975 QQAvllkhqeleRHRLDMVwerERRRQHVMLMKAMEA--RKKAEEKERLKQ-EKRDEKRLNKERKLEQRRLELEMAKE 1049
Cdd:COG4717    112 EEL---------REELEKL---EKLLQLLPLYQELEAleAELAELPERLEElEERLEELRELEEELEELEAELAELQE 177
DUF4670 pfam15709
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ...
867-1048 1.27e-03

Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.


Pssm-ID: 464815 [Multi-domain]  Cd Length: 522  Bit Score: 43.79  E-value: 1.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  867 RKLQAQEIARQAAQIKLLRKLQ-----KQEQARVAKEAK--KQQ----AIMAAEEKRKQKEQMKIIK-----QQEKIKR- 929
Cdd:pfam15709  352 RKRREQEEQRRLQQEQLERAEKmreelELEQQRRFEEIRlrKQRleeeRQRQEEEERKQRLQLQAAQerarqQQEEFRRk 431
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  930 ---IQQIRMEKEL-RAqqileakkkkkeeaanakllEAEKRtKEKELRRQQAvllkhqelERHRLDMVWERERRRQHVml 1005
Cdd:pfam15709  432 lqeLQRKKQQEEAeRA--------------------EAEKQ-RQKELEMQLA--------EEQKRLMEMAEEERLEYQ-- 480
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1039761240 1006 mkameaRKKAEEKERLKQEKrDEKRLNKErklEQRRLELEMAK 1048
Cdd:pfam15709  481 ------RQKQEAEEKARLEA-EERRQKEE---EAARLALEEAM 513
PRK12704 PRK12704
phosphodiesterase; Provisional
962-1052 1.55e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 43.61  E-value: 1.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  962 EAEKRTKEKELRRQQAVLLKHQELERhrldmvwERERRRQHVmlmKAMEARKKAEEkERLKQ-----EKRDEKRLNKERK 1036
Cdd:PRK12704    50 EAEAIKKEALLEAKEEIHKLRNEFEK-------ELRERRNEL---QKLEKRLLQKE-ENLDRklellEKREEELEKKEKE 118
                           90
                   ....*....|....*..
gi 1039761240 1037 LEQRRLELE-MAKELKK 1052
Cdd:PRK12704   119 LEQKQQELEkKEEELEE 135
MAP7 pfam05672
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is ...
889-1039 1.57e-03

MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is presumably controlled by tissue-specific microtubule-associated proteins (MAPs). The 115-kDa epithelial MAP (E-MAP-115/MAP7) has been identified as a microtubule-stabilising protein predominantly expressed in cell lines of epithelial origin. The binding of this microtubule associated protein is nucleotide independent.


Pssm-ID: 461709 [Multi-domain]  Cd Length: 153  Bit Score: 41.18  E-value: 1.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  889 KQEQARVAKEAKKQQAIMAAEEKRKQKEQMK------IIKQQEKIKRIQQIRMEKELRAQQileakkkkkeeaanakllE 962
Cdd:pfam05672   19 KRRQAREQREREEQERLEKEEEERLRKEELRrraeeeRARREEEARRLEEERRREEEERQR------------------K 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039761240  963 AEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQHvmlmkaMEARKKAEEKERLKQEKRDEKRLNKERKLEQ 1039
Cdd:pfam05672   81 AEEEAEEREQREQEEQERLQKQKEEAEAKAREEAERQRQE------REKIMQQEEQERLERKKRIEEIMKRTRKSDQ 151
PHD_JADE2 cd15680
PHD finger found in protein Jade-2 and similar proteins; Jade-2, also termed PHD finger ...
1963-2007 1.73e-03

PHD finger found in protein Jade-2 and similar proteins; Jade-2, also termed PHD finger protein 15 (PHF15), is a plant homeodomain (PHD) zinc finger protein that is closely related to Jade-1, which functions as an essential regulator of multiple cell signaling pathways. Like Jade-1, Jade-2 is required for ING4 and ING5 to associate with histone acetyltransferase (HAT) HBO1 and Eaf6 to form a HBO1 complex that has a histone H4-specific acetyltransferase activity, a reduced activity toward histone H3, and is responsible for the bulk of histone H4 acetylation in vivo. Jade-2 contains a canonical Cys4HisCys3 PHD finger followed by a non-canonical extended PHD (ePHD) finger, Cys2HisCys5HisCys2His, both of which are zinc-binding motifs. This model corresponds to the canonical Cys4HisCys3 PHD finger.


Pssm-ID: 277150 [Multi-domain]  Cd Length: 46  Bit Score: 38.06  E-value: 1.73e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1039761240 1963 CQICR--KGDNEELLLLCDGCDKGCHTYCHrpKITTIPDGDWFCPAC 2007
Cdd:cd15680      2 CDVCRspEGEDGNEMVFCDKCNVCVHQACY--GILKVPTGSWLCRTC 46
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
848-1049 1.80e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 43.60  E-value: 1.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  848 RPPNVGSAEFLDNTDAKLLRKLQAQEIARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMAaeEKRKQKEQMKIIKQQEKI 927
Cdd:COG4717    295 REKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLR--EAEELEEELQLEELEQEI 372
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  928 KRI---QQIRMEKELRAQqileakkkkkeEAANAKLLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQHVM 1004
Cdd:COG4717    373 AALlaeAGVEDEEELRAA-----------LEQAEEYQELKEELEELEEQLEELLGELEELLEALDEEELEEELEELEEEL 441
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240 1005 LMKAMEARKKAEEKERLKQEKrdeKRLNKERKLEQRRLELEMAKE 1049
Cdd:COG4717    442 EELEEELEELREELAELEAEL---EQLEEDGELAELLQELEELKA 483
PHD5_NSD cd15568
PHD finger 5 found in nuclear receptor-binding SET domain-containing (NSD) proteins; The ...
1962-2005 1.82e-03

PHD finger 5 found in nuclear receptor-binding SET domain-containing (NSD) proteins; The nuclear receptor binding SET domain (NSD) protein is a family of three HMTases, NSD1, NSD2/MMSET/WHSC1, and NSD3/WHSC1L1, that are critical in maintaining chromatin integrity. Reducing NSD activity through specific lysine-HMTase inhibitors appears promising to help suppress cancer growth. NSD proteins have specific mono- and dimethylase activities for H3K36, and they play non-redundant roles during development. NSD1 plays a role in several pathologies, including but not limited to Sotos and Weaver syndromes, acute myeloid leukemia, breast cancer, neuroblastoma, and glioblastoma formation. NSD2 is involved in cancer cell proliferation, survival, and tumor growth, by mediating constitutive NF-kappaB signaling via the cytokine autocrine loop. NSD3 is amplified in human breast cancer cell lines. Moreover, translocation resulting in NUP98 fusion to NSD3 leads to the development of acute myeloid leukemia. NSD proteins contain a catalytic suppressor of variegation, enhancer of zeste and trithorax (SET) domain, two proline-tryptophan-tryptophan-proline (PWWP) domains, five plant homeodomain (PHD) fingers, and an NSD-specific Cys-His rich domain (Cys5HisCysHis). This model corresponds to the fifth PHD finger.


Pssm-ID: 277043 [Multi-domain]  Cd Length: 43  Bit Score: 38.08  E-value: 1.82e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1962 YCQICRKGdneELLLLCDGcdKGCHTYCHRP--KITTIPDGDWFCP 2005
Cdd:cd15568      1 ECFRCGDG---GDLVLCDF--KGCPKVYHLSclGLEKPPGGKWICP 41
Casc1_N pfam15927
Cancer susceptibility candidate 1 N-terminus; This presumed domain is functionally ...
1011-1053 1.90e-03

Cancer susceptibility candidate 1 N-terminus; This presumed domain is functionally uncharacterized. This domain family is found in eukaryotes, and is approximately 200 amino acids in length. The family is found in association with pfam12366. There are two completely conserved residues (N and W) that may be functionally important.


Pssm-ID: 464947 [Multi-domain]  Cd Length: 201  Bit Score: 41.96  E-value: 1.90e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1039761240 1011 ARKKAEEKERLKQEKRDEKRLNKERKLEQRRLELEMAKELKKP 1053
Cdd:pfam15927    1 ARLREEEEERLRAEEEEAERLEEERREEEEEERLAAEQDRRAE 43
MAP7 pfam05672
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is ...
896-1043 2.00e-03

MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is presumably controlled by tissue-specific microtubule-associated proteins (MAPs). The 115-kDa epithelial MAP (E-MAP-115/MAP7) has been identified as a microtubule-stabilising protein predominantly expressed in cell lines of epithelial origin. The binding of this microtubule associated protein is nucleotide independent.


Pssm-ID: 461709 [Multi-domain]  Cd Length: 153  Bit Score: 40.79  E-value: 2.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  896 AKEAKKqqaimAAEEKRKQKEQMKIIKQQEKIKRIQQIRMEKELRAQQileakkkkkeeaanakllEAEKRTKEKELRRQ 975
Cdd:pfam05672    9 AEEAAR-----ILAEKRRQAREQREREEQERLEKEEEERLRKEELRRR------------------AEEERARREEEARR 65
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039761240  976 QAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEA-RKKA-----EEKERLKQEKrdEKRLNKErklEQRRLE 1043
Cdd:pfam05672   66 LEEERRREEEERQRKAEEEAEEREQREQEEQERLQKqKEEAeakarEEAERQRQER--EKIMQQE---EQERLE 134
PHD_BRPF3 cd15678
PHD finger found in bromodomain and PHD finger-containing protein 3 (BRPF3) and similar ...
1962-2012 2.09e-03

PHD finger found in bromodomain and PHD finger-containing protein 3 (BRPF3) and similar proteins; BRPF3 is a homolog of BRPF1 and BRPF2. It is a scaffold protein that forms a novel monocytic leukemic zinc finger protein (MOZ)/MOZ-related factor (MORF) H3 histone acetyltransferase (HAT) complex with other regulatory subunits. BRPF3 contains a canonical Cys4HisCys3 plant homeodomain (PHD) finger followed by a non-canonical extended PHD (ePHD) finger, Cys2HisCys5HisCys2His, a bromodomain, and a proline-tryptophan-tryptophan-proline (PWWP) domain. This model corresponds to the canonical Cys4HisCys3 PHD finger.


Pssm-ID: 277148 [Multi-domain]  Cd Length: 55  Bit Score: 38.08  E-value: 2.09e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039761240 1962 YCQICRKGD--NEELLLLCDGCDKGCHTYCHrpKITTIPDGDWFCPACISKAS 2012
Cdd:cd15678      3 FCCVCLDDEchNSNVILFCDICNLAVHQECY--GVPYIPEGQWLCRCCLQSPS 53
GBP_C pfam02841
Guanylate-binding protein, C-terminal domain; Transcription of the anti-viral ...
856-981 2.10e-03

Guanylate-binding protein, C-terminal domain; Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP.


Pssm-ID: 460721 [Multi-domain]  Cd Length: 297  Bit Score: 42.66  E-value: 2.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  856 EFLDNTDAKLLRKLQAQeiarQAAQIKllRKLQKQEQARvAKEAKKQQAIMaaEEKRKQKEQM---KIIKQQEKIKRIQQ 932
Cdd:pfam02841  180 EFLQSKEAVEEAILQTD----QALTAK--EKAIEAERAK-AEAAEAEQELL--REKQKEEEQMmeaQERSYQEHVKQLIE 250
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1039761240  933 iRMEKElRAQqileakkkkkeeaanakLLEAEKRTKEKELRRQQAVLLK 981
Cdd:pfam02841  251 -KMEAE-REQ-----------------LLAEQERMLEHKLQEQEELLKE 280
TolA COG3064
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
867-1052 2.25e-03

Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442298 [Multi-domain]  Cd Length: 485  Bit Score: 43.10  E-value: 2.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  867 RKLQAQEIARQAAQ-------IKLLRKLQKQEQARVAK-EAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQIRMEKE 938
Cdd:COG3064     24 EKRAAAEAEQKAKEeaeeerlAELEAKRQAEEEAREAKaEAEQRAAELAAEAAKKLAEAEKAAAEAEKKAAAEKAKAAKE 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  939 LRAQQILEAKKKkkeeaanakllEAEKRTKEKElrRQQAvllkhqelerhrldmvwERERRRqhvmlmKAMEARKKAEEK 1018
Cdd:COG3064    104 AEAAAAAEKAAA-----------AAEKEKAEEA--KRKA-----------------EEEAKR------KAEEERKAAEAE 147
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1039761240 1019 ERLKQEKRDEKRLNKERKLEQRRLELEMAKELKK 1052
Cdd:COG3064    148 AAAKAEAEAARAAAAAAAAAAAAAARAAAGAAAA 181
CCDC47 pfam07946
PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of ...
997-1052 2.63e-03

PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of the PAT complex, an endoplasmic reticulum (ER)-resident membrane multiprotein complex that facilitates multi-pass membrane proteins insertion into membranes. The PAT complex, formed by CCDC47 and Asterix proteins, acts as an intramembrane chaperone by directly interacting with nascent transmembrane domains (TMDs), releasing its substrates upon correct folding, and is needed for optimal biogenesis of multi-pass membrane proteins. CCDC47 is required to maintain the stability of Asterix. CCDC47 is associated with various membrane-associated processes and is component of a ribosome-associated ER translocon complex involved in multi-pass membrane protein transport into the ER membrane and biogenesis. It is also involved in the regulation of calcium ion homeostasis in the ER, being also required for proper protein degradation via the ERAD (ER-associated degradation) pathway.


Pssm-ID: 462322  Cd Length: 323  Bit Score: 42.17  E-value: 2.63e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039761240  997 ERRRQHVM--LMKAMEARKK---AEEKERLKQEKRDEKrLNKERKLEQRR-LELEMAKELKK 1052
Cdd:pfam07946  263 KKTREEEIekIKKAAEEERAeeaQEKKEEAKKKEREEK-LAKLSPEEQRKyEEKERKKEQRK 323
WHIM1 pfam15612
WSTF, HB1, Itc1p, MBD9 motif 1; A conserved alpha helical motif that along with the WHIM2 and ...
1183-1223 2.64e-03

WSTF, HB1, Itc1p, MBD9 motif 1; A conserved alpha helical motif that along with the WHIM2 and WHIM3 motifs, and the DDT domain comprise an alpha helical module found in diverse eukaryotic chromatin proteins.Based on the Ioc3 structure, this module is inferred to interact with nucleosomal linker DNA and the SLIDE domain of ISWI proteins. The resulting complex forms a protein ruler that measures out the spacing between two adjacent nucleosomes. The conserved basic residue in WHIM1 is involved in packing with the DDT motif. The module shows a great domain architectural diversity and is often combined with other modified histone peptide recognising and DNA binding domains, some of which discriminate methylated DNA.


Pssm-ID: 464774 [Multi-domain]  Cd Length: 46  Bit Score: 37.48  E-value: 2.64e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1039761240 1183 ELTESLKTKAFQAHTPAQKASILAFLVNELACSKSVVSEID 1223
Cdd:pfam15612    6 GLLETLKKGGYYELSPEEKLKILKALCDLLLSSSAIRDEIE 46
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
870-1048 2.68e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 43.12  E-value: 2.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  870 QAQEIARQAAQIKLL-RKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQM--KIIKQQEKIKRIQQIRMEKELRAQQILE 946
Cdd:TIGR02168  675 RRREIEELEEKIEELeEKIAELEKALAELRKELEELEEELEQLRKELEELsrQISALRKDLARLEAEVEQLEERIAQLSK 754
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  947 AKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLlkHQELERHRLDMVWERER----RRQHVMLmkAMEARKKAEEKERLK 1022
Cdd:TIGR02168  755 ELTELEAEIEELEERLEEAEEELAEAEAEIEEL--EAQIEQLKEELKALREAldelRAELTLL--NEEAANLRERLESLE 830
                          170       180
                   ....*....|....*....|....*.
gi 1039761240 1023 QEKRDEKRLNKERKLEQRRLELEMAK 1048
Cdd:TIGR02168  831 RRIAATERRLEDLEEQIEELSEDIES 856
GBP_C cd16269
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal ...
960-1051 2.78e-03

Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal domain. Guanylate-binding proteins (GBPs) are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence, and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. This C-terminal domain has been shown to mediate inhibition of endothelial cell proliferation by inflammatory cytokines.


Pssm-ID: 293879 [Multi-domain]  Cd Length: 291  Bit Score: 42.18  E-value: 2.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  960 LLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQH----VMLMKAM--EARKKAEEKERLKQEK-RDEKRLN 1032
Cdd:cd16269    193 LTEKEKEIEAERAKAEAAEQERKLLEEQQRELEQKLEDQERSYeehlRQLKEKMeeERENLLKEQERALESKlKEQEALL 272
                           90
                   ....*....|....*....
gi 1039761240 1033 KERKLEQRRLELEMAKELK 1051
Cdd:cd16269    273 EEGFKEQAELLQEEIRSLK 291
PHD_PHF3_like cd15552
PHD finger found in PHD finger protein 3 (PHF3), and death-inducer obliterator variants Dido1, ...
1962-2007 3.00e-03

PHD finger found in PHD finger protein 3 (PHF3), and death-inducer obliterator variants Dido1, Dido2, and Dido3; PHF3 is a human homolog of yeast protein bypass of Ess1 (Bye1), a nuclear protein with a domain resembling the central domain in the transcription elongation factor TFIIS. It is ubiquitously expressed in normal tissues including brain, but its expression is significantly reduced or lost in glioblastomas. PHF3 contains an N-terminal plant homeodomain (PHD) finger, a central RNA polymerase II (Pol II)-binding TFIIS-like domain (TLD) domain, and a C-terminal Spen paralogue and orthologue C-terminal (SPOC) domain. This family also includes Dido gene encoding three alternative splicing variants (Dido1, 2, and 3), which have been implicated in a number of cellular processes such as apoptosis and chromosomal segregation, particularly in the hematopoietic system. Dido1 is important for maintaining embryonic stem (ES) cells and directly regulates the expression of pluripotency factors. It is the shortest isoform that contains only a highly conserved PHD finger responsible for the binding of histone H3 with a higher affinity for trimethylated lysine4 (H3K4me3). Gene Dido1 is a Bone morphogenetic protein (BMP) target gene and promotes BMP-induced melanoma progression. It also triggers apoptosis after nuclear translocation and caspase upregulation. Dido3 is the largest isoform and is ubiquitously expressed in all human tissues. It is dispensable for ES cell self-renewal and pluripotency, but is involved in the maintenance of stem cell genomic stability and tumorigenesis. Dido3 contains a PHD finger, a transcription elongation factor S-II subunit M (TFSIIM) domain, a SPOC module, and a long C-terminal region (CT) of unknown homology.


Pssm-ID: 277027  Cd Length: 50  Bit Score: 37.76  E-value: 3.00e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 1962 YCqICRKGDNEELLLLCDGCDKGCHTYC-----HRPKITTIPDGDWFCPAC 2007
Cdd:cd15552      1 YC-ICRKPHNNRFMICCDRCEEWFHGDCvgiteAQGKEMEENIEEYVCPKC 50
FliJ COG2882
Flagellar biosynthesis chaperone FliJ [Cell motility];
882-1047 3.04e-03

Flagellar biosynthesis chaperone FliJ [Cell motility];


Pssm-ID: 442129 [Multi-domain]  Cd Length: 142  Bit Score: 40.27  E-value: 3.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  882 KLLRKLQKQEQARVAKEAKKQQAIMAAEEKRKQKEQmkiiKQQEKIKRIQQiRMEKELRAQQILEakkkkkeeaanaklL 961
Cdd:COG2882      9 TLLDLAEKEEDEAARELGQAQQALEQAEEQLEQLEQ----YREEYEQRLQQ-KLQQGLSAAQLRN--------------Y 69
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  962 EAEKRTKEKELRRQQAVLLKHQElerhrldmvwERERRRQHvmLMKAMeARKKAEEKerLKqEKRDEKRLNKERKLEQRR 1041
Cdd:COG2882     70 QQFIARLDEAIEQQQQQVAQAEQ----------QVEQARQA--WLEAR-QERKALEK--LK-ERRREEERQEENRREQKE 133

                   ....*.
gi 1039761240 1042 LElEMA 1047
Cdd:COG2882    134 LD-ELA 138
PHD_AF10_AF17 cd15574
PHD finger found in protein AF-10 and AF-17; This family includes protein AF-10 and AF-17. ...
1963-2007 3.09e-03

PHD finger found in protein AF-10 and AF-17; This family includes protein AF-10 and AF-17. AF-10, also termed ALL1 (acute lymphoblastic leukemia)-fused gene from chromosome 10 protein, is a transcription factor encoded by gene AF10, a translocation partner of the MLL (mixed-lineage leukemia) oncogene in leukemia. AF-10 has been implicated in the development of leukemia following chromosomal rearrangements between the AF10 gene and one of at least two other genes, MLL and CALM. It plays a key role in the survival of uncommitted hematopoietic cells. Moreover, AF-10 functions as a follistatin-related gene (FLRG)-interacting protein. The interaction with FLRG enhances AF10-dependent transcription. It interacts with the human counterpart of the yeast Dot1, hDOT1L, and may act as a bridge for the recruitment of hDOT1L to the genes targeted by MLL-AF10. It also interacts with the synovial sarcoma associated SYT protein and may play a role in synovial sarcomas and acute leukemias. AF-17, also termed ALL1-fused gene from chromosome 17 protein, is encoded by gene AF17 that has been identified in hematological malignancies as translocation partners of the mixed lineage leukemia gene MLL. It is a putative transcription factor that may play a role in multiple signaling pathways. It is involved in chromatin-mediated gene regulation mechanisms. It functions as a component of the multi-subunit Dot1 complex (Dotcom) and plays a role in the Wnt/Wingless signaling pathway. It also seems to be a downstream target of the beta-catenin/T-cell factor pathway, and participates in G2-M progression. Moreover, it may function as an important regulator of ENaC-mediated Na+ transport and thus blood pressure. Both AF-10 and AF-17 contain an N-terminal canonical Cys4HisCys3 plant homeodomain (PHD) finger followed by a non-canonical extended PHD (ePHD) finger, Cys2HisCys5HisCys2His. The PHD finger is involved in their homo-oligomerization. In the C-terminal region, they possess a leucine zipper domain and a glutamine-rich region. This family also includes ZFP-1, the Caenorhabditis elegans AF10 homolog. It was originally identified as a factor promoting RNAi interference in C. elegans. It also acts as a Dot1-interacting protein that opposes H2B ubiquitination to reduce polymerase II (Pol II) transcription. This model corresponds to the canonical Cys4HisCys3 PHD finger.


Pssm-ID: 277049 [Multi-domain]  Cd Length: 48  Bit Score: 37.49  E-value: 3.09e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1039761240 1963 CQIC--RKGDNEELLLLCDG--CDKGCHTYCHrpKITTIPDGDWFCPAC 2007
Cdd:cd15574      2 CCVCsdERGWAENPLVYCDGhgCNVAVHQACY--GIVQVPTGPWFCRKC 48
PHD_JADE1 cd15679
PHD finger found in protein Jade-1 and similar proteins; Jade-1, also termed PHD finger ...
1963-2007 3.81e-03

PHD finger found in protein Jade-1 and similar proteins; Jade-1, also termed PHD finger protein 17 (PHF17), is a novel binding partner of von Hippel-Lindau (VHL) tumor suppressor Pvhl, a key regulator of the cellular oxygen sensing pathway. It is highly expressed in renal proximal tubules. Jade-1 functions as an essential regulator of multiple cell signaling pathways. It may be involved in the serine/threonine kinase AKT/AKT1 pathway during renal cancer pathogenesis and normally prevents renal epithelial cell proliferation and transformation. It also acts as a pro-apoptotic and growth suppressive ubiquitin ligase to inhibit canonical Wnt downstream effector beta-catenin for proteasomal degradation, and as a transcription factor associated with histone acetyltransferase activity and with increased abundance of cyclin-dependent kinase inhibitor p21. Moreover, Jade-1 is required for ING4 and ING5 to associate with histone acetyltransferase (HAT) HBO1 and Eaf6 to form a HBO1 complex, and plays a role in epithelial cell regeneration. It has also been identified as a novel component of the nephrocystin protein (NPHP) complex and interacts with the ciliary protein nephrocystin-4 (NPHP4). Jade-1 contains a canonical Cys4HisCys3 plant homeodomain (PHD) finger followed by a non-canonical extended PHD (ePHD) finger, Cys2HisCys5HisCys2His, both of which are zinc-binding motifs. This model corresponds to the canonical Cys4HisCys3 PHD finger.


Pssm-ID: 277149 [Multi-domain]  Cd Length: 46  Bit Score: 37.36  E-value: 3.81e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1039761240 1963 CQICRKGDNEE--LLLLCDGCDKGCHTYCHrpKITTIPDGDWFCPAC 2007
Cdd:cd15679      2 CDVCQSPDGEDgnEMVFCDKCNICVHQACY--GILKVPEGSWLCRTC 46
ATP-synt_Fo_b cd06503
F-type ATP synthase, membrane subunit b; Membrane subunit b is a component of the Fo complex ...
856-942 4.01e-03

F-type ATP synthase, membrane subunit b; Membrane subunit b is a component of the Fo complex of FoF1-ATP synthase. The F-type ATP synthases (FoF1-ATPase) consist of two structural domains: the F1 (assembly factor one) complex containing the soluble catalytic core, and the Fo (oligomycin sensitive factor) complex containing the membrane proton channel, linked together by a central stalk and a peripheral stalk. F1 is composed of alpha (or A), beta (B), gamma (C), delta (D) and epsilon (E) subunits with a stoichiometry of 3:3:1:1:1, while Fo consists of the three subunits a, b, and c (1:2:10-14). An oligomeric ring of 10-14 c subunits (c-ring) make up the Fo rotor. The flux of protons through the ATPase channel (Fo) drives the rotation of the c-ring, which in turn is coupled to the rotation of the F1 complex gamma subunit rotor due to the permanent binding between the gamma and epsilon subunits of F1 and the c-ring of Fo. The F-ATP synthases are primarily found in the inner membranes of eukaryotic mitochondria, in the thylakoid membranes of chloroplasts or in the plasma membranes of bacteria. The F-ATP synthases are the primary producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). Alternatively, under conditions of low driving force, ATP synthases function as ATPases, thus generating a transmembrane proton or Na(+) gradient at the expense of energy derived from ATP hydrolysis. This group also includes F-ATP synthase that has also been found in the archaea Candidatus Methanoperedens.


Pssm-ID: 349951 [Multi-domain]  Cd Length: 132  Bit Score: 39.34  E-value: 4.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  856 EFLDNTDAKLLRKLQAQEIARQAAQikllrKLQKQEQARVAkEAKKQ-QAIMAAEEKRKQKEQMKIIKQ-QEKIKRI--- 930
Cdd:cd06503     26 KALDEREEKIAESLEEAEKAKEEAE-----ELLAEYEEKLA-EARAEaQEIIEEARKEAEKIKEEILAEaKEEAERIleq 99
                           90
                   ....*....|....*....
gi 1039761240  931 --QQIRMEK-----ELRAQ 942
Cdd:cd06503    100 akAEIEQEKekalaELRKE 118
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
887-1050 4.11e-03

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 42.15  E-value: 4.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  887 LQKQEQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRiqqirmekelraqqileakkkkkeeaanaklLEAEKR 966
Cdd:COG2433    382 LEELIEKELPEEEPEAEREKEHEERELTEEEEEIRRLEEQVER-------------------------------LEAEVE 430
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  967 TKEKELRRQQAVLlkhQELERhRLdmvwERERRRQHVMLMKAMEARKKAEEKERLKQEKRDEKRLNKERKLEQRRLElEM 1046
Cdd:COG2433    431 ELEAELEEKDERI---ERLER-EL----SEARSEERREIRKDREISRLDREIERLERELEEERERIEELKRKLERLK-EL 501

                   ....
gi 1039761240 1047 AKEL 1050
Cdd:COG2433    502 WKLE 505
CCDC66 pfam15236
Coiled-coil domain-containing protein 66; This protein family, named Coiled-coil ...
887-1026 4.72e-03

Coiled-coil domain-containing protein 66; This protein family, named Coiled-coil domain-containing protein 66 (CCDC) refers to a protein domain found in eukaryotes, and is approximately 160 amino acids in length. CCDC66 protein is detected mainly in the inner segments of photoreceptors in many vertebrates including mice and humans. It has been found in dogs, that a mutation in the CCDC66 gene causes generalized progressive retinal atrophy (gPRA). This shows that the protein encoded for by this gene is vital for healthy vision and guards against photoreceptor cell degeneration. The structure of CCDC66 proteins includes a heptad repeat pattern which contains at least one coiled-coil domain. There are at least two or more alpha-helices which form a cable-like structure.


Pssm-ID: 434558 [Multi-domain]  Cd Length: 154  Bit Score: 39.78  E-value: 4.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  887 LQKQEQARvAKEAKKQQAIMAA-EEKRKQKeqmkiikqQEKIKRIQQIRMEKELRAQQileakkkkkeeaanakLLEAEK 965
Cdd:pfam15236   44 LEERERKR-QKALEHQNAIKKQlEEKERQK--------KLEEERRRQEEQEEEERLRR----------------EREEEQ 98
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039761240  966 RTKEKELRRQQavllkhqelERHrldmvwERERRRQHVMLmkamEARKKAEE-KERLKQEKR 1026
Cdd:pfam15236   99 KQFEEERRKQK---------EKE------EAMTRKTQALL----QAMQKAQElAQRLKQEQR 141
ClassIIa_HDAC4_Gln-rich-N cd10162
Glutamine-rich N-terminal helical domain of HDAC4, a Class IIa histone deacetylase; This ...
964-1054 4.95e-03

Glutamine-rich N-terminal helical domain of HDAC4, a Class IIa histone deacetylase; This family consists of the glutamine-rich domain of histone deacetylase 4 (HDAC4). It belongs to a superfamily that consists of the glutamine-rich N-terminal helical extension to certain Class IIa histone deacetylases (HDACs), including HDAC4, HDAC5 and HDCA9; it is missing from HDAC7. This domain confers responsiveness to calcium signals and mediates interactions with transcription factors and cofactors, and it is able to repress transcription independently of the HDAC C-terminal, zinc-dependent catalytic domain. It has many intra- and inter-helical interactions which are possibly involved in reversible assembly and disassembly of proteins. HDACs regulate diverse cellular processes through enzymatic deacetylation of histone as well as non-histone proteins, in particular deacetylating N(6)-acetyl-lysine residues.


Pssm-ID: 197398 [Multi-domain]  Cd Length: 90  Bit Score: 38.25  E-value: 4.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  964 EKRTKEKELRRQQAVLLKHQELERHRLDmvweRERRRQHVMLMKAMEAR--KKAEEKERLKQEKRDEKRLNKERKLEQRR 1041
Cdd:cd10162      1 EPALREQQLQQELLALKQKQQIQRQLLI----AEFQRQHEQLSRQHEAQlhEHIKQQQELLAMKHQQELLEHQRKLERHR 76
                           90
                   ....*....|...
gi 1039761240 1042 LELEMAKELKKPK 1054
Cdd:cd10162     77 QEQEMEKQQREQK 89
PRK11637 PRK11637
AmiB activator; Provisional
872-1041 5.07e-03

AmiB activator; Provisional


Pssm-ID: 236942 [Multi-domain]  Cd Length: 428  Bit Score: 41.60  E-value: 5.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  872 QEIARQAAQIKLLRKLQKQEQARVAKE---AKKQ------QAIMAAEE-KRKQKEQMKI----IKQQEKIKRIQQIRmeK 937
Cdd:PRK11637   103 KQIDELNASIAKLEQQQAAQERLLAAQldaAFRQgehtglQLILSGEEsQRGERILAYFgylnQARQETIAELKQTR--E 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  938 ELRAQQileakkkkkeeaanaKLLEaEKRTKEKEL---RRQQAVLLKHQELERHRLDMVWER--ERRRQHVMLMKAME-- 1010
Cdd:PRK11637   181 ELAAQK---------------AELE-EKQSQQKTLlyeQQAQQQKLEQARNERKKTLTGLESslQKDQQQLSELRANEsr 244
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1039761240 1011 -----ARKKAEEKERLKQEKRDEKRLNKERKLEQRR 1041
Cdd:PRK11637   245 lrdsiARAEREAKARAEREAREAARVRDKQKQAKRK 280
DDRGK pfam09756
DDRGK domain; This is a family of proteins of approximately 300 residues, found in plants and ...
885-981 5.18e-03

DDRGK domain; This is a family of proteins of approximately 300 residues, found in plants and vertebrates. They contain a highly conserved DDRGK motif.


Pssm-ID: 370664 [Multi-domain]  Cd Length: 188  Bit Score: 40.41  E-value: 5.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  885 RKLQKQEQARVA-KEAKKQQ--AIMAAEEKRKQKEQMKIIKQQEKIKRIQQIRMEKElraqqileakkkkkeeaanakll 961
Cdd:pfam09756    1 KKLGAKKRAKLElKEAKRQQreAEEEEREEREKLEEKREEEYKEREEREEEAEKEKE----------------------- 57
                           90       100
                   ....*....|....*....|
gi 1039761240  962 EAEKRTKEKELRRQQAVLLK 981
Cdd:pfam09756   58 EEERKQEEEQERKEQEEYEK 77
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
860-942 5.19e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 41.67  E-value: 5.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  860 NTDAKLLRKLQAQEIARQAAQIKLLRKLQkQEQARVAKEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQI--RMEK 937
Cdd:COG4942    156 RADLAELAALRAELEAERAELEALLAELE-EERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALiaRLEA 234

                   ....*
gi 1039761240  938 ELRAQ 942
Cdd:COG4942    235 EAAAA 239
DUF5384 pfam17358
Family of unknown function (DUF5384); This is a family of unknown function found in ...
962-1062 5.34e-03

Family of unknown function (DUF5384); This is a family of unknown function found in Proteobacteria.


Pssm-ID: 407453 [Multi-domain]  Cd Length: 145  Bit Score: 39.59  E-value: 5.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  962 EAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQhvmlmkAMEARKKAEEKERLKQE-----KRDEKRLNKERK 1036
Cdd:pfam17358    1 EQQGKEDRVAAERQAAYEREWEEEQARAEAAAAAARRARA------AAAAAAAAAAKERAKAEaladkKRDQSYEDELRA 74
                           90       100
                   ....*....|....*....|....*.
gi 1039761240 1037 LEQRRLELEMAKELKKPKEDMCLADQ 1062
Cdd:pfam17358   75 LEIEERKLALAAQKARAKRENDFIDQ 100
growth_prot_Scy NF041483
polarized growth protein Scy;
863-1056 5.97e-03

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 42.12  E-value: 5.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  863 AKLLRKlQAQE-IARQAAQIKLLRKlQKQEQArvakEAKKQQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQIRMEKELR- 940
Cdd:NF041483   515 ATTLRR-QAEEtLERTRAEAERLRA-EAEEQA----EEVRAAAERAARELREETERAIAARQAEAAEELTRLHTEAEERl 588
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  941 --AQQILEAKKKKKEEAANAKLLEAEK-RTKEKELRR---QQAvllkHQELERHRLDMVWERERRR---QHVMLMKAMEA 1011
Cdd:NF041483   589 taAEEALADARAEAERIRREAAEETERlRTEAAERIRtlqAQA----EQEAERLRTEAAADASAARaegENVAVRLRSEA 664
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1039761240 1012 rkkAEEKERLKQEKRDE-KRLNKERKLEQRRLELEMAKELKKPKED 1056
Cdd:NF041483   665 ---AAEAERLKSEAQESaDRVRAEAAAAAERVGTEAAEALAAAQEE 707
ClassIIa_HDAC_Gln-rich-N cd10149
Glutamine-rich N-terminal helical domain of various Class IIa histone deacetylases (HDAC4, ...
895-1003 6.00e-03

Glutamine-rich N-terminal helical domain of various Class IIa histone deacetylases (HDAC4, HDAC5 and HDCA9); This superfamily consists of a glutamine-rich N-terminal helical extension to certain Class IIa histone deacetylases (HDACs), including HDAC4, HDAC5 and HDAC9; it is missing in HDAC7. It is referred to as the glutamine-rich domain, and confers responsiveness to calcium signals and mediates interactions with transcription factors and cofactors. This domain is able to repress transcription independently of the HDAC's C-terminal, zinc-dependent catalytic domain. It has many intra- and inter-helical interactions which are possibly involved in reversible assembly and disassembly of proteins. HDACs regulate diverse cellular processes through enzymatic deacetylation of histone as well as non-histone proteins, in particular deacetylating N(6)-acetyl-lysine residues.


Pssm-ID: 197397 [Multi-domain]  Cd Length: 90  Bit Score: 38.13  E-value: 6.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  895 VAKEAKKQQAIMAAEEKRKQKEQMKII---KQQEKIKRIQQIRMEKELRAQQILEAkkkkkeeaanaklleaekrtkeke 971
Cdd:cd10149      3 VLREQQLQQELLALKQQQQIQKQLLIAefqKQHENLTRQHEAQLQEHIKQQQEMLA------------------------ 58
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1039761240  972 LRRQQAVLLKHQELERHRLDMVWERERRRQHV 1003
Cdd:cd10149     59 IKQQQELLEKQRKLEQQRQEQELEKQRREQQL 90
GBP_C cd16269
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal ...
867-989 6.88e-03

Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal domain. Guanylate-binding proteins (GBPs) are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence, and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. This C-terminal domain has been shown to mediate inhibition of endothelial cell proliferation by inflammatory cytokines.


Pssm-ID: 293879 [Multi-domain]  Cd Length: 291  Bit Score: 41.02  E-value: 6.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  867 RKLQAQEIARQAAQIKllRKLQKQEQARVAKEAKKQQAIMAAEEKRKQK--EQMKIIKQQEKIKRIQQIRMEKELRAQqi 944
Cdd:cd16269    191 QALTEKEKEIEAERAK--AEAAEQERKLLEEQQRELEQKLEDQERSYEEhlRQLKEKMEEERENLLKEQERALESKLK-- 266
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 1039761240  945 leakkkkkeeaanaklleaEKRTKEKELRRQQAVLLKhQELERHR 989
Cdd:cd16269    267 -------------------EQEALLEEGFKEQAELLQ-EEIRSLK 291
MAP7 pfam05672
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is ...
875-999 6.93e-03

MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is presumably controlled by tissue-specific microtubule-associated proteins (MAPs). The 115-kDa epithelial MAP (E-MAP-115/MAP7) has been identified as a microtubule-stabilising protein predominantly expressed in cell lines of epithelial origin. The binding of this microtubule associated protein is nucleotide independent.


Pssm-ID: 461709 [Multi-domain]  Cd Length: 153  Bit Score: 39.25  E-value: 6.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  875 ARQAAQIKLLRKLQKQEQARVAKEAKKQQAIMA----------AEEKRKQKE-----------QMKIIKQQEKIKRIQQI 933
Cdd:pfam05672   23 AREQREREEQERLEKEEEERLRKEELRRRAEEErarreeearrLEEERRREEeerqrkaeeeaEEREQREQEEQERLQKQ 102
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039761240  934 RMEKELRAQQileakkkkkeeaanakllEAEKRTKEKELRRQQAVllkhQE-LER-HRLDMVWERERR 999
Cdd:pfam05672  103 KEEAEAKARE------------------EAERQRQEREKIMQQEE----QErLERkKRIEEIMKRTRK 148
TolA COG3064
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
862-1046 7.40e-03

Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442298 [Multi-domain]  Cd Length: 485  Bit Score: 41.18  E-value: 7.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  862 DAKLLRKLQAQEIARQAAQIKLLRKLQKQE---QARVAKEAKK---QQAIMAAEEKRKQKEQMKIIKQQEKIKRIQQIRM 935
Cdd:COG3064     85 AAAEAEKKAAAEKAKAAKEAEAAAAAEKAAaaaEKEKAEEAKRkaeEEAKRKAEEERKAAEAEAAAKAEAEAARAAAAAA 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  936 EKELRAQQILEAKKKKKEEAANAKLLEAEKRTKEKELRRQQAVLLKHQELERHRLDMVWERERRRQHVMLMKAMEARKKA 1015
Cdd:COG3064    165 AAAAAAAARAAAGAAAALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEATEEAA 244
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1039761240 1016 EEKERLKQEKRDEKRLNKERKLEQRRLELEM 1046
Cdd:COG3064    245 LGGAEEAADLAAVGVLGAALAAAAAGAAALS 275
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
858-1067 7.44e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 41.58  E-value: 7.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  858 LDNTDAKLLRkLQ--AQEIARQaaqiklLRKLQKQeqarvakeAKKqqaimaAEEKRKQKEQMKIIKQQEKIKRIQQIRM 935
Cdd:TIGR02168  181 LERTRENLDR-LEdiLNELERQ------LKSLERQ--------AEK------AERYKELKAELRELELALLVLRLEELRE 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  936 EKELRAQQIleakkkkkeeaanaklleAEKRTKEKELRRQQAVLlkHQELERHRLDMVwERERRRQ--------HVMLMK 1007
Cdd:TIGR02168  240 ELEELQEEL------------------KEAEEELEELTAELQEL--EEKLEELRLEVS-ELEEEIEelqkelyaLANEIS 298
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 1008 AMEARKKaEEKERLKQEKRDEKRLNKER-KLEQRRLELEMAKELKKPKEDMCLADQKPLPE 1067
Cdd:TIGR02168  299 RLEQQKQ-ILRERLANLERQLEELEAQLeELESKLDELAEELAELEEKLEELKEELESLEA 358
PRK12704 PRK12704
phosphodiesterase; Provisional
879-1038 7.83e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 41.30  E-value: 7.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  879 AQIKLLRKLQKQ--EQARVAKEAKKQQAIMAAEEKRKQ---------KEQMKIIKQQEKIKRIQQIRMEKELRaqqilea 947
Cdd:PRK12704    31 AKIKEAEEEAKRilEEAKKEAEAIKKEALLEAKEEIHKlrnefekelRERRNELQKLEKRLLQKEENLDRKLE------- 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  948 kkkkkeeaanakLLEAEKR---TKEKEL-RRQQAVLLKHQELERhrldMVWERERRRQHVMLMKAMEARKkaEEKERLKQ 1023
Cdd:PRK12704   104 ------------LLEKREEeleKKEKELeQKQQELEKKEEELEE----LIEEQLQELERISGLTAEEAKE--ILLEKVEE 165
                          170       180
                   ....*....|....*....|
gi 1039761240 1024 EKRDE-----KRLNKERKLE 1038
Cdd:PRK12704   166 EARHEaavliKEIEEEAKEE 185
PHD_Cfp1 cd15553
PHD finger found in CXXC-type zinc finger protein 1 (Cfp1); Cfp1, also termed CpG-binding ...
1962-2007 8.16e-03

PHD finger found in CXXC-type zinc finger protein 1 (Cfp1); Cfp1, also termed CpG-binding protein, or PHD finger and CXXC domain-containing protein 1 (PCCX1), is a specificity factor that binds to unmethylated CpGs and links H3K4me3 with CpG islands (CGIs). It integrates both promoter CpG content and gene activity for accurate trimethylation of histone H3 Lys 4 (H3K4me3) deposition in embryonic stem cells. Moreover, Cfp1 is an essential component of the SETD1 histone H3K4 methyltransferase complex and functions as a critical regulator of histone methylation, cytosine methylation, cellular differentiation, and vertebrate development. Cfp1 contains a plant homeodomain (PHD) finger, a CXXC domain, and a CpG binding protein zinc finger C-terminal domain. Its CXXC domain selectively binds to non-methylated CpG islands, following by a preference for a guanosine nucleotide.


Pssm-ID: 277028  Cd Length: 46  Bit Score: 36.20  E-value: 8.16e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039761240 1962 YCqICRKGDNEELLLLCDGCDKGCHTYC-----HRPKITTipdgDWFCPAC 2007
Cdd:cd15553      1 YC-ICRSSDISRFMIGCDNCEEWYHGDCiniteKEAKAIK----EWYCQQC 46
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
864-1057 9.35e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 41.44  E-value: 9.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  864 KLLRKLQAqEIARQAAQIKLLRKLQKQ--EQARVAKEAKKQ---QAIMAAEEKRKQKEQmkiikqqeKIKRIQQIRMEKE 938
Cdd:COG4913    295 AELEELRA-ELARLEAELERLEARLDAlrEELDELEAQIRGnggDRLEQLEREIERLER--------ELEERERRRARLE 365
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  939 LRAQQileakkkkkeeaanaklLEAEKRTKEKELRRQQAVLlkHQELERhrldmvWERERRRQHVMLMKAMEARKKA-EE 1017
Cdd:COG4913    366 ALLAA-----------------LGLPLPASAEEFAALRAEA--AALLEA------LEEELEALEEALAEAEAALRDLrRE 420
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1039761240 1018 KERLKQEKRD--EKRLNKERKLEQRRLELemAKELKKPKEDM 1057
Cdd:COG4913    421 LRELEAEIASleRRKSNIPARLLALRDAL--AEALGLDEAEL 460
RNase_Y_N pfam12072
RNase Y N-terminal region;
885-1056 9.36e-03

RNase Y N-terminal region;


Pssm-ID: 463456 [Multi-domain]  Cd Length: 201  Bit Score: 39.48  E-value: 9.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  885 RKLQKQEQA--RVAKEAKKQqaimaAEEKRKQKeqmkIIKQQEKIKRiqqirmekeLRAQqileakkkkkeeaanaklLE 962
Cdd:pfam12072   27 AKIGSAEELakRIIEEAKKE-----AETKKKEA----LLEAKEEIHK---------LRAE------------------AE 70
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761240  963 AEKRTKEKELRRQQAVLLKHQELERHRLDMVWERER---RRQHVMLMKAMEARKKAEEKERLKQEKRDE-KRLN------ 1032
Cdd:pfam12072   71 RELKERRNELQRQERRLLQKEETLDRKDESLEKKEEsleKKEKELEAQQQQLEEKEEELEELIEEQRQElERISgltsee 150
                          170       180
                   ....*....|....*....|....*..
gi 1039761240 1033 -KERKLEQ--RRLELEMAKELKKPKED 1056
Cdd:pfam12072  151 aKEILLDEveEELRHEAAVMIKEIEEE 177
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH