NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1039756159|ref|XP_017173422|]
View 

zinc finger protein 236 isoform X4 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
596-1001 8.60e-09

FOG: Zn-finger [General function prediction only];


:

Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 60.09  E-value: 8.60e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  596 NEADRPYKCFYCHRAYKKSCHLKQHIRSHTGEKPFKCSQCGRG--FVSAGVLKAHVRTHTGLKSFKCLICNG-AFTTGGS 672
Cdd:COG5048     28 SNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDksFSRPLELSRHLRTHHNNPSDLNSKSLPlSNSKASS 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  673 LRRHMGIHNDLRPYMCPYCQKTFKTSLNCKKHMKTHRYELAQQLQQHQEaSSMDDDSTVDQQSMHVAAPMPVEIESAELQ 752
Cdd:COG5048    108 SSLSSSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLRNNPLPGNNS-SSVNTPQSNSLHPPLPANSLSKDPSSNLSL 186
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  753 QTPETVAADPESILELGPQHvvgtedaalgqQLADQPLEADEDGFTASQAPLPGHMDQFEEQGTPQPSfesagLPQGFTV 832
Cdd:COG5048    187 LISSNVSTSIPSSSENSPLS-----------SSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQ-----SPSSLSS 250
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  833 TDTYSQQTSFPPVQQLQDSSTLESQALSTSFHQQNLLQVPNSDAINvatrllPESSQEDlDLQTQGPQFLEDSEDQSRRS 912
Cdd:COG5048    251 SDSSSSASESPRSSLPTASSQSSSPNESDSSSEKGFSLPIKSKQCN------ISFSRSS-PLTRHLRSVNHSGESLKPFS 323
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  913 YRCDYCNKGFKKSSHLKQHVRSHTGEKPYKCKLCGRAFVSSGVLKS-------HEKTHTGVKAFSC--SICNASFTTNGS 983
Cdd:COG5048    324 CPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNeppqslqQYKDLKNDKKSETlsNSCIRNFKRDSN 403
                          410
                   ....*....|....*...
gi 1039756159  984 LTRHMATHMSMKPYKCPF 1001
Cdd:COG5048    404 LSLHIITHLSFRPYNCKN 421
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
140-493 4.29e-08

FOG: Zn-finger [General function prediction only];


:

Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 57.78  E-value: 4.29e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  140 FPYSCPHCGKTFQKPSQLTRHIRIHTGERPFKCSECGKAFNQK--GALQTHMIKHTGEKPHACAfcpaafsqKGNLQSHV 217
Cdd:COG5048     32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSrpLELSRHLRTHHNNPSDLNS--------KSLPLSNS 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  218 QRVHSEVKNGPTYNCTECSCVFKSLG------SLNTHISKMHMGGPPNSTTSaeTAHVITATIFQTLPLQQVEAQVSSVS 291
Cdd:COG5048    104 KASSSSLSSSSSNSNDNNLLSSHSLPpssrdpQLPDLLSISNLRNNPLPGNN--SSSVNTPQSNSLHPPLPANSLSKDPS 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  292 SEQSSQAVSDVIQQLLELSEPGPVEAQQSPQSgrQLSVTVGINQDILQQALENSGLSslpvaaPPSDCSHAQTATVSTQS 371
Cdd:COG5048    182 SNLSLLISSNVSTSIPSSSENSPLSSSYSIPS--SSSDQNLENSSSSLPLTTNSQLS------PKSLLSQSPSSLSSSDS 253
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  372 PHASSVSAEQADPMDAEQEKgqespektdkkekkllkkkSPFLPGSIREENGVRWHVCPYCTKEFRKPSDLVRHIRIHTH 451
Cdd:COG5048    254 SSSASESPRSSLPTASSQSS-------------------SPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNH 314
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039756159  452 E----KPFKCP--QCFRAFAVKSTLTAHIKTHTGIKAFKCQY--CMKSFS 493
Cdd:COG5048    315 SgeslKPFSCPysLCGKLFSRNDALKRHILLHTSISPAKEKLlnSSSKFS 364
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
1283-1559 4.61e-08

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


:

Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 58.62  E-value: 4.61e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1283 ASVSAGGDLTVSL--TDGSLATLEGIQLQLAANLVGPNVQIS--GIDASSINNITLQIDPSILQQTLQQGSLLAQPITGE 1358
Cdd:COG3210    802 GTITAAGTTAINVtgSGGTITINTATTGLTGTGDTTSGAGGSntTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGS 881
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1359 SSTASQNSSLQTSDSTVPASVVIQPLSGLSLQPTVTSANLTIGPLSEQDSVLTTSSSGSQDLSQVMTSQGLVSTSTGphe 1438
Cdd:COG3210    882 GGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAAS--- 958
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1439 ITLTINNSSLSQVLAQAAGPTASSSSGSPQEITLTISELNPSSGSLPSTAPMSPSAISAQNLVMSSSGVGADASVTLTLA 1518
Cdd:COG3210    959 ASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAA 1038
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1039756159 1519 DTQGVLSGGLDTVTLNITSQGQQFPALLTDPSLSGQGGAGS 1559
Cdd:COG3210   1039 TAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGG 1079
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1127-1151 2.68e-06

Zinc-finger double domain;


:

Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 45.44  E-value: 2.68e-06
                           10        20
                   ....*....|....*....|....*
gi 1039756159 1127 DLVRHVRIHTGEKPYKCDECGKSFT 1151
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFK 25
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
880-1260 3.22e-06

FOG: Zn-finger [General function prediction only];


:

Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 51.62  E-value: 3.22e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  880 ATRLLPESSQEDLDLQTQGPQFLEDSEDQSRRSYRCDYCNKGFKKSSHLKQHVRSHTGEKPYKCKLCGRAFVSSGVLKSH 959
Cdd:COG5048      1 ATLTSSQSSSSNNSVLSSTPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  960 ---EKTHTGVKAFSCSICNASFTTNGSLTRHMATHMSMKPYKCPFCEEGfrtavhcrkHMKRHQAVSSAAAAAAETEGGD 1036
Cdd:COG5048     81 rhlRTHHNNPSDLNSKSLPLSNSKASSSSLSSSSSNSNDNNLLSSHSLP---------PSSRDPQLPDLLSISNLRNNPL 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1037 TCVEEDEENSDRSASRKPRPEVITFTEEETAQLAKIqPQESATVSEKV------LVQSAAEKDRISEMKDKQAELEAEPK 1110
Cdd:COG5048    152 PGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLSLL-ISSNVSTSIPSssenspLSSSYSIPSSSSDQNLENSSSSLPLT 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1111 HANCCTYCPKSFKKPSDLVRHVRIHTGEKPYKCDECGKSFTVKSTLDCHVKTHTGQKLFSCH-VCSNAFSTKGSLKVHMR 1189
Cdd:COG5048    231 TNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTASSQSSSPNESDSSSEKGFSLPIKSkQCNISFSRSSPLTRHLR 310
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039756159 1190 --LHTG--AKPFKCPH--CELRFRTSGRRKTHMQFHYKSDPKKarKPVTRSSSESLQSVNLLNSSSTDPNVFIMNNS 1260
Cdd:COG5048    311 svNHSGesLKPFSCPYslCGKLFSRNDALKRHILLHTSISPAK--EKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDK 385
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1683-1707 1.14e-05

Zinc-finger double domain;


:

Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 43.51  E-value: 1.14e-05
                           10        20
                   ....*....|....*....|....*
gi 1039756159 1683 LERHSRIHTGERPFHCTLCDKAFNQ 1707
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
497-522 2.14e-04

Zinc-finger double domain;


:

Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 40.05  E-value: 2.14e-04
                           10        20
                   ....*....|....*....|....*.
gi 1039756159  497 SLKVHIRLHTGVRPFACPHCDKKFRT 522
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1710-1735 1.44e-03

Zinc-finger double domain;


:

Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 37.74  E-value: 1.44e-03
                           10        20
                   ....*....|....*....|....*.
gi 1039756159 1710 ALQVHLKKHTGERPYRCDYCVMGFTQ 1735
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SUF4-like super family cl41227
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
1605-1653 2.35e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


The actual alignment was detected with superfamily member cd20908:

Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 38.69  E-value: 2.35e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1039756159 1605 CLDCDRAFSSAAVLMHHSKEVHGKerihgCRVCRKAFKRATHLKEHMLT 1653
Cdd:cd20908      4 CYYCDREFDDEKILIQHQKAKHFK-----CHICHKKLYTAGGLAVHCLQ 47
 
Name Accession Description Interval E-value
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
596-1001 8.60e-09

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 60.09  E-value: 8.60e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  596 NEADRPYKCFYCHRAYKKSCHLKQHIRSHTGEKPFKCSQCGRG--FVSAGVLKAHVRTHTGLKSFKCLICNG-AFTTGGS 672
Cdd:COG5048     28 SNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDksFSRPLELSRHLRTHHNNPSDLNSKSLPlSNSKASS 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  673 LRRHMGIHNDLRPYMCPYCQKTFKTSLNCKKHMKTHRYELAQQLQQHQEaSSMDDDSTVDQQSMHVAAPMPVEIESAELQ 752
Cdd:COG5048    108 SSLSSSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLRNNPLPGNNS-SSVNTPQSNSLHPPLPANSLSKDPSSNLSL 186
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  753 QTPETVAADPESILELGPQHvvgtedaalgqQLADQPLEADEDGFTASQAPLPGHMDQFEEQGTPQPSfesagLPQGFTV 832
Cdd:COG5048    187 LISSNVSTSIPSSSENSPLS-----------SSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQ-----SPSSLSS 250
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  833 TDTYSQQTSFPPVQQLQDSSTLESQALSTSFHQQNLLQVPNSDAINvatrllPESSQEDlDLQTQGPQFLEDSEDQSRRS 912
Cdd:COG5048    251 SDSSSSASESPRSSLPTASSQSSSPNESDSSSEKGFSLPIKSKQCN------ISFSRSS-PLTRHLRSVNHSGESLKPFS 323
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  913 YRCDYCNKGFKKSSHLKQHVRSHTGEKPYKCKLCGRAFVSSGVLKS-------HEKTHTGVKAFSC--SICNASFTTNGS 983
Cdd:COG5048    324 CPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNeppqslqQYKDLKNDKKSETlsNSCIRNFKRDSN 403
                          410
                   ....*....|....*...
gi 1039756159  984 LTRHMATHMSMKPYKCPF 1001
Cdd:COG5048    404 LSLHIITHLSFRPYNCKN 421
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
140-493 4.29e-08

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 57.78  E-value: 4.29e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  140 FPYSCPHCGKTFQKPSQLTRHIRIHTGERPFKCSECGKAFNQK--GALQTHMIKHTGEKPHACAfcpaafsqKGNLQSHV 217
Cdd:COG5048     32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSrpLELSRHLRTHHNNPSDLNS--------KSLPLSNS 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  218 QRVHSEVKNGPTYNCTECSCVFKSLG------SLNTHISKMHMGGPPNSTTSaeTAHVITATIFQTLPLQQVEAQVSSVS 291
Cdd:COG5048    104 KASSSSLSSSSSNSNDNNLLSSHSLPpssrdpQLPDLLSISNLRNNPLPGNN--SSSVNTPQSNSLHPPLPANSLSKDPS 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  292 SEQSSQAVSDVIQQLLELSEPGPVEAQQSPQSgrQLSVTVGINQDILQQALENSGLSslpvaaPPSDCSHAQTATVSTQS 371
Cdd:COG5048    182 SNLSLLISSNVSTSIPSSSENSPLSSSYSIPS--SSSDQNLENSSSSLPLTTNSQLS------PKSLLSQSPSSLSSSDS 253
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  372 PHASSVSAEQADPMDAEQEKgqespektdkkekkllkkkSPFLPGSIREENGVRWHVCPYCTKEFRKPSDLVRHIRIHTH 451
Cdd:COG5048    254 SSSASESPRSSLPTASSQSS-------------------SPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNH 314
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039756159  452 E----KPFKCP--QCFRAFAVKSTLTAHIKTHTGIKAFKCQY--CMKSFS 493
Cdd:COG5048    315 SgeslKPFSCPysLCGKLFSRNDALKRHILLHTSISPAKEKLlnSSSKFS 364
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
1283-1559 4.61e-08

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 58.62  E-value: 4.61e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1283 ASVSAGGDLTVSL--TDGSLATLEGIQLQLAANLVGPNVQIS--GIDASSINNITLQIDPSILQQTLQQGSLLAQPITGE 1358
Cdd:COG3210    802 GTITAAGTTAINVtgSGGTITINTATTGLTGTGDTTSGAGGSntTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGS 881
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1359 SSTASQNSSLQTSDSTVPASVVIQPLSGLSLQPTVTSANLTIGPLSEQDSVLTTSSSGSQDLSQVMTSQGLVSTSTGphe 1438
Cdd:COG3210    882 GGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAAS--- 958
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1439 ITLTINNSSLSQVLAQAAGPTASSSSGSPQEITLTISELNPSSGSLPSTAPMSPSAISAQNLVMSSSGVGADASVTLTLA 1518
Cdd:COG3210    959 ASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAA 1038
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1039756159 1519 DTQGVLSGGLDTVTLNITSQGQQFPALLTDPSLSGQGGAGS 1559
Cdd:COG3210   1039 TAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGG 1079
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1127-1151 2.68e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 45.44  E-value: 2.68e-06
                           10        20
                   ....*....|....*....|....*
gi 1039756159 1127 DLVRHVRIHTGEKPYKCDECGKSFT 1151
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFK 25
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
880-1260 3.22e-06

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 51.62  E-value: 3.22e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  880 ATRLLPESSQEDLDLQTQGPQFLEDSEDQSRRSYRCDYCNKGFKKSSHLKQHVRSHTGEKPYKCKLCGRAFVSSGVLKSH 959
Cdd:COG5048      1 ATLTSSQSSSSNNSVLSSTPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  960 ---EKTHTGVKAFSCSICNASFTTNGSLTRHMATHMSMKPYKCPFCEEGfrtavhcrkHMKRHQAVSSAAAAAAETEGGD 1036
Cdd:COG5048     81 rhlRTHHNNPSDLNSKSLPLSNSKASSSSLSSSSSNSNDNNLLSSHSLP---------PSSRDPQLPDLLSISNLRNNPL 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1037 TCVEEDEENSDRSASRKPRPEVITFTEEETAQLAKIqPQESATVSEKV------LVQSAAEKDRISEMKDKQAELEAEPK 1110
Cdd:COG5048    152 PGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLSLL-ISSNVSTSIPSssenspLSSSYSIPSSSSDQNLENSSSSLPLT 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1111 HANCCTYCPKSFKKPSDLVRHVRIHTGEKPYKCDECGKSFTVKSTLDCHVKTHTGQKLFSCH-VCSNAFSTKGSLKVHMR 1189
Cdd:COG5048    231 TNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTASSQSSSPNESDSSSEKGFSLPIKSkQCNISFSRSSPLTRHLR 310
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039756159 1190 --LHTG--AKPFKCPH--CELRFRTSGRRKTHMQFHYKSDPKKarKPVTRSSSESLQSVNLLNSSSTDPNVFIMNNS 1260
Cdd:COG5048    311 svNHSGesLKPFSCPYslCGKLFSRNDALKRHILLHTSISPAK--EKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDK 385
zf-H2C2_2 pfam13465
Zinc-finger double domain;
927-952 6.46e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 44.28  E-value: 6.46e-06
                           10        20
                   ....*....|....*....|....*.
gi 1039756159  927 HLKQHVRSHTGEKPYKCKLCGRAFVS 952
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
157-181 6.79e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 44.28  E-value: 6.79e-06
                           10        20
                   ....*....|....*....|....*
gi 1039756159  157 LTRHIRIHTGERPFKCSECGKAFNQ 181
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1683-1707 1.14e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 43.51  E-value: 1.14e-05
                           10        20
                   ....*....|....*....|....*
gi 1039756159 1683 LERHSRIHTGERPFHCTLCDKAFNQ 1707
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1317-1618 1.41e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 50.30  E-value: 1.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1317 PNVQISGIDASSINNITLQIDPS----ILQQTLQQGSLLAQPITGESSTASQNSSLQTSDSTVPA---SVVIQPLSGLSL 1389
Cdd:pfam05109  567 PNATIPTLGKTSPTSAVTTPTPNatspTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTgqhNITSSSTSSMSL 646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1390 QPTVTSAnlTIGPLSEQDSV----LTTSS--SGSQDLSQVMTSqglvstSTGPHEITlTINNSSLSQVLAQAAGPTASSS 1463
Cdd:pfam05109  647 RPSSISE--TLSPSTSDNSTshmpLLTSAhpTGGENITQVTPA------STSTHHVS-TSSPAPRPGTTSQASGPGNSST 717
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1464 SGSPQEITLTiselnpsSGSLP--STAPMSPSAISAQNLVMSSSGVGADASvtltladtqgvlSGGLDTvtlniTSQGQQ 1541
Cdd:pfam05109  718 STKPGEVNVT-------KGTPPknATSPQAPSGQKTAVPTVTSTGGKANST------------TGGKHT-----TGHGAR 773
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1542 FPallTDPSLSGQGGAGSPQVILVSHT---PQSSSAAGEEIAYQVTDV-PAQLT----PHSQPEKEGLSHQCLDcdraFS 1613
Cdd:pfam05109  774 TS---TEPTTDYGGDSTTPRTRYNATTylpPSTSSKLRPRWTFTSPPVtTAQATvpvpPTSQPRFSNLSMLVLQ----WA 846

                   ....*
gi 1039756159 1614 SAAVL 1618
Cdd:pfam05109  847 SLAVL 851
zf-H2C2_2 pfam13465
Zinc-finger double domain;
497-522 2.14e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 40.05  E-value: 2.14e-04
                           10        20
                   ....*....|....*....|....*.
gi 1039756159  497 SLKVHIRLHTGVRPFACPHCDKKFRT 522
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
600-651 2.15e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 41.39  E-value: 2.15e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039756159  600 RPYkCFYCHRAYKKSCHLKQHIRSHTgekpFKCSQCGRGFVSAGVLKAHVRT 651
Cdd:cd20908      1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHCLQ 47
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1183-1208 5.93e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.51  E-value: 5.93e-04
                           10        20
                   ....*....|....*....|....*.
gi 1039756159 1183 SLKVHMRLHTGAKPFKCPHCELRFRT 1208
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1710-1735 1.44e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 37.74  E-value: 1.44e-03
                           10        20
                   ....*....|....*....|....*.
gi 1039756159 1710 ALQVHLKKHTGERPYRCDYCVMGFTQ 1735
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SP4_N cd22536
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins ...
1305-1571 1.66e-03

N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. Human SP4 is a risk gene of multiple psychiatric disorders including schizophrenia, bipolar disorder, and major depression. SP4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP4.


Pssm-ID: 411773 [Multi-domain]  Cd Length: 623  Bit Score: 43.37  E-value: 1.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1305 GIQLQLAANLV---GPNVQISGIDASSINNITLQIdpsilqQTLQQGSLLAqPITGESSTASQNSSLQ-TSDSTVPasVV 1380
Cdd:cd22536    141 SVQYQVIPQIQtveGQQIQISPANATALQDLQGQI------QLIPAGNNQA-ILTTPNRTASGNIIAQnLANQTVP--VQ 211
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1381 IQPLSGLSLQ---------PTVTSANLTIGPLSEQDSVLTTSSSGSQDLSQVMTSQGlvSTSTGPHEITLTINNSSLSqv 1451
Cdd:cd22536    212 IRPGVSIPLQlqtipgaqaQVVTTLPINIGGVTLALPVINNVAAGGGSGQLVQPSDG--GVSNGNQLVSTPITTASVS-- 287
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1452 laqaagpTASSSSGSPQEITLTISELNPSSGSLPSTAPMSPSAISAQNLVMSSSGVGADASVTLTLADTQgVLSGGLDTV 1531
Cdd:cd22536    288 -------TMPESPSSSTTCTTTASTSLTSSDTLVSSAETGQYASTAASSERTEEEPQTSAAESEAQSSSQ-LQSNGLQNV 359
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1039756159 1532 TLNitSQGQQFPALLTDPSLSGQGGAGSPQVILVSHTPQS 1571
Cdd:cd22536    360 QDQ--SNSLQQVQIVGQPILQQIQIQQPQQQIIQAIQPQS 397
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
144-189 1.75e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 39.08  E-value: 1.75e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039756159  144 CPHCGKTFQKPSQLTRHIRIHTgerpFKCSECGKAFNQKGALQTHM 189
Cdd:cd20908      4 CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHC 45
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
1605-1653 2.35e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 38.69  E-value: 2.35e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1039756159 1605 CLDCDRAFSSAAVLMHHSKEVHGKerihgCRVCRKAFKRATHLKEHMLT 1653
Cdd:cd20908      4 CYYCDREFDDEKILIQHQKAKHFK-----CHICHKKLYTAGGLAVHCLQ 47
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
1139-1188 4.19e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 37.92  E-value: 4.19e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1139 KPYkCDECGKSFTVKSTLDCHVKTHTgqklFSCHVCSNAFSTKGSLKVHM 1188
Cdd:cd20908      1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHC 45
PHA00733 PHA00733
hypothetical protein
480-530 5.77e-03

hypothetical protein


Pssm-ID: 177301  Cd Length: 128  Bit Score: 38.70  E-value: 5.77e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039756159  480 IKAFKCQYCMKSFSTSGSLKVHIRL--HTGVrpfaCPHCDKKFRTSGHRKTHV 530
Cdd:PHA00733    71 VSPYVCPLCLMPFSSSVSLKQHIRYteHSKV----CPVCGKEFRNTDSTLDHV 119
 
Name Accession Description Interval E-value
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
596-1001 8.60e-09

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 60.09  E-value: 8.60e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  596 NEADRPYKCFYCHRAYKKSCHLKQHIRSHTGEKPFKCSQCGRG--FVSAGVLKAHVRTHTGLKSFKCLICNG-AFTTGGS 672
Cdd:COG5048     28 SNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDksFSRPLELSRHLRTHHNNPSDLNSKSLPlSNSKASS 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  673 LRRHMGIHNDLRPYMCPYCQKTFKTSLNCKKHMKTHRYELAQQLQQHQEaSSMDDDSTVDQQSMHVAAPMPVEIESAELQ 752
Cdd:COG5048    108 SSLSSSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLRNNPLPGNNS-SSVNTPQSNSLHPPLPANSLSKDPSSNLSL 186
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  753 QTPETVAADPESILELGPQHvvgtedaalgqQLADQPLEADEDGFTASQAPLPGHMDQFEEQGTPQPSfesagLPQGFTV 832
Cdd:COG5048    187 LISSNVSTSIPSSSENSPLS-----------SSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQ-----SPSSLSS 250
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  833 TDTYSQQTSFPPVQQLQDSSTLESQALSTSFHQQNLLQVPNSDAINvatrllPESSQEDlDLQTQGPQFLEDSEDQSRRS 912
Cdd:COG5048    251 SDSSSSASESPRSSLPTASSQSSSPNESDSSSEKGFSLPIKSKQCN------ISFSRSS-PLTRHLRSVNHSGESLKPFS 323
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  913 YRCDYCNKGFKKSSHLKQHVRSHTGEKPYKCKLCGRAFVSSGVLKS-------HEKTHTGVKAFSC--SICNASFTTNGS 983
Cdd:COG5048    324 CPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNeppqslqQYKDLKNDKKSETlsNSCIRNFKRDSN 403
                          410
                   ....*....|....*...
gi 1039756159  984 LTRHMATHMSMKPYKCPF 1001
Cdd:COG5048    404 LSLHIITHLSFRPYNCKN 421
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
140-493 4.29e-08

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 57.78  E-value: 4.29e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  140 FPYSCPHCGKTFQKPSQLTRHIRIHTGERPFKCSECGKAFNQK--GALQTHMIKHTGEKPHACAfcpaafsqKGNLQSHV 217
Cdd:COG5048     32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSrpLELSRHLRTHHNNPSDLNS--------KSLPLSNS 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  218 QRVHSEVKNGPTYNCTECSCVFKSLG------SLNTHISKMHMGGPPNSTTSaeTAHVITATIFQTLPLQQVEAQVSSVS 291
Cdd:COG5048    104 KASSSSLSSSSSNSNDNNLLSSHSLPpssrdpQLPDLLSISNLRNNPLPGNN--SSSVNTPQSNSLHPPLPANSLSKDPS 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  292 SEQSSQAVSDVIQQLLELSEPGPVEAQQSPQSgrQLSVTVGINQDILQQALENSGLSslpvaaPPSDCSHAQTATVSTQS 371
Cdd:COG5048    182 SNLSLLISSNVSTSIPSSSENSPLSSSYSIPS--SSSDQNLENSSSSLPLTTNSQLS------PKSLLSQSPSSLSSSDS 253
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  372 PHASSVSAEQADPMDAEQEKgqespektdkkekkllkkkSPFLPGSIREENGVRWHVCPYCTKEFRKPSDLVRHIRIHTH 451
Cdd:COG5048    254 SSSASESPRSSLPTASSQSS-------------------SPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNH 314
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039756159  452 E----KPFKCP--QCFRAFAVKSTLTAHIKTHTGIKAFKCQY--CMKSFS 493
Cdd:COG5048    315 SgeslKPFSCPysLCGKLFSRNDALKRHILLHTSISPAKEKLlnSSSKFS 364
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
1283-1559 4.61e-08

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 58.62  E-value: 4.61e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1283 ASVSAGGDLTVSL--TDGSLATLEGIQLQLAANLVGPNVQIS--GIDASSINNITLQIDPSILQQTLQQGSLLAQPITGE 1358
Cdd:COG3210    802 GTITAAGTTAINVtgSGGTITINTATTGLTGTGDTTSGAGGSntTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGS 881
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1359 SSTASQNSSLQTSDSTVPASVVIQPLSGLSLQPTVTSANLTIGPLSEQDSVLTTSSSGSQDLSQVMTSQGLVSTSTGphe 1438
Cdd:COG3210    882 GGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAAS--- 958
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1439 ITLTINNSSLSQVLAQAAGPTASSSSGSPQEITLTISELNPSSGSLPSTAPMSPSAISAQNLVMSSSGVGADASVTLTLA 1518
Cdd:COG3210    959 ASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAA 1038
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1039756159 1519 DTQGVLSGGLDTVTLNITSQGQQFPALLTDPSLSGQGGAGS 1559
Cdd:COG3210   1039 TAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGG 1079
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
1288-1576 4.83e-07

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 55.16  E-value: 4.83e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1288 GGDLTVslTDGSLATLEGIQLQLAANLVGPNVQISGIDASSINNITLQIDPSILQQTLqQGSLLAQPITGESSTASQNSS 1367
Cdd:COG3210    818 GGTITI--NTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAAT-AASITVGSGGVATSTGTANAG 894
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1368 LQTSDSTVPASVVIQPLSGLSLQPTVTSANLTIGPLSEQDSVLTTSSSGSQDLSQVMTSQGLVSTSTGPHEITLTINNSS 1447
Cdd:COG3210    895 TLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGS 974
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1448 LSQVLAQAAGPTASSSSGSPQEITLTISELNPSSGSLPSTAPMSPSAISAQNLVMSSSGVGADASVTLTLADTQGVLSGG 1527
Cdd:COG3210    975 SAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGIS 1054
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 1039756159 1528 LDTVTLNITSQGQQFPALLTDPSLSGQGGAGSPQVILVSHTPQSSSAAG 1576
Cdd:COG3210   1055 GGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGAT 1103
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1127-1151 2.68e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 45.44  E-value: 2.68e-06
                           10        20
                   ....*....|....*....|....*
gi 1039756159 1127 DLVRHVRIHTGEKPYKCDECGKSFT 1151
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFK 25
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
880-1260 3.22e-06

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 51.62  E-value: 3.22e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  880 ATRLLPESSQEDLDLQTQGPQFLEDSEDQSRRSYRCDYCNKGFKKSSHLKQHVRSHTGEKPYKCKLCGRAFVSSGVLKSH 959
Cdd:COG5048      1 ATLTSSQSSSSNNSVLSSTPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159  960 ---EKTHTGVKAFSCSICNASFTTNGSLTRHMATHMSMKPYKCPFCEEGfrtavhcrkHMKRHQAVSSAAAAAAETEGGD 1036
Cdd:COG5048     81 rhlRTHHNNPSDLNSKSLPLSNSKASSSSLSSSSSNSNDNNLLSSHSLP---------PSSRDPQLPDLLSISNLRNNPL 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1037 TCVEEDEENSDRSASRKPRPEVITFTEEETAQLAKIqPQESATVSEKV------LVQSAAEKDRISEMKDKQAELEAEPK 1110
Cdd:COG5048    152 PGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLSLL-ISSNVSTSIPSssenspLSSSYSIPSSSSDQNLENSSSSLPLT 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1111 HANCCTYCPKSFKKPSDLVRHVRIHTGEKPYKCDECGKSFTVKSTLDCHVKTHTGQKLFSCH-VCSNAFSTKGSLKVHMR 1189
Cdd:COG5048    231 TNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTASSQSSSPNESDSSSEKGFSLPIKSkQCNISFSRSSPLTRHLR 310
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039756159 1190 --LHTG--AKPFKCPH--CELRFRTSGRRKTHMQFHYKSDPKKarKPVTRSSSESLQSVNLLNSSSTDPNVFIMNNS 1260
Cdd:COG5048    311 svNHSGesLKPFSCPYslCGKLFSRNDALKRHILLHTSISPAK--EKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDK 385
zf-H2C2_2 pfam13465
Zinc-finger double domain;
927-952 6.46e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 44.28  E-value: 6.46e-06
                           10        20
                   ....*....|....*....|....*.
gi 1039756159  927 HLKQHVRSHTGEKPYKCKLCGRAFVS 952
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
157-181 6.79e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 44.28  E-value: 6.79e-06
                           10        20
                   ....*....|....*....|....*
gi 1039756159  157 LTRHIRIHTGERPFKCSECGKAFNQ 181
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1683-1707 1.14e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 43.51  E-value: 1.14e-05
                           10        20
                   ....*....|....*....|....*
gi 1039756159 1683 LERHSRIHTGERPFHCTLCDKAFNQ 1707
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1317-1618 1.41e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 50.30  E-value: 1.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1317 PNVQISGIDASSINNITLQIDPS----ILQQTLQQGSLLAQPITGESSTASQNSSLQTSDSTVPA---SVVIQPLSGLSL 1389
Cdd:pfam05109  567 PNATIPTLGKTSPTSAVTTPTPNatspTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTgqhNITSSSTSSMSL 646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1390 QPTVTSAnlTIGPLSEQDSV----LTTSS--SGSQDLSQVMTSqglvstSTGPHEITlTINNSSLSQVLAQAAGPTASSS 1463
Cdd:pfam05109  647 RPSSISE--TLSPSTSDNSTshmpLLTSAhpTGGENITQVTPA------STSTHHVS-TSSPAPRPGTTSQASGPGNSST 717
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1464 SGSPQEITLTiselnpsSGSLP--STAPMSPSAISAQNLVMSSSGVGADASvtltladtqgvlSGGLDTvtlniTSQGQQ 1541
Cdd:pfam05109  718 STKPGEVNVT-------KGTPPknATSPQAPSGQKTAVPTVTSTGGKANST------------TGGKHT-----TGHGAR 773
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1542 FPallTDPSLSGQGGAGSPQVILVSHT---PQSSSAAGEEIAYQVTDV-PAQLT----PHSQPEKEGLSHQCLDcdraFS 1613
Cdd:pfam05109  774 TS---TEPTTDYGGDSTTPRTRYNATTylpPSTSSKLRPRWTFTSPPVtTAQATvpvpPTSQPRFSNLSMLVLQ----WA 846

                   ....*
gi 1039756159 1614 SAAVL 1618
Cdd:pfam05109  847 SLAVL 851
zf-H2C2_2 pfam13465
Zinc-finger double domain;
616-641 1.77e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 43.13  E-value: 1.77e-05
                           10        20
                   ....*....|....*....|....*.
gi 1039756159  616 HLKQHIRSHTGEKPFKCSQCGRGFVS 641
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
1231-1576 2.15e-05

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 49.76  E-value: 2.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1231 VTRSSSESLQSVNLLNSSSTDPNVFIMNNSVLTGQFDQNVLQPGLVGQAILPASVSAGGDLTVSLTDGSLATLEGIQLQL 1310
Cdd:COG3210    509 GIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSA 588
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1311 AANLVGPNVQISGIDASSINNITLQIDPSILQQTLQQGSLLAQPITGESSTASQNSSLQTSDSTVPASVVIQPLSGLSLQ 1390
Cdd:COG3210    589 TGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAG 668
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1391 PTVTSANLTIGPLSEQDSVLTTSSSGSQDL-----------SQVMTSQGLVSTSTGPHEITLTINNSSLSQVLAQAAGPT 1459
Cdd:COG3210    669 GTGGGTTGTVTSGATGGTTGTTLNAATGGTlnnagntltisTGSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGVT 748
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1460 ASS-SSGSP------------QEITLTISELNPSSGSL--PSTAPMSPSAISAQNLVMSSSGV----GADASVTLTLADT 1520
Cdd:COG3210    749 ITSgNAGTLsigltanttasgTTLTLANANGNTSAGATldNAGAEISIDITADGTITAAGTTAinvtGSGGTITINTATT 828
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039756159 1521 qGVLSGGLDTVTLNITSQGQQFPALLTDPSLSGQGGAGSPQVILVSHTPQSSSAAG 1576
Cdd:COG3210    829 -GLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGG 883
zf-H2C2_2 pfam13465
Zinc-finger double domain;
497-522 2.14e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 40.05  E-value: 2.14e-04
                           10        20
                   ....*....|....*....|....*.
gi 1039756159  497 SLKVHIRLHTGVRPFACPHCDKKFRT 522
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
600-651 2.15e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 41.39  E-value: 2.15e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039756159  600 RPYkCFYCHRAYKKSCHLKQHIRSHTgekpFKCSQCGRGFVSAGVLKAHVRT 651
Cdd:cd20908      1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHCLQ 47
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
142-164 2.33e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 39.98  E-value: 2.33e-04
                           10        20
                   ....*....|....*....|...
gi 1039756159  142 YSCPHCGKTFQKPSQLTRHIRIH 164
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-H2C2_2 pfam13465
Zinc-finger double domain;
983-1008 3.85e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 39.28  E-value: 3.85e-04
                           10        20
                   ....*....|....*....|....*.
gi 1039756159  983 SLTRHMATHMSMKPYKCPFCEEGFRT 1008
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
628-677 4.44e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 40.62  E-value: 4.44e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039756159  628 KPFkCSQCGRGFVSAGVLKAHVRTHTglksFKCLICNGAFTTGGSLRRHM 677
Cdd:cd20908      1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHC 45
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1183-1208 5.93e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.51  E-value: 5.93e-04
                           10        20
                   ....*....|....*....|....*.
gi 1039756159 1183 SLKVHMRLHTGAKPFKCPHCELRFRT 1208
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
915-959 6.08e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 40.23  E-value: 6.08e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1039756159  915 CDYCNKGFKKSSHLKQHVRSHTgekpYKCKLCGRAFVSSGVLKSH 959
Cdd:cd20908      4 CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVH 44
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
913-935 1.31e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 37.66  E-value: 1.31e-03
                           10        20
                   ....*....|....*....|...
gi 1039756159  913 YRCDYCNKGFKKSSHLKQHVRSH 935
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1710-1735 1.44e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 37.74  E-value: 1.44e-03
                           10        20
                   ....*....|....*....|....*.
gi 1039756159 1710 ALQVHLKKHTGERPYRCDYCVMGFTQ 1735
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SP4_N cd22536
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins ...
1305-1571 1.66e-03

N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. Human SP4 is a risk gene of multiple psychiatric disorders including schizophrenia, bipolar disorder, and major depression. SP4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP4.


Pssm-ID: 411773 [Multi-domain]  Cd Length: 623  Bit Score: 43.37  E-value: 1.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1305 GIQLQLAANLV---GPNVQISGIDASSINNITLQIdpsilqQTLQQGSLLAqPITGESSTASQNSSLQ-TSDSTVPasVV 1380
Cdd:cd22536    141 SVQYQVIPQIQtveGQQIQISPANATALQDLQGQI------QLIPAGNNQA-ILTTPNRTASGNIIAQnLANQTVP--VQ 211
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1381 IQPLSGLSLQ---------PTVTSANLTIGPLSEQDSVLTTSSSGSQDLSQVMTSQGlvSTSTGPHEITLTINNSSLSqv 1451
Cdd:cd22536    212 IRPGVSIPLQlqtipgaqaQVVTTLPINIGGVTLALPVINNVAAGGGSGQLVQPSDG--GVSNGNQLVSTPITTASVS-- 287
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1452 laqaagpTASSSSGSPQEITLTISELNPSSGSLPSTAPMSPSAISAQNLVMSSSGVGADASVTLTLADTQgVLSGGLDTV 1531
Cdd:cd22536    288 -------TMPESPSSSTTCTTTASTSLTSSDTLVSSAETGQYASTAASSERTEEEPQTSAAESEAQSSSQ-LQSNGLQNV 359
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1039756159 1532 TLNitSQGQQFPALLTDPSLSGQGGAGSPQVILVSHTPQS 1571
Cdd:cd22536    360 QDQ--SNSLQQVQIVGQPILQQIQIQQPQQQIIQAIQPQS 397
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
144-189 1.75e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 39.08  E-value: 1.75e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1039756159  144 CPHCGKTFQKPSQLTRHIRIHTgerpFKCSECGKAFNQKGALQTHM 189
Cdd:cd20908      4 CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHC 45
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
1605-1653 2.35e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 38.69  E-value: 2.35e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1039756159 1605 CLDCDRAFSSAAVLMHHSKEVHGKerihgCRVCRKAFKRATHLKEHMLT 1653
Cdd:cd20908      4 CYYCDREFDDEKILIQHQKAKHFK-----CHICHKKLYTAGGLAVHCLQ 47
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
170-192 2.43e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 36.89  E-value: 2.43e-03
                           10        20
                   ....*....|....*....|...
gi 1039756159  170 FKCSECGKAFNQKGALQTHMIKH 192
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
427-449 3.04e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 36.51  E-value: 3.04e-03
                           10        20
                   ....*....|....*....|...
gi 1039756159  427 HVCPYCTKEFRKPSDLVRHIRIH 449
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
453-501 3.15e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 38.31  E-value: 3.15e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1039756159  453 KPFkCPQCFRAFAVKSTLTAHIKTHTgikaFKCQYCMKSFSTSGSLKVH 501
Cdd:cd20908      1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVH 44
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
1139-1188 4.19e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 37.92  E-value: 4.19e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1139 KPYkCDECGKSFTVKSTLDCHVKTHTgqklFSCHVCSNAFSTKGSLKVHM 1188
Cdd:cd20908      1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHC 45
zf-H2C2_2 pfam13465
Zinc-finger double domain;
672-697 4.44e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 36.20  E-value: 4.44e-03
                           10        20
                   ....*....|....*....|....*.
gi 1039756159  672 SLRRHMGIHNDLRPYMCPYCQKTFKT 697
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
PHA00733 PHA00733
hypothetical protein
480-530 5.77e-03

hypothetical protein


Pssm-ID: 177301  Cd Length: 128  Bit Score: 38.70  E-value: 5.77e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039756159  480 IKAFKCQYCMKSFSTSGSLKVHIRL--HTGVrpfaCPHCDKKFRTSGHRKTHV 530
Cdd:PHA00733    71 VSPYVCPLCLMPFSSSVSLKQHIRYteHSKV----CPVCGKEFRNTDSTLDHV 119
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
1660-1729 6.92e-03

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 41.22  E-value: 6.92e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039756159 1660 LSSQKPRVFKCDSCEKAFAKPSQLERHSRIHTGERPFHCTL--CDKAFNQKSALQVHLKKHTGERPYRCDYC 1729
Cdd:COG5048     26 SLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYsgCDKSFSRPLELSRHLRTHHNNPSDLNSKS 97
SP3_N cd22537
N-terminal domain of transcription factor Specificity Protein (SP) 3; Specificity Proteins ...
1281-1596 7.73e-03

N-terminal domain of transcription factor Specificity Protein (SP) 3; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP1 and SP3 can interact with and recruit a large number of proteins including the transcription initiation complex, histone modifying enzymes, and chromatin remodeling complexes, which strongly suggest that SP1 and SP3 are important transcription factors in remodeling chromatin and the regulation of gene expression. SP3 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP3.


Pssm-ID: 411774 [Multi-domain]  Cd Length: 574  Bit Score: 41.09  E-value: 7.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1281 LPASVSAGGDLTVSLTDGSLATLEGIQLQLAAN---LVGPNVQISGIDASSInnitLQIDPSILQQTLQQGSLLAQPITG 1357
Cdd:cd22537     24 SPSPGDDAAAAGNAASAGQTGDLASAQLTGAPNrweVLTPTPTTIKDEAGNL----VQIPGGGTVTSSGQYVLPLQSLQN 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1358 ES--STASQNSSLQTSDSTVPASVV--IQPLSGLSLQ--PTVTSANLTIGPLSEQDSVLTTSSSGSQDLSQVMTSQGLVS 1431
Cdd:cd22537    100 QQifSVAPGSDASNGTVPNVQYQVIpqIQTTDGQQVQlgFATSSDNTGLQQEGGQIQIIPGSNQTIIASGTPSAVQQLLS 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1432 TSTGPHEI-TLTINNSSL---SQVLAQAAgptasssSGSPQEITLT-ISELNPSS-GSLPSTAPMSPSAISAQNLVM--- 1502
Cdd:cd22537    180 QSGHVVQIqGVSIGGSSFpgqTQVVANVP-------LGLPGNITFVpINSVDLDSlGLSGTSQTMTTGITADGQLINtgq 252
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039756159 1503 ---SSSGVGADASVTLTLADTQG-------VLSGGLDTVTLNITSQGQQFPALLTDPSLSGQGGAGSPQVILVSHTPQSS 1572
Cdd:cd22537    253 avqSSDNSGESGKVSPDINETNTnadlfvpTSSSSQLPVTIDSTGILQQNASSLTTVSGQVHTSDLQGNYIQAPVSDETQ 332
                          330       340
                   ....*....|....*....|....
gi 1039756159 1573 SAAGEEIAYQVTDVPAQLTPHSQP 1596
Cdd:cd22537    333 AQNIQVSTAQPSVQQIQLHESQQP 356
zf-H2C2_2 pfam13465
Zinc-finger double domain;
470-494 7.85e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 35.42  E-value: 7.85e-03
                           10        20
                   ....*....|....*....|....*
gi 1039756159  470 LTAHIKTHTGIKAFKCQYCMKSFST 494
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
969-991 9.04e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 35.35  E-value: 9.04e-03
                           10        20
                   ....*....|....*....|...
gi 1039756159  969 FSCSICNASFTTNGSLTRHMATH 991
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH