NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1034561584|ref|XP_016857727|]
View 

zinc finger and SCAN domain-containing protein 20 isoform X2 [Homo sapiens]

Protein Classification

SCAN and COG5048 domain-containing protein( domain architecture ID 12029794)

protein containing domains SCAN, Myb_DNA-bind_4, and COG5048

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SCAN pfam02023
SCAN domain; The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found ...
47-132 2.48e-50

SCAN domain; The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerization.


:

Pssm-ID: 460417 [Multi-domain]  Cd Length: 89  Bit Score: 171.90  E-value: 2.48e-50
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584   47 PEASRQRFRQFQYRDAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESGEEAVALV 126
Cdd:pfam02023    1 PEASRQRFRQFCYQEAEGPREALSQLRELCHQWLRPEKHTKEQILELLVLEQFLTILPEEIQSWVREHHPESGEEAVALA 80

                   ....*.
gi 1034561584  127 EDWHRE 132
Cdd:pfam02023   81 EDLLLE 86
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
698-1028 1.59e-21

FOG: Zn-finger [General function prediction only];


:

Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 99.00  E-value: 1.59e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  698 DHQGLYLAEKPYKCDTCMKSFSRSSHFIAHQRIHTGEKPYKCL--ECGKNFSDRSNLNTHQRIHTGEKPYKC------LE 769
Cdd:COG5048     23 TLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSysGCDKSFSRPLELSRHLRTHHNNPSDLNskslplSN 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  770 CGKSFSDHSNLITHQRIHTGEKPYKCGEcwKSFNQSSNLLkHQRIHLGGNPDQ-CSEPGGNFAQSPSFSA-HWRNSTeet 847
Cdd:COG5048    103 SKASSSSLSSSSSNSNDNNLLSSHSLPP--SSRDPQLPDL-LSISNLRNNPLPgNNSSSVNTPQSNSLHPpLPANSL--- 176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  848 apeqPQSISKDLNSPGPHSTNSGEKLYECSECGRSFSKSSALISHQRIHTGEKPYECAECGKSFSKSSTLANHQRTHtge 927
Cdd:COG5048    177 ----SKDPSSNLSLLISSNVSTSIPSSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLS--- 249
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  928 KPYKCVDCGKCFS-----ERSKLITHQRVHTGE-----KPYKCLECGKFFRDRSNLITHQR--IHTGE--KPYKCRE--C 991
Cdd:COG5048    250 SSDSSSSASESPRsslptASSQSSSPNESDSSSekgfsLPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslC 329
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 1034561584  992 GKCFNQSSSLIIHQRIHTGEKPYKC--TECGKDFNNSSH 1028
Cdd:COG5048    330 GKLFSRNDALKRHILLHTSISPAKEklLNSSSKFSPLLN 368
Myb_DNA-bind_4 pfam13837
Myb/SANT-like DNA-binding domain; This presumed domain appears to be related to other Myb ...
323-404 2.58e-17

Myb/SANT-like DNA-binding domain; This presumed domain appears to be related to other Myb/SANT-like DNA binding domains. In particular pfam10545 seems most related. This family is greatly expanded in plants and appears in several proteins annotated as transposon proteins.


:

Pssm-ID: 463994  Cd Length: 84  Bit Score: 77.69  E-value: 2.58e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  323 GVHWGYEETKTFLAILSEsPFSEKLRT-CHQNRQVYRAIAEQLRARGFLRTLEQCRYRVKNLLRNYRKAKSSH--PPGTC 399
Cdd:pfam13837    1 RNKWTEEETLALIEIWGE-RLELRFQEsKKRNKKLWEEIAEKMAELGYNRSPEQCKEKWENLKKKYRKEKEGNngSGSSW 79

                   ....*
gi 1034561584  400 PFYEE 404
Cdd:pfam13837   80 PFFEE 84
Myb_DNA-bind_4 pfam13837
Myb/SANT-like DNA-binding domain; This presumed domain appears to be related to other Myb ...
482-563 4.11e-17

Myb/SANT-like DNA-binding domain; This presumed domain appears to be related to other Myb/SANT-like DNA binding domains. In particular pfam10545 seems most related. This family is greatly expanded in plants and appears in several proteins annotated as transposon proteins.


:

Pssm-ID: 463994  Cd Length: 84  Bit Score: 76.92  E-value: 4.11e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  482 GVHWGYEETKAFLAILSEsPFSEKLRT-CHQNSQVYRAIAERLCALGFLRTLEQCRYRFKNLLRSYRKAKSSH--PPGTC 558
Cdd:pfam13837    1 RNKWTEEETLALIEIWGE-RLELRFQEsKKRNKKLWEEIAEKMAELGYNRSPEQCKEKWENLKKKYRKEKEGNngSGSSW 79

                   ....*
gi 1034561584  559 PFYEE 563
Cdd:pfam13837   80 PFFEE 84
 
Name Accession Description Interval E-value
SCAN pfam02023
SCAN domain; The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found ...
47-132 2.48e-50

SCAN domain; The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerization.


Pssm-ID: 460417 [Multi-domain]  Cd Length: 89  Bit Score: 171.90  E-value: 2.48e-50
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584   47 PEASRQRFRQFQYRDAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESGEEAVALV 126
Cdd:pfam02023    1 PEASRQRFRQFCYQEAEGPREALSQLRELCHQWLRPEKHTKEQILELLVLEQFLTILPEEIQSWVREHHPESGEEAVALA 80

                   ....*.
gi 1034561584  127 EDWHRE 132
Cdd:pfam02023   81 EDLLLE 86
SCAN smart00431
leucine rich region;
47-138 6.49e-50

leucine rich region;


Pssm-ID: 128708 [Multi-domain]  Cd Length: 113  Bit Score: 171.72  E-value: 6.49e-50
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584    47 PEASRQRFRQFQYRDAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESGEEAVALV 126
Cdd:smart00431    1 PEIFRQRFRQFRYQETSGPREALSRLRELCRQWLRPELHTKEQILELLVLEQFLTILPGELQAWVREHHPESGEEAVTLL 80
                            90
                    ....*....|..
gi 1034561584   127 EDWHRETRTAGQ 138
Cdd:smart00431   81 EDLERELDEPGQ 92
SCAN cd07936
SCAN oligomerization domain; The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 ...
47-128 6.79e-40

SCAN oligomerization domain; The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several vertebrate proteins that contain C2H2 zinc finger motifs, many of which may be transcription factors playing roles in cell survival and differentiation. This protein-interaction domain is able to mediate homo- and hetero-oligomerization of SCAN-containing proteins. Some SCAN-containing proteins, including those of lower vertebrates, do not contain zinc finger motifs. It has been noted that the SCAN domain resembles a domain-swapped version of the C-terminal domain of the HIV capsid protein. This domain model features elements common to the three general groups of SCAN domains (SCAN-A1, SCAN-A2, and SCAN-B). The SCAND1 protein is truncated at the C-terminus with respect to this model, the SCAND2 protein appears to have a truncated central helix.


Pssm-ID: 153421  Cd Length: 85  Bit Score: 142.01  E-value: 6.79e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584   47 PEASRQRFRQFQYRDAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESGEEAVALV 126
Cdd:cd07936      1 PETYRQRFRAFQYQEASGPREALQRLRELCRQWLRPEIHTKEQILELLVLEQFLIILPPEVQAWVRERKPESGEEAATLA 80

                   ..
gi 1034561584  127 ED 128
Cdd:cd07936     81 ED 82
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
698-1028 1.59e-21

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 99.00  E-value: 1.59e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  698 DHQGLYLAEKPYKCDTCMKSFSRSSHFIAHQRIHTGEKPYKCL--ECGKNFSDRSNLNTHQRIHTGEKPYKC------LE 769
Cdd:COG5048     23 TLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSysGCDKSFSRPLELSRHLRTHHNNPSDLNskslplSN 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  770 CGKSFSDHSNLITHQRIHTGEKPYKCGEcwKSFNQSSNLLkHQRIHLGGNPDQ-CSEPGGNFAQSPSFSA-HWRNSTeet 847
Cdd:COG5048    103 SKASSSSLSSSSSNSNDNNLLSSHSLPP--SSRDPQLPDL-LSISNLRNNPLPgNNSSSVNTPQSNSLHPpLPANSL--- 176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  848 apeqPQSISKDLNSPGPHSTNSGEKLYECSECGRSFSKSSALISHQRIHTGEKPYECAECGKSFSKSSTLANHQRTHtge 927
Cdd:COG5048    177 ----SKDPSSNLSLLISSNVSTSIPSSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLS--- 249
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  928 KPYKCVDCGKCFS-----ERSKLITHQRVHTGE-----KPYKCLECGKFFRDRSNLITHQR--IHTGE--KPYKCRE--C 991
Cdd:COG5048    250 SSDSSSSASESPRsslptASSQSSSPNESDSSSekgfsLPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslC 329
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 1034561584  992 GKCFNQSSSLIIHQRIHTGEKPYKC--TECGKDFNNSSH 1028
Cdd:COG5048    330 GKLFSRNDALKRHILLHTSISPAKEklLNSSSKFSPLLN 368
Myb_DNA-bind_4 pfam13837
Myb/SANT-like DNA-binding domain; This presumed domain appears to be related to other Myb ...
323-404 2.58e-17

Myb/SANT-like DNA-binding domain; This presumed domain appears to be related to other Myb/SANT-like DNA binding domains. In particular pfam10545 seems most related. This family is greatly expanded in plants and appears in several proteins annotated as transposon proteins.


Pssm-ID: 463994  Cd Length: 84  Bit Score: 77.69  E-value: 2.58e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  323 GVHWGYEETKTFLAILSEsPFSEKLRT-CHQNRQVYRAIAEQLRARGFLRTLEQCRYRVKNLLRNYRKAKSSH--PPGTC 399
Cdd:pfam13837    1 RNKWTEEETLALIEIWGE-RLELRFQEsKKRNKKLWEEIAEKMAELGYNRSPEQCKEKWENLKKKYRKEKEGNngSGSSW 79

                   ....*
gi 1034561584  400 PFYEE 404
Cdd:pfam13837   80 PFFEE 84
Myb_DNA-bind_4 pfam13837
Myb/SANT-like DNA-binding domain; This presumed domain appears to be related to other Myb ...
482-563 4.11e-17

Myb/SANT-like DNA-binding domain; This presumed domain appears to be related to other Myb/SANT-like DNA binding domains. In particular pfam10545 seems most related. This family is greatly expanded in plants and appears in several proteins annotated as transposon proteins.


Pssm-ID: 463994  Cd Length: 84  Bit Score: 76.92  E-value: 4.11e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  482 GVHWGYEETKAFLAILSEsPFSEKLRT-CHQNSQVYRAIAERLCALGFLRTLEQCRYRFKNLLRSYRKAKSSH--PPGTC 558
Cdd:pfam13837    1 RNKWTEEETLALIEIWGE-RLELRFQEsKKRNKKLWEEIAEKMAELGYNRSPEQCKEKWENLKKKYRKEKEGNngSGSSW 79

                   ....*
gi 1034561584  559 PFYEE 563
Cdd:pfam13837   80 PFFEE 84
GT1 cd12203
GT1, myb-like, SANT family; GT-1, a myb-like protein, is one of the GT trihelix transcription ...
485-550 4.09e-07

GT1, myb-like, SANT family; GT-1, a myb-like protein, is one of the GT trihelix transcription factors. GT-1 binds the GT cis-element of rbcS-3A, a light-induced gene, as a dimer. Arabidopsis GT-1 is a trans-activator and acts in the stabilization of components of the transcription pre-initiation complex comprised of TFIIA-TBP-TATA. The isolated GT-1 DNA-binding domain is sufficient to bind DNA. This region closely resembles the myb domain, but with longer helices. It has been proposed that GT-1 may respond to light signals via calcium-dependent phosphorylation to create a light-modulated molecular switch. These proteins are members of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of the DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 213402  Cd Length: 66  Bit Score: 48.05  E-value: 4.09e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034561584  485 WGYEETKAFLAILSE--SPFSEKLRtchqNSQVYRAIAERLCALGFLRTLEQCRYRFKNLLRSYRKAK 550
Cdd:cd12203      3 WPREETLSLIRLRREmeSRFQETKS----KKALWEEIAAKMRELGYNRSAKQCKEKWENLNKYYKKVK 66
zf-H2C2_2 pfam13465
Zinc-finger double domain;
751-775 2.51e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 44.67  E-value: 2.51e-06
                           10        20
                   ....*....|....*....|....*
gi 1034561584  751 NLNTHQRIHTGEKPYKCLECGKSFS 775
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFK 25
GT1 cd12203
GT1, myb-like, SANT family; GT-1, a myb-like protein, is one of the GT trihelix transcription ...
326-391 2.66e-06

GT1, myb-like, SANT family; GT-1, a myb-like protein, is one of the GT trihelix transcription factors. GT-1 binds the GT cis-element of rbcS-3A, a light-induced gene, as a dimer. Arabidopsis GT-1 is a trans-activator and acts in the stabilization of components of the transcription pre-initiation complex comprised of TFIIA-TBP-TATA. The isolated GT-1 DNA-binding domain is sufficient to bind DNA. This region closely resembles the myb domain, but with longer helices. It has been proposed that GT-1 may respond to light signals via calcium-dependent phosphorylation to create a light-modulated molecular switch. These proteins are members of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of the DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 213402  Cd Length: 66  Bit Score: 45.73  E-value: 2.66e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034561584  326 WGYEETKTFLAILSE--SPFSEKLRtchqNRQVYRAIAEQLRARGFLRTLEQCRYRVKNLLRNYRKAK 391
Cdd:cd12203      3 WPREETLSLIRLRREmeSRFQETKS----KKALWEEIAAKMRELGYNRSAKQCKEKWENLNKYYKKVK 66
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
956-1008 9.59e-05

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 41.77  E-value: 9.59e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1034561584  956 KPYkCLECGKFFRDRSNLITHQRIHTgekpYKCRECGKCFNQSSSLIIH-QRIH 1008
Cdd:cd20908      1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHcLQVH 49
ZnF_C2H2 smart00355
zinc finger;
765-787 9.75e-03

zinc finger;


Pssm-ID: 197676  Cd Length: 23  Bit Score: 34.75  E-value: 9.75e-03
                            10        20
                    ....*....|....*....|...
gi 1034561584   765 YKCLECGKSFSDHSNLITHQRIH 787
Cdd:smart00355    1 YRCPECGKVFKSKSALREHMRTH 23
 
Name Accession Description Interval E-value
SCAN pfam02023
SCAN domain; The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found ...
47-132 2.48e-50

SCAN domain; The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerization.


Pssm-ID: 460417 [Multi-domain]  Cd Length: 89  Bit Score: 171.90  E-value: 2.48e-50
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584   47 PEASRQRFRQFQYRDAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESGEEAVALV 126
Cdd:pfam02023    1 PEASRQRFRQFCYQEAEGPREALSQLRELCHQWLRPEKHTKEQILELLVLEQFLTILPEEIQSWVREHHPESGEEAVALA 80

                   ....*.
gi 1034561584  127 EDWHRE 132
Cdd:pfam02023   81 EDLLLE 86
SCAN smart00431
leucine rich region;
47-138 6.49e-50

leucine rich region;


Pssm-ID: 128708 [Multi-domain]  Cd Length: 113  Bit Score: 171.72  E-value: 6.49e-50
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584    47 PEASRQRFRQFQYRDAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESGEEAVALV 126
Cdd:smart00431    1 PEIFRQRFRQFRYQETSGPREALSRLRELCRQWLRPELHTKEQILELLVLEQFLTILPGELQAWVREHHPESGEEAVTLL 80
                            90
                    ....*....|..
gi 1034561584   127 EDWHRETRTAGQ 138
Cdd:smart00431   81 EDLERELDEPGQ 92
SCAN cd07936
SCAN oligomerization domain; The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 ...
47-128 6.79e-40

SCAN oligomerization domain; The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several vertebrate proteins that contain C2H2 zinc finger motifs, many of which may be transcription factors playing roles in cell survival and differentiation. This protein-interaction domain is able to mediate homo- and hetero-oligomerization of SCAN-containing proteins. Some SCAN-containing proteins, including those of lower vertebrates, do not contain zinc finger motifs. It has been noted that the SCAN domain resembles a domain-swapped version of the C-terminal domain of the HIV capsid protein. This domain model features elements common to the three general groups of SCAN domains (SCAN-A1, SCAN-A2, and SCAN-B). The SCAND1 protein is truncated at the C-terminus with respect to this model, the SCAND2 protein appears to have a truncated central helix.


Pssm-ID: 153421  Cd Length: 85  Bit Score: 142.01  E-value: 6.79e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584   47 PEASRQRFRQFQYRDAAGPHEAFSQLWALCCRWLRPEIRLKEQILELLVLEQFLTILPREVQTWVQARHPESGEEAVALV 126
Cdd:cd07936      1 PETYRQRFRAFQYQEASGPREALQRLRELCRQWLRPEIHTKEQILELLVLEQFLIILPPEVQAWVRERKPESGEEAATLA 80

                   ..
gi 1034561584  127 ED 128
Cdd:cd07936     81 ED 82
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
698-1028 1.59e-21

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 99.00  E-value: 1.59e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  698 DHQGLYLAEKPYKCDTCMKSFSRSSHFIAHQRIHTGEKPYKCL--ECGKNFSDRSNLNTHQRIHTGEKPYKC------LE 769
Cdd:COG5048     23 TLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSysGCDKSFSRPLELSRHLRTHHNNPSDLNskslplSN 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  770 CGKSFSDHSNLITHQRIHTGEKPYKCGEcwKSFNQSSNLLkHQRIHLGGNPDQ-CSEPGGNFAQSPSFSA-HWRNSTeet 847
Cdd:COG5048    103 SKASSSSLSSSSSNSNDNNLLSSHSLPP--SSRDPQLPDL-LSISNLRNNPLPgNNSSSVNTPQSNSLHPpLPANSL--- 176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  848 apeqPQSISKDLNSPGPHSTNSGEKLYECSECGRSFSKSSALISHQRIHTGEKPYECAECGKSFSKSSTLANHQRTHtge 927
Cdd:COG5048    177 ----SKDPSSNLSLLISSNVSTSIPSSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLS--- 249
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  928 KPYKCVDCGKCFS-----ERSKLITHQRVHTGE-----KPYKCLECGKFFRDRSNLITHQR--IHTGE--KPYKCRE--C 991
Cdd:COG5048    250 SSDSSSSASESPRsslptASSQSSSPNESDSSSekgfsLPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslC 329
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 1034561584  992 GKCFNQSSSLIIHQRIHTGEKPYKC--TECGKDFNNSSH 1028
Cdd:COG5048    330 GKLFSRNDALKRHILLHTSISPAKEklLNSSSKFSPLLN 368
Myb_DNA-bind_4 pfam13837
Myb/SANT-like DNA-binding domain; This presumed domain appears to be related to other Myb ...
323-404 2.58e-17

Myb/SANT-like DNA-binding domain; This presumed domain appears to be related to other Myb/SANT-like DNA binding domains. In particular pfam10545 seems most related. This family is greatly expanded in plants and appears in several proteins annotated as transposon proteins.


Pssm-ID: 463994  Cd Length: 84  Bit Score: 77.69  E-value: 2.58e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  323 GVHWGYEETKTFLAILSEsPFSEKLRT-CHQNRQVYRAIAEQLRARGFLRTLEQCRYRVKNLLRNYRKAKSSH--PPGTC 399
Cdd:pfam13837    1 RNKWTEEETLALIEIWGE-RLELRFQEsKKRNKKLWEEIAEKMAELGYNRSPEQCKEKWENLKKKYRKEKEGNngSGSSW 79

                   ....*
gi 1034561584  400 PFYEE 404
Cdd:pfam13837   80 PFFEE 84
Myb_DNA-bind_4 pfam13837
Myb/SANT-like DNA-binding domain; This presumed domain appears to be related to other Myb ...
482-563 4.11e-17

Myb/SANT-like DNA-binding domain; This presumed domain appears to be related to other Myb/SANT-like DNA binding domains. In particular pfam10545 seems most related. This family is greatly expanded in plants and appears in several proteins annotated as transposon proteins.


Pssm-ID: 463994  Cd Length: 84  Bit Score: 76.92  E-value: 4.11e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  482 GVHWGYEETKAFLAILSEsPFSEKLRT-CHQNSQVYRAIAERLCALGFLRTLEQCRYRFKNLLRSYRKAKSSH--PPGTC 558
Cdd:pfam13837    1 RNKWTEEETLALIEIWGE-RLELRFQEsKKRNKKLWEEIAEKMAELGYNRSPEQCKEKWENLKKKYRKEKEGNngSGSSW 79

                   ....*
gi 1034561584  559 PFYEE 563
Cdd:pfam13837   80 PFFEE 84
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
735-1035 1.99e-14

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 77.04  E-value: 1.99e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  735 KPYKCLECGKNFSDRSNLNTHQRIHTGEKPYKCL--ECGKSFSDHSNLITHQRIHTgekpykcgecwksfNQSSNLLKHQ 812
Cdd:COG5048     32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSysGCDKSFSRPLELSRHLRTHH--------------NNPSDLNSKS 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  813 RIHLGGNPDQCSEPGGNFaqspsfsahwrNSTEETAPEQ----PQSISKDLNSPGPHSTNSGEKLYECSECGRSFSKSSA 888
Cdd:COG5048     98 LPLSNSKASSSSLSSSSS-----------NSNDNNLLSShslpPSSRDPQLPDLLSISNLRNNPLPGNNSSSVNTPQSNS 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  889 LIShqrihtgekPYECAECGKSFSKSSTLANHQRTHTGEKPYKCVDCGKCFSERSKLITHQRVHTGEKPYKCLECGKFFR 968
Cdd:COG5048    167 LHP---------PLPANSLSKDPSSNLSLLISSNVSTSIPSSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSP 237
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034561584  969 DRSNLITHQRIHTGEKPYKCRECGKCFNQSSSLIIHQRIHTGE-------KPYKCTECGKDFNNSSHFSAHRRT 1035
Cdd:COG5048    238 KSLLSQSPSSLSSSDSSSSASESPRSSLPTASSQSSSPNESDSssekgfsLPIKSKQCNISFSRSSPLTRHLRS 311
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
687-1032 1.25e-13

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 74.73  E-value: 1.25e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  687 SSSEEDLEKLIDHQGLYLAEKPYKC--DTCMKSFSRSSHFIAHQRIHTGEKPYKCL----------------ECGKNFS- 747
Cdd:COG5048     40 TDSFSRLEHLTRHIRSHTGEKPSQCsySGCDKSFSRPLELSRHLRTHHNNPSDLNSkslplsnskasssslsSSSSNSNd 119
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  748 --------------DRSNLNTHQRIHTGEKPYKclECGKSF--SDHSNLITHQ--RIHTGEKPYKCGEC----WKSFNQS 805
Cdd:COG5048    120 nnllsshslppssrDPQLPDLLSISNLRNNPLP--GNNSSSvnTPQSNSLHPPlpANSLSKDPSSNLSLlissNVSTSIP 197
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  806 SNLLKHQRIHLGGNPDQCSEPGGNFAQSPSFSAHWRNSTEETAPEQPQS--ISKDLNS----------PGPHSTNSGEKL 873
Cdd:COG5048    198 SSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSslSSSDSSSsasesprsslPTASSQSSSPNE 277
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  874 YE------------CSECGRSFSKSSALISHQR--IHTGE--KPYECAE--CGKSFSKSSTLANHQRTHTGEKPYKCVDC 935
Cdd:COG5048    278 SDsssekgfslpikSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslCGKLFSRNDALKRHILLHTSISPAKEKLL 357
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  936 GKCFSERSKLIT--HQRVH-----TGEKPYKCL--ECGKFFRDRSNLITHQRIHTGEKP--YKCRECGKCFNQSSSLIIH 1004
Cdd:COG5048    358 NSSSKFSPLLNNepPQSLQqykdlKNDKKSETLsnSCIRNFKRDSNLSLHIITHLSFRPynCKNPPCSKSFNRHYNLIPH 437
                          410       420
                   ....*....|....*....|....*...
gi 1034561584 1005 QRIHTgEKPYKCTECGKDFNNSSHFSAH 1032
Cdd:COG5048    438 KKIHT-NHAPLLCSILKSFRRDLDLSNH 464
GT1 cd12203
GT1, myb-like, SANT family; GT-1, a myb-like protein, is one of the GT trihelix transcription ...
485-550 4.09e-07

GT1, myb-like, SANT family; GT-1, a myb-like protein, is one of the GT trihelix transcription factors. GT-1 binds the GT cis-element of rbcS-3A, a light-induced gene, as a dimer. Arabidopsis GT-1 is a trans-activator and acts in the stabilization of components of the transcription pre-initiation complex comprised of TFIIA-TBP-TATA. The isolated GT-1 DNA-binding domain is sufficient to bind DNA. This region closely resembles the myb domain, but with longer helices. It has been proposed that GT-1 may respond to light signals via calcium-dependent phosphorylation to create a light-modulated molecular switch. These proteins are members of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of the DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 213402  Cd Length: 66  Bit Score: 48.05  E-value: 4.09e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034561584  485 WGYEETKAFLAILSE--SPFSEKLRtchqNSQVYRAIAERLCALGFLRTLEQCRYRFKNLLRSYRKAK 550
Cdd:cd12203      3 WPREETLSLIRLRREmeSRFQETKS----KKALWEEIAAKMRELGYNRSAKQCKEKWENLNKYYKKVK 66
zf-H2C2_2 pfam13465
Zinc-finger double domain;
751-775 2.51e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 44.67  E-value: 2.51e-06
                           10        20
                   ....*....|....*....|....*
gi 1034561584  751 NLNTHQRIHTGEKPYKCLECGKSFS 775
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFK 25
zf-H2C2_2 pfam13465
Zinc-finger double domain;
972-997 2.63e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 44.67  E-value: 2.63e-06
                           10        20
                   ....*....|....*....|....*.
gi 1034561584  972 NLITHQRIHTGEKPYKCRECGKCFNQ 997
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
GT1 cd12203
GT1, myb-like, SANT family; GT-1, a myb-like protein, is one of the GT trihelix transcription ...
326-391 2.66e-06

GT1, myb-like, SANT family; GT-1, a myb-like protein, is one of the GT trihelix transcription factors. GT-1 binds the GT cis-element of rbcS-3A, a light-induced gene, as a dimer. Arabidopsis GT-1 is a trans-activator and acts in the stabilization of components of the transcription pre-initiation complex comprised of TFIIA-TBP-TATA. The isolated GT-1 DNA-binding domain is sufficient to bind DNA. This region closely resembles the myb domain, but with longer helices. It has been proposed that GT-1 may respond to light signals via calcium-dependent phosphorylation to create a light-modulated molecular switch. These proteins are members of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of the DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 213402  Cd Length: 66  Bit Score: 45.73  E-value: 2.66e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034561584  326 WGYEETKTFLAILSE--SPFSEKLRtchqNRQVYRAIAEQLRARGFLRTLEQCRYRVKNLLRNYRKAK 391
Cdd:cd12203      3 WPREETLSLIRLRREmeSRFQETKS----KKALWEEIAAKMRELGYNRSAKQCKEKWENLNKYYKKVK 66
zf-H2C2_2 pfam13465
Zinc-finger double domain;
888-913 8.61e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 43.13  E-value: 8.61e-06
                           10        20
                   ....*....|....*....|....*.
gi 1034561584  888 ALISHQRIHTGEKPYECAECGKSFSK 913
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
678-811 1.11e-05

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 49.31  E-value: 1.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  678 NPSQEQWQESSSEedlekliDHQGLYLaekPYKCDTCMKSFSRSSHFIAHQR--IHTGE--KPYKCLE--CGKNFSDRSN 751
Cdd:COG5048    269 SSQSSSPNESDSS-------SEKGFSL---PIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslCGKLFSRNDA 338
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034561584  752 LNTHQRIHTGEKP------------------------------------------------------------------- 764
Cdd:COG5048    339 LKRHILLHTSISPakekllnssskfspllnneppqslqqykdlkndkksetlsnscirnfkrdsnlslhiithlsfrpyn 418
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1034561584  765 YKCLECGKSFSDHSNLITHQRIHTGEKPYKCgECWKSFNQSSNLLKH 811
Cdd:COG5048    419 CKNPPCSKSFNRHYNLIPHKKIHTNHAPLLC-SILKSFRRDLDLSNH 464
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1000-1025 1.68e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 42.36  E-value: 1.68e-05
                           10        20
                   ....*....|....*....|....*.
gi 1034561584 1000 SLIIHQRIHTGEKPYKCTECGKDFNN 1025
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
765-787 3.63e-05

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 41.52  E-value: 3.63e-05
                           10        20
                   ....*....|....*....|...
gi 1034561584  765 YKCLECGKSFSDHSNLITHQRIH 787
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
737-759 5.87e-05

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 40.75  E-value: 5.87e-05
                           10        20
                   ....*....|....*....|...
gi 1034561584  737 YKCLECGKNFSDRSNLNTHQRIH 759
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
958-980 7.35e-05

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 40.36  E-value: 7.35e-05
                           10        20
                   ....*....|....*....|...
gi 1034561584  958 YKCLECGKFFRDRSNLITHQRIH 980
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
986-1008 8.60e-05

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 40.36  E-value: 8.60e-05
                           10        20
                   ....*....|....*....|...
gi 1034561584  986 YKCRECGKCFNQSSSLIIHQRIH 1008
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
956-1008 9.59e-05

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 41.77  E-value: 9.59e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1034561584  956 KPYkCLECGKFFRDRSNLITHQRIHTgekpYKCRECGKCFNQSSSLIIH-QRIH 1008
Cdd:cd20908      1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHcLQVH 49
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
793-815 1.26e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 39.98  E-value: 1.26e-04
                           10        20
                   ....*....|....*....|...
gi 1034561584  793 YKCGECWKSFNQSSNLLKHQRIH 815
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
984-1032 1.36e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 41.39  E-value: 1.36e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1034561584  984 KPYkCRECGKCFNQSSSLIIHQRIHTgekpYKCTECGKDFNNSSHFSAH 1032
Cdd:cd20908      1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVH 44
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
902-924 1.49e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 39.59  E-value: 1.49e-04
                           10        20
                   ....*....|....*....|...
gi 1034561584  902 YECAECGKSFSKSSTLANHQRTH 924
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
763-815 1.74e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 41.00  E-value: 1.74e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1034561584  763 KPYkCLECGKSFSDHSNLITHQRIHTgekpYKCGECWKSFNQSSNLLKH-QRIH 815
Cdd:cd20908      1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHcLQVH 49
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
1014-1036 1.96e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 39.21  E-value: 1.96e-04
                           10        20
                   ....*....|....*....|...
gi 1034561584 1014 YKCTECGKDFNNSSHFSAHRRTH 1036
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-H2C2_2 pfam13465
Zinc-finger double domain;
779-804 2.02e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 39.28  E-value: 2.02e-04
                           10        20
                   ....*....|....*....|....*.
gi 1034561584  779 NLITHQRIHTGEKPYKCGECWKSFNQ 804
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
874-896 2.18e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 39.21  E-value: 2.18e-04
                           10        20
                   ....*....|....*....|...
gi 1034561584  874 YECSECGRSFSKSSALISHQRIH 896
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-H2C2_2 pfam13465
Zinc-finger double domain;
945-967 2.72e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.89  E-value: 2.72e-04
                           10        20
                   ....*....|....*....|...
gi 1034561584  945 LITHQRVHTGEKPYKCLECGKFF 967
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSF 24
zf-H2C2_2 pfam13465
Zinc-finger double domain;
727-747 3.15e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.89  E-value: 3.15e-04
                           10        20
                   ....*....|....*....|.
gi 1034561584  727 HQRIHTGEKPYKCLECGKNFS 747
Cdd:pfam13465    5 HMRTHTGEKPYKCPECGKSFK 25
zf-H2C2_2 pfam13465
Zinc-finger double domain;
917-940 3.44e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.89  E-value: 3.44e-04
                           10        20
                   ....*....|....*....|....
gi 1034561584  917 LANHQRTHTGEKPYKCVDCGKCFS 940
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFK 25
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
930-952 3.56e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 38.44  E-value: 3.56e-04
                           10        20
                   ....*....|....*....|...
gi 1034561584  930 YKCVDCGKCFSERSKLITHQRVH 952
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
709-731 3.08e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 36.12  E-value: 3.08e-03
                           10        20
                   ....*....|....*....|...
gi 1034561584  709 YKCDTCMKSFSRSSHFIAHQRIH 731
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
876-920 3.39e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 37.54  E-value: 3.39e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1034561584  876 CSECGRSFSKSSALISHQRIHTgekpYECAECGKSFSKSSTLANH 920
Cdd:cd20908      4 CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVH 44
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
735-787 5.32e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 36.77  E-value: 5.32e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1034561584  735 KPYkCLECGKNFSDRSNLNTHQRIHTgekpYKCLECGKSFSDHSNLITH-QRIH 787
Cdd:cd20908      1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHcLQVH 49
ZnF_C2H2 smart00355
zinc finger;
765-787 9.75e-03

zinc finger;


Pssm-ID: 197676  Cd Length: 23  Bit Score: 34.75  E-value: 9.75e-03
                            10        20
                    ....*....|....*....|...
gi 1034561584   765 YKCLECGKSFSDHSNLITHQRIH 787
Cdd:smart00355    1 YRCPECGKVFKSKSALREHMRTH 23
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH