NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720394324|ref|XP_030106540|]
View 

pecanex-like protein 3 isoform X6 [Mus musculus]

Protein Classification

oligosaccharide repeat unit polymerase; pecanex family protein( domain architecture ID 10523572)

oligosaccharide repeat unit polymerase may act to polymerize the oligosaccharide repeat units of surface polysaccharides, including O-antigen in Gram-negative bacteria and capsular polysaccharide in Gram-positive bacteria; pecanex family protein similar to Drosophila melanogaster protein pecanex that is involved in neurogenesis

Gene Ontology:  GO:0016020

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Pecanex_C pfam05041
Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein ...
1572-1798 1.59e-138

Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein homologs. The pecanex protein is a maternal-effect neurogenic gene found in Drosophila.


:

Pssm-ID: 461533  Cd Length: 227  Bit Score: 430.20  E-value: 1.59e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1572 DQDWNSPLVTLCFGLCVLGRRALGTASHSMSASLEPFLYGLHALFKGDFRITSPRDEWVFADMDLLHRVVAPGVRMALKL 1651
Cdd:pfam05041    1 DSDSDSTLVTLCFALSLLGRRALGSASHSMSNSLESFLYGLHFLFKGDFRITSDKDEWVFMDLDLLRKVVAPAMRMALKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1652 HQDHFTSPDEYEEPAALYDAIAANEERLVISHEGDPAWRSAILSNTPSLLALRHVMDDASDEYKIIMLNRRHLSFRVIKV 1731
Cdd:pfam05041   81 HQDHFTDPDEYDENEVLYDAIHTYELVIVIEHESDPRWRVAVLSNNPSLLALRHVDDDGEDEYKIIMLNRRTLSFRVIKV 160
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720394324 1732 NRECVRGLWAGQQQELVFLRNRNPERGSIQNAKQALRNMINSSCDQPLGYPIYVSPLTTSLAGSHPQ 1798
Cdd:pfam05041  161 NRECVRGLWAGQQQELIFLRNRNRERGSIQNAKQALRNIINSSCDQPIGYPIYVSPLTTSYSNTHLQ 227
PHA03247 super family cl33720
large tegument protein UL36; Provisional
194-654 6.38e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.94  E-value: 6.38e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  194 GDLPQTPPGVVPDPSLPSTDSSERSPmagdgvPWGGSGVADTPMSPLLKGSLSQELSKSFLTLTRpdralVRTSSR-REQ 272
Cdd:PHA03247  2602 VDDRGDPRGPAPPSPLPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGR-----VSRPRRaRRL 2670
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  273 CRGTGGYQPLDRrgsgdPMPQKAgssdscfsgtdRETLSSFKSEKtnsthlDSPPGGHAPEgsdtdPPSEAELPASPDAG 352
Cdd:PHA03247  2671 GRAAQASSPPQR-----PRRRAA-----------RPTVGSLTSLA------DPPPPPPTPE-----PAPHALVSATPLPP 2723
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  353 VPSDDTLRSFDTVIGAGTPPGQTEPLLVVRPKDLALLR-----PSKRRPPMRGHSPPGRTPRRPLLEGSGFFEDEDTSEG 427
Cdd:PHA03247  2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPttagpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  428 SELSPASSLRSQRRYSTDSSSSTSCYSPESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYG----TQRTPSTASAKTHARV 503
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppSRSPAAKPAAPARPPV 2883
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  504 LSMDGAGGDVLRAPLAGSKAELEAQPGMELAA--GEPAVLPPEARRGPAANQPGW-RGELQEEGAVGGAPEETGQRECTS 580
Cdd:PHA03247  2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPppQPQPQPPPPPQPQPPPPPPPRpQPPLAPTTDPAGAGEPSGAVPQPW 2963
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  581 NVR----RAQAIRRR--HNAGSNPTPPAsvmgSPPSLQEAQRGRAASHSRALTL-PSALHFASSLLLTRAGPNVHEASNF 653
Cdd:PHA03247  2964 LGAlvpgRVAVPRFRvpQPAPSREAPAS----STPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTLWPPDDTEDSDA 3039

                   .
gi 1720394324  654 D 654
Cdd:PHA03247  3040 D 3040
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1848-2024 3.78e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 3.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1848 LTSLSNH-PPLAHPTPENAAGSSeQPLPPGPSWGP----RPSLSGSGDgrpppllqwpppRLPGPPPASPAPTEGPRPSR 1922
Cdd:pfam03154  399 LSSLSTHhPPSAHPPPLQLMPQS-QQLPPPPAQPPvltqSQSLPPPAA------------SHPPTSGLHQVPSQSPFPQH 465
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1923 PSGPallnsegpsgkwslGGRKGLGGPDGEPASGSPKGgtPKSQAPLDLSLSpdvSSEASPARTTQDLPcldssipegct 2002
Cdd:pfam03154  466 PFVP--------------GGPPPITPPSGPPTSTSSAM--PGIQPPSSASVS---SSGPVPAAVSCPLP----------- 515
                          170       180
                   ....*....|....*....|..
gi 1720394324 2003 PSGAPGDWPVPAEERESPAAQP 2024
Cdd:pfam03154  516 PVQIKEEALDEAEEPESPPPPP 537
 
Name Accession Description Interval E-value
Pecanex_C pfam05041
Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein ...
1572-1798 1.59e-138

Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein homologs. The pecanex protein is a maternal-effect neurogenic gene found in Drosophila.


Pssm-ID: 461533  Cd Length: 227  Bit Score: 430.20  E-value: 1.59e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1572 DQDWNSPLVTLCFGLCVLGRRALGTASHSMSASLEPFLYGLHALFKGDFRITSPRDEWVFADMDLLHRVVAPGVRMALKL 1651
Cdd:pfam05041    1 DSDSDSTLVTLCFALSLLGRRALGSASHSMSNSLESFLYGLHFLFKGDFRITSDKDEWVFMDLDLLRKVVAPAMRMALKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1652 HQDHFTSPDEYEEPAALYDAIAANEERLVISHEGDPAWRSAILSNTPSLLALRHVMDDASDEYKIIMLNRRHLSFRVIKV 1731
Cdd:pfam05041   81 HQDHFTDPDEYDENEVLYDAIHTYELVIVIEHESDPRWRVAVLSNNPSLLALRHVDDDGEDEYKIIMLNRRTLSFRVIKV 160
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720394324 1732 NRECVRGLWAGQQQELVFLRNRNPERGSIQNAKQALRNMINSSCDQPLGYPIYVSPLTTSLAGSHPQ 1798
Cdd:pfam05041  161 NRECVRGLWAGQQQELIFLRNRNRERGSIQNAKQALRNIINSSCDQPIGYPIYVSPLTTSYSNTHLQ 227
PHA03247 PHA03247
large tegument protein UL36; Provisional
194-654 6.38e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.94  E-value: 6.38e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  194 GDLPQTPPGVVPDPSLPSTDSSERSPmagdgvPWGGSGVADTPMSPLLKGSLSQELSKSFLTLTRpdralVRTSSR-REQ 272
Cdd:PHA03247  2602 VDDRGDPRGPAPPSPLPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGR-----VSRPRRaRRL 2670
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  273 CRGTGGYQPLDRrgsgdPMPQKAgssdscfsgtdRETLSSFKSEKtnsthlDSPPGGHAPEgsdtdPPSEAELPASPDAG 352
Cdd:PHA03247  2671 GRAAQASSPPQR-----PRRRAA-----------RPTVGSLTSLA------DPPPPPPTPE-----PAPHALVSATPLPP 2723
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  353 VPSDDTLRSFDTVIGAGTPPGQTEPLLVVRPKDLALLR-----PSKRRPPMRGHSPPGRTPRRPLLEGSGFFEDEDTSEG 427
Cdd:PHA03247  2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPttagpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  428 SELSPASSLRSQRRYSTDSSSSTSCYSPESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYG----TQRTPSTASAKTHARV 503
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppSRSPAAKPAAPARPPV 2883
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  504 LSMDGAGGDVLRAPLAGSKAELEAQPGMELAA--GEPAVLPPEARRGPAANQPGW-RGELQEEGAVGGAPEETGQRECTS 580
Cdd:PHA03247  2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPppQPQPQPPPPPQPQPPPPPPPRpQPPLAPTTDPAGAGEPSGAVPQPW 2963
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  581 NVR----RAQAIRRR--HNAGSNPTPPAsvmgSPPSLQEAQRGRAASHSRALTL-PSALHFASSLLLTRAGPNVHEASNF 653
Cdd:PHA03247  2964 LGAlvpgRVAVPRFRvpQPAPSREAPAS----STPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTLWPPDDTEDSDA 3039

                   .
gi 1720394324  654 D 654
Cdd:PHA03247  3040 D 3040
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
455-592 1.20e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 41.60  E-value: 1.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  455 PESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYGTQrTPSTASAKTHARVLSM--DGAGGDVLRAPLAGSKAELEAQPGME 532
Cdd:cd21975     25 PEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGAD-SPGLVTAAPHLLAANVlaPLRGPSVEGSSLESGDADMGSDSDVA 103
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1720394324  533 LAAGEPAVLPPEARRGPAAN-QPGWrgeLQEEGAVGGAPEETGQRECTSNVRR-AQAIRRRH 592
Cdd:cd21975    104 PASGAAASTSPESSSDAASSpSPLS---LLHPGEAGLEPERPRPRVRRGVRRRgVTPAAKRH 162
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1848-2024 3.78e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 3.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1848 LTSLSNH-PPLAHPTPENAAGSSeQPLPPGPSWGP----RPSLSGSGDgrpppllqwpppRLPGPPPASPAPTEGPRPSR 1922
Cdd:pfam03154  399 LSSLSTHhPPSAHPPPLQLMPQS-QQLPPPPAQPPvltqSQSLPPPAA------------SHPPTSGLHQVPSQSPFPQH 465
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1923 PSGPallnsegpsgkwslGGRKGLGGPDGEPASGSPKGgtPKSQAPLDLSLSpdvSSEASPARTTQDLPcldssipegct 2002
Cdd:pfam03154  466 PFVP--------------GGPPPITPPSGPPTSTSSAM--PGIQPPSSASVS---SSGPVPAAVSCPLP----------- 515
                          170       180
                   ....*....|....*....|..
gi 1720394324 2003 PSGAPGDWPVPAEERESPAAQP 2024
Cdd:pfam03154  516 PVQIKEEALDEAEEPESPPPPP 537
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1855-2024 4.05e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.47  E-value: 4.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1855 PPLAHPTPENAAGSSEQPLPPGPSWGPRPSLSGSGDGRPPPLLQWPPPRLPGPPPASP----APTEGPRPsRPSGPALLN 1930
Cdd:PHA03307   189 PPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGcgwgPENECPLP-RPAPITLPT 267
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1931 SEGPSGKWSLGGRK--------GLGGPDGEPASGSPKGGTPKSQAPLDLSLSPDVSSEASPARTTQDLP-CLDSSIPEGC 2001
Cdd:PHA03307   268 RIWEASGWNGPSSRpgpassssSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSrGAAVSPGPSP 347
                          170       180
                   ....*....|....*....|...
gi 1720394324 2002 TPSGAPGDWPVPAEERESPAAQP 2024
Cdd:PHA03307   348 SRSPSPSRPPPPADPSSPRKRPR 370
 
Name Accession Description Interval E-value
Pecanex_C pfam05041
Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein ...
1572-1798 1.59e-138

Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein homologs. The pecanex protein is a maternal-effect neurogenic gene found in Drosophila.


Pssm-ID: 461533  Cd Length: 227  Bit Score: 430.20  E-value: 1.59e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1572 DQDWNSPLVTLCFGLCVLGRRALGTASHSMSASLEPFLYGLHALFKGDFRITSPRDEWVFADMDLLHRVVAPGVRMALKL 1651
Cdd:pfam05041    1 DSDSDSTLVTLCFALSLLGRRALGSASHSMSNSLESFLYGLHFLFKGDFRITSDKDEWVFMDLDLLRKVVAPAMRMALKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1652 HQDHFTSPDEYEEPAALYDAIAANEERLVISHEGDPAWRSAILSNTPSLLALRHVMDDASDEYKIIMLNRRHLSFRVIKV 1731
Cdd:pfam05041   81 HQDHFTDPDEYDENEVLYDAIHTYELVIVIEHESDPRWRVAVLSNNPSLLALRHVDDDGEDEYKIIMLNRRTLSFRVIKV 160
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720394324 1732 NRECVRGLWAGQQQELVFLRNRNPERGSIQNAKQALRNMINSSCDQPLGYPIYVSPLTTSLAGSHPQ 1798
Cdd:pfam05041  161 NRECVRGLWAGQQQELIFLRNRNRERGSIQNAKQALRNIINSSCDQPIGYPIYVSPLTTSYSNTHLQ 227
PHA03247 PHA03247
large tegument protein UL36; Provisional
194-654 6.38e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.94  E-value: 6.38e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  194 GDLPQTPPGVVPDPSLPSTDSSERSPmagdgvPWGGSGVADTPMSPLLKGSLSQELSKSFLTLTRpdralVRTSSR-REQ 272
Cdd:PHA03247  2602 VDDRGDPRGPAPPSPLPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGR-----VSRPRRaRRL 2670
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  273 CRGTGGYQPLDRrgsgdPMPQKAgssdscfsgtdRETLSSFKSEKtnsthlDSPPGGHAPEgsdtdPPSEAELPASPDAG 352
Cdd:PHA03247  2671 GRAAQASSPPQR-----PRRRAA-----------RPTVGSLTSLA------DPPPPPPTPE-----PAPHALVSATPLPP 2723
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  353 VPSDDTLRSFDTVIGAGTPPGQTEPLLVVRPKDLALLR-----PSKRRPPMRGHSPPGRTPRRPLLEGSGFFEDEDTSEG 427
Cdd:PHA03247  2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPttagpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  428 SELSPASSLRSQRRYSTDSSSSTSCYSPESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYG----TQRTPSTASAKTHARV 503
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppSRSPAAKPAAPARPPV 2883
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  504 LSMDGAGGDVLRAPLAGSKAELEAQPGMELAA--GEPAVLPPEARRGPAANQPGW-RGELQEEGAVGGAPEETGQRECTS 580
Cdd:PHA03247  2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPppQPQPQPPPPPQPQPPPPPPPRpQPPLAPTTDPAGAGEPSGAVPQPW 2963
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  581 NVR----RAQAIRRR--HNAGSNPTPPAsvmgSPPSLQEAQRGRAASHSRALTL-PSALHFASSLLLTRAGPNVHEASNF 653
Cdd:PHA03247  2964 LGAlvpgRVAVPRFRvpQPAPSREAPAS----STPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTLWPPDDTEDSDA 3039

                   .
gi 1720394324  654 D 654
Cdd:PHA03247  3040 D 3040
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
286-621 2.28e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 53.25  E-value: 2.28e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  286 GSGDPMPQKAGSSDSCFSGTDRETLSSFKSEKTNSTHLDSPPGGHAPEGSDTDPPSEA----ELPASPDAGVPSDDTLRS 361
Cdd:PHA03307    74 GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPApdlsEMLRPVGSPGPPPAASPP 153
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  362 FDTVIGAGTPPGQTEPLLVVRPkdLALLRPSKRRPPMRGHSPPGRTPrRPLLEGSGFFEDEDTSEGSELSPASSLRSQRR 441
Cdd:PHA03307   154 AAGASPAAVASDAASSRQAALP--LSSPEETARAPSSPPAEPPPSTP-PAAASPRPPRRSSPISASASSPAPAPGRSAAD 230
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  442 YSTDSSSSTSCYSP----ESSQGAAGGP---------RKRRAPHGAEEGTAVPPKRPYGTQRTPSTASAKTHARVLSMDG 508
Cdd:PHA03307   231 DAGASSSDSSSSESsgcgWGPENECPLPrpapitlptRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPS 310
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  509 ---AGGDVLRAPLAGSKAELEAQPGMELAAGEPAvlPPEAR-----RGPAANQPGWRGELQEEGAVGGAPEETGQRECTS 580
Cdd:PHA03307   311 sprASSSSSSSRESSSSSTSSSSESSRGAAVSPG--PSPSRspspsRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRR 388
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|.
gi 1720394324  581 NVRRAQAIRRRHNAGSNPTPPASVMGSPPSLQEAQRGRAAS 621
Cdd:PHA03307   389 RARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYAR 429
PHA03247 PHA03247
large tegument protein UL36; Provisional
334-609 2.66e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.55  E-value: 2.66e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  334 GSDTDPPSEAELPASPDAGVP-SDDTLRSFDTVIGA-----GTPPGQTEPLLVVRPKDLALLRPSKRRPPMRGHSP---- 403
Cdd:PHA03247  2549 GDPPPPLPPAAPPAAPDRSVPpPRPAPRPSEPAVTSrarrpDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPdppp 2628
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  404 PGRTPRRPLLEGSGFFEDEDTSEGSELSPASSLRSQRRYSTDSSSSTSCYSPEssqgaagGPRKRRAPHGAEEGTAV--- 480
Cdd:PHA03247  2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ-------RPRRRAARPTVGSLTSLadp 2701
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  481 -PPKRPYGTQRTPSTASAKTHARVLSMDGAGGDVLRAPLAGSKAELEAQPGME-------LAAGEPAVLPPEARRG--PA 550
Cdd:PHA03247  2702 pPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParparppTTAGPPAPAPPAAPAAgpPR 2781
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720394324  551 ANQPGWRGELQEEGAVGGAPEETGQRECTSNVRRAQAIRRRHNAGSNPTPPASVMGSPP 609
Cdd:PHA03247  2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP 2840
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
197-438 6.03e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 6.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  197 PQTPPGVVPDPSLPSTDSSERSPMAGDGVPWGGSGVADTPMSPLLKGSLSQelsksfltltrpdralvrTSSRREQCrgt 276
Cdd:PHA03307   189 PPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDS------------------SSSESSGC--- 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  277 gGYQPLDRRGSGDPMPQKA-GSSDSCFSGTDRETLSSFKSEKTNSTHLDSPPGGHAPEGSDTDPPSEAELPASPDAGVPS 355
Cdd:PHA03307   248 -GWGPENECPLPRPAPITLpTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSS 326
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  356 DDTLRSFDTVIGAGTPPGQTE---PLLVVRPKDLALLRPSKRRPPMRGHSPPGRTPRRPLLEGSGFFEDEDT--SEGSEL 430
Cdd:PHA03307   327 SSTSSSSESSRGAAVSPGPSPsrsPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRArrRDATGR 406

                   ....*...
gi 1720394324  431 SPASSLRS 438
Cdd:PHA03307   407 FPAGRPRP 414
PRK13863 PRK13863
T-DNA border endonuclease VirD2;
419-626 7.56e-04

T-DNA border endonuclease VirD2;


Pssm-ID: 237533 [Multi-domain]  Cd Length: 446  Bit Score: 44.17  E-value: 7.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  419 FEDEDTSEGSelsPASSLRSQRRYSTDSSSSTSCYSPESSQGAAG----GPRKRRAPHGAEEGTAVPPKRPYGTQRTPST 494
Cdd:PRK13863   211 FEDADFEEFS---PGEDHREPSQSFDTSPGEAPQGEPESAERPEKlqneSEVRLQEPAGSSIKADARIRVSLESERRAQP 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  495 ASAKTharvlSMDGAGGDVLRAPLAGSKAELEAQPGMELAAGEPAVLPPEAR-------RGPAANQPGWRGELQEEGAVG 567
Cdd:PRK13863   288 SASKI-----PVADDFGIETSYVAEGDVRKLEGNSGTPRLATEVATHTTSERqqrrkrpRDDEGEPSGAKRTRLNGIAVG 362
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  568 gaPEET-GQRECTSNVRRAQAIRRRHNAGSNPTPPASVMGSPPSLQEAQRGRAASHSRAL 626
Cdd:PRK13863   363 --PEANaGEQDGRDDPITSPAQPPRSNPLADPVRASIATDSLPATADRQQQREPSSKRPR 420
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
455-592 1.20e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 41.60  E-value: 1.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  455 PESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYGTQrTPSTASAKTHARVLSM--DGAGGDVLRAPLAGSKAELEAQPGME 532
Cdd:cd21975     25 PEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGAD-SPGLVTAAPHLLAANVlaPLRGPSVEGSSLESGDADMGSDSDVA 103
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1720394324  533 LAAGEPAVLPPEARRGPAAN-QPGWrgeLQEEGAVGGAPEETGQRECTSNVRR-AQAIRRRH 592
Cdd:cd21975    104 PASGAAASTSPESSSDAASSpSPLS---LLHPGEAGLEPERPRPRVRRGVRRRgVTPAAKRH 162
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1848-2024 3.78e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 3.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1848 LTSLSNH-PPLAHPTPENAAGSSeQPLPPGPSWGP----RPSLSGSGDgrpppllqwpppRLPGPPPASPAPTEGPRPSR 1922
Cdd:pfam03154  399 LSSLSTHhPPSAHPPPLQLMPQS-QQLPPPPAQPPvltqSQSLPPPAA------------SHPPTSGLHQVPSQSPFPQH 465
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1923 PSGPallnsegpsgkwslGGRKGLGGPDGEPASGSPKGgtPKSQAPLDLSLSpdvSSEASPARTTQDLPcldssipegct 2002
Cdd:pfam03154  466 PFVP--------------GGPPPITPPSGPPTSTSSAM--PGIQPPSSASVS---SSGPVPAAVSCPLP----------- 515
                          170       180
                   ....*....|....*....|..
gi 1720394324 2003 PSGAPGDWPVPAEERESPAAQP 2024
Cdd:pfam03154  516 PVQIKEEALDEAEEPESPPPPP 537
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1855-2024 4.05e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.47  E-value: 4.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1855 PPLAHPTPENAAGSSEQPLPPGPSWGPRPSLSGSGDGRPPPLLQWPPPRLPGPPPASP----APTEGPRPsRPSGPALLN 1930
Cdd:PHA03307   189 PPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGcgwgPENECPLP-RPAPITLPT 267
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324 1931 SEGPSGKWSLGGRK--------GLGGPDGEPASGSPKGGTPKSQAPLDLSLSPDVSSEASPARTTQDLP-CLDSSIPEGC 2001
Cdd:PHA03307   268 RIWEASGWNGPSSRpgpassssSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSrGAAVSPGPSP 347
                          170       180
                   ....*....|....*....|...
gi 1720394324 2002 TPSGAPGDWPVPAEERESPAAQP 2024
Cdd:PHA03307   348 SRSPSPSRPPPPADPSSPRKRPR 370
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
343-629 4.20e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.14  E-value: 4.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  343 AELPASPD--AGVpSDDTLR--SFDTVIGAGTPPGQTEPLLVVRpkdlALLRPSKRRPPMRGHSPPGRTPRRPLLEGSGF 418
Cdd:PRK07003   336 GELGLAPDeyAGF-TMTLLRmlAFEPAVTGGGAPGGGVPARVAG----AVPAPGARAAAAVGASAVPAVTAVTGAAGAAL 410
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  419 FEDEDTSEGSELSPASSLRSQRRYSTDSSSSTSCYSP--ESSQGAAGGPRKRRAPHGAEEGTAVPPKR-PYGTQRTPSTA 495
Cdd:PRK07003   411 APKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDApvPAKANARASADSRCDERDAQPPADSGSASaPASDAPPDAAF 490
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  496 SAKTHARVLSMDGAGGDVLRAPLAGSKAELE----AQPGMELAAGEPAVLPPEARRGPAA------NQPGWRGELQEEGA 565
Cdd:PRK07003   491 EPAPRAAAPSAATPAAVPDARAPAAASREDApaaaAPPAPEARPPTPAAAAPAARAGGAAaaldvlRNAGMRVSSDRGAR 570
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720394324  566 VGGAPEETGQRECTSnvrrAQAIRRRhnAGSNPTPPASVMGSPPSLQEAQRGRAASHSRALTLP 629
Cdd:PRK07003   571 AAAAAKPAAAPAAAP----KPAAPRV--AVQVPTPRARAATGDAPPNGAARAEQAAESRGAPPP 628
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
197-502 4.75e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 4.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  197 PQTPPGVVPDPSLPSTDSSERSPMAGDGVPWGGSGVADTPMSPLLKgslsqelsksfltlTRPDRALVRTSSRREQCRGT 276
Cdd:PHA03307    84 SRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPS--------------PAPDLSEMLRPVGSPGPPPA 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  277 GGYQPLDRRGSGDPMPQKAGSSDSCFSGTDRETLSSfKSEKTNSTHLDSPPGGHAPEGSDTDPPSEAELPASPDAGVPSD 356
Cdd:PHA03307   150 ASPPAAGASPAAVASDAASSRQAALPLSSPEETARA-PSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSA 228
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394324  357 DTLRSFDTVIGAGTP-PGQTEPLLVVRPKDLALLRPSKRRPpmRGHSPPGRTPRRPLLEGSGFFEDEDTSEGSELSPASS 435
Cdd:PHA03307   229 ADDAGASSSDSSSSEsSGCGWGPENECPLPRPAPITLPTRI--WEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSG 306
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720394324  436 LRSQRRYSTDSSSSTSCYSPESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYGTQRTPSTASAKTHAR 502
Cdd:PHA03307   307 PAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSR 373
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH