NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|755519506|ref|XP_011248846|]
View 

WD repeat-containing protein 62 isoform X9 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
365-745 1.11e-36

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 144.28  E-value: 1.11e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  365 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIRFWNL 443
Cdd:COG2319    83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFSPDGKTLaSGSADGTVRLWDL 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  444 DSAsdtrwqknifsdsllkvvyvendiQHLQDLSHFPDRgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELH 523
Cdd:COG2319   150 ATG------------------------KLLRTLTGHSGA-------------VTSVAFSPDGKLLASGSDDGTVRLWDLA 192
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  524 FMDELIKVEAHDAEVLCLEYSkPEtGvTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFagTRDVQMI-SCGA 602
Cdd:COG2319   193 TGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAF--SPDGRLLaSGSA 266
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  603 DKSIYFRSAQqaSDGLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDEGSllkVHV 682
Cdd:COG2319   267 DGTVRLWDLA--TGELLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAF 338
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755519506  683 DPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL 745
Cdd:COG2319   339 SPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
94-445 7.17e-23

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 103.07  E-value: 7.17e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506   94 VVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVS 173
Cdd:COG2319   144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  174 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEASTEAKVTStvpLVGRSGilgel 251
Cdd:COG2319   222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLATGELLRT---LTGHSG----- 289
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  252 hnnifcgvacgrgrmagntfcvsysgllcqfnekrvldkWINlkvslssCLCVS--DELIFCGCTDGIVRIFQAHSLLYL 329
Cdd:COG2319   290 ---------------------------------------GVN-------SVAFSpdGKLLASGSDDGTVRLWDLATGKLL 323
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  330 TNLpKPHYLGVDvahgldssflfhrkaeavypdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVW 409
Cdd:COG2319   324 RTL-TGHTGAVR-----------------------SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTG---HTGAVT 376
                         330       340       350
                  ....*....|....*....|....*....|....*..
gi 755519506  410 NVevypefedqrACLPSGTFL-TCSSDNTIRFWNLDS 445
Cdd:COG2319   377 SV----------AFSPDGRTLaSGSADGTVRLWDLAT 403
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
984-1458 2.58e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.46  E-value: 2.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506   984 EETEAGPEDQQGDTYLRVSSVSS-KDQSPPEDSGESEAELECS---FAAAHSSAPQTDPGPHLTMTAGKPEYPSteelSQ 1059
Cdd:pfam03154  128 DEGSSDPKDIDQDNRSTSPSIPSpQDNESDSDSSAQQQILQTQppvLQAQSGAASPPSPPPPGTTQAATAGPTP----SA 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1060 PELPGLGNGSLPQTPEQEKFLRHHFETLTDAPTEGPmgiflelfhgslgdikisetedyffnPRLSISTqflSRLQKTSR 1139
Cdd:pfam03154  204 PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHP--------------------------QRLPSPH---PPLQPMTQ 254
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1140 CPPrlPLHLMKSPEAQPVGQGGNQPKAGPLRAGTGYMSSDGTNVLSGQKAEETQEALslldrkPPTPTSVLTTGREQSIS 1219
Cdd:pfam03154  255 PPP--PSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQV------PPGPSPAAPGQSQQRIH 326
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1220 APSSCSYLES-------------TTSSHAKTTRSISLGDSEGPVTAELPQSLHKPlSPGQELQAIPTTVALT--SSIKDH 1284
Cdd:pfam03154  327 TPPSQSQLQSqqppreqplppapLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGP-SPFQMNSNLPPPPALKplSSLSTH 405
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1285 EPAplswgnheARASLKLTLSSVCEQLLSPPPQEPPITHVWSQEP--VDVPPSMAVTVASFCAPSP-----VDMSTLGLH 1357
Cdd:pfam03154  406 HPP--------SAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPpaASHPPTSGLHQVPSQSPFPqhpfvPGGPPPITP 477
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1358 SSMFLPKTSASGPLTPPAHLQLLETRSRVPGSTAALLEPT------PDASGVIADSPGHWDTEVPTPELlgsVESVLHRL 1431
Cdd:pfam03154  478 PSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVqikeeaLDEAEEPESPPPPPRSPSPEPTV---VNTPSHAS 554
                          490       500       510
                   ....*....|....*....|....*....|....*...
gi 755519506  1432 QTA-FQEAL----------DLYRMLVSSSQLGPEQQQA 1458
Cdd:pfam03154  555 QSArFYKHLdrgynscartDLYFMPLAGSKLAKKREEA 592
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
365-745 1.11e-36

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 144.28  E-value: 1.11e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  365 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIRFWNL 443
Cdd:COG2319    83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFSPDGKTLaSGSADGTVRLWDL 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  444 DSAsdtrwqknifsdsllkvvyvendiQHLQDLSHFPDRgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELH 523
Cdd:COG2319   150 ATG------------------------KLLRTLTGHSGA-------------VTSVAFSPDGKLLASGSDDGTVRLWDLA 192
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  524 FMDELIKVEAHDAEVLCLEYSkPEtGvTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFagTRDVQMI-SCGA 602
Cdd:COG2319   193 TGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAF--SPDGRLLaSGSA 266
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  603 DKSIYFRSAQqaSDGLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDEGSllkVHV 682
Cdd:COG2319   267 DGTVRLWDLA--TGELLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAF 338
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755519506  683 DPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL 745
Cdd:COG2319   339 SPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
365-743 2.68e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 113.58  E-value: 2.68e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  365 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqRACLPSGTFLTCSSDNTIRFWNLD 444
Cdd:cd00200    14 CVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKG---HTGPVRDV---------AASADGTYLASGSSDKTIRLWDLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  445 SASDTRwqknIFsdsllkvvyvendIQHLQDlshfpdrgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELHF 524
Cdd:cd00200    82 TGECVR----TL-------------TGHTSY--------------------VSSVAFSPDGRILSSSSRDKTIKVWDVET 124
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  525 MDELIKVEAHDAEVLCLEYSKPETgvtLLASASRDRLIHVLNVEKNYnLEQTLDDHSSSITAIKFAGTRDvQMISCGADK 604
Cdd:cd00200   125 GKCLTTLRGHTDWVNSVAFSPDGT---FVASSSQDGTIKLWDLRTGK-CVATLTGHTGEVNSVAFSPDGE-KLLSSSSDG 199
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  605 SIyfrsaqqasdglhfvrthhvaekttlydmdiditqkyvavacqdrnvRVYNTVSGKQKKCYkgsQGDEGSLLKVHVDP 684
Cdd:cd00200   200 TI-----------------------------------------------KLWDLSTGKCLGTL---RGHENGVNSVAFSP 229
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 755519506  685 SGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 743
Cdd:cd00200   230 DGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
WD40 COG2319
WD40 repeat [General function prediction only];
94-445 7.17e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 103.07  E-value: 7.17e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506   94 VVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVS 173
Cdd:COG2319   144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  174 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEASTEAKVTStvpLVGRSGilgel 251
Cdd:COG2319   222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLATGELLRT---LTGHSG----- 289
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  252 hnnifcgvacgrgrmagntfcvsysgllcqfnekrvldkWINlkvslssCLCVS--DELIFCGCTDGIVRIFQAHSLLYL 329
Cdd:COG2319   290 ---------------------------------------GVN-------SVAFSpdGKLLASGSDDGTVRLWDLATGKLL 323
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  330 TNLpKPHYLGVDvahgldssflfhrkaeavypdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVW 409
Cdd:COG2319   324 RTL-TGHTGAVR-----------------------SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTG---HTGAVT 376
                         330       340       350
                  ....*....|....*....|....*....|....*..
gi 755519506  410 NVevypefedqrACLPSGTFL-TCSSDNTIRFWNLDS 445
Cdd:COG2319   377 SV----------AFSPDGRTLaSGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
112-442 1.55e-22

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 99.72  E-value: 1.55e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  112 RKSLSALAFSPDGKYIVTG-ENGHrpaVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 190
Cdd:cd00200     9 TGGVTCVAFSPDGKLLATGsGDGT---IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  191 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWFLEASTEAKVTSTvplvgrsgilgelHNN-IFCGVACGRGRMA 267
Cdd:cd00200    84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVWDVETGKCLTTLRG-------------HTDwVNSVAFSPDGTFV 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  268 gntFCVSYSGL-----LCQFNEKRVL---DKWINlkvslssCLCVSD--ELIFCGCTDGIVRIFqahsllyltNLPKPHY 337
Cdd:cd00200   151 ---ASSSQDGTiklwdLRTGKCVATLtghTGEVN-------SVAFSPdgEKLLSSSSDGTIKLW---------DLSTGKC 211
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  338 LGVDVAHGldssflfhrkaEAVYpdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypef 417
Cdd:cd00200   212 LGTLRGHE-----------NGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---HTNSVTSL------ 267
                         330       340
                  ....*....|....*....|....*.
gi 755519506  418 edqrACLPSGTFL-TCSSDNTIRFWN 442
Cdd:cd00200   268 ----AWSPDGKRLaSGSADGTIRIWD 289
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
984-1458 2.58e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.46  E-value: 2.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506   984 EETEAGPEDQQGDTYLRVSSVSS-KDQSPPEDSGESEAELECS---FAAAHSSAPQTDPGPHLTMTAGKPEYPSteelSQ 1059
Cdd:pfam03154  128 DEGSSDPKDIDQDNRSTSPSIPSpQDNESDSDSSAQQQILQTQppvLQAQSGAASPPSPPPPGTTQAATAGPTP----SA 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1060 PELPGLGNGSLPQTPEQEKFLRHHFETLTDAPTEGPmgiflelfhgslgdikisetedyffnPRLSISTqflSRLQKTSR 1139
Cdd:pfam03154  204 PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHP--------------------------QRLPSPH---PPLQPMTQ 254
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1140 CPPrlPLHLMKSPEAQPVGQGGNQPKAGPLRAGTGYMSSDGTNVLSGQKAEETQEALslldrkPPTPTSVLTTGREQSIS 1219
Cdd:pfam03154  255 PPP--PSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQV------PPGPSPAAPGQSQQRIH 326
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1220 APSSCSYLES-------------TTSSHAKTTRSISLGDSEGPVTAELPQSLHKPlSPGQELQAIPTTVALT--SSIKDH 1284
Cdd:pfam03154  327 TPPSQSQLQSqqppreqplppapLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGP-SPFQMNSNLPPPPALKplSSLSTH 405
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1285 EPAplswgnheARASLKLTLSSVCEQLLSPPPQEPPITHVWSQEP--VDVPPSMAVTVASFCAPSP-----VDMSTLGLH 1357
Cdd:pfam03154  406 HPP--------SAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPpaASHPPTSGLHQVPSQSPFPqhpfvPGGPPPITP 477
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1358 SSMFLPKTSASGPLTPPAHLQLLETRSRVPGSTAALLEPT------PDASGVIADSPGHWDTEVPTPELlgsVESVLHRL 1431
Cdd:pfam03154  478 PSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVqikeeaLDEAEEPESPPPPPRSPSPEPTV---VNTPSHAS 554
                          490       500       510
                   ....*....|....*....|....*....|....*...
gi 755519506  1432 QTA-FQEAL----------DLYRMLVSSSQLGPEQQQA 1458
Cdd:pfam03154  555 QSArFYKHLdrgynscartDLYFMPLAGSKLAKKREEA 592
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
705-744 2.80e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 42.68  E-value: 2.80e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 755519506    705 SGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWH 744
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
706-743 1.84e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.41  E-value: 1.84e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 755519506   706 GECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 743
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
146-186 4.11e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.52  E-value: 4.11e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 755519506    146 KTQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWD 186
Cdd:smart00320    2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLWD 40
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
365-745 1.11e-36

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 144.28  E-value: 1.11e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  365 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIRFWNL 443
Cdd:COG2319    83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFSPDGKTLaSGSADGTVRLWDL 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  444 DSAsdtrwqknifsdsllkvvyvendiQHLQDLSHFPDRgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELH 523
Cdd:COG2319   150 ATG------------------------KLLRTLTGHSGA-------------VTSVAFSPDGKLLASGSDDGTVRLWDLA 192
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  524 FMDELIKVEAHDAEVLCLEYSkPEtGvTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFagTRDVQMI-SCGA 602
Cdd:COG2319   193 TGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAF--SPDGRLLaSGSA 266
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  603 DKSIYFRSAQqaSDGLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDEGSllkVHV 682
Cdd:COG2319   267 DGTVRLWDLA--TGELLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAF 338
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755519506  683 DPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL 745
Cdd:COG2319   339 SPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
365-743 2.68e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 113.58  E-value: 2.68e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  365 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqRACLPSGTFLTCSSDNTIRFWNLD 444
Cdd:cd00200    14 CVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKG---HTGPVRDV---------AASADGTYLASGSSDKTIRLWDLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  445 SASDTRwqknIFsdsllkvvyvendIQHLQDlshfpdrgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELHF 524
Cdd:cd00200    82 TGECVR----TL-------------TGHTSY--------------------VSSVAFSPDGRILSSSSRDKTIKVWDVET 124
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  525 MDELIKVEAHDAEVLCLEYSKPETgvtLLASASRDRLIHVLNVEKNYnLEQTLDDHSSSITAIKFAGTRDvQMISCGADK 604
Cdd:cd00200   125 GKCLTTLRGHTDWVNSVAFSPDGT---FVASSSQDGTIKLWDLRTGK-CVATLTGHTGEVNSVAFSPDGE-KLLSSSSDG 199
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  605 SIyfrsaqqasdglhfvrthhvaekttlydmdiditqkyvavacqdrnvRVYNTVSGKQKKCYkgsQGDEGSLLKVHVDP 684
Cdd:cd00200   200 TI-----------------------------------------------KLWDLSTGKCLGTL---RGHENGVNSVAFSP 229
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 755519506  685 SGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 743
Cdd:cd00200   230 DGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
493-745 4.18e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 112.81  E-value: 4.18e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  493 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELIKVEAHDAEVLCLEYSKpetGVTLLASASRDRLIHVLNVEKNyN 572
Cdd:cd00200     9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  573 LEQTLDDHSSSITAIKFAGTRDVqMISCGADKSIyfRSAQQASDGLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 652
Cdd:cd00200    85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTI--KVWDVETGKCLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  653 VRVYNTVSGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLI 732
Cdd:cd00200   159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
                         250
                  ....*....|...
gi 755519506  733 TVSGDSCVFIWHL 745
Cdd:cd00200   236 SGSEDGTIRVWDL 248
WD40 COG2319
WD40 repeat [General function prediction only];
43-568 9.09e-26

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 111.93  E-value: 9.09e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506   43 LRRRTRLAAAPEDTVQNRVTLEKVLGITAQNSSGLTCDPGTGHVAYLAGCVVVVLNPKENKQQHIFNTTRKSLSALAFSP 122
Cdd:COG2319     9 LAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP 88
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  123 DGKYIVTGenGHRPAVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVSMGYQHdmVLNVWDWKKDIVVAS-NKVSCR 201
Cdd:COG2319    89 DGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTlTGHSGA 164
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  202 VIALSFSEDSSYFVTVG-NRHVRFWFLEASTEAKVtstvpLVGrsgilgelHNNIFCGVAcgrgrmagntfcVSYSGllc 280
Cdd:COG2319   165 VTSVAFSPDGKLLASGSdDGTVRLWDLATGKLLRT-----LTG--------HTGAVRSVA------------FSPDG--- 216
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  281 qfnekrvldkwinlkvslssclcvsdELIFCGCTDGIVRIFQAHSLLYLTNLPkphylgvdvAHGldssflfhrkaEAVY 360
Cdd:COG2319   217 --------------------------KLLASGSADGTVRLWDLATGKLLRTLT---------GHS-----------GSVR 250
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  361 pdtvALTFDPVHQWLSCVYKDHSIYIWDVKDiDEVSKIWSElfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIR 439
Cdd:COG2319   251 ----SVAFSPDGRLLASGSADGTVRLWDLAT-GELLRTLTG--HSGGVNSV----------AFSPDGKLLaSGSDDGTVR 313
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  440 FWNLDSAsdtrwqknifsdsllkvvyvendiQHLQDLSHFPDRgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRI 519
Cdd:COG2319   314 LWDLATG------------------------KLLRTLTGHTGA-------------VRSVAFSPDGKTLASGSDDGTVRL 356
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*....
gi 755519506  520 HELHFMDELIKVEAHDAEVLCLEYSKPEtgvTLLASASRDRLIHVLNVE 568
Cdd:COG2319   357 WDLATGELLRTLTGHTGAVTSVAFSPDG---RTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
534-743 2.66e-25

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 107.81  E-value: 2.66e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  534 HDAEVLCLEYSkpeTGVTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFAGTRDvQMISCGADKSIYFrsaqQ 613
Cdd:cd00200     8 HTGGVTCVAFS---PDGKLLATGSGDGTIKVWDLETG-ELLRTLKGHTGPVRDVAASADGT-YLASGSSDKTIRL----W 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  614 ASDGLHFVRTHHVAEKTtLYDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDegsLLKVHVDPSGTFLATSC 693
Cdd:cd00200    79 DLETGECVRTLTGHTSY-VSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDW---VNSVAFSPDGTFVASSS 154
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 755519506  694 SDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 743
Cdd:cd00200   155 QDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLW 204
WD40 COG2319
WD40 repeat [General function prediction only];
481-751 1.96e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 104.61  E-value: 1.96e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  481 DRGSENGTPMDMKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELIKVEAHDAEVLCLEYSkPETgvTLLASASRDR 560
Cdd:COG2319    66 AAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFS-PDG--KTLASGSADG 142
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  561 LIHVLNVEKNyNLEQTLDDHSSSITAIKFA--GTRdvqMISCGADKSIYFRSAQQASDgLHFVRTHhvaeKTTLYDMDID 638
Cdd:COG2319   143 TVRLWDLATG-KLLRTLTGHSGAVTSVAFSpdGKL---LASGSDDGTVRLWDLATGKL-LRTLTGH----TGAVRSVAFS 213
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  639 ITQKYVAVACQDRNVRVYNTVSGKQKKCYKGsqgDEGSLLKVHVDPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEI 718
Cdd:COG2319   214 PDGKLLASGSADGTVRLWDLATGKLLRTLTG---HSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGG 290
                         250       260       270
                  ....*....|....*....|....*....|....*
gi 755519506  719 VTGMKFTYDCRHLITVSGDSCVFIWHL--GPEITT 751
Cdd:COG2319   291 VNSVAFSPDGKLLASGSDDGTVRLWDLatGKLLRT 325
WD40 COG2319
WD40 repeat [General function prediction only];
94-445 7.17e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 103.07  E-value: 7.17e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506   94 VVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVS 173
Cdd:COG2319   144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  174 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEASTEAKVTStvpLVGRSGilgel 251
Cdd:COG2319   222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLATGELLRT---LTGHSG----- 289
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  252 hnnifcgvacgrgrmagntfcvsysgllcqfnekrvldkWINlkvslssCLCVS--DELIFCGCTDGIVRIFQAHSLLYL 329
Cdd:COG2319   290 ---------------------------------------GVN-------SVAFSpdGKLLASGSDDGTVRLWDLATGKLL 323
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  330 TNLpKPHYLGVDvahgldssflfhrkaeavypdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVW 409
Cdd:COG2319   324 RTL-TGHTGAVR-----------------------SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTG---HTGAVT 376
                         330       340       350
                  ....*....|....*....|....*....|....*..
gi 755519506  410 NVevypefedqrACLPSGTFL-TCSSDNTIRFWNLDS 445
Cdd:COG2319   377 SV----------AFSPDGRTLaSGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
3-519 1.10e-22

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 102.30  E-value: 1.10e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506    3 AALAAGGYTRSDTIEKLSSVMAGVPARRNQSSPPPAPPLCLRRRTRLAAAPEDTVQNRVTLEKVLGITAQNSSGLTCDPG 82
Cdd:COG2319    11 AASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDG 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506   83 TGHVAYLAGCVVVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGENGHRpaVRIWDVEEKTQVAEMLGHKYGVACV 162
Cdd:COG2319    91 RLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTGHSGAVTSV 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  163 AFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWFLEASTEAKVtstvp 240
Cdd:COG2319   169 AFSPDGKLLASGSD--DGTVRLWDLATGKLLRTlTGHTGAVRSVAFSPDGKLLASGSaDGTVRLWDLATGKLLRT----- 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  241 LVGRSGILgelhnnifcgvacgrgrmagntFCVSYSGllcqfnekrvldkwinlkvslssclcvSDELIFCGCTDGIVRI 320
Cdd:COG2319   242 LTGHSGSV----------------------RSVAFSP---------------------------DGRLLASGSADGTVRL 272
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  321 FqahsllyltnlpkphylgvDVAHGLDSSFLFHRKAeAVYpdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWS 400
Cdd:COG2319   273 W-------------------DLATGELLRTLTGHSG-GVN----SVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTG 328
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  401 elfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIRFWNLdsasDTRWQKNIFSdsllkvvyvendiQHlqdlshf 479
Cdd:COG2319   329 ---HTGAVRSV----------AFSPDGKTLaSGSDDGTVRLWDL----ATGELLRTLT-------------GH------- 371
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|
gi 755519506  480 pdrgsengtpmdmKAGVRVMQVSPDGQHLASGDRSGNLRI 519
Cdd:COG2319   372 -------------TGAVTSVAFSPDGRTLASGSADGTVRL 398
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
112-442 1.55e-22

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 99.72  E-value: 1.55e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  112 RKSLSALAFSPDGKYIVTG-ENGHrpaVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 190
Cdd:cd00200     9 TGGVTCVAFSPDGKLLATGsGDGT---IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  191 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWFLEASTEAKVTSTvplvgrsgilgelHNN-IFCGVACGRGRMA 267
Cdd:cd00200    84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVWDVETGKCLTTLRG-------------HTDwVNSVAFSPDGTFV 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  268 gntFCVSYSGL-----LCQFNEKRVL---DKWINlkvslssCLCVSD--ELIFCGCTDGIVRIFqahsllyltNLPKPHY 337
Cdd:cd00200   151 ---ASSSQDGTiklwdLRTGKCVATLtghTGEVN-------SVAFSPdgEKLLSSSSDGTIKLW---------DLSTGKC 211
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  338 LGVDVAHGldssflfhrkaEAVYpdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypef 417
Cdd:cd00200   212 LGTLRGHE-----------NGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---HTNSVTSL------ 267
                         330       340
                  ....*....|....*....|....*.
gi 755519506  418 edqrACLPSGTFL-TCSSDNTIRFWN 442
Cdd:cd00200   268 ----AWSPDGKRLaSGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
300-606 9.97e-19

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 88.55  E-value: 9.97e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  300 SCLCVSD--ELIFCGCTDGIVRIFQAHSLLYLTNLpKPHYLGV-DVAHGLDSSFLF------------------------ 352
Cdd:cd00200    13 TCVAFSPdgKLLATGSGDGTIKVWDLETGELLRTL-KGHTGPVrDVAASADGTYLAsgssdktirlwdletgecvrtltg 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  353 HRKAeaVYpdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIwseLFHSSFVWNVEVypefedqracLPSGTFLTC 432
Cdd:cd00200    92 HTSY--VS----SVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTL---RGHTDWVNSVAF----------SPDGTFVAS 152
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  433 SS-DNTIRFWNLDSasdtrwqknifsdsllkvvyvendiqhlqdlshfpdrGSENGTPMDMKAGVRVMQVSPDGQHLASG 511
Cdd:cd00200   153 SSqDGTIKLWDLRT-------------------------------------GKCVATLTGHTGEVNSVAFSPDGEKLLSS 195
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  512 DRSGNLRIHELHfMDELIKV-EAHDAEVLCLEYSKPEtgvTLLASASRDRLIHVLNVEKNYNLeQTLDDHSSSITAIKFA 590
Cdd:cd00200   196 SSDGTIKLWDLS-TGKCLGTlRGHENGVNSVAFSPDG---YLLASGSEDGTIRVWDLRTGECV-QTLSGHTNSVTSLAWS 270
                         330
                  ....*....|....*.
gi 755519506  591 GTRDVqMISCGADKSI 606
Cdd:cd00200   271 PDGKR-LASGSADGTI 285
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
154-564 8.65e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 82.77  E-value: 8.65e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  154 GHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWFLEASt 231
Cdd:cd00200     7 GHTGGVTCVAFSPDGKLLATGSG--DGTIKVWDLETGELLRTLKGhTGPVRDVAASADGTYLASGSsDKTIRLWDLETG- 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  232 eaKVTSTvpLVGrsgilgelHNnifcgvacgrgrmaGNTFCVSYSgllcqfnekrvldkwinlkvslssclcVSDELIFC 311
Cdd:cd00200    84 --ECVRT--LTG--------HT--------------SYVSSVAFS---------------------------PDGRILSS 110
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  312 GCTDGIVRIFQAHSLLYLTnlpkphylgvdvahgldsSFLFHRKaeavypDTVALTFDPVHQWLSCVYKDHSIYIWDVKD 391
Cdd:cd00200   111 SSRDKTIKVWDVETGKCLT------------------TLRGHTD------WVNSVAFSPDGTFVASSSQDGTIKLWDLRT 166
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  392 IdevSKIWSELFHSSFVWNVEVYPEfedqraclpSGTFLTCSSDNTIRFWNLDSAsdtrwqknifsdsllkvvyvendiQ 471
Cdd:cd00200   167 G---KCVATLTGHTGEVNSVAFSPD---------GEKLLSSSSDGTIKLWDLSTG------------------------K 210
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  472 HLQDLshfpdRGSENgtpmdmkaGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELIKVEAHDAEVLCLEYSkPETGVt 551
Cdd:cd00200   211 CLGTL-----RGHEN--------GVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWS-PDGKR- 275
                         410
                  ....*....|...
gi 755519506  552 lLASASRDRLIHV 564
Cdd:cd00200   276 -LASGSADGTIRI 287
WD40 COG2319
WD40 repeat [General function prediction only];
481-751 9.63e-15

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 78.03  E-value: 9.63e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  481 DRGSENGTPMDMKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELIKVEAHDAEVLCLEYSkpeTGVTLLASASRDR 560
Cdd:COG2319    24 ALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFS---PDGRLLASASADG 100
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  561 LIHVLNVEKNYNLeQTLDDHSSSITAIKFAgtrdvqmiscgadksiyfrsaqqaSDGlhfvrthhvaekttlydmdidit 640
Cdd:COG2319   101 TVRLWDLATGLLL-RTLTGHTGAVRSVAFS------------------------PDG----------------------- 132
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  641 qKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVT 720
Cdd:COG2319   133 -KTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTS---VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 208
                         250       260       270
                  ....*....|....*....|....*....|...
gi 755519506  721 GMKFTYDCRHLITVSGDSCVFIWHL--GPEITT 751
Cdd:COG2319   209 SVAFSPDGKLLASGSADGTVRLWDLatGKLLRT 241
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
633-756 1.40e-12

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 70.06  E-value: 1.40e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  633 YDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSqgdEGSLLKVHVDPSGTFLATSCSDKSISLIDFYSGECVAKM 712
Cdd:cd00200    13 TCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGH---TGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTL 89
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 755519506  713 FGHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL-GPEITTCMKQH 756
Cdd:cd00200    90 TGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVeTGKCLTTLRGH 134
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
88-225 1.43e-12

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 70.06  E-value: 1.43e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506   88 YLAGC----VVVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGENGHrpAVRIWDVEEKTQVAEMLGHKYGVACVA 163
Cdd:cd00200   107 ILSSSsrdkTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDG--TIKLWDLRTGKCVATLTGHTGEVNSVA 184
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 755519506  164 FSPNMKHIVSMGyqHDMVLNVWDWKKDIVVASNKVSC-RVIALSFSEDSSYFVTVG-NRHVRFW 225
Cdd:cd00200   185 FSPDGEKLLSSS--SDGTIKLWDLSTGKCLGTLRGHEnGVNSVAFSPDGYLLASGSeDGTIRVW 246
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
984-1458 2.58e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.46  E-value: 2.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506   984 EETEAGPEDQQGDTYLRVSSVSS-KDQSPPEDSGESEAELECS---FAAAHSSAPQTDPGPHLTMTAGKPEYPSteelSQ 1059
Cdd:pfam03154  128 DEGSSDPKDIDQDNRSTSPSIPSpQDNESDSDSSAQQQILQTQppvLQAQSGAASPPSPPPPGTTQAATAGPTP----SA 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1060 PELPGLGNGSLPQTPEQEKFLRHHFETLTDAPTEGPmgiflelfhgslgdikisetedyffnPRLSISTqflSRLQKTSR 1139
Cdd:pfam03154  204 PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHP--------------------------QRLPSPH---PPLQPMTQ 254
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1140 CPPrlPLHLMKSPEAQPVGQGGNQPKAGPLRAGTGYMSSDGTNVLSGQKAEETQEALslldrkPPTPTSVLTTGREQSIS 1219
Cdd:pfam03154  255 PPP--PSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQV------PPGPSPAAPGQSQQRIH 326
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1220 APSSCSYLES-------------TTSSHAKTTRSISLGDSEGPVTAELPQSLHKPlSPGQELQAIPTTVALT--SSIKDH 1284
Cdd:pfam03154  327 TPPSQSQLQSqqppreqplppapLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGP-SPFQMNSNLPPPPALKplSSLSTH 405
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1285 EPAplswgnheARASLKLTLSSVCEQLLSPPPQEPPITHVWSQEP--VDVPPSMAVTVASFCAPSP-----VDMSTLGLH 1357
Cdd:pfam03154  406 HPP--------SAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPpaASHPPTSGLHQVPSQSPFPqhpfvPGGPPPITP 477
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506  1358 SSMFLPKTSASGPLTPPAHLQLLETRSRVPGSTAALLEPT------PDASGVIADSPGHWDTEVPTPELlgsVESVLHRL 1431
Cdd:pfam03154  478 PSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVqikeeaLDEAEEPESPPPPPRSPSPEPTV---VNTPSHAS 554
                          490       500       510
                   ....*....|....*....|....*....|....*...
gi 755519506  1432 QTA-FQEAL----------DLYRMLVSSSQLGPEQQQA 1458
Cdd:pfam03154  555 QSArFYKHLdrgynscartDLYFMPLAGSKLAKKREEA 592
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
705-744 2.80e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 42.68  E-value: 2.80e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 755519506    705 SGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWH 744
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
706-743 1.84e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.41  E-value: 1.84e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 755519506   706 GECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 743
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
635-727 3.25e-04

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 41.11  E-value: 3.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755519506   635 MDIditqkyVAVACQDRNVRVYNTvSGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISLIDFYSGECVAKMF 713
Cdd:pfam12894    7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
                           90
                   ....*....|....
gi 755519506   714 GHSEIVTGMKFTYD 727
Cdd:pfam12894   78 AGSDLITCLGWGEN 91
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
146-186 4.11e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.52  E-value: 4.11e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 755519506    146 KTQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWD 186
Cdd:smart00320    2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLWD 40
COG4946 COG4946
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
118-171 4.18e-03

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 41.95  E-value: 4.18e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 755519506  118 LAFSPDGKYIV---TGENGHRpAVRIWDVEEK--TQVAEmlgHKYGVACVAFSPNMKHI 171
Cdd:COG4946   437 LAWSPDSKWLAyskPGPNQLS-QIFLYDVETGktVQLTD---GRYDDGSPAFSPDGKYL 491
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
714-762 6.06e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 40.40  E-value: 6.06e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 755519506  714 GHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL-GPEITTCMKQHLLEINH 762
Cdd:cd00200     7 GHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLeTGELLRTLKGHTGPVRD 56
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
103-142 8.94e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 35.37  E-value: 8.94e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 755519506    103 KQQHIFNTTRKSLSALAFSPDGKYIVTG-ENGHrpaVRIWD 142
Cdd:smart00320    3 ELLKTLKGHTGPVTSVAFSPDGKYLASGsDDGT---IKLWD 40
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH