|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-438 |
1.30e-41 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 158.92 E-value: 1.30e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 57 FLGHNDDIISLALHPDKTLIATGQVGKEpyICIWNSYNVHTVSILKDvHTHGVACLAFDSDGQHLASVGLDakNTVCIWD 136
Cdd:COG2319 74 LLGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 137 WRKGKLLASATGHSDRIFDISWDPyqpnrmvscgvkhikfwtlcgnaltakrgifgkTGDLqtilcLAcakeditySGAL 216
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSP---------------------------------DGKL-----LA--------SGSD 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 217 NGDIYVWKGLT--LVRTIQGaHSAGIFSLYACEEG--FATGGRDGCIRLWDTDfkpiTKiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319 183 DGTVRLWDLATgkLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLA----TG-KLLRTLTGHSG-SVRSVAFS 255
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 293 AD--RLLAGTQDSEIfEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEEA-V 369
Cdd:COG2319 256 PDgrLLASGSADGTV-RLWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGaV 333
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081596 370 RSVSFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA 438
Cdd:COG2319 334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1452-1832 |
2.05e-31 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.88 E-value: 2.05e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1452 LTVNQHPKYRNVVATSQIGTT-------PSIHIWDAMTKHTLSMLRcFHTKGVNYINFSATGKLLVSVGVDpeHTITVWR 1524
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLlasasadGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1525 WQEGTKVASRGGHLERIFVVEFRPDSdTQFVSVGV-KHMKFWTLAGSALLYKkgVIGSMEAAkmqtmLSVAFGANNLTF- 1602
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLRT--LTGHTGAV-----RSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1603 TGAINGDVYVWK-EHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKLWDqemkrcrafqLETGQLVEc 1680
Cdd:COG2319 221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGELLR- 282
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1681 vrsvcrgkgkilvgtkdgeiievgeksaasniLIDGHmEGEIWGLATHPSKDMFISASNDGTARIWDLADKKLLNKVNlG 1760
Cdd:COG2319 283 --------------------------------TLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081596 1761 HAA--RCAAYSPDGEMVAIGMKNGEFVILLVNTLKVWGKKRDRKSAIQDIRISPDNRFLAVGSSEQTVDFYDLT 1832
Cdd:COG2319 329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
708-1136 |
1.54e-30 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 126.56 E-value: 1.54e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 708 AVAVVYNRQQHAQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAVHVWDTQTLKCLSLLKGHhQRGVCALDFSgirvek 787
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTGH-TGAVRSVAFS------ 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 788 ktsrltsnvglqeiwknrSDGKCLVSVGLDdfHSVVFWDWKKGEKIATTRGHKDKIFVVkcnpqhadklvtvgikhikfw 867
Cdd:COG2319 130 ------------------PDGKTLASGSAD--GTVRLWDLATGKLLRTLTGHSGAVTSV--------------------- 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 868 qqagggftskrgSFGSAGKLetmmcvsygrmedlVFSGAATGDIFIWkDVL---LLKTVKAHDGPVFAM-YALD-KGFVT 942
Cdd:COG2319 169 ------------AFSPDGKL--------------LASGSDDGTVRLW-DLAtgkLLRTLTGHTGAVRSVaFSPDgKLLAS 221
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 943 GGKDGIVELWDdmfercLKTYAIKRTalstsskglLLEDNPSIRAITLGH-GHILV-GTKNGEILEIDKSGPMTLLVQGH 1020
Cdd:COG2319 222 GSADGTVRLWD------LATGKLLRT---------LTGHSGSVRSVAFSPdGRLLAsGSADGTVRLWDLATGELLRTLTG 286
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1021 MEGEVWGLAAHPLLPICATVSDDKTLRIWELSSQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDML 1100
Cdd:COG2319 287 HSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLR 366
|
410 420 430
....*....|....*....|....*....|....*.
gi 1907081596 1101 SFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNV 1136
Cdd:COG2319 367 TLTGHTGAVTSVAFSPD-GRTLASGSADGTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
315-592 |
2.83e-27 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 116.55 E-value: 2.83e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EEAVRSVSFSPDGSQLALGMKDGSFIVL 393
Cdd:COG2319 111 LLLRTLTGH-TGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGhSGAVTSVAFSPDGKLLASGSDDGTVRLW 189
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 394 RVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQrykkiGECNKSL----SFITHIDWSLDSKYLQTN 469
Cdd:COG2319 190 DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLAT-----GKLLRTLtghsGSVRSVAFSPDGRLLASG 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 470 DGAGERLFYKMPSGKPLTSKEEIKGipwaswtcvrgpevsgiwpkyteviDINSVDANYNSSVLVSGDDFGLVKLFKfpc 549
Cdd:COG2319 265 SADGTVRLWDLATGELLRTLTGHSG-------------------------GVNSVAFSPDGKLLASGSDDGTVRLWD--- 316
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 1907081596 550 LKKGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWR 592
Cdd:COG2319 317 LATGKLLRTLTGHTGAVRSVAFSPDGKTLAS-GSDDGTVRLWD 358
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-48 |
5.56e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.56e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 1907081596 2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
668-715 |
5.78e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.78e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 1907081596 668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1714-1988 |
1.91e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 90.86 E-value: 1.91e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1714 IDGHmEGEIWGLATHPSKDMFISASNDGTARIWDLADKKLLnKVNLGHAA--RCAAYSPDGEMVAIGMKNgefvillvNT 1791
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSD--------KT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1792 LKVW----GKKRDR----KSAIQDIRISPDNRFLAVGSSEQTVDFYDLTQGTSLNRIGYCKDipsFVIQMDFSADSKYiq 1863
Cdd:cd00200 75 IRLWdletGECVRTltghTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF-- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1864 VSTGAYKRQVH--EVPLGKqvteamVVEKITwaswtsvlgdevigiwprnADKADVNCACVTHAGLNIVTGDDFGLLKLF 1941
Cdd:cd00200 150 VASSSQDGTIKlwDLRTGK------CVATLT-------------------GHTGEVNSVAFSPDGEKLLSSSSDGTIKLW 204
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 1907081596 1942 DFpctEKFAKHKRYFGHSAHVTNIRFSSDDKYVVStGGDDCSVFVWR 1988
Cdd:cd00200 205 DL---STGKCLGTLRGHENGVNSVAFSPDGYLLAS-GSEDGTIRVWD 247
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1064-1290 |
3.69e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 72.37 E-value: 3.69e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1064 KKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMLSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVG 1143
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASAD-GTYLASGSSDKTIRLWDLETGECVR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1144 ICKGASSYITHIDWDSRGKLLqvnSGAkeqlffeaprGRKHTIRpseaekiEWDTWTcvlgPTCEGIWPAHSDvtDVNAA 1223
Cdd:cd00200 88 TLTGHTSYVSSVAFSPDGRIL---SSS----------SRDKTIK-------VWDVET----GKCLTTLRGHTD--WVNSV 141
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081596 1224 NLTKDGSLLATGDDFGFVKLFSyPVKGQhaRFKKYVGHSAHVTNVRWlHNDSVLLTVGGADTALMIW 1290
Cdd:cd00200 142 AFSPDGTFVASSSQDGTIKLWD-LRTGK--CVATLTGHTGEVNSVAF-SPDGEKLLSSSSDGTIKLW 204
|
|
| HELP super family |
cl04081 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1382-1433 |
8.01e-13 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. The actual alignment was detected with superfamily member pfam03451:
Pssm-ID: 460922 Cd Length: 72 Bit Score: 65.27 E-value: 8.01e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1907081596 1382 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGaDIIFHTAAAGIVQN 1433
Cdd:pfam03451 20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-438 |
1.30e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 158.92 E-value: 1.30e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 57 FLGHNDDIISLALHPDKTLIATGQVGKEpyICIWNSYNVHTVSILKDvHTHGVACLAFDSDGQHLASVGLDakNTVCIWD 136
Cdd:COG2319 74 LLGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 137 WRKGKLLASATGHSDRIFDISWDPyqpnrmvscgvkhikfwtlcgnaltakrgifgkTGDLqtilcLAcakeditySGAL 216
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSP---------------------------------DGKL-----LA--------SGSD 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 217 NGDIYVWKGLT--LVRTIQGaHSAGIFSLYACEEG--FATGGRDGCIRLWDTDfkpiTKiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319 183 DGTVRLWDLATgkLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLA----TG-KLLRTLTGHSG-SVRSVAFS 255
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 293 AD--RLLAGTQDSEIfEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEEA-V 369
Cdd:COG2319 256 PDgrLLASGSADGTV-RLWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGaV 333
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081596 370 RSVSFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA 438
Cdd:COG2319 334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
57-353 |
8.40e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 135.93 E-value: 8.40e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 57 FLGHNDDIISLALHPDKTLIATGQVGKEpyICIWNSYNVHTVSILKdVHTHGVACLAFDSDGQHLASVGLDakNTVCIWD 136
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 137 WRKGKLLASATGHSDRIFDISWDPYqpNRMVSCGVKH--IKFWTL-CGNALTAKRGIFGktgdlqTILCLA-CAKEDITY 212
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVeTGKCLTTLRGHTD------WVNSVAfSPDGTFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 213 SGALNGDIYVW--KGLTLVRTIQGaHSAGIFSLYACEEG--FATGGRDGCIRLWDTDFKPITKIdLRETEQGykglsIRS 288
Cdd:cd00200 152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGT-LRGHENG-----VNS 224
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081596 289 VCWKADRLL--AGTQDS--EIFEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:cd00200 225 VAFSPDGYLlaSGSEDGtiRVWDL---RTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1452-1832 |
2.05e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.88 E-value: 2.05e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1452 LTVNQHPKYRNVVATSQIGTT-------PSIHIWDAMTKHTLSMLRcFHTKGVNYINFSATGKLLVSVGVDpeHTITVWR 1524
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLlasasadGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1525 WQEGTKVASRGGHLERIFVVEFRPDSdTQFVSVGV-KHMKFWTLAGSALLYKkgVIGSMEAAkmqtmLSVAFGANNLTF- 1602
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLRT--LTGHTGAV-----RSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1603 TGAINGDVYVWK-EHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKLWDqemkrcrafqLETGQLVEc 1680
Cdd:COG2319 221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGELLR- 282
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1681 vrsvcrgkgkilvgtkdgeiievgeksaasniLIDGHmEGEIWGLATHPSKDMFISASNDGTARIWDLADKKLLNKVNlG 1760
Cdd:COG2319 283 --------------------------------TLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081596 1761 HAA--RCAAYSPDGEMVAIGMKNGEFVILLVNTLKVWGKKRDRKSAIQDIRISPDNRFLAVGSSEQTVDFYDLT 1832
Cdd:COG2319 329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
708-1136 |
1.54e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 126.56 E-value: 1.54e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 708 AVAVVYNRQQHAQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAVHVWDTQTLKCLSLLKGHhQRGVCALDFSgirvek 787
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTGH-TGAVRSVAFS------ 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 788 ktsrltsnvglqeiwknrSDGKCLVSVGLDdfHSVVFWDWKKGEKIATTRGHKDKIFVVkcnpqhadklvtvgikhikfw 867
Cdd:COG2319 130 ------------------PDGKTLASGSAD--GTVRLWDLATGKLLRTLTGHSGAVTSV--------------------- 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 868 qqagggftskrgSFGSAGKLetmmcvsygrmedlVFSGAATGDIFIWkDVL---LLKTVKAHDGPVFAM-YALD-KGFVT 942
Cdd:COG2319 169 ------------AFSPDGKL--------------LASGSDDGTVRLW-DLAtgkLLRTLTGHTGAVRSVaFSPDgKLLAS 221
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 943 GGKDGIVELWDdmfercLKTYAIKRTalstsskglLLEDNPSIRAITLGH-GHILV-GTKNGEILEIDKSGPMTLLVQGH 1020
Cdd:COG2319 222 GSADGTVRLWD------LATGKLLRT---------LTGHSGSVRSVAFSPdGRLLAsGSADGTVRLWDLATGELLRTLTG 286
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1021 MEGEVWGLAAHPLLPICATVSDDKTLRIWELSSQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDML 1100
Cdd:COG2319 287 HSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLR 366
|
410 420 430
....*....|....*....|....*....|....*.
gi 1907081596 1101 SFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNV 1136
Cdd:COG2319 367 TLTGHTGAVTSVAFSPD-GRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1050 |
5.73e-29 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 118.98 E-value: 5.73e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 725 GHDDDILSLTIHPVKDYVATGqvGRDAAVHVWDTQTLKCLSLLKGhHQRGVCALDFSgirvekktsrltsnvglqeiwkn 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAAS----------------------- 60
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 805 rSDGKCLVSVGLDdfHSVVFWDWKKGEKIATTRGHKDKIFVVKCNPQHadKLVTVGIKH--IKFWQQAGGGFTSkrgSFG 882
Cdd:cd00200 61 -ADGTYLASGSSD--KTIRLWDLETGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDVETGKCLT---TLR 132
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 883 saGKLETMMCVSYGRMEDLVFSGAATGDIFIWkDV---LLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFE 957
Cdd:cd00200 133 --GHTDWVNSVAFSPDGTFVASSSQDGTIKLW-DLrtgKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTG 209
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 958 RCLKTYAIKR---TALSTSSKGLLLednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPL 1033
Cdd:cd00200 210 KCLGTLRGHEngvNSVAFSPDGYLL----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPD 272
|
330
....*....|....*..
gi 1907081596 1034 LPICATVSDDKTLRIWE 1050
Cdd:cd00200 273 GKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
315-592 |
2.83e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 116.55 E-value: 2.83e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EEAVRSVSFSPDGSQLALGMKDGSFIVL 393
Cdd:COG2319 111 LLLRTLTGH-TGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGhSGAVTSVAFSPDGKLLASGSDDGTVRLW 189
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 394 RVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQrykkiGECNKSL----SFITHIDWSLDSKYLQTN 469
Cdd:COG2319 190 DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLAT-----GKLLRTLtghsGSVRSVAFSPDGRLLASG 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 470 DGAGERLFYKMPSGKPLTSKEEIKGipwaswtcvrgpevsgiwpkyteviDINSVDANYNSSVLVSGDDFGLVKLFKfpc 549
Cdd:COG2319 265 SADGTVRLWDLATGELLRTLTGHSG-------------------------GVNSVAFSPDGKLLASGSDDGTVRLWD--- 316
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 1907081596 550 LKKGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWR 592
Cdd:COG2319 317 LATGKLLRTLTGHTGAVRSVAFSPDGKTLAS-GSDDGTVRLWD 358
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1493-1795 |
2.45e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 96.64 E-value: 2.45e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1493 HTKGVNYINFSATGKLLVSVGVDpeHTITVWRWQEGTKVASRGGHLERIFVVEFRPDSdTQFVSVGVKHM-KFWTLAGSA 1571
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSDKTiRLWDLETGE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1572 LLYkkgvigSMEAAKmQTMLSVAFGANNLTFTGAI-NGDVYVWK-EHFLIRLVAKAHTGPVFTMyTTLRDGLIVTGGker 1649
Cdd:cd00200 85 CVR------TLTGHT-SYVSSVAFSPDGRILSSSSrDKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVASS--- 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1650 ptKEGGAVKLWD-QEMKRCRAFQLETGQlvecVRSVC--RGKGKILVGTKDGEIIeVGEKSAASNI-LIDGHmEGEIWGL 1725
Cdd:cd00200 154 --SQDGTIKLWDlRTGKCVATLTGHTGE----VNSVAfsPDGEKLLSSSSDGTIK-LWDLSTGKCLgTLRGH-ENGVNSV 225
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081596 1726 ATHPSKDMFISASNDGTARIWDLADKKLLNKVNlGHAAR--CAAYSPDGEMVAIGMKNgefvillvNTLKVW 1795
Cdd:cd00200 226 AFSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GHTNSvtSLAWSPDGKRLASGSAD--------GTIRIW 288
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-48 |
5.56e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.56e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 1907081596 2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
668-715 |
5.78e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.78e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 1907081596 668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1714-1988 |
1.91e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 90.86 E-value: 1.91e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1714 IDGHmEGEIWGLATHPSKDMFISASNDGTARIWDLADKKLLnKVNLGHAA--RCAAYSPDGEMVAIGMKNgefvillvNT 1791
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSD--------KT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1792 LKVW----GKKRDR----KSAIQDIRISPDNRFLAVGSSEQTVDFYDLTQGTSLNRIGYCKDipsFVIQMDFSADSKYiq 1863
Cdd:cd00200 75 IRLWdletGECVRTltghTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF-- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1864 VSTGAYKRQVH--EVPLGKqvteamVVEKITwaswtsvlgdevigiwprnADKADVNCACVTHAGLNIVTGDDFGLLKLF 1941
Cdd:cd00200 150 VASSSQDGTIKlwDLRTGK------CVATLT-------------------GHTGEVNSVAFSPDGEKLLSSSSDGTIKLW 204
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 1907081596 1942 DFpctEKFAKHKRYFGHSAHVTNIRFSSDDKYVVStGGDDCSVFVWR 1988
Cdd:cd00200 205 DL---STGKCLGTLRGHENGVNSVAFSPDGYLLAS-GSEDGTIRVWD 247
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1064-1290 |
3.69e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 72.37 E-value: 3.69e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1064 KKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMLSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVG 1143
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASAD-GTYLASGSSDKTIRLWDLETGECVR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1144 ICKGASSYITHIDWDSRGKLLqvnSGAkeqlffeaprGRKHTIRpseaekiEWDTWTcvlgPTCEGIWPAHSDvtDVNAA 1223
Cdd:cd00200 88 TLTGHTSYVSSVAFSPDGRIL---SSS----------SRDKTIK-------VWDVET----GKCLTTLRGHTD--WVNSV 141
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081596 1224 NLTKDGSLLATGDDFGFVKLFSyPVKGQhaRFKKYVGHSAHVTNVRWlHNDSVLLTVGGADTALMIW 1290
Cdd:cd00200 142 AFSPDGTFVASSSQDGTIKLWD-LRTGK--CVATLTGHTGEVNSVAF-SPDGEKLLSSSSDGTIKLW 204
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1382-1433 |
8.01e-13 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 65.27 E-value: 8.01e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1907081596 1382 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGaDIIFHTAAAGIVQN 1433
Cdd:pfam03451 20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
105-136 |
2.33e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 40.37 E-value: 2.33e-04
10 20 30
....*....|....*....|....*....|..
gi 1907081596 105 HTHGVACLAFDSDGQHLASVGLDakNTVCIWD 136
Cdd:smart00320 11 HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1714-1747 |
2.76e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 2.76e-04
10 20 30
....*....|....*....|....*....|....
gi 1907081596 1714 IDGHmEGEIWGLATHPSKDMFISASNDGTARIWD 1747
Cdd:smart00320 8 LKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1714-1747 |
5.40e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 39.25 E-value: 5.40e-04
10 20 30
....*....|....*....|....*....|....
gi 1907081596 1714 IDGHmEGEIWGLATHPSKDMFISASNDGTARIWD 1747
Cdd:pfam00400 7 LEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
315-353 |
7.40e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.87 E-value: 7.40e-04
10 20 30
....*....|....*....|....*....|....*....
gi 1907081596 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:pfam00400 2 KLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1948-1987 |
1.44e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.06 E-value: 1.44e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1907081596 1948 KFAKHKRYFGHSAHVTNIRFSSDDKYVVStGGDDCSVFVW 1987
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLAS-GSDDGTIKLW 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1953-1987 |
1.58e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.10 E-value: 1.58e-03
10 20 30
....*....|....*....|....*....|....*
gi 1907081596 1953 KRYFGHSAHVTNIRFSSDDKYVVStGGDDCSVFVW 1987
Cdd:pfam00400 5 KTLEGHTGSVTSLAFSPDGKLLAS-GSDDGTVKVW 38
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-438 |
1.30e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 158.92 E-value: 1.30e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 57 FLGHNDDIISLALHPDKTLIATGQVGKEpyICIWNSYNVHTVSILKDvHTHGVACLAFDSDGQHLASVGLDakNTVCIWD 136
Cdd:COG2319 74 LLGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 137 WRKGKLLASATGHSDRIFDISWDPyqpnrmvscgvkhikfwtlcgnaltakrgifgkTGDLqtilcLAcakeditySGAL 216
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSP---------------------------------DGKL-----LA--------SGSD 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 217 NGDIYVWKGLT--LVRTIQGaHSAGIFSLYACEEG--FATGGRDGCIRLWDTDfkpiTKiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319 183 DGTVRLWDLATgkLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLA----TG-KLLRTLTGHSG-SVRSVAFS 255
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 293 AD--RLLAGTQDSEIfEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEEA-V 369
Cdd:COG2319 256 PDgrLLASGSADGTV-RLWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGaV 333
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081596 370 RSVSFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA 438
Cdd:COG2319 334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-397 |
5.54e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 151.22 E-value: 5.54e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 57 FLGHNDDIISLALHPDKTLIATGQVGKEpyICIWNSYNVHTVSILKDvHTHGVACLAFDSDGQHLASVGLDakNTVCIWD 136
Cdd:COG2319 116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 137 WRKGKLLASATGHSDRIFDISWDPyqpnrmvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319 191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 217 NGDIYVW--KGLTLVRTIQGaHSAGIFSLYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319 225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319 298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
|
330 340 350
....*....|....*....|....*....|
gi 1907081596 368 AVRSVSFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319 374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
57-353 |
8.40e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 135.93 E-value: 8.40e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 57 FLGHNDDIISLALHPDKTLIATGQVGKEpyICIWNSYNVHTVSILKdVHTHGVACLAFDSDGQHLASVGLDakNTVCIWD 136
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 137 WRKGKLLASATGHSDRIFDISWDPYqpNRMVSCGVKH--IKFWTL-CGNALTAKRGIFGktgdlqTILCLA-CAKEDITY 212
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVeTGKCLTTLRGHTD------WVNSVAfSPDGTFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 213 SGALNGDIYVW--KGLTLVRTIQGaHSAGIFSLYACEEG--FATGGRDGCIRLWDTDFKPITKIdLRETEQGykglsIRS 288
Cdd:cd00200 152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGT-LRGHENG-----VNS 224
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081596 289 VCWKADRLL--AGTQDS--EIFEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:cd00200 225 VAFSPDGYLlaSGSEDGtiRVWDL---RTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1452-1832 |
2.05e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.88 E-value: 2.05e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1452 LTVNQHPKYRNVVATSQIGTT-------PSIHIWDAMTKHTLSMLRcFHTKGVNYINFSATGKLLVSVGVDpeHTITVWR 1524
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLlasasadGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1525 WQEGTKVASRGGHLERIFVVEFRPDSdTQFVSVGV-KHMKFWTLAGSALLYKkgVIGSMEAAkmqtmLSVAFGANNLTF- 1602
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLRT--LTGHTGAV-----RSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1603 TGAINGDVYVWK-EHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKLWDqemkrcrafqLETGQLVEc 1680
Cdd:COG2319 221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGELLR- 282
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1681 vrsvcrgkgkilvgtkdgeiievgeksaasniLIDGHmEGEIWGLATHPSKDMFISASNDGTARIWDLADKKLLNKVNlG 1760
Cdd:COG2319 283 --------------------------------TLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081596 1761 HAA--RCAAYSPDGEMVAIGMKNGEFVILLVNTLKVWGKKRDRKSAIQDIRISPDNRFLAVGSSEQTVDFYDLT 1832
Cdd:COG2319 329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1503-1988 |
2.19e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.88 E-value: 2.19e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1503 SATGKLLVSVGVDPEHTITVWRWQEGTKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYKKGVIGSm 1582
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA- 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1583 eaakmqTMLSVAFGANNLTF-TGAINGDVYVWK-EHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKL 1659
Cdd:COG2319 80 ------AVLSVAFSPDGRLLaSASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRL 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1660 WDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeksaasniLIDGHmEGEIWGLATHPSKDMFISASN 1739
Cdd:COG2319 147 WD----------LATGKLLR---------------------------------TLTGH-SGAVTSVAFSPDGKLLASGSD 182
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1740 DGTARIWDLADKKLLNKVNlGHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNTLKVWGKKRDRKSAIQDIRISPDNRFL 1817
Cdd:COG2319 183 DGTVRLWDLATGKLLRTLT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLL 261
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1818 AVGSSEQTVDFYDLTQGTSLNRIGyckDIPSFVIQMDFSADSKYIqVSTGAYKR-QVHEVPLGKQvteamvvekitwasw 1896
Cdd:COG2319 262 ASGSADGTVRLWDLATGELLRTLT---GHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGKL--------------- 322
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1897 tsvlgdevigIWPRNADKADVNCACVTHAGLNIVTGDDFGLLKLFDfpcTEKFAKHKRYFGHSAHVTNIRFSSDDKYVVS 1976
Cdd:COG2319 323 ----------LRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS 389
|
490
....*....|..
gi 1907081596 1977 tGGDDCSVFVWR 1988
Cdd:COG2319 390 -GSADGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
708-1136 |
1.54e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 126.56 E-value: 1.54e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 708 AVAVVYNRQQHAQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAVHVWDTQTLKCLSLLKGHhQRGVCALDFSgirvek 787
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTGH-TGAVRSVAFS------ 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 788 ktsrltsnvglqeiwknrSDGKCLVSVGLDdfHSVVFWDWKKGEKIATTRGHKDKIFVVkcnpqhadklvtvgikhikfw 867
Cdd:COG2319 130 ------------------PDGKTLASGSAD--GTVRLWDLATGKLLRTLTGHSGAVTSV--------------------- 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 868 qqagggftskrgSFGSAGKLetmmcvsygrmedlVFSGAATGDIFIWkDVL---LLKTVKAHDGPVFAM-YALD-KGFVT 942
Cdd:COG2319 169 ------------AFSPDGKL--------------LASGSDDGTVRLW-DLAtgkLLRTLTGHTGAVRSVaFSPDgKLLAS 221
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 943 GGKDGIVELWDdmfercLKTYAIKRTalstsskglLLEDNPSIRAITLGH-GHILV-GTKNGEILEIDKSGPMTLLVQGH 1020
Cdd:COG2319 222 GSADGTVRLWD------LATGKLLRT---------LTGHSGSVRSVAFSPdGRLLAsGSADGTVRLWDLATGELLRTLTG 286
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1021 MEGEVWGLAAHPLLPICATVSDDKTLRIWELSSQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDML 1100
Cdd:COG2319 287 HSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLR 366
|
410 420 430
....*....|....*....|....*....|....*.
gi 1907081596 1101 SFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNV 1136
Cdd:COG2319 367 TLTGHTGAVTSVAFSPD-GRTLASGSADGTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
904-1290 |
7.37e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 124.25 E-value: 7.37e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 904 SGAATGDIFIWKDVLLLKTVKAHDGPVF--AMYALDKGFVTGGKDGIVELWDdmferclktyaikrtALSTSSKGLLLED 981
Cdd:COG2319 55 AGDLTLLLLDAAAGALLATLLGHTAAVLsvAFSPDGRLLASASADGTVRLWD---------------LATGLLLRTLTGH 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 982 NPSIRAITLGH-GHILV-GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELSSQH 1055
Cdd:COG2319 120 TGAVRSVAFSPdGKTLAsGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1056 RMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMLSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYN 1135
Cdd:COG2319 196 LLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWD 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1136 VLTSKRVGICKGASSYITHIDWDSRGKLLqvnsgakeqlffeAPRGRKHTIRPseaekieWDTWTcvlgPTCEGIWPAHS 1215
Cdd:COG2319 275 LATGELLRTLTGHSGGVNSVAFSPDGKLL-------------ASGSDDGTVRL-------WDLAT----GKLLRTLTGHT 330
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907081596 1216 DvtDVNAANLTKDGSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWLHNDSVLLTvGGADTALMIW 1290
Cdd:COG2319 331 G--AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1050 |
5.73e-29 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 118.98 E-value: 5.73e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 725 GHDDDILSLTIHPVKDYVATGqvGRDAAVHVWDTQTLKCLSLLKGhHQRGVCALDFSgirvekktsrltsnvglqeiwkn 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAAS----------------------- 60
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 805 rSDGKCLVSVGLDdfHSVVFWDWKKGEKIATTRGHKDKIFVVKCNPQHadKLVTVGIKH--IKFWQQAGGGFTSkrgSFG 882
Cdd:cd00200 61 -ADGTYLASGSSD--KTIRLWDLETGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDVETGKCLT---TLR 132
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 883 saGKLETMMCVSYGRMEDLVFSGAATGDIFIWkDV---LLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFE 957
Cdd:cd00200 133 --GHTDWVNSVAFSPDGTFVASSSQDGTIKLW-DLrtgKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTG 209
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 958 RCLKTYAIKR---TALSTSSKGLLLednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPL 1033
Cdd:cd00200 210 KCLGTLRGHEngvNSVAFSPDGYLL----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPD 272
|
330
....*....|....*..
gi 1907081596 1034 LPICATVSDDKTLRIWE 1050
Cdd:cd00200 273 GKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
315-592 |
2.83e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 116.55 E-value: 2.83e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EEAVRSVSFSPDGSQLALGMKDGSFIVL 393
Cdd:COG2319 111 LLLRTLTGH-TGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGhSGAVTSVAFSPDGKLLASGSDDGTVRLW 189
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 394 RVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQrykkiGECNKSL----SFITHIDWSLDSKYLQTN 469
Cdd:COG2319 190 DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLAT-----GKLLRTLtghsGSVRSVAFSPDGRLLASG 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 470 DGAGERLFYKMPSGKPLTSKEEIKGipwaswtcvrgpevsgiwpkyteviDINSVDANYNSSVLVSGDDFGLVKLFKfpc 549
Cdd:COG2319 265 SADGTVRLWDLATGELLRTLTGHSG-------------------------GVNSVAFSPDGKLLASGSDDGTVRLWD--- 316
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 1907081596 550 LKKGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWR 592
Cdd:COG2319 317 LATGKLLRTLTGHTGAVRSVAFSPDGKTLAS-GSDDGTVRLWD 358
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
967-1290 |
5.27e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 115.78 E-value: 5.27e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 967 RTALSTSSKGLLLEDNPSIRAITLGHGHILVGTKNGEILEIDKSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTL 1046
Cdd:COG2319 24 ALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRLLASASADGTV 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1047 RIWELSSQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMLSFHHRKEMISDIKFSKDtGKYLAVAS 1126
Cdd:COG2319 103 RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPD-GKLLASGS 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1127 HDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLLqVnSGakeqlffeaprGRKHTIRPseaekieWDtwtcVLGPT 1206
Cdd:COG2319 182 DDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLL-A-SG-----------SADGTVRL-------WD----LATGK 237
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1207 CEGIWPAHSDVtdVNAANLTKDGSLLATGDDFGFVKLFSyPVKGQHARFKKyvGHSAHVTNVRWLHNDSVLLTvGGADTA 1286
Cdd:COG2319 238 LLRTLTGHSGS--VRSVAFSPDGRLLASGSADGTVRLWD-LATGELLRTLT--GHSGGVNSVAFSPDGKLLAS-GSDDGT 311
|
....
gi 1907081596 1287 LMIW 1290
Cdd:COG2319 312 VRLW 315
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
811-1164 |
2.87e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 113.85 E-value: 2.87e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 811 LVSVGLDDFHSVVFWDWKKGEKIATTRGHKDKIFVVKCNPQHADKLVTVGIKHIKFWQQAGGGFTSKRGSFGSAGKletm 890
Cdd:COG2319 49 ARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVR---- 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 891 mCVSY---GRMedlVFSGAATGDIFIWkDVL---LLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDDMFERCLKT 962
Cdd:COG2319 125 -SVAFspdGKT---LASGSADGTVRLW-DLAtgkLLRTLTGHSGAVTSVaFSPDgKLLASGSDDGTVRLWDLATGKLLRT 199
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 963 yaikrtalstsskglLLEDNPSIRAITLGH-GHILV-GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpi 1036
Cdd:COG2319 200 ---------------LTGHTGAVRSVAFSPdGKLLAsGSADGTVRLWDlATGKLLRTLTGH-SGSVRSVAFSPdgrLL-- 261
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1037 cATVSDDKTLRIWELSSQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMLSFHHRKEMISDIKFSK 1116
Cdd:COG2319 262 -ASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSP 340
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 1907081596 1117 DtGKYLAVASHDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLL 1164
Cdd:COG2319 341 D-GKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTL 387
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
274-868 |
3.10e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 107.69 E-value: 3.10e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 274 LRETEQGYKGLSIRSVCWKADRLLAGTQDSEIFEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:COG2319 28 LLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRLLASASADGTVRLWD 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 354 LADHALIARCNM-EEAVRSVSFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPV 432
Cdd:COG2319 107 LATGLLLRTLTGhTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTV 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 433 DVYAVAQrykkiGECNKSL----SFITHIDWSLDSKYLqtndgagerlfykmpsgkpltskeeikgipwaswtcvrgpev 508
Cdd:COG2319 187 RLWDLAT-----GKLLRTLtghtGAVRSVAFSPDGKLL------------------------------------------ 219
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 509 sgiwpkytevidinsvdanynssvlVSGDDFGLVKLFKfpcLKKGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSV 588
Cdd:COG2319 220 -------------------------ASGSADGTVRLWD---LATGKLLRTLTGHSGSVRSVAFSPDGRLLAS-GSADGTV 270
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 589 FQWRfipeavsngvlettpqeggadsyseesdsdfsdvpeldsdieqetqinydrqvykedlpqlkqqskeknhavpflk 668
Cdd:COG2319 271 RLWD---------------------------------------------------------------------------- 274
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 669 rekapedslklqfihgyrgydcrnnlfyTQAGEVVyhiaavavvynrqqhaqRLYLGHDDDILSLTIHPVKDYVATGqvG 748
Cdd:COG2319 275 ----------------------------LATGELL-----------------RTLTGHSGGVNSVAFSPDGKLLASG--S 307
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 749 RDAAVHVWDTQTLKCLSLLKGHHQRgVCALDFSgirvekktsrltsnvglqeiwknrSDGKCLVSVGLDdfHSVVFWDWK 828
Cdd:COG2319 308 DDGTVRLWDLATGKLLRTLTGHTGA-VRSVAFS------------------------PDGKTLASGSDD--GTVRLWDLA 360
|
570 580 590 600
....*....|....*....|....*....|....*....|.
gi 1907081596 829 KGEKIATTRGHKDKIFVVKCNPQHaDKLVTVGI-KHIKFWQ 868
Cdd:COG2319 361 TGELLRTLTGHTGAVTSVAFSPDG-RTLASGSAdGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1433-1750 |
6.24e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 106.53 E-value: 6.24e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1433 NLSTGSQSFYLE-HTDDILCLTVnqHPKYRNVVATSQIGTtpsIHIWDAMTKHTLSMLRCfHTKGVNYINFSATGKLLVS 1511
Cdd:COG2319 106 DLATGLLLRTLTgHTGAVRSVAF--SPDGKTLASGSADGT---VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLAS 179
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1512 VGVDpeHTITVWRWQEGTKVASRGGHLERIFVVEFRPDSDTqFVSVGV-KHMKFWTLAGSALLYKKGVIGSmeaakmqTM 1590
Cdd:COG2319 180 GSDD--GTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKL-LASGSAdGTVRLWDLATGKLLRTLTGHSG-------SV 249
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1591 LSVAFGANNLTF-TGAINGDVYVWK-EHFLIRLVAKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDqemkrcr 1668
Cdd:COG2319 250 RSVAFSPDGRLLaSGSADGTVRLWDlATGELLRTLTGHSGGVNSVAFSPDGKLLASGS------DDGTVRLWD------- 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1669 afqLETGQLV-------ECVRSVC-RGKGKILV-GTKDGEI----IEVGEKSAAsnilIDGHmEGEIWGLATHPSKDMFI 1735
Cdd:COG2319 317 ---LATGKLLrtltghtGAVRSVAfSPDGKTLAsGSDDGTVrlwdLATGELLRT----LTGH-TGAVTSVAFSPDGRTLA 388
|
330
....*....|....*
gi 1907081596 1736 SASNDGTARIWDLAD 1750
Cdd:COG2319 389 SGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1493-1795 |
2.45e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 96.64 E-value: 2.45e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1493 HTKGVNYINFSATGKLLVSVGVDpeHTITVWRWQEGTKVASRGGHLERIFVVEFRPDSdTQFVSVGVKHM-KFWTLAGSA 1571
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSDKTiRLWDLETGE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1572 LLYkkgvigSMEAAKmQTMLSVAFGANNLTFTGAI-NGDVYVWK-EHFLIRLVAKAHTGPVFTMyTTLRDGLIVTGGker 1649
Cdd:cd00200 85 CVR------TLTGHT-SYVSSVAFSPDGRILSSSSrDKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVASS--- 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1650 ptKEGGAVKLWD-QEMKRCRAFQLETGQlvecVRSVC--RGKGKILVGTKDGEIIeVGEKSAASNI-LIDGHmEGEIWGL 1725
Cdd:cd00200 154 --SQDGTIKLWDlRTGKCVATLTGHTGE----VNSVAfsPDGEKLLSSSSDGTIK-LWDLSTGKCLgTLRGH-ENGVNSV 225
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081596 1726 ATHPSKDMFISASNDGTARIWDLADKKLLNKVNlGHAAR--CAAYSPDGEMVAIGMKNgefvillvNTLKVW 1795
Cdd:cd00200 226 AFSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GHTNSvtSLAWSPDGKRLASGSAD--------GTIRIW 288
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
228-466 |
1.26e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 94.32 E-value: 1.26e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 228 LVRTIQGaHSAGIFSL--YACEEGFATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWKAD--RLLAGTQDS 303
Cdd:cd00200 1 LRRTLKG-HTGGVTCVafSPDGKLLATGSGDGTIKVWDLETG-----ELLRTLKGHTG-PVRDVAASADgtYLASGSSDK 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 304 EIFevIVRERDKPML-ILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EEAVRSVSFSPDGSQL 381
Cdd:cd00200 74 TIR--LWDLETGECVrTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGhTDWVNSVAFSPDGTFV 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 382 ALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQRyKKIGECNKSLSFITHIDWSL 461
Cdd:cd00200 151 ASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG-KCLGTLRGHENGVNSVAFSP 229
|
....*
gi 1907081596 462 DSKYL 466
Cdd:cd00200 230 DGYLL 234
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-48 |
5.56e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.56e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 1907081596 2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1624-1988 |
5.72e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 92.40 E-value: 5.72e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1624 KAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDQEMKRC-RAFQLETGQlVECVRSVCRGKgKILVGTKDGEIIE 1702
Cdd:cd00200 6 KGHTGGVTCVAFSPDGKLLATGS------GDGTIKVWDLETGELlRTLKGHTGP-VRDVAASADGT-YLASGSSDKTIRL 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1703 VGEKSAASNILIDGHmEGEIWGLATHPSKDMFISASNDGTARIWDLADKKLLnKVNLGH--AARCAAYSPDGEMVAIGMK 1780
Cdd:cd00200 78 WDLETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCL-TTLRGHtdWVNSVAFSPDGTFVASSSQ 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1781 NGefvillvnTLKVW----GKKRDR----KSAIQDIRISPDNRFLAVGSSEQTVDFYDLTQGTSLNRIGYCKDipsFVIQ 1852
Cdd:cd00200 156 DG--------TIKLWdlrtGKCVATltghTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHEN---GVNS 224
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1853 MDFSADSKYIqvstgaykrqvhevplgkqvteamvvekitwaswTSVLGDEVIGIWprnadkadvncacvthaglNIVTG 1932
Cdd:cd00200 225 VAFSPDGYLL----------------------------------ASGSEDGTIRVW-------------------DLRTG 251
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081596 1933 DdfgllklfdfpctekfaKHKRYFGHSAHVTNIRFSSDDKYVVStGGDDCSVFVWR 1988
Cdd:cd00200 252 E-----------------CVQTLSGHTNSVTSLAWSPDGKRLAS-GSADGTIRIWD 289
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
668-715 |
5.78e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.78e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 1907081596 668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
919-1245 |
1.91e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 90.86 E-value: 1.91e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 919 LLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD---DMFERCLKTYAIKRTALSTSSKGLLL----EDNpSIRait 989
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDletGELLRTLKGHTGPVRDVAASADGTYLasgsSDK-TIR--- 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 990 lghghiLVGTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELSSQHRMLAVRKLKKGGRC 1069
Cdd:cd00200 77 ------LWDLETGECVRT---------LTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1070 CAFSPDGKALAVGLNDGSFLVVNADTVEDMLSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVGICKGAS 1149
Cdd:cd00200 141 VAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPD-GEKLLSSSSDGTIKLWDLSTGKCLGTLRGHE 219
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1150 SYITHIDWDSRGKLLQvnSGAKEQlffeaprgrkhTIRpseaekiEWDTWTCVLGPTCEGiwpaHSdvTDVNAANLTKDG 1229
Cdd:cd00200 220 NGVNSVAFSPDGYLLA--SGSEDG-----------TIR-------VWDLRTGECVQTLSG----HT--NSVTSLAWSPDG 273
|
330
....*....|....*.
gi 1907081596 1230 SLLATGDDFGFVKLFS 1245
Cdd:cd00200 274 KRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1714-1988 |
1.91e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 90.86 E-value: 1.91e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1714 IDGHmEGEIWGLATHPSKDMFISASNDGTARIWDLADKKLLnKVNLGHAA--RCAAYSPDGEMVAIGMKNgefvillvNT 1791
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSD--------KT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1792 LKVW----GKKRDR----KSAIQDIRISPDNRFLAVGSSEQTVDFYDLTQGTSLNRIGYCKDipsFVIQMDFSADSKYiq 1863
Cdd:cd00200 75 IRLWdletGECVRTltghTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF-- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1864 VSTGAYKRQVH--EVPLGKqvteamVVEKITwaswtsvlgdevigiwprnADKADVNCACVTHAGLNIVTGDDFGLLKLF 1941
Cdd:cd00200 150 VASSSQDGTIKlwDLRTGK------CVATLT-------------------GHTGEVNSVAFSPDGEKLLSSSSDGTIKLW 204
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 1907081596 1942 DFpctEKFAKHKRYFGHSAHVTNIRFSSDDKYVVStGGDDCSVFVWR 1988
Cdd:cd00200 205 DL---STGKCLGTLRGHENGVNSVAFSPDGYLLAS-GSEDGTIRVWD 247
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
723-953 |
5.40e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 89.70 E-value: 5.40e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 723 YLGHDDDILSLTIHPVKDYVATGqvGRDAAVHVWDTQTLKCLSLLKGhHQRGVCALDFSGirvekktsrltsnvglqeiw 802
Cdd:cd00200 89 LTGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRG-HTDWVNSVAFSP-------------------- 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 803 knrsDGKCLVSVGLDdfHSVVFWDWKKGEKIATTRGHKDKIFVVKCNPQHADKLVTVGIKHIKFWQQAGGgftSKRGSFg 882
Cdd:cd00200 146 ----DGTFVASSSQD--GTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG---KCLGTL- 215
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907081596 883 sAGKLETMMCVSYGRMEDLVFSGAATGDIFIW--KDVLLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD 953
Cdd:cd00200 216 -RGHENGVNSVAFSPDGYLLASGSEDGTIRVWdlRTGECVQTLSGHTNSVTSLAWSPDGkrLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
521-1095 |
1.40e-16 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 84.19 E-value: 1.40e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKKGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRfipeavsn 600
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWD---LATGLLLRTLTGHTGAVRSVAFSPDGKTLAS-GSADGTVRLWD-------- 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 601 gvlettpqeggadsyseesdsdfsdvpeldsdieqetqinydrqvykedlpqlkqqskeknhavpflkrekapedslklq 680
Cdd:COG2319 --------------------------------------------------------------------------------
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 681 fihgyrgydcrnnlfyTQAGEVVYHIAavavvynrqqhaqrlylGHDDDILSLTIHPVKDYVATGqvGRDAAVHVWDTQT 760
Cdd:COG2319 149 ----------------LATGKLLRTLT-----------------GHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLAT 193
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 761 LKCLSLLKGhHQRGVCALDFSgirvekktsrltsnvglqeiwknrSDGKCLVSVGLDdfHSVVFWDWKKGEKIATTRGHK 840
Cdd:COG2319 194 GKLLRTLTG-HTGAVRSVAFS------------------------PDGKLLASGSAD--GTVRLWDLATGKLLRTLTGHS 246
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 841 DKIFVVkcnpqhadklvtvgikhikfwqqagggftskrgSFGSAGKletmmcvsygrmedLVFSGAATGDIFIWkDV--- 917
Cdd:COG2319 247 GSVRSV---------------------------------AFSPDGR--------------LLASGSADGTVRLW-DLatg 278
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 918 LLLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDdmferclktyaikrtalstsskglllednpsiraitlghghi 995
Cdd:COG2319 279 ELLRTLTGHSGGVNSVaFSPDgKLLASGSDDGTVRLWD------------------------------------------ 316
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 996 lvgTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELSSQHRMLAVRKLKKGGRCCAFSPD 1075
Cdd:COG2319 317 ---LATGKLLRT---------LTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPD 383
|
570 580
....*....|....*....|
gi 1907081596 1076 GKALAVGLNDGSFLVVNADT 1095
Cdd:COG2319 384 GRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
46-177 |
7.94e-16 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 80.07 E-value: 7.94e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 46 VYNTREHS-QKFFLGHNDDIISLALHPDKTLIATGQVGKEpyICIWNSYNVHTVSILKdVHTHGVACLAFDSDGQHLASV 124
Cdd:cd00200 161 LWDLRTGKcVATLTGHTGEVNSVAFSPDGEKLLSSSSDGT--IKLWDLSTGKCLGTLR-GHENGVNSVAFSPDGYLLASG 237
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 1907081596 125 GLDakNTVCIWDWRKGKLLASATGHSDRIFDISWDPyQPNRMVSCGV-KHIKFW 177
Cdd:cd00200 238 SED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP-DGKRLASGSAdGTIRIW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
55-179 |
5.17e-15 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 79.57 E-value: 5.17e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 55 KFFLGHNDDIISLALHPDKTLIATGqvGKEPYICIWNSYNVHTVSILKDvHTHGVACLAFDSDGQHLASVGLDakNTVCI 134
Cdd:COG2319 282 RTLTGHSGGVNSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGSDD--GTVRL 356
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 1907081596 135 WDWRKGKLLASATGHSDRIFDISWDPyQPNRMVSCGV-KHIKFWTL 179
Cdd:COG2319 357 WDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSAdGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1430-1661 |
9.87e-15 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 76.99 E-value: 9.87e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1430 IVQNLSTGSQSFYLE-HTDDILCLTVNQHPKYrnVVATSQIGTtpsIHIWDAMTKHTLSMLRCfHTKGVNYINFSATGKL 1508
Cdd:cd00200 76 RLWDLETGECVRTLTgHTSYVSSVAFSPDGRI--LSSSSRDKT---IKVWDVETGKCLTTLRG-HTDWVNSVAFSPDGTF 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1509 LVSVGVDpeHTITVWRWQEGTKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLykkGVIGSMEAAkmq 1588
Cdd:cd00200 150 VASSSQD--GTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCL---GTLRGHENG--- 221
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907081596 1589 tMLSVAFGANNLTFTGA-INGDVYVWK-EHFLIRLVAKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWD 1661
Cdd:cd00200 222 -VNSVAFSPDGYLLASGsEDGTIRVWDlRTGECVQTLSGHTNSVTSLAWSPDGKRLASGS------ADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
291-953 |
1.21e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 75.33 E-value: 1.21e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 291 WKADRLLAGTQDSEIFEVIVRERDKPMLILQGHCEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEEAVR 370
Cdd:COG2319 3 SADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVL 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 371 SVSFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVyavaqrykkigecnks 450
Cdd:COG2319 83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRL---------------- 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 451 lsfithidWSLDSkylqtndgagerlfykmpsGKPLtskeeikgipwaswtcvrgpevsgiwpkytevidinsvdanyns 530
Cdd:COG2319 147 --------WDLAT-------------------GKLL-------------------------------------------- 155
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 531 svlvsgddfglvklfkfpclkkgakfRKYVGHSAHVTNVRWSHDFQWvLSTGGADHSVFQWRfipeavsngvlettpqeg 610
Cdd:COG2319 156 --------------------------RTLTGHSGAVTSVAFSPDGKL-LASGSDDGTVRLWD------------------ 190
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 611 gadsyseesdsdfsdvpeldsdieqetqinydrqvykedlpqlkqqskeknhavpflkrekapedslklqfihgyrgydc 690
Cdd:COG2319 --------------------------------------------------------------------------------
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 691 rnnlfyTQAGEVVyhiaavavvynrqqhaqRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAVHVWDTQTLKCLSLLKGh 770
Cdd:COG2319 191 ------LATGKLL-----------------RTLTGHTGAVRSVAFSPDGKLLASG--SADGTVRLWDLATGKLLRTLTG- 244
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 771 HQRGVCALDFSgirvekktsrltsnvglqeiwknrSDGKCLVSVGLDdfHSVVFWDWKKGEKIATTRGHKDKIFVVKCNP 850
Cdd:COG2319 245 HSGSVRSVAFS------------------------PDGRLLASGSAD--GTVRLWDLATGELLRTLTGHSGGVNSVAFSP 298
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 851 QhADKLVTVGI-KHIKFWQQAGG----GFTSKRGSFGSagkletmmcVSYGRMEDLVFSGAATGDIFIW--KDVLLLKTV 923
Cdd:COG2319 299 D-GKLLASGSDdGTVRLWDLATGkllrTLTGHTGAVRS---------VAFSPDGKTLASGSDDGTVRLWdlATGELLRTL 368
|
650 660 670
....*....|....*....|....*....|..
gi 1907081596 924 KAHDGPVFAMYALDKG--FVTGGKDGIVELWD 953
Cdd:COG2319 369 TGHTGAVTSVAFSPDGrtLASGSADGTVRLWD 400
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1064-1290 |
3.69e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 72.37 E-value: 3.69e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1064 KKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMLSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVG 1143
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASAD-GTYLASGSSDKTIRLWDLETGECVR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1144 ICKGASSYITHIDWDSRGKLLqvnSGAkeqlffeaprGRKHTIRpseaekiEWDTWTcvlgPTCEGIWPAHSDvtDVNAA 1223
Cdd:cd00200 88 TLTGHTSYVSSVAFSPDGRIL---SSS----------SRDKTIK-------VWDVET----GKCLTTLRGHTD--WVNSV 141
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081596 1224 NLTKDGSLLATGDDFGFVKLFSyPVKGQhaRFKKYVGHSAHVTNVRWlHNDSVLLTVGGADTALMIW 1290
Cdd:cd00200 142 AFSPDGTFVASSSQDGTIKLWD-LRTGK--CVATLTGHTGEVNSVAF-SPDGEKLLSSSSDGTIKLW 204
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1382-1433 |
8.01e-13 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 65.27 E-value: 8.01e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1907081596 1382 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGaDIIFHTAAAGIVQN 1433
Cdd:pfam03451 20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
521-850 |
5.79e-10 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 62.74 E-value: 5.79e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKKGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRfipeaVSN 600
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWD-----LET 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 601 GVLETTpQEGgadsyseesdsDFSDVpeLDSDIEQETQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapEDS 676
Cdd:cd00200 83 GECVRT-LTG-----------HTSYV--SSVAFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TDW 137
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 677 LklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHAQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAV 753
Cdd:cd00200 138 V--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGTI 201
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 754 HVWDTQTLKCLSLLKGhHQRGVCALDFSgirvekktsrltsnvglqeiwknrSDGKCLVSVGLDdfHSVVFWDWKKGEKI 833
Cdd:cd00200 202 KLWDLSTGKCLGTLRG-HENGVNSVAFS------------------------PDGYLLASGSED--GTIRVWDLRTGECV 254
|
330
....*....|....*..
gi 1907081596 834 ATTRGHKDKIFVVKCNP 850
Cdd:cd00200 255 QTLSGHTNSVTSLAWSP 271
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1207-1565 |
2.59e-07 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 54.65 E-value: 2.59e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1207 CEGIWPAHSDVtdVNAANLTKDGSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWLHNDSVLLTvGGADTA 1286
Cdd:cd00200 1 LRRTLKGHTGG--VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLK---GHTGPVRDVAASADGTYLAS-GSSDKT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1287 LMIW---TREFVGTQEsklvdseesdtdaeedgGYDSDVareKAIDYTTKIYAVSiremeGTKPHQQLK--EVSMEERQG 1361
Cdd:cd00200 75 IRLWdleTGECVRTLT-----------------GHTSYV---SSVAFSPDGRILS-----SSSRDKTIKvwDVETGKCLT 129
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1362 IVKGSRPPVSRAAPQPEKlqknnitkkkklveelalDHVFGYRgfdcrnnlhylNDGADIIFHTAAAGIVQNLSTgsqsf 1441
Cdd:cd00200 130 TLRGHTDWVNSVAFSPDG------------------TFVASSS-----------QDGTIKLWDLRTGKCVATLTG----- 175
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1442 yleHTDDILCLTVnqHPKYRNVVATSQIGTtpsIHIWDAMTKHTLSMLRcFHTKGVNYINFSATGKLLVSVGVDpeHTIT 1521
Cdd:cd00200 176 ---HTGEVNSVAF--SPDGEKLLSSSSDGT---IKLWDLSTGKCLGTLR-GHENGVNSVAFSPDGYLLASGSED--GTIR 244
|
330 340 350 360
....*....|....*....|....*....|....*....|....*
gi 1907081596 1522 VWRWQEGTKVASRGGHLERIFVVEFRPDSdTQFVSVGV-KHMKFW 1565
Cdd:cd00200 245 VWDLRTGECVQTLSGHTNSVTSLAWSPDG-KRLASGSAdGTIRIW 288
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1104-1290 |
7.23e-06 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 50.03 E-value: 7.23e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1104 HRKEmISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLLQVNSGAKeqlffeaprgrk 1183
Cdd:cd00200 8 HTGG-VTCVAFSPD-GKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDK------------ 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1184 hTIRpseaekiEWDTWTcvlgPTCEGIWPAHSDvtDVNAANLTKDGSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSA 1263
Cdd:cd00200 74 -TIR-------LWDLET----GECVRTLTGHTS--YVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLR---GHTD 136
|
170 180
....*....|....*....|....*..
gi 1907081596 1264 HVTNVRWLHNDSvLLTVGGADTALMIW 1290
Cdd:cd00200 137 WVNSVAFSPDGT-FVASSSQDGTIKLW 162
|
|
| COG4946 |
COG4946 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
368-466 |
1.30e-04 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 47.34 E-value: 1.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 368 AVRSVSFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946 344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
|
90 100
....*....|....*....|....
gi 1907081596 443 KIGECNKSLSFIThIDWSLDSKYL 466
Cdd:COG4946 424 KVDTDGYGDGISD-LAWSPDSKWL 446
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
105-136 |
2.33e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 40.37 E-value: 2.33e-04
10 20 30
....*....|....*....|....*....|..
gi 1907081596 105 HTHGVACLAFDSDGQHLASVGLDakNTVCIWD 136
Cdd:smart00320 11 HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1714-1747 |
2.76e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 2.76e-04
10 20 30
....*....|....*....|....*....|....
gi 1907081596 1714 IDGHmEGEIWGLATHPSKDMFISASNDGTARIWD 1747
Cdd:smart00320 8 LKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
1723-1819 |
3.33e-04 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 44.30 E-value: 3.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1723 WGLATHPSKD-MFISASNDGTARIWDLADKKLLNKVNLGHAARCAAYSPDGEMVAIGMKNGEFVILLV-----NTLKVWg 1796
Cdd:COG3391 113 RGLAVDPDGGrLYVADSGNGRVSVIDTATGKVVATIPVGAGPHGIAVDPDGKRLYVANSGSNTVSVIVsvidtATGKVV- 191
|
90 100
....*....|....*....|...
gi 1907081596 1797 KKRDRKSAIQDIRISPDNRFLAV 1819
Cdd:COG3391 192 ATIPVGGGPVGVAVSPDGRRLYV 214
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1714-1747 |
5.40e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 39.25 E-value: 5.40e-04
10 20 30
....*....|....*....|....*....|....
gi 1907081596 1714 IDGHmEGEIWGLATHPSKDMFISASNDGTARIWD 1747
Cdd:pfam00400 7 LEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
315-353 |
7.40e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.87 E-value: 7.40e-04
10 20 30
....*....|....*....|....*....|....*....
gi 1907081596 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:pfam00400 2 KLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
1734-1862 |
8.86e-04 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 43.14 E-value: 8.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1734 FISASNDGTARIWDLADKKLLNKVNLGHAARCAAYSPDGEMVAI-GMKNGEFVILLVNTLKVWGKKRDRKSAiQDIRISP 1812
Cdd:COG3391 83 YVANSGSGRVSVIDLATGKVVATIPVGGGPRGLAVDPDGGRLYVaDSGNGRVSVIDTATGKVVATIPVGAGP-HGIAVDP 161
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 1907081596 1813 DNRFLAVGSSEQT-----VDFYDLTQGTSLNRIgyckDIPSFVIQMDFSADSKYI 1862
Cdd:COG3391 162 DGKRLYVANSGSNtvsviVSVIDTATGKVVATI----PVGGGPVGVAVSPDGRRL 212
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
105-136 |
9.65e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.48 E-value: 9.65e-04
10 20 30
....*....|....*....|....*....|..
gi 1907081596 105 HTHGVACLAFDSDGQHLASVGLDakNTVCIWD 136
Cdd:pfam00400 10 HTGSVTSLAFSPDGKLLASGSDD--GTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1948-1987 |
1.44e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.06 E-value: 1.44e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1907081596 1948 KFAKHKRYFGHSAHVTNIRFSSDDKYVVStGGDDCSVFVW 1987
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLAS-GSDDGTIKLW 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1953-1987 |
1.58e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.10 E-value: 1.58e-03
10 20 30
....*....|....*....|....*....|....*
gi 1907081596 1953 KRYFGHSAHVTNIRFSSDDKYVVStGGDDCSVFVW 1987
Cdd:pfam00400 5 KTLEGHTGSVTSLAFSPDGKLLAS-GSDDGTVKVW 38
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
315-353 |
1.79e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.68 E-value: 1.79e-03
10 20 30
....*....|....*....|....*....|....*....
gi 1907081596 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:smart00320 3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
1766-1848 |
5.65e-03 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 38.03 E-value: 5.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081596 1766 AAYSPDGEMVAIGMKNGEFVILLVNTLKVWGKKRD-RKSAIQDIRISPDNRFLAVGSSEQTVDFYDLTQGTSLNRIGYCK 1844
Cdd:pfam12894 1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPDkEDLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGS 80
|
....
gi 1907081596 1845 DIPS 1848
Cdd:pfam12894 81 DLIT 84
|
|
|