NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1212973534|ref|NP_001340096|]
View 

dynein axonemal intermediate chain 2 isoform 3 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
166-472 9.54e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 81.23  E-value: 9.54e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 166 THLSWHPDGNRkLAVAyscldfqrapvgmSSDS--YIWDLENpNKPELALK-PSSPLVTLEFNPkDSHVLLGGCYNGQIA 242
Cdd:cd00200    13 TCVAFSPDGKL-LATG-------------SGDGtiKVWDLET-GELLRTLKgHTGPVRDVAASA-DGTYLASGSSDKTIR 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 243 CWDTRKGSLVAELStiesSHRDPVYGTIWLQSktGTECFSASTDGQVMWWDIRkmSEPTEVVILDITKkeqlenalGAIS 322
Cdd:cd00200    77 LWDLETGECVRTLT----GHTSYVSSVAFSPD--GRILSSSSRDKTIKVWDVE--TGKCLTTLRGHTD--------WVNS 140
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 323 LEFestLPTKFMVGTeqgivISCNRKAK---TSAEKIVCTFPGHHGPIYALQrnpFYPKN---FLTVGDWTARIWSEDSr 396
Cdd:cd00200   141 VAF---SPDGTFVAS-----SSQDGTIKlwdLRTGKCVATLTGHTGEVNSVA---FSPDGeklLSSSSDGTIKLWDLST- 208
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1212973534 397 ESSIMWTKYHMAYLTDAAWSPVRpTVFFTTRMDGTLDIWDFMFEQCDPTLSLKvcDEALFCLRVQDNGCLIACGSQ 472
Cdd:cd00200   209 GKCLGTLRGHENGVNSVAFSPDG-YLLASGSEDGTIRVWDLRTGECVQTLSGH--TNSVTSLAWSPDGKRLASGSA 281
DUF4795 super family cl23731
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. ...
493-562 4.15e-04

Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 285 and 978 amino acids in length.


The actual alignment was detected with superfamily member pfam16043:

Pssm-ID: 464990 [Multi-domain]  Cd Length: 181  Bit Score: 41.52  E-value: 4.15e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1212973534 493 KNVASSMFERETRREKILEARHREMRLKEKGKAegrDEEQTDEELAV--DLEALVSKAEEEFFDIIFAELKK 562
Cdd:pfam16043  27 SETTSELSERLQQRQKHLEALYQQIEKLEKVKA---DKEVVEEELDEkaDKEALASKVSRDQFDETLEELNQ 95
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
166-472 9.54e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 81.23  E-value: 9.54e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 166 THLSWHPDGNRkLAVAyscldfqrapvgmSSDS--YIWDLENpNKPELALK-PSSPLVTLEFNPkDSHVLLGGCYNGQIA 242
Cdd:cd00200    13 TCVAFSPDGKL-LATG-------------SGDGtiKVWDLET-GELLRTLKgHTGPVRDVAASA-DGTYLASGSSDKTIR 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 243 CWDTRKGSLVAELStiesSHRDPVYGTIWLQSktGTECFSASTDGQVMWWDIRkmSEPTEVVILDITKkeqlenalGAIS 322
Cdd:cd00200    77 LWDLETGECVRTLT----GHTSYVSSVAFSPD--GRILSSSSRDKTIKVWDVE--TGKCLTTLRGHTD--------WVNS 140
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 323 LEFestLPTKFMVGTeqgivISCNRKAK---TSAEKIVCTFPGHHGPIYALQrnpFYPKN---FLTVGDWTARIWSEDSr 396
Cdd:cd00200   141 VAF---SPDGTFVAS-----SSQDGTIKlwdLRTGKCVATLTGHTGEVNSVA---FSPDGeklLSSSSDGTIKLWDLST- 208
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1212973534 397 ESSIMWTKYHMAYLTDAAWSPVRpTVFFTTRMDGTLDIWDFMFEQCDPTLSLKvcDEALFCLRVQDNGCLIACGSQ 472
Cdd:cd00200   209 GKCLGTLRGHENGVNSVAFSPDG-YLLASGSEDGTIRVWDLRTGECVQTLSGH--TNSVTSLAWSPDGKRLASGSA 281
WD40 COG2319
WD40 repeat [General function prediction only];
150-436 2.68e-11

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 66.09  E-value: 2.68e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 150 KTINVF--RDPQEIKRAATH------LSWHPDGNRkLAVAyscldfqrapvgmSSDS--YIWDLENpNKPELALKPSSPL 219
Cdd:COG2319   142 GTVRLWdlATGKLLRTLTGHsgavtsVAFSPDGKL-LASG-------------SDDGtvRLWDLAT-GKLLRTLTGHTGA 206
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 220 VT-LEFNPkDSHVLLGGCYNGQIACWDTRKGSLVAELSTiessHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIRkms 298
Cdd:COG2319   207 VRsVAFSP-DGKLLASGSADGTVRLWDLATGKLLRTLTG----HSGSVRSVAF--SPDGRLLASGSADGTVRLWDLA--- 276
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 299 eptevvildiTKKEQ--LENALGAI-SLEFeSTLPTKFMVGTEQGIVISCNrkakTSAEKIVCTFPGHHGPIYALQrnpF 375
Cdd:COG2319   277 ----------TGELLrtLTGHSGGVnSVAF-SPDGKLLASGSDDGTVRLWD----LATGKLLRTLTGHTGAVRSVA---F 338
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1212973534 376 YPK-NFLTVG--DWTARIWSEDSRESSIMWTKyHMAYLTDAAWSPVRPTVfFTTRMDGTLDIWD 436
Cdd:COG2319   339 SPDgKTLASGsdDGTVRLWDLATGELLRTLTG-HTGAVTSVAFSPDGRTL-ASGSADGTVRLWD 400
DUF4795 pfam16043
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. ...
493-562 4.15e-04

Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 285 and 978 amino acids in length.


Pssm-ID: 464990 [Multi-domain]  Cd Length: 181  Bit Score: 41.52  E-value: 4.15e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1212973534 493 KNVASSMFERETRREKILEARHREMRLKEKGKAegrDEEQTDEELAV--DLEALVSKAEEEFFDIIFAELKK 562
Cdd:pfam16043  27 SETTSELSERLQQRQKHLEALYQQIEKLEKVKA---DKEVVEEELDEkaDKEALASKVSRDQFDETLEELNQ 95
PTZ00421 PTZ00421
coronin; Provisional
168-307 2.72e-03

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 40.65  E-value: 2.72e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 168 LSWHPDGNRKLAVAyscldfqrapvGMSSDSYIWDLENPNKPELALKPSSPLVTLEFNPKDShVLLGGCYNGQIACWDTR 247
Cdd:PTZ00421  131 VSFHPSAMNVLASA-----------GADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGS-LLCTTSKDKKLNIIDPR 198
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1212973534 248 KGSLVAELSTIES--SHRdpvygTIWLQSKTG--TECFSASTDGQVMWWDIRKMSEPTEVVILD 307
Cdd:PTZ00421  199 DGTIVSSVEAHASakSQR-----CLWAKRKDLiiTLGCSKSQQRQIMLWDTRKMASPYSTVDLD 257
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
166-472 9.54e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 81.23  E-value: 9.54e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 166 THLSWHPDGNRkLAVAyscldfqrapvgmSSDS--YIWDLENpNKPELALK-PSSPLVTLEFNPkDSHVLLGGCYNGQIA 242
Cdd:cd00200    13 TCVAFSPDGKL-LATG-------------SGDGtiKVWDLET-GELLRTLKgHTGPVRDVAASA-DGTYLASGSSDKTIR 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 243 CWDTRKGSLVAELStiesSHRDPVYGTIWLQSktGTECFSASTDGQVMWWDIRkmSEPTEVVILDITKkeqlenalGAIS 322
Cdd:cd00200    77 LWDLETGECVRTLT----GHTSYVSSVAFSPD--GRILSSSSRDKTIKVWDVE--TGKCLTTLRGHTD--------WVNS 140
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 323 LEFestLPTKFMVGTeqgivISCNRKAK---TSAEKIVCTFPGHHGPIYALQrnpFYPKN---FLTVGDWTARIWSEDSr 396
Cdd:cd00200   141 VAF---SPDGTFVAS-----SSQDGTIKlwdLRTGKCVATLTGHTGEVNSVA---FSPDGeklLSSSSDGTIKLWDLST- 208
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1212973534 397 ESSIMWTKYHMAYLTDAAWSPVRpTVFFTTRMDGTLDIWDFMFEQCDPTLSLKvcDEALFCLRVQDNGCLIACGSQ 472
Cdd:cd00200   209 GKCLGTLRGHENGVNSVAFSPDG-YLLASGSEDGTIRVWDLRTGECVQTLSGH--TNSVTSLAWSPDGKRLASGSA 281
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
150-436 1.19e-12

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 68.90  E-value: 1.19e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 150 KTINVF--RDPQEIKRAATH------LSWHPDGNrklaVAYSCldfqrapvgmSSDSYI--WDLENpNKPELALK-PSSP 218
Cdd:cd00200    73 KTIRLWdlETGECVRTLTGHtsyvssVAFSPDGR----ILSSS----------SRDKTIkvWDVET-GKCLTTLRgHTDW 137
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 219 LVTLEFNPkDSHVLLGGCYNGQIACWDTRKGSLVAELstieSSHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIRkms 298
Cdd:cd00200   138 VNSVAFSP-DGTFVASSSQDGTIKLWDLRTGKCVATL----TGHTGEVNSVAF--SPDGEKLLSSSSDGTIKLWDLS--- 207
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 299 eptevvilditkkeqlenalgaislefestlptkfmvgteqgiviscnrkaktsAEKIVCTFPGHHGPIYALQRNPfyPK 378
Cdd:cd00200   208 ------------------------------------------------------TGKCLGTLRGHENGVNSVAFSP--DG 231
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 379 NFLTVGDW--TARIWSEDSRESSIMWTKyHMAYLTDAAWSPVRPTVfFTTRMDGTLDIWD 436
Cdd:cd00200   232 YLLASGSEdgTIRVWDLRTGECVQTLSG-HTNSVTSLAWSPDGKRL-ASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
150-436 2.68e-11

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 66.09  E-value: 2.68e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 150 KTINVF--RDPQEIKRAATH------LSWHPDGNRkLAVAyscldfqrapvgmSSDS--YIWDLENpNKPELALKPSSPL 219
Cdd:COG2319   142 GTVRLWdlATGKLLRTLTGHsgavtsVAFSPDGKL-LASG-------------SDDGtvRLWDLAT-GKLLRTLTGHTGA 206
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 220 VT-LEFNPkDSHVLLGGCYNGQIACWDTRKGSLVAELSTiessHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIRkms 298
Cdd:COG2319   207 VRsVAFSP-DGKLLASGSADGTVRLWDLATGKLLRTLTG----HSGSVRSVAF--SPDGRLLASGSADGTVRLWDLA--- 276
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 299 eptevvildiTKKEQ--LENALGAI-SLEFeSTLPTKFMVGTEQGIVISCNrkakTSAEKIVCTFPGHHGPIYALQrnpF 375
Cdd:COG2319   277 ----------TGELLrtLTGHSGGVnSVAF-SPDGKLLASGSDDGTVRLWD----LATGKLLRTLTGHTGAVRSVA---F 338
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1212973534 376 YPK-NFLTVG--DWTARIWSEDSRESSIMWTKyHMAYLTDAAWSPVRPTVfFTTRMDGTLDIWD 436
Cdd:COG2319   339 SPDgKTLASGsdDGTVRLWDLATGELLRTLTG-HTGAVTSVAFSPDGRTL-ASGSADGTVRLWD 400
WD40 COG2319
WD40 repeat [General function prediction only];
168-497 1.75e-10

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 63.39  E-value: 1.75e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 168 LSWHPDGNRKLAVAYSCLDFQRAPVGMSSDSYIWDLENPNKPELALKPSSPLVTLEFNPkDSHVLLGGCYNGQIACWDTR 247
Cdd:COG2319    72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSP-DGKTLASGSADGTVRLWDLA 150
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 248 KGSLVAELstieSSHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIRkmsepTEVVILDITKKEQLENALgAISlefes 327
Cdd:COG2319   151 TGKLLRTL----TGHSGAVTSVAF--SPDGKLLASGSDDGTVRLWDLA-----TGKLLRTLTGHTGAVRSV-AFS----- 213
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 328 tlPTkfmvGTeqgIVISC--NRKAK---TSAEKIVCTFPGHHGPIYALQrnpFYPKN-FLTVGDW--TARIWseDSRESS 399
Cdd:COG2319   214 --PD----GK---LLASGsaDGTVRlwdLATGKLLRTLTGHSGSVRSVA---FSPDGrLLASGSAdgTVRLW--DLATGE 279
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 400 IMWT-KYHMAYLTDAAWSPVRPTVfFTTRMDGTLDIWDfmFEQCDPTLSLKVCDEALFCLRVQDNGCLIACGSQLGTTTL 478
Cdd:COG2319   280 LLRTlTGHSGGVNSVAFSPDGKLL-ASGSDDGTVRLWD--LATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRL 356
                         330       340
                  ....*....|....*....|.
gi 1212973534 479 LEVSPG--LSTLQRNEKNVAS 497
Cdd:COG2319   357 WDLATGelLRTLTGHTGAVTS 377
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
147-296 2.95e-08

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 55.42  E-value: 2.95e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 147 PSAKTINVFRDpqeIKRAATHLSWHPDGNRklaVAYSCLDfqrapvgmsSDSYIWDLENPNKPELALKPSSPLVTLEFNP 226
Cdd:cd00200   123 ETGKCLTTLRG---HTDWVNSVAFSPDGTF---VASSSQD---------GTIKLWDLRTGKCVATLTGHTGEVNSVAFSP 187
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 227 KDSHVLLGGCyNGQIACWDTRKGSLVAELstieSSHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIRK 296
Cdd:cd00200   188 DGEKLLSSSS-DGTIKLWDLSTGKCLGTL----RGHENGVNSVAF--SPDGYLLASGSEDGTIRVWDLRT 250
WD40 COG2319
WD40 repeat [General function prediction only];
164-295 5.84e-06

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 49.14  E-value: 5.84e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 164 AATHLSWHPDGNRkLAVAyscldfqrapvgmSSDS--YIWDLENPnKPELALKPSSPLVT-LEFNPkDSHVLLGGCYNGQ 240
Cdd:COG2319   290 GVNSVAFSPDGKL-LASG-------------SDDGtvRLWDLATG-KLLRTLTGHTGAVRsVAFSP-DGKTLASGSDDGT 353
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1212973534 241 IACWDTRKGSLVAELStiesSHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIR 295
Cdd:COG2319   354 VRLWDLATGELLRTLT----GHTGAVTSVAF--SPDGRTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
163-497 1.17e-05

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 47.98  E-value: 1.17e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 163 RAATHLSWHPDGNRKLAVAYSCLDFQRAPVGMSSDSYIWDLENPNKPELALKPSSPLVTLEFNPkDSHVLLGGCYNGQIA 242
Cdd:COG2319    25 LGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP-DGRLLASASADGTVR 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 243 CWDTRKGSLVAELStiesSHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIrkmseptevvilditkkeqlenalgais 322
Cdd:COG2319   104 LWDLATGLLLRTLT----GHTGAVRSVAF--SPDGKTLASGSADGTVRLWDL---------------------------- 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 323 lefestlptkfmvgtEQGiviscnrkaktsaeKIVCTFPGHHGPIYALQRNPfyPKNFLTVGDW--TARIWSEDSRESSI 400
Cdd:COG2319   150 ---------------ATG--------------KLLRTLTGHSGAVTSVAFSP--DGKLLASGSDdgTVRLWDLATGKLLR 198
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 401 MWTKyHMAYLTDAAWSPvRPTVFFTTRMDGTLDIWDfmFEQCDPTLSLKVCDEALFCLRVQDNGCLIACGSQLGTTTLLE 480
Cdd:COG2319   199 TLTG-HTGAVRSVAFSP-DGKLLASGSADGTVRLWD--LATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWD 274
                         330
                  ....*....|....*....
gi 1212973534 481 VSPG--LSTLQRNEKNVAS 497
Cdd:COG2319   275 LATGelLRTLTGHSGGVNS 293
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
322-478 1.43e-05

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 47.33  E-value: 1.43e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 322 SLEFeSTLPTKFMVGTEQGIVISCNrkakTSAEKIVCTFPGHHGPIYALQRNPFYPKnFLTVG-DWTARIWSEDSRESSI 400
Cdd:cd00200    14 CVAF-SPDGKLLATGSGDGTIKVWD----LETGELLRTLKGHTGPVRDVAASADGTY-LASGSsDKTIRLWDLETGECVR 87
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1212973534 401 MWTKyHMAYLTDAAWSPVRPtVFFTTRMDGTLDIWDfmFEQCDPTLSLKVCDEALFCLRVQDNGCLIACGSQLGTTTL 478
Cdd:cd00200    88 TLTG-HTSYVSSVAFSPDGR-ILSSSSRDKTIKVWD--VETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKL 161
DUF4795 pfam16043
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. ...
493-562 4.15e-04

Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 285 and 978 amino acids in length.


Pssm-ID: 464990 [Multi-domain]  Cd Length: 181  Bit Score: 41.52  E-value: 4.15e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1212973534 493 KNVASSMFERETRREKILEARHREMRLKEKGKAegrDEEQTDEELAV--DLEALVSKAEEEFFDIIFAELKK 562
Cdd:pfam16043  27 SETTSELSERLQQRQKHLEALYQQIEKLEKVKA---DKEVVEEELDEkaDKEALASKVSRDQFDETLEELNQ 95
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
357-484 1.23e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 41.17  E-value: 1.23e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 357 VCTFPGHHGPIYALQRNPfyPKNFLTVG--DWTARIWseDSRESSIMWT-KYHMAYLTDAAWSPVRPTVfFTTRMDGTLD 433
Cdd:cd00200     2 RRTLKGHTGGVTCVAFSP--DGKLLATGsgDGTIKVW--DLETGELLRTlKGHTGPVRDVAASADGTYL-ASGSSDKTIR 76
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1212973534 434 IWDFMFEQCdpTLSLKVCDEALFCLRVQDNGCLIACGSQLGTTTLLEVSPG 484
Cdd:cd00200    77 LWDLETGEC--VRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETG 125
PTZ00421 PTZ00421
coronin; Provisional
168-307 2.72e-03

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 40.65  E-value: 2.72e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 168 LSWHPDGNRKLAVAyscldfqrapvGMSSDSYIWDLENPNKPELALKPSSPLVTLEFNPKDShVLLGGCYNGQIACWDTR 247
Cdd:PTZ00421  131 VSFHPSAMNVLASA-----------GADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGS-LLCTTSKDKKLNIIDPR 198
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1212973534 248 KGSLVAELSTIES--SHRdpvygTIWLQSKTG--TECFSASTDGQVMWWDIRKMSEPTEVVILD 307
Cdd:PTZ00421  199 DGTIVSSVEAHASakSQR-----CLWAKRKDLiiTLGCSKSQQRQIMLWDTRKMASPYSTVDLD 257
PTZ00420 PTZ00420
coronin; Provisional
77-307 5.27e-03

coronin; Provisional


Pssm-ID: 240412 [Multi-domain]  Cd Length: 568  Bit Score: 39.93  E-value: 5.27e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534  77 VEGGwpKDVNPLELEQTIRfRKKVEKDENYVNAIMQL------GSIMEHCiKQNNAIDIYEEYFNDEEAMEVmeedpsak 150
Cdd:PTZ00420   47 VEGG--GLIGAIRLENQMR-KPPVIKLKGHTSSILDLqfnpcfSEILASG-SEDLTIRVWEIPHNDESVKEI-------- 114
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 151 tinvfRDPQEI----KRAATHLSWHPdgnrklaVAYscldFQRAPVGMSSDSYIWDLENPNKPELALKPSSpLVTLEFNP 226
Cdd:PTZ00420  115 -----KDPQCIlkghKKKISIIDWNP-------MNY----YIMCSSGFDSFVNIWDIENEKRAFQINMPKK-LSSLKWNI 177
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1212973534 227 KDShVLLGGCYNGQIACWDTRKGSLVAELStIESSHRDPvyGTIWLQSKTG------TECFSASTDGQVMWWDIRKMSEP 300
Cdd:PTZ00420  178 KGN-LLSGTCVGKHMHIIDPRKQEIASSFH-IHDGGKNT--KNIWIDGLGGddnyilSTGFSKNNMREMKLWDLKNTTSA 253

                  ....*..
gi 1212973534 301 TEVVILD 307
Cdd:PTZ00420  254 LVTMSID 260
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH