|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
121-402 |
1.51e-66 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 226.83 E-value: 1.51e-66
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 121 PVFVVRWTPEGRRLVTGASSGEFTLWNGLTFNFETILQAHDSPVRAMTWSHNDMWMLTADHGGYVKYWQSNMNN-VKMFQ 199
Cdd:cd00200 11 GVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEcVRTLT 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 200 AHKEAIREASFSPTDNKFATCSDDGTVRIWDFLRCHEERILRGHGADVKCVDWHPTKGLVVSGSKDSQqpIKFWDPKTGQ 279
Cdd:cd00200 91 GHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGT--IKLWDLRTGK 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 280 SLATLHAHKNTVMEVKLNLNGNWLLTASRDHLCKLFDIRNlKEELQVFRGHKKEATAVAWHPvHEGLFASGGSDGSLLFW 359
Cdd:cd00200 169 CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLST-GKCLGTLRGHENGVNSVAFSP-DGYLLASGSEDGTIRVW 246
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 56243590 360 HVGVEKEVGGMEmAHEGMIWSLAWHPLGHILCSGSNDHTSKFW 402
Cdd:cd00200 247 DLRTGECVQTLS-GHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
121-402 |
1.50e-57 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 204.76 E-value: 1.50e-57
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 121 PVFVVRWTPEGRRLVTGASSGEFTLWNGLTFNFETILQAHDSPVRAMTWSHNDMWMLTADHGGYVKYWQSNMNN-VKMFQ 199
Cdd:COG2319 122 AVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKlLRTLT 201
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 200 AHKEAIREASFSPTDNKFATCSDDGTVRIWDFLRCHEERILRGHGADVKCVDWHPTKGLVVSGSKDSQqpIKFWDPKTGQ 279
Cdd:COG2319 202 GHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGT--VRLWDLATGE 279
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 280 SLATLHAHKNTVMEVKLNLNGNWLLTASRDHLCKLFDIRNlKEELQVFRGHKKEATAVAWHPvHEGLFASGGSDGSLLFW 359
Cdd:COG2319 280 LLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLAT-GKLLRTLTGHTGAVRSVAFSP-DGKTLASGSDDGTVRLW 357
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 56243590 360 HVGVEKEVGGMEmAHEGMIWSLAWHPLGHILCSGSNDHTSKFW 402
Cdd:COG2319 358 DLATGELLRTLT-GHTGAVTSVAFSPDGRTLASGSADGTVRLW 399
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
191-230 |
7.16e-09 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 52.70 E-value: 7.16e-09
10 20 30 40
....*....|....*....|....*....|....*....|
gi 56243590 191 NMNNVKMFQAHKEAIREASFSPTDNKFATCSDDGTVRIWD 230
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PTZ00421 |
PTZ00421 |
coronin; Provisional |
205-319 |
2.82e-08 |
|
coronin; Provisional
Pssm-ID: 173611 [Multi-domain] Cd Length: 493 Bit Score: 57.98 E-value: 2.82e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 205 IREASFSPTDN-KFATCSDDGTVRIWDFlrcHEERI----------LRGHGADVKCVDWHPT-KGLVVSGSKDSQqpIKF 272
Cdd:PTZ00421 78 IIDVAFNPFDPqKLFTASEDGTIMGWGI---PEEGLtqnisdpivhLQGHTKKVGIVSFHPSaMNVLASAGADMV--VNV 152
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 56243590 273 WDPKTGQSLATLHAHKNTVMEVKLNLNGNWLLTASRDHLCKLFDIRN 319
Cdd:PTZ00421 153 WDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRD 199
|
|
| gly_rich_SclB |
NF038329 |
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ... |
618-864 |
3.92e-08 |
|
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.
Pssm-ID: 468478 [Multi-domain] Cd Length: 440 Bit Score: 57.22 E-value: 3.92e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 618 GPPGPQGQFRPPGPQGQMGPQGPPlhqggggPQGFMGPQGPQGPPQGLPRPQDMHGPQGMQrhpgphgplgpqgppgpqg 697
Cdd:NF038329 123 GPAGPAGPAGEQGPRGDRGETGPA-------GPAGPPGPQGERGEKGPAGPQGEAGPQGPA------------------- 176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 698 ssgpqghmgpqgppgpqghigpqgppGPQGHLGPQGPPGTQGMQGPPGPRGMQGPPHPHGIQGGPGSQGIQGPVSQGPLM 777
Cdd:NF038329 177 --------------------------GKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPA 230
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 778 GLNPRGMQGPPGPRENQGPApqgmimghppqemrGPHPPGgllghGPQEMRGPQEIRGMQGPPPQGSMLGPPqelrGPPG 857
Cdd:NF038329 231 GDGQQGPDGDPGPTGEDGPQ--------------GPDGPA-----GKDGPRGDRGEAGPDGPDGKDGERGPV----GPAG 287
|
....*..
gi 56243590 858 SQSQQGP 864
Cdd:NF038329 288 KDGQNGK 294
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
195-230 |
7.54e-08 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 49.65 E-value: 7.54e-08
10 20 30
....*....|....*....|....*....|....*.
gi 56243590 195 VKMFQAHKEAIREASFSPTDNKFATCSDDGTVRIWD 230
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| gly_rich_SclB |
NF038329 |
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ... |
585-863 |
3.11e-07 |
|
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.
Pssm-ID: 468478 [Multi-domain] Cd Length: 440 Bit Score: 54.53 E-value: 3.11e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 585 PQPFPGQGPmsQIPQGFQQPHPSQQMPMNMAQMGPPGPQGQFRPPGPQGQMGPQGPplhqggggpqgfmgpqgpqgppqg 664
Cdd:NF038329 122 PGPAGPAGP--AGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGP------------------------ 175
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 665 lPRPQDMHGPQGMQrhpgphgplgpqgppgpqgssgpqghmgpqgppgpqghigpqgppgpqghlGPQGPPGTQGMQGPP 744
Cdd:NF038329 176 -AGKDGEAGAKGPA---------------------------------------------------GEKGPQGPRGETGPA 203
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 745 GPRGMQGPPHPHGIQGGPGSQGIQGPVSQGPlmgLNPRGMQGPPGPRENQGPAPQGmimghppqemrGPHPPGGLLGH-G 823
Cdd:NF038329 204 GEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQ---QGPDGDPGPTGEDGPQGPDGPA-----------GKDGPRGDRGEaG 269
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 56243590 824 PQEMRGPQEIRGMQGPPPQGSMLGPpqelRGPPGSQSQQG 863
Cdd:NF038329 270 PDGPDGKDGERGPVGPAGKDGQNGK----DGLPGKDGKDG 305
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
564-932 |
3.92e-07 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 54.63 E-value: 3.92e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 564 LAQKQVEQIQPPPSSGT--PLLGPQPFPGQGPMsqiPQGFQQPHPSQQMPMNmAQMGPPGPQG--QFRPPGPQGQMGPQG 639
Cdd:pfam09606 63 PQGGQGNGGMGGGQQGMpdPINALQNLAGQGTR---PQMMGPMGPGPGGPMG-QQMGGPGTASnlLASLGRPQMPMGGAG 138
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 640 PPlhqggGGPQGFMGPQGPQGPPQGLPRPQDMHGPQGMQRHPGPHGPLGPQGPPGPQGSSGPQGHMGPQGPPGPQGHIGP 719
Cdd:pfam09606 139 FP-----SQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGPA 213
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 720 QGPPGPQGHLGPQGPPGTQGMQGPPGPRGMQGPPHPHGI----------------QGGPGSQGIQGPVSQGPLMGLNPRG 783
Cdd:pfam09606 214 DAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGqqsqlgmginqmqqmpQGVGGGAGQGGPGQPMGPPGQQPGA 293
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 784 M------QGPPGPRENQGPAPQ----GMIMGHPPQEMRGPHPPGGLLGHGPQEM---------RGPQEIRGMQGPPPQgs 844
Cdd:pfam09606 294 MpnvmsiGDQNNYQQQQTRQQQqqqgGNHPAAHQQQMNQSVGQGGQVVALGGLNhletwnpgnFGGLGANPMQRGQPG-- 371
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 845 MLGPPQELRG-------PPGSQSQQGPPQGSLGPPPQGGMQGPPGPQGQQNPARGPHPSQGPIPFQQQKTPLLGDGPRAP 917
Cdd:pfam09606 372 MMSSPSPVPGqqvrqvtPNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPAQQRTIGQDSPGGS 451
|
410
....*....|....*
gi 56243590 918 FNQEGQSTGPPPLIP 932
Cdd:pfam09606 452 LNTPGQSAVNSPLNP 466
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
556-676 |
2.16e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 48.65 E-value: 2.16e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 556 LEQLKIERLAQKQVE--QIQPP---PSSGTPLLG---PQPFPGQGPMSQIPQ-----GFQQPHPSQQMPMnmaqmGPPGP 622
Cdd:TIGR01628 360 LAQRKEQRRAHLQDQfmQLQPRmrqLPMGSPMGGamgQPPYYGQGPQQQFNGqplgwPRMSMMPTPMGPG-----GPLRP 434
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 56243590 623 QGqFRPPGPQGQMGPQGPPL-HQGGGGPQGFMGPQGPQGPPQGLPRPQDMHGPQG 676
Cdd:TIGR01628 435 NG-LAPMNAVRAPSRNAQNAaQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGG 488
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
796-946 |
5.52e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 47.72 E-value: 5.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 796 PAPQGMIMGHPPQEmrGPHPPGGLLGHGPQEMRGPQEIRGMQGPPPQGSMLG-PPQELRGPPGSQSQQGPPQGSlgpppq 874
Cdd:pfam09770 211 AQQPAPAPAQPPAA--PPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGhPVTILQRPQSPQPDPAQPSIQ------ 282
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 56243590 875 ggmqgppgpqGQQNPARGPHPSQGPIPFQQQKTPLLGDGPRAPFNQEGQ-STGPPPLIPGLGQQGAQGRIPPL 946
Cdd:pfam09770 283 ----------PQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQNPQpGVQPAPAHQAHRQQGSFGRQAPI 345
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
735-984 |
6.57e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 47.37 E-value: 6.57e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 735 PGTQGMQGPPGPRGMQGPPHPHGIQGGPGsqgiqgPVSQGPLMGLNPRGMQGPPGPRENQGPAPQGMIMGHPPQEMRGPH 814
Cdd:PHA03378 598 PVPHPSQTPEPPTTQSHIPETSAPRQWPM------PLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGH 671
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 815 PPGGLLGHGPQEMRGPQEIRGMQGPPPQGSMLGPPQElrGPPGSQSqqgPPQGSLGPPPQGGMQGPPGPQGQQNPARGPH 894
Cdd:PHA03378 672 IPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPA--APPGRAQ---RPAAATGRARPPAAAPGRARPPAAAPGRARP 746
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 895 PSQGPIPFQQqktPLLGDGP-RAPFNQEGQST-GPPPLIPGLGQQGAQGRIPPLNPGQ-GPGPNKGDSRGPPNHHMGPMS 971
Cdd:PHA03378 747 PAAAPGRARP---PAAAPGRaRPPAAAPGAPTpQPPPQAPPAPQQRPRGAPTPQPPPQaGPTSMQLMPRAAPGQQGPTKQ 823
|
250
....*....|...
gi 56243590 972 ERRHEQSGGPEHG 984
Cdd:PHA03378 824 ILRQLLTGGVKRG 836
|
|
| SPT5 |
COG5164 |
Transcription elongation factor SPT5 [Transcription]; |
666-949 |
8.07e-05 |
|
Transcription elongation factor SPT5 [Transcription];
Pssm-ID: 444063 [Multi-domain] Cd Length: 495 Bit Score: 46.95 E-value: 8.07e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 666 PRPQDMHGPQGMQRHPGPHGPLGPQGPPGPQGSSGPQGHMgpqgppgpqghigpqgppgpqghlGPQGPPGTQGMQGPPG 745
Cdd:COG5164 12 SDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQ------------------------GSTTPAGNTGGTRPAG 67
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 746 PRGMQGPPHPHGIQGGPGSQGIQGPVSQGPLMG-LNPRGMQGPPGPRENQGPAPQGMIMGHPPQEMRGPHPPGGLLGHGP 824
Cdd:COG5164 68 NQGATGPAQNQGGTTPAQNQGGTRPAGNTGGTTpAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGP 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 825 qemrGPQEIRGMQGPPPQGSMLGPPQE--LRGPPGSQSQQGPPQGSLGPPPQGGMQGPPGPQGQQNPARGPHPSQGPIPF 902
Cdd:COG5164 148 ----GSTGPGGSTTPPGDGGSTTPPGPggSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDD 223
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 56243590 903 QQQKTPllGDGPRAPFNQEGQS--TGPPPLIPGLGQQGAQGRIPPLNPG 949
Cdd:COG5164 224 RGGKTG--PKDQRPKTNPIERRgpERPEAAALPAELTALEAENRAANPE 270
|
|
| PRK13729 |
PRK13729 |
conjugal transfer pilus assembly protein TraB; Provisional |
509-629 |
3.08e-04 |
|
conjugal transfer pilus assembly protein TraB; Provisional
Pssm-ID: 184281 [Multi-domain] Cd Length: 475 Bit Score: 44.81 E-value: 3.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 509 MQNKVPIPAPNEV-----------LNDRKEDIKLEEKKKTQAEIEQEMATLQ-----YTNPQLLEQLKIERLAQ------ 566
Cdd:PRK13729 38 MSGNGEAVAEQEPvpdmtgvvdttFDDKVRQHATTEMQVTAAQMQKQYEEIRreldvLNKQRGDDQRRIEKLGQdnaala 117
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 56243590 567 KQVEQI---------QPPPSSGTPLLGPQ--PFPGQGPMSQIPQGFQQPHPSQQMPMNMAQMGPPGPQGQFRPP 629
Cdd:PRK13729 118 EQVKALganpvtatgEPVPQMPASPPGPEgePQPGNTPVSFPPQGSVAVPPPTAFYPGNGVTPPPQVTYQSVPV 191
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
533-643 |
2.90e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 41.95 E-value: 2.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 533 EKKKTQAEIEQEMATLQYTNPQL------LEQLKIERLAQKQVEQIQPPPSSGTPLLGPQPFPGQGPMSQIPQGFQQPHP 606
Cdd:pfam09770 167 PKKAAAPAPAPQPAAQPASLPAPsrkmmsLEEVEAAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQP 246
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 56243590 607 sQQMPMNMAQMGPPGP-------QGQFRPPGPQGQMGPQGPPLH 643
Cdd:pfam09770 247 -QQQPQQPQQHPGQGHpvtilqrPQSPQPDPAQPSIQPQAQQFH 289
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
733-1027 |
5.67e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.46 E-value: 5.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 733 GPPGTQGMQGPPGPRGMQGPPH--PHGIQGGPGsqgiQGPVSQGPLMGLNPRGMQG----------PPGPRENQGPAPQG 800
Cdd:PHA03247 2639 DPHPPPTVPPPERPRDDPAPGRvsRPRRARRLG----RAAQASSPPQRPRRRAARPtvgsltsladPPPPPPTPEPAPHA 2714
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 801 MIMGHP----PQEMRGPHPP------------GGLLGHGPQEMRGPQEIRGMQGP-PPQGSMLGPPQELRGPPGSQSQQG 863
Cdd:PHA03247 2715 LVSATPlppgPAAARQASPAlpaapappavpaGPATPGGPARPARPPTTAGPPAPaPPAAPAAGPPRRLTRPAVASLSES 2794
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 864 PPQGSLGPPPQGGMQGPPGPQGQQNPARGPHPSQGPIPFQQQKTPLLGDGPRAPFNQEGQSTGP-------PPLIPGLGQ 936
Cdd:PHA03247 2795 RESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrPPSRSPAAK 2874
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 937 QGAQGRI-------PPLNPGQGPGPNKGDSRGPPNHHMGPMSERRHEQSGGPEHgPERGPFRGGQdcrgPPDRRGPHPDf 1009
Cdd:PHA03247 2875 PAAPARPpvrrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ-PQPPPPPPPR----PQPPLAPTTD- 2948
|
330
....*....|....*...
gi 56243590 1010 PDDFSRPDDFHPDKRFGH 1027
Cdd:PHA03247 2949 PAGAGEPSGAVPQPWLGA 2966
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
121-402 |
1.51e-66 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 226.83 E-value: 1.51e-66
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 121 PVFVVRWTPEGRRLVTGASSGEFTLWNGLTFNFETILQAHDSPVRAMTWSHNDMWMLTADHGGYVKYWQSNMNN-VKMFQ 199
Cdd:cd00200 11 GVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEcVRTLT 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 200 AHKEAIREASFSPTDNKFATCSDDGTVRIWDFLRCHEERILRGHGADVKCVDWHPTKGLVVSGSKDSQqpIKFWDPKTGQ 279
Cdd:cd00200 91 GHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGT--IKLWDLRTGK 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 280 SLATLHAHKNTVMEVKLNLNGNWLLTASRDHLCKLFDIRNlKEELQVFRGHKKEATAVAWHPvHEGLFASGGSDGSLLFW 359
Cdd:cd00200 169 CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLST-GKCLGTLRGHENGVNSVAFSP-DGYLLASGSEDGTIRVW 246
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 56243590 360 HVGVEKEVGGMEmAHEGMIWSLAWHPLGHILCSGSNDHTSKFW 402
Cdd:cd00200 247 DLRTGECVQTLS-GHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
121-402 |
1.50e-57 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 204.76 E-value: 1.50e-57
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 121 PVFVVRWTPEGRRLVTGASSGEFTLWNGLTFNFETILQAHDSPVRAMTWSHNDMWMLTADHGGYVKYWQSNMNN-VKMFQ 199
Cdd:COG2319 122 AVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKlLRTLT 201
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 200 AHKEAIREASFSPTDNKFATCSDDGTVRIWDFLRCHEERILRGHGADVKCVDWHPTKGLVVSGSKDSQqpIKFWDPKTGQ 279
Cdd:COG2319 202 GHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGT--VRLWDLATGE 279
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 280 SLATLHAHKNTVMEVKLNLNGNWLLTASRDHLCKLFDIRNlKEELQVFRGHKKEATAVAWHPvHEGLFASGGSDGSLLFW 359
Cdd:COG2319 280 LLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLAT-GKLLRTLTGHTGAVRSVAFSP-DGKTLASGSDDGTVRLW 357
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 56243590 360 HVGVEKEVGGMEmAHEGMIWSLAWHPLGHILCSGSNDHTSKFW 402
Cdd:COG2319 358 DLATGELLRTLT-GHTGAVTSVAFSPDGRTLASGSADGTVRLW 399
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
121-402 |
2.38e-52 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 189.74 E-value: 2.38e-52
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 121 PVFVVRWTPEGRRLVTGASSGEFTLWNGLTFNFETILQAHDSPVRAMTWSHNDMWMLTADHGGYVKYWQ-SNMNNVKMFQ 199
Cdd:COG2319 38 AVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDlATGLLLRTLT 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 200 AHKEAIREASFSPTDNKFATCSDDGTVRIWDFLRCHEERILRGHGADVKCVDWHPTKGLVVSGSKDSQqpIKFWDPKTGQ 279
Cdd:COG2319 118 GHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGT--VRLWDLATGK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 280 SLATLHAHKNTVMEVKLNLNGNWLLTASRDHLCKLFDIRNlKEELQVFRGHKKEATAVAWHPvhEG-LFASGGSDGSLLF 358
Cdd:COG2319 196 LLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLAT-GKLLRTLTGHSGSVRSVAFSP--DGrLLASGSADGTVRL 272
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 56243590 359 WHVGvEKEVGGMEMAHEGMIWSLAWHPLGHILCSGSNDHTSKFW 402
Cdd:COG2319 273 WDLA-TGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLW 315
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
121-361 |
1.94e-47 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 175.48 E-value: 1.94e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 121 PVFVVRWTPEGRRLVTGASSGEFTLWNGLTFNFETILQAHDSPVRAMTWSHNDMWMLTADHGGYVKYWqsNMNN---VKM 197
Cdd:COG2319 164 AVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW--DLATgklLRT 241
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 198 FQAHKEAIREASFSPTDNKFATCSDDGTVRIWDFLRCHEERILRGHGADVKCVDWHPTKGLVVSGSKDSQqpIKFWDPKT 277
Cdd:COG2319 242 LTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGT--VRLWDLAT 319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 278 GQSLATLHAHKNTVMEVKLNLNGNWLLTASRDHLCKLFDIRNlKEELQVFRGHKKEATAVAWHPvHEGLFASGGSDGSLL 357
Cdd:COG2319 320 GKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAT-GELLRTLTGHTGAVTSVAFSP-DGRTLASGSADGTVR 397
|
....
gi 56243590 358 FWHV 361
Cdd:COG2319 398 LWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
194-402 |
4.11e-43 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 159.42 E-value: 4.11e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 194 NVKMFQAHKEAIREASFSPTDNKFATCSDDGTVRIWDFLRCHEERILRGHGADVKCVDWHPTKGLVVSGSKDSQqpIKFW 273
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKT--IRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 274 DPKTGQSLATLHAHKNTVMEVKLNLNGNWLLTASRDHLCKLFDIRNlKEELQVFRGHKKEATAVAWHPvhEGLFASGGS- 352
Cdd:cd00200 79 DLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVET-GKCLTTLRGHTDWVNSVAFSP--DGTFVASSSq 155
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 56243590 353 DGSLLFWHVGVEKEVGGMEmAHEGMIWSLAWHPLGHILCSGSNDHTSKFW 402
Cdd:cd00200 156 DGTIKLWDLRTGKCVATLT-GHTGEVNSVAFSPDGEKLLSSSSDGTIKLW 204
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
126-402 |
3.17e-42 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 160.08 E-value: 3.17e-42
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 126 RWTPEGRRLVTGASSGEFTLWNGLTFNFETILQAHDSPVRAMTWSHND-MWMLTADHGGYVKYWQSNMNNVKMFQAHKEA 204
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGaRLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 205 IREASFSPTDNKFATCSDDGTVRIWDFLRCHEERILRGHGADVKCVDWHPTKGLVVSGSKDSQqpIKFWDPKTGQSLATL 284
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTL 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 285 HAHKNTVMEVKLNLNGNWLLTASRDHLCKLFDIRNLKeELQVFRGHKKEATAVAWHPvhEG-LFASGGSDGSLLFWHVGV 363
Cdd:COG2319 159 TGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGK-LLRTLTGHTGAVRSVAFSP--DGkLLASGSADGTVRLWDLAT 235
|
250 260 270
....*....|....*....|....*....|....*....
gi 56243590 364 EKEVGGMEmAHEGMIWSLAWHPLGHILCSGSNDHTSKFW 402
Cdd:COG2319 236 GKLLRTLT-GHSGSVRSVAFSPDGRLLASGSADGTVRLW 273
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
119-359 |
4.22e-39 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 147.48 E-value: 4.22e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 119 KCPVFVVRWTPEGRRLVTGASSGEFTLWNGLTFNFETILQAHDSPVRAmtwshndmwmltadhggyvkywqsnmnnvkmf 198
Cdd:cd00200 93 TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS-------------------------------- 140
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 199 qahkeaireASFSPtDNKF-ATCSDDGTVRIWDFLRCHEERILRGHGADVKCVDWHPTKGLVVSGSKDSQqpIKFWDPKT 277
Cdd:cd00200 141 ---------VAFSP-DGTFvASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGT--IKLWDLST 208
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 278 GQSLATLHAHKNTVMEVKLNLNGNWLLTASRDHLCKLFDIRNlKEELQVFRGHKKEATAVAWHPvHEGLFASGGSDGSLL 357
Cdd:cd00200 209 GKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRT-GECVQTLSGHTNSVTSLAWSP-DGKRLASGSADGTIR 286
|
..
gi 56243590 358 FW 359
Cdd:cd00200 287 IW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
121-319 |
7.38e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 150.45 E-value: 7.38e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 121 PVFVVRWTPEGRRLVTGASSGEFTLWNGLTFNFETILQAHDSPVRAMTWSHNDMWMLTADHGGYVKYWQ-SNMNNVKMFQ 199
Cdd:COG2319 206 AVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDlATGELLRTLT 285
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 200 AHKEAIREASFSPTDNKFATCSDDGTVRIWDFLRCHEERILRGHGADVKCVDWHPTKGLVVSGSKDSQqpIKFWDPKTGQ 279
Cdd:COG2319 286 GHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGT--VRLWDLATGE 363
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 56243590 280 SLATLHAHKNTVMEVKLNLNGNWLLTASRDHLCKLFDIRN 319
Cdd:COG2319 364 LLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
119-274 |
1.52e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 117.05 E-value: 1.52e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 119 KCPVFVVRWTPEGRRLVTGASSGEFTLWNGLTFNFETILQAHDSPVRAMTWSHNDMWMLTADHGGYVKYWQSNM-NNVKM 197
Cdd:cd00200 135 TDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTgKCLGT 214
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 56243590 198 FQAHKEAIREASFSPTDNKFATCSDDGTVRIWDFLRCHEERILRGHGADVKCVDWHPTKGLVVSGSKDSQqpIKFWD 274
Cdd:cd00200 215 LRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGT--IRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
191-230 |
7.16e-09 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 52.70 E-value: 7.16e-09
10 20 30 40
....*....|....*....|....*....|....*....|
gi 56243590 191 NMNNVKMFQAHKEAIREASFSPTDNKFATCSDDGTVRIWD 230
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PTZ00421 |
PTZ00421 |
coronin; Provisional |
205-319 |
2.82e-08 |
|
coronin; Provisional
Pssm-ID: 173611 [Multi-domain] Cd Length: 493 Bit Score: 57.98 E-value: 2.82e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 205 IREASFSPTDN-KFATCSDDGTVRIWDFlrcHEERI----------LRGHGADVKCVDWHPT-KGLVVSGSKDSQqpIKF 272
Cdd:PTZ00421 78 IIDVAFNPFDPqKLFTASEDGTIMGWGI---PEEGLtqnisdpivhLQGHTKKVGIVSFHPSaMNVLASAGADMV--VNV 152
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 56243590 273 WDPKTGQSLATLHAHKNTVMEVKLNLNGNWLLTASRDHLCKLFDIRN 319
Cdd:PTZ00421 153 WDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRD 199
|
|
| gly_rich_SclB |
NF038329 |
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ... |
618-864 |
3.92e-08 |
|
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.
Pssm-ID: 468478 [Multi-domain] Cd Length: 440 Bit Score: 57.22 E-value: 3.92e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 618 GPPGPQGQFRPPGPQGQMGPQGPPlhqggggPQGFMGPQGPQGPPQGLPRPQDMHGPQGMQrhpgphgplgpqgppgpqg 697
Cdd:NF038329 123 GPAGPAGPAGEQGPRGDRGETGPA-------GPAGPPGPQGERGEKGPAGPQGEAGPQGPA------------------- 176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 698 ssgpqghmgpqgppgpqghigpqgppGPQGHLGPQGPPGTQGMQGPPGPRGMQGPPHPHGIQGGPGSQGIQGPVSQGPLM 777
Cdd:NF038329 177 --------------------------GKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPA 230
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 778 GLNPRGMQGPPGPRENQGPApqgmimghppqemrGPHPPGgllghGPQEMRGPQEIRGMQGPPPQGSMLGPPqelrGPPG 857
Cdd:NF038329 231 GDGQQGPDGDPGPTGEDGPQ--------------GPDGPA-----GKDGPRGDRGEAGPDGPDGKDGERGPV----GPAG 287
|
....*..
gi 56243590 858 SQSQQGP 864
Cdd:NF038329 288 KDGQNGK 294
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
195-230 |
7.54e-08 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 49.65 E-value: 7.54e-08
10 20 30
....*....|....*....|....*....|....*.
gi 56243590 195 VKMFQAHKEAIREASFSPTDNKFATCSDDGTVRIWD 230
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
215-359 |
1.86e-07 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 55.86 E-value: 1.86e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 215 NKFATCSDDGTVRIWDFLRCHEERILRGHGADVKCVDWH---PTkgLVVSGSKDSQqpIKFWDPKTGQSLATLHAHKNTV 291
Cdd:PLN00181 546 SQVASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSsadPT--LLASGSDDGS--VKLWSINQGVSIGTIKTKANIC 621
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 56243590 292 MEVKLNLNGNWLLTASRDHLCKLFDIRNLKEELQVFRGHKKEATAVAWhpVHEGLFASGGSDGSLLFW 359
Cdd:PLN00181 622 CVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTMIGHSKTVSYVRF--VDSSTLVSSSTDNTLKLW 687
|
|
| gly_rich_SclB |
NF038329 |
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ... |
585-863 |
3.11e-07 |
|
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.
Pssm-ID: 468478 [Multi-domain] Cd Length: 440 Bit Score: 54.53 E-value: 3.11e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 585 PQPFPGQGPmsQIPQGFQQPHPSQQMPMNMAQMGPPGPQGQFRPPGPQGQMGPQGPplhqggggpqgfmgpqgpqgppqg 664
Cdd:NF038329 122 PGPAGPAGP--AGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGP------------------------ 175
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 665 lPRPQDMHGPQGMQrhpgphgplgpqgppgpqgssgpqghmgpqgppgpqghigpqgppgpqghlGPQGPPGTQGMQGPP 744
Cdd:NF038329 176 -AGKDGEAGAKGPA---------------------------------------------------GEKGPQGPRGETGPA 203
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 745 GPRGMQGPPHPHGIQGGPGSQGIQGPVSQGPlmgLNPRGMQGPPGPRENQGPAPQGmimghppqemrGPHPPGGLLGH-G 823
Cdd:NF038329 204 GEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQ---QGPDGDPGPTGEDGPQGPDGPA-----------GKDGPRGDRGEaG 269
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 56243590 824 PQEMRGPQEIRGMQGPPPQGSMLGPpqelRGPPGSQSQQG 863
Cdd:NF038329 270 PDGPDGKDGERGPVGPAGKDGQNGK----DGLPGKDGKDG 305
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
564-932 |
3.92e-07 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 54.63 E-value: 3.92e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 564 LAQKQVEQIQPPPSSGT--PLLGPQPFPGQGPMsqiPQGFQQPHPSQQMPMNmAQMGPPGPQG--QFRPPGPQGQMGPQG 639
Cdd:pfam09606 63 PQGGQGNGGMGGGQQGMpdPINALQNLAGQGTR---PQMMGPMGPGPGGPMG-QQMGGPGTASnlLASLGRPQMPMGGAG 138
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 640 PPlhqggGGPQGFMGPQGPQGPPQGLPRPQDMHGPQGMQRHPGPHGPLGPQGPPGPQGSSGPQGHMGPQGPPGPQGHIGP 719
Cdd:pfam09606 139 FP-----SQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGPA 213
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 720 QGPPGPQGHLGPQGPPGTQGMQGPPGPRGMQGPPHPHGI----------------QGGPGSQGIQGPVSQGPLMGLNPRG 783
Cdd:pfam09606 214 DAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGqqsqlgmginqmqqmpQGVGGGAGQGGPGQPMGPPGQQPGA 293
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 784 M------QGPPGPRENQGPAPQ----GMIMGHPPQEMRGPHPPGGLLGHGPQEM---------RGPQEIRGMQGPPPQgs 844
Cdd:pfam09606 294 MpnvmsiGDQNNYQQQQTRQQQqqqgGNHPAAHQQQMNQSVGQGGQVVALGGLNhletwnpgnFGGLGANPMQRGQPG-- 371
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 845 MLGPPQELRG-------PPGSQSQQGPPQGSLGPPPQGGMQGPPGPQGQQNPARGPHPSQGPIPFQQQKTPLLGDGPRAP 917
Cdd:pfam09606 372 MMSSPSPVPGqqvrqvtPNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPAQQRTIGQDSPGGS 451
|
410
....*....|....*
gi 56243590 918 FNQEGQSTGPPPLIP 932
Cdd:pfam09606 452 LNTPGQSAVNSPLNP 466
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
238-274 |
5.82e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.31 E-value: 5.82e-07
10 20 30
....*....|....*....|....*....|....*..
gi 56243590 238 RILRGHGADVKCVDWHPTKGLVVSGSKDSQqpIKFWD 274
Cdd:smart00320 6 KTLKGHTGPVTSVAFSPDGKYLASGSDDGT--IKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
238-274 |
3.35e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 45.03 E-value: 3.35e-06
10 20 30
....*....|....*....|....*....|....*..
gi 56243590 238 RILRGHGADVKCVDWHPTKGLVVSGSKDSQqpIKFWD 274
Cdd:pfam00400 5 KTLEGHTGSVTSLAFSPDGKLLASGSDDGT--VKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
321-360 |
4.26e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 44.61 E-value: 4.26e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 56243590 321 KEELQVFRGHKKEATAVAWHPvHEGLFASGGSDGSLLFWH 360
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
730-950 |
5.87e-06 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 50.78 E-value: 5.87e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 730 GPQGPPGTQgmQGPPG--------------PRGMQGPPHPHGIQGGPGSQGIQGPVSQgPLMGLNPRGMQGPPGPRENQG 795
Cdd:pfam09606 105 GPGGPMGQQ--MGGPGtasnllaslgrpqmPMGGAGFPSQMSRVGRMQPGGQAGGMMQ-PSSGQPGSGTPNQMGPNGGPG 181
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 796 --------PAPQGMIMGHPPQEMRGPHPPGGLLGHGpqemRGPQEIRGMQGPPPQGSMLGPPQELRGppGSQSQQGPPQG 867
Cdd:pfam09606 182 qgqaggmnGGQQGPMGGQMPPQMGVPGMPGPADAGA----QMGQQAQANGGMNPQQMGGAPNQVAMQ--QQQPQQQGQQS 255
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 868 SLGPPPQGGMQGPPGPQGQQnPARGPHPSQGPIPFQQQKTPLLGdGPRAPFNQEGQSTGPPplipglgQQGAQGRIPPLN 947
Cdd:pfam09606 256 QLGMGINQMQQMPQGVGGGA-GQGGPGQPMGPPGQQPGAMPNVM-SIGDQNNYQQQQTRQQ-------QQQQGGNHPAAH 326
|
...
gi 56243590 948 PGQ 950
Cdd:pfam09606 327 QQQ 329
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
373-402 |
7.17e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 44.23 E-value: 7.17e-06
10 20 30
....*....|....*....|....*....|
gi 56243590 373 AHEGMIWSLAWHPLGHILCSGSNDHTSKFW 402
Cdd:smart00320 10 GHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
277-316 |
1.15e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 43.46 E-value: 1.15e-05
10 20 30 40
....*....|....*....|....*....|....*....|
gi 56243590 277 TGQSLATLHAHKNTVMEVKLNLNGNWLLTASRDHLCKLFD 316
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
278-316 |
1.40e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 43.10 E-value: 1.40e-05
10 20 30
....*....|....*....|....*....|....*....
gi 56243590 278 GQSLATLHAHKNTVMEVKLNLNGNWLLTASRDHLCKLFD 316
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
612-989 |
1.50e-05 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 49.62 E-value: 1.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 612 MNMAQMGPPGPQGQFRPPGPQGQMGPQGPPLHQGGGGPQGFMGPQGPQgppqglPRPQDMHGPQGMQRHPGPHGPLGPQG 691
Cdd:pfam09606 53 MSKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNLAGQGTRPQMMG------PMGPGPGGPMGQQMGGPGTASNLLAS 126
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 692 PPGPQGSSGpqghmgpqgppgpqghiGPQGPPGPQGHLGPQGPPGTQGMQGPPGPRGMQGPPHPHGIQGGPGSQGIQGPV 771
Cdd:pfam09606 127 LGRPQMPMG-----------------GAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGQAGGMN 189
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 772 --SQGPLMGLNPRGM--QGPPGPRE--NQGPAPQGMIMGHPPQEMRGPHP-------PGGLLGHGPQEMRGPQEIRGMQG 838
Cdd:pfam09606 190 ggQQGPMGGQMPPQMgvPGMPGPADagAQMGQQAQANGGMNPQQMGGAPNqvamqqqQPQQQGQQSQLGMGINQMQQMPQ 269
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 839 PPPQGSMLGPPQELRGPPGSQSQQGPPQGSLGPPPQGGMQGPPGPQGQqnpARGPHPSQGPipfQQQKTPLLGDGPRAPF 918
Cdd:pfam09606 270 GVGGGAGQGGPGQPMGPPGQQPGAMPNVMSIGDQNNYQQQQTRQQQQQ---QGGNHPAAHQ---QQMNQSVGQGGQVVAL 343
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 56243590 919 NQEGQS-TGPPPLIPGLGQQGAQGRIPPLNPGQGPGPNKGDSRGPPNHHMGPMSERRHEQSGGP-EHGPERGP 989
Cdd:pfam09606 344 GGLNHLeTWNPGNFGGLGANPMQRGQPGMMSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPQGPgSQPPQSHP 416
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
556-676 |
2.16e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 48.65 E-value: 2.16e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 556 LEQLKIERLAQKQVE--QIQPP---PSSGTPLLG---PQPFPGQGPMSQIPQ-----GFQQPHPSQQMPMnmaqmGPPGP 622
Cdd:TIGR01628 360 LAQRKEQRRAHLQDQfmQLQPRmrqLPMGSPMGGamgQPPYYGQGPQQQFNGqplgwPRMSMMPTPMGPG-----GPLRP 434
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 56243590 623 QGqFRPPGPQGQMGPQGPPL-HQGGGGPQGFMGPQGPQGPPQGLPRPQDMHGPQG 676
Cdd:TIGR01628 435 NG-LAPMNAVRAPSRNAQNAaQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGG 488
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
373-402 |
5.50e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 41.56 E-value: 5.50e-05
10 20 30
....*....|....*....|....*....|
gi 56243590 373 AHEGMIWSLAWHPLGHILCSGSNDHTSKFW 402
Cdd:pfam00400 9 GHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
796-946 |
5.52e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 47.72 E-value: 5.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 796 PAPQGMIMGHPPQEmrGPHPPGGLLGHGPQEMRGPQEIRGMQGPPPQGSMLG-PPQELRGPPGSQSQQGPPQGSlgpppq 874
Cdd:pfam09770 211 AQQPAPAPAQPPAA--PPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGhPVTILQRPQSPQPDPAQPSIQ------ 282
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 56243590 875 ggmqgppgpqGQQNPARGPHPSQGPIPFQQQKTPLLGDGPRAPFNQEGQ-STGPPPLIPGLGQQGAQGRIPPL 946
Cdd:pfam09770 283 ----------PQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQNPQpGVQPAPAHQAHRQQGSFGRQAPI 345
|
|
| PTZ00421 |
PTZ00421 |
coronin; Provisional |
288-398 |
6.50e-05 |
|
coronin; Provisional
Pssm-ID: 173611 [Multi-domain] Cd Length: 493 Bit Score: 47.20 E-value: 6.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 288 KNTVMEVKLN-LNGNWLLTASRDHLCKLFDI------RNLKEELQVFRGHKKEATAVAWHPVHEGLFASGGSDGSLLFWH 360
Cdd:PTZ00421 75 EGPIIDVAFNpFDPQKLFTASEDGTIMGWGIpeegltQNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWD 154
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 56243590 361 V--GVEKEVggmEMAHEGMIWSLAWHPLGHILCSGSNDHT 398
Cdd:PTZ00421 155 VerGKAVEV---IKCHSDQITSLEWNLDGSLLCTTSKDKK 191
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
735-984 |
6.57e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 47.37 E-value: 6.57e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 735 PGTQGMQGPPGPRGMQGPPHPHGIQGGPGsqgiqgPVSQGPLMGLNPRGMQGPPGPRENQGPAPQGMIMGHPPQEMRGPH 814
Cdd:PHA03378 598 PVPHPSQTPEPPTTQSHIPETSAPRQWPM------PLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGH 671
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 815 PPGGLLGHGPQEMRGPQEIRGMQGPPPQGSMLGPPQElrGPPGSQSqqgPPQGSLGPPPQGGMQGPPGPQGQQNPARGPH 894
Cdd:PHA03378 672 IPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPA--APPGRAQ---RPAAATGRARPPAAAPGRARPPAAAPGRARP 746
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 895 PSQGPIPFQQqktPLLGDGP-RAPFNQEGQST-GPPPLIPGLGQQGAQGRIPPLNPGQ-GPGPNKGDSRGPPNHHMGPMS 971
Cdd:PHA03378 747 PAAAPGRARP---PAAAPGRaRPPAAAPGAPTpQPPPQAPPAPQQRPRGAPTPQPPPQaGPTSMQLMPRAAPGQQGPTKQ 823
|
250
....*....|...
gi 56243590 972 ERRHEQSGGPEHG 984
Cdd:PHA03378 824 ILRQLLTGGVKRG 836
|
|
| SPT5 |
COG5164 |
Transcription elongation factor SPT5 [Transcription]; |
666-949 |
8.07e-05 |
|
Transcription elongation factor SPT5 [Transcription];
Pssm-ID: 444063 [Multi-domain] Cd Length: 495 Bit Score: 46.95 E-value: 8.07e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 666 PRPQDMHGPQGMQRHPGPHGPLGPQGPPGPQGSSGPQGHMgpqgppgpqghigpqgppgpqghlGPQGPPGTQGMQGPPG 745
Cdd:COG5164 12 SDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQ------------------------GSTTPAGNTGGTRPAG 67
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 746 PRGMQGPPHPHGIQGGPGSQGIQGPVSQGPLMG-LNPRGMQGPPGPRENQGPAPQGMIMGHPPQEMRGPHPPGGLLGHGP 824
Cdd:COG5164 68 NQGATGPAQNQGGTTPAQNQGGTRPAGNTGGTTpAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGP 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 825 qemrGPQEIRGMQGPPPQGSMLGPPQE--LRGPPGSQSQQGPPQGSLGPPPQGGMQGPPGPQGQQNPARGPHPSQGPIPF 902
Cdd:COG5164 148 ----GSTGPGGSTTPPGDGGSTTPPGPggSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDD 223
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 56243590 903 QQQKTPllGDGPRAPFNQEGQS--TGPPPLIPGLGQQGAQGRIPPLNPG 949
Cdd:COG5164 224 RGGKTG--PKDQRPKTNPIERRgpERPEAAALPAELTALEAENRAANPE 270
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
321-359 |
1.60e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.41 E-value: 1.60e-04
10 20 30
....*....|....*....|....*....|....*....
gi 56243590 321 KEELQVFRGHKKEATAVAWHPvHEGLFASGGSDGSLLFW 359
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSP-DGKLLASGSDDGTVKVW 38
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
150-188 |
2.25e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 2.25e-04
10 20 30
....*....|....*....|....*....|....*....
gi 56243590 150 TFNFETILQAHDSPVRAMTWSHNDMWMLTADHGGYVKYW 188
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
731-969 |
2.38e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.70 E-value: 2.38e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 731 PQGPPGTQGMQGPPGPRGMQGPPHPHGIQGGPGSQGIQGPVSQGPLMGLNPRGMQGPPGPRENQGPAPQGMI------MG 804
Cdd:PHA03247 2757 PARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppptsaQP 2836
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 805 HPPQEMRGPHPP-----GGLLGHGPQEMRGPQeirGMQGPPPQGSMLGPPQELRGPPGSQSQQGPPQgslgppPQGGMQG 879
Cdd:PHA03247 2837 TAPPPPPGPPPPslplgGSVAPGGDVRRRPPS---RSPAAKPAAPARPPVRRLARPAVSRSTESFAL------PPDQPER 2907
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 880 PPGPQGQQNPARGPHPSQGPIPFQQQKTPLLGDGPRAPFNQEGQSTGPPPLIPGlGQQGA---------QGRIPPLNPgq 950
Cdd:PHA03247 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ-PWLGAlvpgrvavpRFRVPQPAP-- 2984
|
250
....*....|....*....
gi 56243590 951 gPGPNKGDSRGPPNHHMGP 969
Cdd:PHA03247 2985 -SREAPASSTPPLTGHSLS 3002
|
|
| PRK13729 |
PRK13729 |
conjugal transfer pilus assembly protein TraB; Provisional |
509-629 |
3.08e-04 |
|
conjugal transfer pilus assembly protein TraB; Provisional
Pssm-ID: 184281 [Multi-domain] Cd Length: 475 Bit Score: 44.81 E-value: 3.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 509 MQNKVPIPAPNEV-----------LNDRKEDIKLEEKKKTQAEIEQEMATLQ-----YTNPQLLEQLKIERLAQ------ 566
Cdd:PRK13729 38 MSGNGEAVAEQEPvpdmtgvvdttFDDKVRQHATTEMQVTAAQMQKQYEEIRreldvLNKQRGDDQRRIEKLGQdnaala 117
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 56243590 567 KQVEQI---------QPPPSSGTPLLGPQ--PFPGQGPMSQIPQGFQQPHPSQQMPMNMAQMGPPGPQGQFRPP 629
Cdd:PRK13729 118 EQVKALganpvtatgEPVPQMPASPPGPEgePQPGNTPVSFPPQGSVAVPPPTAFYPGNGVTPPPQVTYQSVPV 191
|
|
| Collagen |
pfam01391 |
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ... |
730-791 |
9.01e-04 |
|
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.
Pssm-ID: 460189 [Multi-domain] Cd Length: 57 Bit Score: 38.63 E-value: 9.01e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 56243590 730 GPQGPPGTQGMQGPPGPRGMQGPPHPHGIQGGPGSQGIQGpvsqgplmglnPRGMQGPPGPR 791
Cdd:pfam01391 7 GPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPG-----------PPGAPGAPGPP 57
|
|
| PTZ00420 |
PTZ00420 |
coronin; Provisional |
284-397 |
1.42e-03 |
|
coronin; Provisional
Pssm-ID: 240412 [Multi-domain] Cd Length: 568 Bit Score: 43.01 E-value: 1.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 284 LHAHKNTVMEVKLN-LNGNWLLTASRDHLCKLFDIRN-------LKEELQVFRGHKKEATAVAWHPVHEGLFASGGSDGS 355
Cdd:PTZ00420 70 LKGHTSSILDLQFNpCFSEILASGSEDLTIRVWEIPHndesvkeIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSF 149
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 56243590 356 LLFWHVGVEKEVGGMEMAHEgmIWSLAWHPLGHIL---CSGSNDH 397
Cdd:PTZ00420 150 VNIWDIENEKRAFQINMPKK--LSSLKWNIKGNLLsgtCVGKHMH 192
|
|
| PTZ00420 |
PTZ00420 |
coronin; Provisional |
198-274 |
1.76e-03 |
|
coronin; Provisional
Pssm-ID: 240412 [Multi-domain] Cd Length: 568 Bit Score: 42.63 E-value: 1.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 198 FQAHKEAIREASFSPTDNK-FATCSDDGTVRIWDfLRCHEER---------ILRGHGADVKCVDWHPTKGLVVSGSK-DS 266
Cdd:PTZ00420 70 LKGHTSSILDLQFNPCFSEiLASGSEDLTIRVWE-IPHNDESvkeikdpqcILKGHKKKISIIDWNPMNYYIMCSSGfDS 148
|
....*...
gi 56243590 267 QqpIKFWD 274
Cdd:PTZ00420 149 F--VNIWD 154
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
151-188 |
2.52e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 36.94 E-value: 2.52e-03
10 20 30
....*....|....*....|....*....|....*...
gi 56243590 151 FNFETILQAHDSPVRAMTWSHNDMWMLTADHGGYVKYW 188
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
533-643 |
2.90e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 41.95 E-value: 2.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 533 EKKKTQAEIEQEMATLQYTNPQL------LEQLKIERLAQKQVEQIQPPPSSGTPLLGPQPFPGQGPMSQIPQGFQQPHP 606
Cdd:pfam09770 167 PKKAAAPAPAPQPAAQPASLPAPsrkmmsLEEVEAAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQP 246
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 56243590 607 sQQMPMNMAQMGPPGP-------QGQFRPPGPQGQMGPQGPPLH 643
Cdd:pfam09770 247 -QQQPQQPQQHPGQGHpvtilqrPQSPQPDPAQPSIQPQAQQFH 289
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
541-643 |
5.56e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.22 E-value: 5.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 541 IEQEMATLQYTNPQLLEQLKIERLA-QKQVEQIQPPPSSGTPLLGPQPFPGQGPMSQIPQGFQQPHPSQQMPMNmaqmgP 619
Cdd:PRK10263 749 VEPVQQPQQPVAPQQQYQQPQQPVApQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQ-----P 823
|
90 100
....*....|....*....|....
gi 56243590 620 PGPQGQFRPPGPQGQMGPQGPPLH 643
Cdd:PRK10263 824 VAPQPQYQQPQQPVAPQPQDTLLH 847
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
733-1027 |
5.67e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.46 E-value: 5.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 733 GPPGTQGMQGPPGPRGMQGPPH--PHGIQGGPGsqgiQGPVSQGPLMGLNPRGMQG----------PPGPRENQGPAPQG 800
Cdd:PHA03247 2639 DPHPPPTVPPPERPRDDPAPGRvsRPRRARRLG----RAAQASSPPQRPRRRAARPtvgsltsladPPPPPPTPEPAPHA 2714
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 801 MIMGHP----PQEMRGPHPP------------GGLLGHGPQEMRGPQEIRGMQGP-PPQGSMLGPPQELRGPPGSQSQQG 863
Cdd:PHA03247 2715 LVSATPlppgPAAARQASPAlpaapappavpaGPATPGGPARPARPPTTAGPPAPaPPAAPAAGPPRRLTRPAVASLSES 2794
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 864 PPQGSLGPPPQGGMQGPPGPQGQQNPARGPHPSQGPIPFQQQKTPLLGDGPRAPFNQEGQSTGP-------PPLIPGLGQ 936
Cdd:PHA03247 2795 RESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrPPSRSPAAK 2874
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 937 QGAQGRI-------PPLNPGQGPGPNKGDSRGPPNHHMGPMSERRHEQSGGPEHgPERGPFRGGQdcrgPPDRRGPHPDf 1009
Cdd:PHA03247 2875 PAAPARPpvrrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ-PQPPPPPPPR----PQPPLAPTTD- 2948
|
330
....*....|....*...
gi 56243590 1010 PDDFSRPDDFHPDKRFGH 1027
Cdd:PHA03247 2949 PAGAGEPSGAVPQPWLGA 2966
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
734-917 |
8.97e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 40.35 E-value: 8.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 734 PPGTQGMQGPPGPRGMQGPPHPHGIQGGPGSQGIQGPVSQGPLMGLNPRGmQGPPGPRENQGPAPQGMIMGHPPQEMRGP 813
Cdd:PRK07764 615 PAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASD-GGDGWPAKAGGAAPAAPPPAPAPAAPAAP 693
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56243590 814 HPPGGllghGPQEMRGPQEIRGMQGPPPQGSMLGPPQELRGPPGSQSQQGPPQGSLGPPPQGGMQGPPGPQGQQNPARGP 893
Cdd:PRK07764 694 AGAAP----AQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAA 769
|
170 180
....*....|....*....|....
gi 56243590 894 HPSQGPIPFQQQKTPLLGDGPRAP 917
Cdd:PRK07764 770 PAAAPPPSPPSEEEEMAEDDAPSM 793
|
|
|