|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
429-849 |
9.12e-17 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 86.92 E-value: 9.12e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 429 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 506
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 507 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 586
Cdd:PHA03247 2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 587 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 662
Cdd:PHA03247 2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 663 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 731
Cdd:PHA03247 2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 732 TTTTKRTRRPHPKPKTTPHPEV-----------PQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPS 800
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPARPPVrrlarpavsrsTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
|
410 420 430 440
....*....|....*....|....*....|....*....|....*....
gi 2462588874 801 PEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLEtrgIPFIPMISPSPSQE 849
Cdd:PHA03247 2942 PLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA---VPRFRVPQPAPSRE 2987
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1090-1181 |
2.63e-10 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 58.28 E-value: 2.63e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 1090 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1167
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 2462588874 1168 LGEGPVSNTVAFST 1181
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1091-1171 |
4.78e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.46 E-value: 4.78e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 1091 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1168
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 2462588874 1169 GEG 1171
Cdd:smart00060 81 GEG 83
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
729-1022 |
7.67e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.71 E-value: 7.67e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 729 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQT------PR-------------------APPKPKTSPRP----RIP 779
Cdd:PHA03247 2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPvgepvhPRmltwirgleelasddagdpPPPLPPAAPPAapdrSVP 2569
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 780 QTQPVPKVPQ-RVTAKPKTSPSPEVSYTTPAPKDvllPHKPYPEVSQSEPAPLETRGiPFIPMISPSPSQEELQTTLEET 858
Cdd:PHA03247 2570 PPRPAPRPSEpAVTSRARRPDAPPQSARPRAPVD---DRGDPRGPAPPSPLPPDTHA-PDPPPPSPSPAANEPDPHPPPT 2645
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 859 DQSTQEPFTTKIPRTTELAKTTQAPHR-FYTTVRPRTSDKPHIRPVLNRTTT--RPTRPKPSGMPSGNGVGTGVKQAPRP 935
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRRARRLGRaAQASSPPQRPRRRAARPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGP 2725
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 936 SGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPpnnvTGKPGSAGiiSSGPITTPPlRSTPRPTGTPLERIETDI 1015
Cdd:PHA03247 2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT----AGPPAPAP--PAAPAAGPP-RRLTRPAVASLSESRESL 2798
|
....*..
gi 2462588874 1016 KQPTVPA 1022
Cdd:PHA03247 2799 PSPWDPA 2805
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1091-1174 |
1.64e-05 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 44.33 E-value: 1.64e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 1091 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1167
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 2462588874 1168 LGEGPVS 1174
Cdd:pfam00041 79 GGEGPPS 85
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
395-816 |
1.30e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.45 E-value: 1.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 395 PTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAP-------SETPF-VPQKLEIFTSPEMQPTTPAPQQTTSI 466
Cdd:pfam05109 310 PASQDMPTNTTDITYVGDNATYSVPMVTSEDANSPNVTVTAfwawpnnTETDFkCKWTLTSGTPSGCENISGAFASNRTF 389
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 467 PSTPKRRPRPKPPRTKPERTTSAGTITPKI--SKSPEPTWTTPAPGKTQFISLKPKIPLsPEVTH-----TKPA---PEP 536
Cdd:pfam05109 390 DITVSGLGTAPKTLIITRTATNATTTTHKVifSKAPESTTTSPTLNTTGFAAPNTTTGL-PSSTHvptnlTAPAstgPTV 468
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 537 QTLLPSQSTIGPETPGTKPST-TLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPK 615
Cdd:pfam05109 469 STADVTSPTPAGTTSGASPVTpSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAV 548
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 616 TTHRPDAPQIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVTV--RTEATVTTLAPKTS 693
Cdd:pfam05109 549 TTPTPNATSPTPAVTTPT---PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLggTSSTPVVTSPPKNA 625
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 694 QRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAP-PKPKT 772
Cdd:pfam05109 626 TSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPaPRPGT 705
|
410 420 430 440
....*....|....*....|....*....|....*....|....
gi 2462588874 773 SPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLP 816
Cdd:pfam05109 706 TSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVP 749
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
907-1186 |
1.46e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.15 E-value: 1.46e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 907 TTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGI 986
Cdd:COG3401 48 TKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATT 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 987 ISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNS--- 1063
Cdd:COG3401 128 ATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTyyy 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 1064 -PCSITDSVKRFPKEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMT 1142
Cdd:COG3401 208 rVAATDTGGESAPSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATV 282
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 2462588874 1143 NQTFSTVENLKPNTSYEFQVKPKNPLG-EGPVSNTVAFSTESADP 1186
Cdd:COG3401 283 TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1085-1229 |
1.87e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 45.76 E-value: 1.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 1085 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1162
Cdd:COG3401 324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462588874 1163 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1229
Cdd:COG3401 398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
124-202 |
2.13e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.45 E-value: 2.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060 4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72
|
...
gi 2462588874 200 GVK 202
Cdd:smart00060 73 RVR 75
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
123-202 |
2.20e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 41.25 E-value: 2.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 123 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 199
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 2462588874 200 GVK 202
Cdd:pfam00041 72 RVQ 74
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
548-781 |
7.98e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 43.60 E-value: 7.98e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 548 PETPGTK----PSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPseRPKTTHRPDAP 623
Cdd:NF033839 284 PKEPGNKkpsaPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETP--KPEVKPQPEKP 361
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 624 QIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPvpfETEAPSMTIVPTTDIEPVTVRTEATVTTLAPKtsqrtrtrRPRP 703
Cdd:NF033839 362 KPEVKPQPEK---PKPEVKPQPETPKPEVKPQP---EKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQ--------PEKP 427
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462588874 704 KHKTTPRPETLQTKLDFGPITPGTSSAPTTTTkrtrrphPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQT 781
Cdd:NF033839 428 KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPET-------PKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPST 498
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
123-202 |
8.81e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.79 E-value: 8.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 123 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 200
Cdd:cd00063 3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73
|
..
gi 2462588874 201 VK 202
Cdd:cd00063 74 VR 75
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
741-811 |
3.80e-03 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 41.54 E-value: 3.80e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462588874 741 PHPKPKTTPHPEVPqtklAPKqtpraPPKPKTSPRPRIPQTQ--------PVPKVPQRVTAKPktSPSPEVSYTTPAPK 811
Cdd:NF033838 418 EQPQPAPAPQPEKP----APK-----PEKPAEQPKAEKPADQqaeedyarRSEEEYNRLTQQQ--PPKTEKPAQPSTPK 485
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
722-1093 |
8.76e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.52 E-value: 8.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 722 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAP----KQTPRAPPKPKTSPRPRI-PQTQPVPkvPQRVTAKPK 796
Cdd:pfam03154 189 PGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPhtliQQTPTLHPQRLPSPHPPLqPMTQPPP--PSQVSPQPL 266
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 797 TSPSPEVSYT-TPAPKDVLLPHKPYPEVSQSEPAPL---ETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPfttkiPR 872
Cdd:pfam03154 267 PQPSLHGQMPpMPHSLQTGPSHMQHPVPPQPFPLTPqssQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-----PR 341
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 873 TTELAKttqAPHRFYTTVRPRTSDKPHI-RPVLNRTTTRPTRPKPSGMPSgngvgtgvkQAPRPSGADRNVSVDSTHPTK 951
Cdd:pfam03154 342 EQPLPP---APLSMPHIKPPPTTPIPQLpNPQSHKHPPHLSGPSPFQMNS---------NLPPPPALKPLSSLSTHHPPS 409
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 952 KPGTRRPPLPPRPTHPRRKPLPPNnVTGKPGSAGIISSGPittPPLRSTPRPTGTPLErietdiKQPTVPasgeelenit 1031
Cdd:pfam03154 410 AHPPPLQLMPQSQQLPPPPAQPPV-LTQSQSLPPPAASHP---PTSGLHQVPSQSPFP------QHPFVP---------- 469
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462588874 1032 dfsSSPTRETDPLGKPRFKGPHVRYIQKPDNSPCSITDSV-------------KRFPKEEATEGNATSPPQNPPT 1093
Cdd:pfam03154 470 ---GGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVpaavscplppvqiKEEALDEAEEPESPPPPPRSPS 541
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
429-849 |
9.12e-17 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 86.92 E-value: 9.12e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 429 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 506
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 507 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 586
Cdd:PHA03247 2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 587 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 662
Cdd:PHA03247 2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 663 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 731
Cdd:PHA03247 2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 732 TTTTKRTRRPHPKPKTTPHPEV-----------PQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPS 800
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPARPPVrrlarpavsrsTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
|
410 420 430 440
....*....|....*....|....*....|....*....|....*....
gi 2462588874 801 PEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLEtrgIPFIPMISPSPSQE 849
Cdd:PHA03247 2942 PLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA---VPRFRVPQPAPSRE 2987
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
419-824 |
2.31e-14 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 78.83 E-value: 2.31e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 419 PPKTSRTLEQPRATLAPSETPFvpqkleiftSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISK 498
Cdd:PHA03247 2608 PRGPAPPSPLPPDTHAPDPPPP---------SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASS 2678
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 499 SPE---PTWTTPAPGKTQFISLKPKIPLSPEvthtkPAPEPQTLLPSQSTIGPETPGTKPSTTLAP--RKTKRPGRRPRP 573
Cdd:PHA03247 2679 PPQrprRRAARPTVGSLTSLADPPPPPPTPE-----PAPHALVSATPLPPGPAAARQASPALPAAPapPAVPAGPATPGG 2753
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 574 RPRPKTTPSPEVPKSK--PALEPATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTK 651
Cdd:PHA03247 2754 PARPARPPTTAGPPAPapPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 652 SVSEPVPFETEAPSMTIVPTTDIEP-VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSA 730
Cdd:PHA03247 2834 AQPTAPPPPPGPPPPSLPLGGSVAPgGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQA 2913
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 731 PTTTTKRTRRPHPkPKTTPHPEVPQTKLAPKQtPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAP 810
Cdd:PHA03247 2914 PPPPQPQPQPPPP-PQPQPPPPPPPRPQPPLA-PTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPAS 2991
|
410
....*....|....
gi 2462588874 811 KDVLLPHKPYPEVS 824
Cdd:PHA03247 2992 STPPLTGHSLSRVS 3005
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
586-1013 |
3.40e-13 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 74.97 E-value: 3.40e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 586 PKSKPALEPATiqPEPLVPTT--ASKPSERPKTT--HRPDAPQIQ--------PGSKPPKQLLPKPQTTAEPDMPPTKSV 653
Cdd:PHA03247 2553 PPLPPAAPPAA--PDRSVPPPrpAPRPSEPAVTSraRRPDAPPQSarprapvdDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 654 SEPVPFETEAPSMTIVPttdiEPVTVRTEATVTTLA-PKTSQRTRTRRPRPKHKTTPRPETLQTkldfgPITPGTSSA-- 730
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVP----PPERPRDDPAPGRVSrPRRARRLGRAAQASSPPQRPRRRAARP-----TVGSLTSLAdp 2701
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 731 PTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPR-------PRIPQTQPVPKVPQRVT--AKPKTSPSP 801
Cdd:PHA03247 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAgpatpggPARPARPPTTAGPPAPAppAAPAAGPPR 2781
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 802 EVSYTTPAPKDVLLPHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPFTTKIPRTTELA---- 877
Cdd:PHA03247 2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVApggd 2861
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 878 -----KTTQAPHRFYTTVRPRTSDKPhiRPVLNRTTT-------RPTRPKPSGMPSGNGVGTGVKQAPRPSGADRnvsvd 945
Cdd:PHA03247 2862 vrrrpPSRSPAAKPAAPARPPVRRLA--RPAVSRSTEsfalppdQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP----- 2934
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462588874 946 sTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGII-----SSGPITTPPLRSTPRPTGTPLERIET 1013
Cdd:PHA03247 2935 -PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrfrvpQPAPSREAPASSTPPLTGHSLSRVSS 3006
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1090-1181 |
2.63e-10 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 58.28 E-value: 2.63e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 1090 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1167
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 2462588874 1168 LGEGPVSNTVAFST 1181
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
331-751 |
3.23e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 65.34 E-value: 3.23e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 331 SARPTTVTPETVPRSTKPTTSSALDVSETTLASSEKPWIVPT-AKISEDSKVLQPQTATYDvfSSPTTSDEPEISDSYTA 409
Cdd:PHA03247 2616 PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPApGRVSRPRRARRLGRAAQA--SSPPQRPRRRAARPTVG 2693
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 410 TSDRILDsiPPKTSRTLE-QPRATLAPSETPFVPQKL-EIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTT 487
Cdd:PHA03247 2694 SLTSLAD--PPPPPPTPEpAPHALVSATPLPPGPAAArQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP 2771
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 488 SAGTITPkiskspePTWTTPAPGKTQFISLKPKIPLSPEvthtkPAPEPQTLLPSQSTigpETPGTKPSTTLAPRKTKRP 567
Cdd:PHA03247 2772 PAAPAAG-------PPRRLTRPAVASLSESRESLPSPWD-----PADPPAAVLAPAAA---LPPAASPAGPLPPPTSAQP 2836
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 568 GRRPRPRPRPKTTPSPE--VPKSKPALEPATIQPEPLVPTTASKPSER-------PKTTHRPDAPQIQPGSKPPKQLLPK 638
Cdd:PHA03247 2837 TAPPPPPGPPPPSLPLGgsVAPGGDVRRRPPSRSPAAKPAAPARPPVRrlarpavSRSTESFALPPDQPERPPQPQAPPP 2916
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 639 PQTTAEPDMPPTKSVSEPVPFETEAPsmtIVPTTDIEPVtvrteatvttlapktsqrtrtrrprpkhkttPRPETLQTKL 718
Cdd:PHA03247 2917 PQPQPQPPPPPQPQPPPPPPPRPQPP---LAPTTDPAGA-------------------------------GEPSGAVPQP 2962
|
410 420 430
....*....|....*....|....*....|...
gi 2462588874 719 DFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHP 751
Cdd:PHA03247 2963 WLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1091-1171 |
4.78e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.46 E-value: 4.78e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 1091 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1168
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 2462588874 1169 GEG 1171
Cdd:smart00060 81 GEG 83
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
652-852 |
6.39e-07 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 54.32 E-value: 6.39e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 652 SVSEPVPFETEAPSMTIVPTTDIEPVTvrTEATVTTLAPKTSQRTRtrrprpKHKTTPRPETLQTKLDFGPitpgTSSAP 731
Cdd:PRK10263 315 PITEPVAVAAAATTATQSWAAPVEPVT--QTPPVASVDVPPAQPTV------AWQPVPGPQTGEPVIAPAP----EGYPQ 382
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 732 TTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYttpAPK 811
Cdd:PRK10263 383 QSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTF---APQ 459
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 2462588874 812 DVLLPHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEELQ 852
Cdd:PRK10263 460 STYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETK 500
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
729-1022 |
7.67e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.71 E-value: 7.67e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 729 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQT------PR-------------------APPKPKTSPRP----RIP 779
Cdd:PHA03247 2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPvgepvhPRmltwirgleelasddagdpPPPLPPAAPPAapdrSVP 2569
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 780 QTQPVPKVPQ-RVTAKPKTSPSPEVSYTTPAPKDvllPHKPYPEVSQSEPAPLETRGiPFIPMISPSPSQEELQTTLEET 858
Cdd:PHA03247 2570 PPRPAPRPSEpAVTSRARRPDAPPQSARPRAPVD---DRGDPRGPAPPSPLPPDTHA-PDPPPPSPSPAANEPDPHPPPT 2645
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 859 DQSTQEPFTTKIPRTTELAKTTQAPHR-FYTTVRPRTSDKPHIRPVLNRTTT--RPTRPKPSGMPSGNGVGTGVKQAPRP 935
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRRARRLGRaAQASSPPQRPRRRAARPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGP 2725
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 936 SGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPpnnvTGKPGSAGiiSSGPITTPPlRSTPRPTGTPLERIETDI 1015
Cdd:PHA03247 2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT----AGPPAPAP--PAAPAAGPP-RRLTRPAVASLSESRESL 2798
|
....*..
gi 2462588874 1016 KQPTVPA 1022
Cdd:PHA03247 2799 PSPWDPA 2805
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1091-1174 |
1.64e-05 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 44.33 E-value: 1.64e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 1091 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1167
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 2462588874 1168 LGEGPVS 1174
Cdd:pfam00041 79 GGEGPPS 85
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
548-921 |
4.41e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 48.15 E-value: 4.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 548 PETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEpATIQPEPlvpTTASKPSERPKTTHRPDAPQIQP 627
Cdd:PTZ00449 511 PEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGE-VGKKPGP---AKEHKPSKIPTLSKKPEFPKDPK 586
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 628 GSKPPKQllPK----PQTTAEPDMPPTKSVSEPV-----PFETEAPSMTIVPTTDIEPVTVR----TEATVTTLAPKTSQ 694
Cdd:PTZ00449 587 HPKDPEE--PKkpkrPRSAQRPTRPKSPKLPELLdipksPKRPESPKSPKRPPPPQRPSSPErpegPKIIKSPKPPKSPK 664
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 695 RTRTRRPRPK------HKTTPRPETLQT-KLDFGPITPGTSSAPTTTTKRTRRPHPKPkttphPEVPQTKLAPKQTPRAP 767
Cdd:PTZ00449 665 PPFDPKFKEKfyddylDAAAKSKETKTTvVLDESFESILKETLPETPGTPFTTPRPLP-----PKLPRDEEFPFEPIGDP 739
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 768 PKPKTSP----------RPRIPQTQPVPKVPQRVTAKPKTspsPEVSYTTPAPKDvllPHKPYPEVSQSEPAPLETRgiP 837
Cdd:PTZ00449 740 DAEQPDDiefftppeeeRTFFHETPADTPLPDILAEEFKE---EDIHAETGEPDE---AMKRPDSPSEHEDKPPGDH--P 811
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 838 FIPMISPSPSQEELQTTLEETD------QSTQEPFTTKIPRT-TELAKTTQAPH-------------------------- 884
Cdd:PTZ00449 812 SLPKKRHRLDGLALSTTDLESDagriakDASGKIVKLKRSKSfDDLTTVEEAEEmgaearkivvdddgteaddedthppe 891
|
410 420 430
....*....|....*....|....*....|....*...
gi 2462588874 885 -RFYTTVRPRTSDKPHIRPVLNRTTTRPTRPKPSGMPS 921
Cdd:PTZ00449 892 eKHKSEVRRRRPPKKPSKPKKPSKPKKPKKPDSAFIPS 929
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
741-827 |
5.53e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 47.50 E-value: 5.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 741 PHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPY 820
Cdd:PRK14950 366 PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYT 445
|
....*..
gi 2462588874 821 PEVSQSE 827
Cdd:PRK14950 446 PPAPPKE 452
|
|
| PRK14954 |
PRK14954 |
DNA polymerase III subunits gamma and tau; Provisional |
754-839 |
1.01e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184918 [Multi-domain] Cd Length: 620 Bit Score: 46.86 E-value: 1.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 754 PQTKLAPKQTPRAPPKPKTSPRP-RIPQTQPVPKVPQRVTAKPKTSPSPEvSYTTPAPKDVLLPHKPYPEVSQSEPAPLE 832
Cdd:PRK14954 376 NDGGVAPSPAGSPDVKKKAPEPDlPQPDRHPGPAKPEAPGARPAELPSPA-SAPTPEQQPPVARSAPLPPSPQASAPRNV 454
|
....*..
gi 2462588874 833 TRGIPFI 839
Cdd:PRK14954 455 ASGKPGV 461
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
583-801 |
1.03e-04 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 46.46 E-value: 1.03e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 583 PEVPKSKPALEPATIQPEPLVPTTASKPSER--PK---TTHRPDAPQIQ-PGSKPPKQLLPKPQTTAEPDMPPTKSVSEP 656
Cdd:PLN03209 328 VPPKESDAADGPKPVPTKPVTPEAPSPPIEEepPQpkaVVPRPLSPYTAyEDLKPPTSPIPTPPSSSPASSKSVDAVAKP 407
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 657 VPFETEAPSMTIVPTTDIEPVTVRTE--------ATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTklDFGPITPGTS 728
Cdd:PLN03209 408 AEPDVVPSPGSASNVPEVEPAQVEAKktrplspyARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVP--DTAPATAATD 485
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 729 SA-PTTTTKRTRRPHP-----KPKTTPHPEVPQTKLAPKQTPRAPP----KPKTSPRPRIPQTQPVPK----VPQRVTAK 794
Cdd:PLN03209 486 AAaPPPANMRPLSPYAvyddlKPPTSPSPAAPVGKVAPSSTNEVVKvgnsAPPTALADEQHHAQPKPRplspYTMYEDLK 565
|
....*..
gi 2462588874 795 PKTSPSP 801
Cdd:PLN03209 566 PPTSPTP 572
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
395-816 |
1.30e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.45 E-value: 1.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 395 PTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAP-------SETPF-VPQKLEIFTSPEMQPTTPAPQQTTSI 466
Cdd:pfam05109 310 PASQDMPTNTTDITYVGDNATYSVPMVTSEDANSPNVTVTAfwawpnnTETDFkCKWTLTSGTPSGCENISGAFASNRTF 389
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 467 PSTPKRRPRPKPPRTKPERTTSAGTITPKI--SKSPEPTWTTPAPGKTQFISLKPKIPLsPEVTH-----TKPA---PEP 536
Cdd:pfam05109 390 DITVSGLGTAPKTLIITRTATNATTTTHKVifSKAPESTTTSPTLNTTGFAAPNTTTGL-PSSTHvptnlTAPAstgPTV 468
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 537 QTLLPSQSTIGPETPGTKPST-TLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPK 615
Cdd:pfam05109 469 STADVTSPTPAGTTSGASPVTpSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAV 548
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 616 TTHRPDAPQIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVTV--RTEATVTTLAPKTS 693
Cdd:pfam05109 549 TTPTPNATSPTPAVTTPT---PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLggTSSTPVVTSPPKNA 625
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 694 QRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAP-PKPKT 772
Cdd:pfam05109 626 TSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPaPRPGT 705
|
410 420 430 440
....*....|....*....|....*....|....*....|....
gi 2462588874 773 SPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLP 816
Cdd:pfam05109 706 TSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVP 749
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
907-1186 |
1.46e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.15 E-value: 1.46e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 907 TTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGI 986
Cdd:COG3401 48 TKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATT 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 987 ISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNS--- 1063
Cdd:COG3401 128 ATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTyyy 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 1064 -PCSITDSVKRFPKEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMT 1142
Cdd:COG3401 208 rVAATDTGGESAPSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATV 282
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 2462588874 1143 NQTFSTVENLKPNTSYEFQVKPKNPLG-EGPVSNTVAFSTESADP 1186
Cdd:COG3401 283 TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1085-1229 |
1.87e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 45.76 E-value: 1.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 1085 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1162
Cdd:COG3401 324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462588874 1163 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1229
Cdd:COG3401 398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
607-1005 |
1.97e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 45.93 E-value: 1.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 607 ASKPSERPKTTHRPDAPQIQPGSkpPKQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPS--MTIVPTTDIEPVTVRTEAT 684
Cdd:PHA03307 17 GGEFFPRPPATPGDAADDLLSGS--QGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPgpGTEAPANESRSTPTWSLST 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 685 VTTLAPKTSQRTRTRRPRPKhKTTPRPETlqtkldfgPITPGTSSAPttttkrtrrPHPKPKTTPHPEVPQTKLAPKQTP 764
Cdd:PHA03307 95 LAPASPAREGSPTPPGPSSP-DPPPPTPP--------PASPPPSPAP---------DLSEMLRPVGSPGPPPAASPPAAG 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 765 RAPPKPKTSPRPRIPQTQPVPKVPQrvTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVS--QSEPAPLETRGIPFIPMI 842
Cdd:PHA03307 157 ASPAAVASDAASSRQAALPLSSPEE--TARAPSSPPAEPPPSTPPAAASPRPPRRSSPISasASSPAPAPGRSAADDAGA 234
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 843 SPSPSqeelqttLEETDQSTQEPFTTKIPRTTELAKTTQAPHRFYTTVRPRTSDKPHIRPvlnRTTTRPTRPKPSGMPSG 922
Cdd:PHA03307 235 SSSDS-------SSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASS---SSSPRERSPSPSPSSPG 304
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 923 NGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGIISSGPITTPPLRSTPR 1002
Cdd:PHA03307 305 SGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGR 384
|
...
gi 2462588874 1003 PTG 1005
Cdd:PHA03307 385 PTR 387
|
|
| kgd |
PRK12270 |
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ... |
719-804 |
2.01e-04 |
|
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;
Pssm-ID: 237030 [Multi-domain] Cd Length: 1228 Bit Score: 46.04 E-value: 2.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 719 DFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPriPQTQPVPKVPQRVTAKPKTS 798
Cdd:PRK12270 35 DYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA--AAAPAAPPAAAAAAAPAAAA 112
|
....*.
gi 2462588874 799 PSPEVS 804
Cdd:PRK12270 113 VEDEVT 118
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
124-202 |
2.13e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.45 E-value: 2.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060 4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72
|
...
gi 2462588874 200 GVK 202
Cdd:smart00060 73 RVR 75
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
123-202 |
2.20e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 41.25 E-value: 2.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 123 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 199
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 2462588874 200 GVK 202
Cdd:pfam00041 72 RVQ 74
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
663-846 |
4.30e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 44.87 E-value: 4.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 663 APSMTIVPTTDIEPVTVRTEAT--VTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTtttkrtrr 740
Cdd:PRK12323 380 APVAQPAPAAAAPAAAAPAPAAppAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPA-------- 451
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 741 phPKPKTTPHPEVPqtklAPKQTPRAPPKPKTSPRPR---IPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPH 817
Cdd:PRK12323 452 --PAPAAAPAAAAR----PAAAGPRPVAAAAAAAPARaapAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAES 525
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 2462588874 818 KPYPEVSQSEP-----------APLETRGIPFIPMISPSP 846
Cdd:PRK12323 526 IPDPATADPDDafetlapapaaAPAPRAAAATEPVVAPRP 565
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
590-658 |
4.60e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 44.42 E-value: 4.60e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462588874 590 PALEPATIQPEPLVPTtASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVP 658
Cdd:PRK14950 362 PVPAPQPAKPTAAAPS-PVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAP 429
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
320-776 |
5.32e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 44.37 E-value: 5.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 320 AESKTPEVEKISARPTTVTPETVPRSTKPTTSSALDVSETTLASSEKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTSD 399
Cdd:pfam03154 87 GASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQI 166
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 400 EPEISDSYTATSDRILDSIPPKTSRTleQPRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPP 479
Cdd:pfam03154 167 LQTQPPVLQAQSGAASPPSPPPPGTT--QAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPS 244
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 480 RTKPERTTSAGTITPKISKSPEPTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTL 559
Cdd:pfam03154 245 PHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQR 324
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 560 AprKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKpsERPKTTHRPDAPQIQPGSKPPKQLlpKP 639
Cdd:pfam03154 325 I--HTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSH--KHPPHLSGPSPFQMNSNLPPPPAL--KP 398
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 640 QTTAEPDMPPTksvSEPVPFETEAPSMTIVPTTDIEPVTVRTEATVTTLA----PKTSQRTRTRRPRPKHKTTPRPETLQ 715
Cdd:pfam03154 399 LSSLSTHHPPS---AHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAAshppTSGLHQVPSQSPFPQHPFVPGGPPPI 475
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462588874 716 TKldfgPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQT----------PRAPPKPKTSPRP 776
Cdd:pfam03154 476 TP----PSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIkeealdeaeePESPPPPPRSPSP 542
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
420-787 |
5.42e-04 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 44.68 E-value: 5.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 420 PKTSRTLEQPRATLAP-----SETPFVPQKLEIFTSPE-----MQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSA 489
Cdd:PTZ00449 597 PKRPRSAQRPTRPKSPklpelLDIPKSPKRPESPKSPKrppppQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFY 676
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 490 GTITPKISKSPEpTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPepqtlLPSQSTIGPETPGTKPSTTLAPRktkrpgr 569
Cdd:PTZ00449 677 DDYLDAAAKSKE-TKTTVVLDESFESILKETLPETPGTPFTTPRP-----LPPKLPRDEEFPFEPIGDPDAEQ------- 743
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 570 rprprPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSkpPKQLLPKPqTTAEPDMPP 649
Cdd:PTZ00449 744 -----PDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS--PSEHEDKP-PGDHPSLPK 815
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 650 TKSVSE-----PVPFETEAPSMTIVPTTdiEPVTVRTEATVTTLApktsqrtrtrrprpkhkttprpeTLQTKLDFGPIT 724
Cdd:PTZ00449 816 KRHRLDglalsTTDLESDAGRIAKDASG--KIVKLKRSKSFDDLT-----------------------TVEEAEEMGAEA 870
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462588874 725 PGTSSAPTTTTKRTRRPHPkPKTTPHPEVPQTKlaPKQTPRAPPKPKTSPRPRIPQTQPVPKV 787
Cdd:PTZ00449 871 RKIVVDDDGTEADDEDTHP-PEEKHKSEVRRRR--PPKKPSKPKKPSKPKKPKKPDSAFIPSI 930
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
548-781 |
7.98e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 43.60 E-value: 7.98e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 548 PETPGTK----PSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPseRPKTTHRPDAP 623
Cdd:NF033839 284 PKEPGNKkpsaPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETP--KPEVKPQPEKP 361
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 624 QIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPvpfETEAPSMTIVPTTDIEPVTVRTEATVTTLAPKtsqrtrtrRPRP 703
Cdd:NF033839 362 KPEVKPQPEK---PKPEVKPQPETPKPEVKPQP---EKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQ--------PEKP 427
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462588874 704 KHKTTPRPETLQTKLDFGPITPGTSSAPTTTTkrtrrphPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQT 781
Cdd:NF033839 428 KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPET-------PKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPST 498
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
718-850 |
8.17e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 43.92 E-value: 8.17e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 718 LDFGPITP--GTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSP-RPRIPQTQPV----PKVPQR 790
Cdd:PRK10263 736 LDDGPHEPlfTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPqQPVAPQPQYQqpqqPVAPQP 815
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462588874 791 VTAKPKTSPSPEVSY------TTPAPKDVLLpHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEE 850
Cdd:PRK10263 816 QYQQPQQPVAPQPQYqqpqqpVAPQPQDTLL-HPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVE 880
|
|
| PRK11633 |
PRK11633 |
cell division protein DedD; Provisional |
722-811 |
8.26e-04 |
|
cell division protein DedD; Provisional
Pssm-ID: 236940 [Multi-domain] Cd Length: 226 Bit Score: 42.30 E-value: 8.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 722 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPqtkLAPKQTPRAPPKPKtsPRPRiPQTQPVPKVPQRVTAKPKTSPSP 801
Cdd:PRK11633 64 PTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTP---VEPEPAPVEPPKPK--PVEK-PKPKPKPQQKVEAPPAPKPEPKP 137
|
90
....*....|
gi 2462588874 802 EVSyTTPAPK 811
Cdd:PRK11633 138 VVE-EKAAPT 146
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
123-202 |
8.81e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.79 E-value: 8.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 123 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 200
Cdd:cd00063 3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73
|
..
gi 2462588874 201 VK 202
Cdd:cd00063 74 VR 75
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
317-553 |
2.06e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 42.25 E-value: 2.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 317 ALPAESKTPEVEKISArPTTVTPETvPRSTKPTTSSALDVSETTLASSEKPWIVPT-AKISEDSKVLQPQTATYDVFSS- 394
Cdd:pfam17823 129 SLPAAIAALPSEAFSA-PRAAACRA-NASAAPRAAIAAASAPHAASPAPRTAASSTtAASSTTAASSAPTTAASSAPATl 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 395 -PTTSDEPEISDSYTATSDRILDSIPPKTS--RTLEQPRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPK 471
Cdd:pfam17823 207 tPARGISTAATATGHPAAGTALAAVGNSSPaaGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHM 286
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 472 RRPRPKPPRTKPERTTSAGTIT------PKISKSPEPT------------------------WTTPAPGKTQFISLKPKI 521
Cdd:pfam17823 287 PSDTMARNPAAPMGAQAQGPIIqvstdqPVHNTAGEPTpspsnttlepntpksvastnlavvTTTKAQAKEPSASPVPVL 366
|
250 260 270
....*....|....*....|....*....|....
gi 2462588874 522 PLS--PEVTHTKPAPEPQTLLPSQSTIGPETPGT 553
Cdd:pfam17823 367 HTSmiPEVEATSPTTQPSPLLPTQGAAGPGILLA 400
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
589-799 |
2.06e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 42.75 E-value: 2.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 589 KPALEPATIQPEPLVPTTASKPSERPKTTHRPDAPQIQ---PGSKPPKQ----LLPKPQTTAE-------------PDMP 648
Cdd:PHA03378 575 QPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQshiPETSAPRQwpmpLRPIPMRPLRmqpitfnvlvfptPHQP 654
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 649 PTKSVSEPVPFETEAPSMTIVPTT---------DIEPVTVRTEATVTT-LAPKTSQRTRTRRPRPKHKTTPRPETLQTKL 718
Cdd:PHA03378 655 PQVEITPYKPTWTQIGHIPYQPSPtgantmlpiQWAPGTMQPPPRAPTpMRPPAAPPGRAQRPAAATGRARPPAAAPGRA 734
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 719 DFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTS 798
Cdd:PHA03378 735 RPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAA 814
|
.
gi 2462588874 799 P 799
Cdd:PHA03378 815 P 815
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
519-745 |
2.32e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.17 E-value: 2.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 519 PKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRktkrpgrrprprprpKTTPSPEVPKSKPALEPATIQ 598
Cdd:PRK12323 374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAAR---------------AVAAAPARRSPAPEALAAARQ 438
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 599 PEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVT 678
Cdd:PRK12323 439 ASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAP 518
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462588874 679 VRTEAtvttlapktsqrtrtrrprpkhKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKP 745
Cdd:PRK12323 519 AGWVA----------------------ESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
|
|
| DamX |
COG3266 |
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ... |
743-830 |
2.36e-03 |
|
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442497 [Multi-domain] Cd Length: 455 Bit Score: 42.14 E-value: 2.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 743 PKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPripQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPE 822
Cdd:COG3266 265 SAPATTSLGEQQEVSLPPAVAAQPAAAAAAQPSA---VALPAAPAAAAAAAAPAEAAAPQPTAAKPVVTETAAPAAPAPE 341
|
....*...
gi 2462588874 823 VSQSEPAP 830
Cdd:COG3266 342 AAAAAAAP 349
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
491-725 |
2.53e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 42.38 E-value: 2.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 491 TITPKIskSPEPTWTTPAPGKTQfislkpkiplsPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRR 570
Cdd:PRK10263 368 TGEPVI--APAPEGYPQQSQYAQ-----------PAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYY 434
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 571 PRPRPRPKTTPSPEVPKSKPALEP-ATIQPEPLV--PTTASKPSERPKTTHRPDAPQIQPG------SKPP--------- 632
Cdd:PRK10263 435 APAPEQPVAGNAWQAEEQQSTFAPqSTYQTEQTYqqPAAQEPLYQQPQPVEQQPVVEPEPVveetkpARPPlyyfeevee 514
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 633 ------KQLLPKPQTTAEPDMPPtksvsEPVPFETEAPSMTIVPTTDIEPVTVRTEATV--TTLAPKTSQRTRTRRPRPK 704
Cdd:PRK10263 515 krarerEQLAAWYQPIPEPVKEP-----EPIKSSLKAPSVAAVPPVEAAAAVSPLASGVkkATLATGAAATVAAPVFSLA 589
|
250 260
....*....|....*....|.
gi 2462588874 705 HKTTPRPetlQTKLDFGPITP 725
Cdd:PRK10263 590 NSGGPRP---QVKEGIGPQLP 607
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
722-810 |
2.64e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 42.10 E-value: 2.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 722 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTklAPKQTPRAPPKPKTSPRPRiPQTQPVPKVPQRVTAKPKTSPSP 801
Cdd:PRK14950 362 PVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAAN--IPPKEPVRETATPPPVPPR-PVAPPVPHTPESAPKLTRAAIPV 438
|
....*....
gi 2462588874 802 EVSYTTPAP 810
Cdd:PRK14950 439 DEKPKYTPP 447
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
317-783 |
3.34e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 41.83 E-value: 3.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 317 ALPAESKTPEVEKISARPTTVTPETVPRSTKPTTSSALDVSETTLASsekpwivPTAKISEDSKVLQPQTATYDVFSSPT 396
Cdd:pfam05109 395 GLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAA-------PNTTTGLPSSTHVPTNLTAPASTGPT 467
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 397 TSDEPEISDSYTATSDRILDSIPPKTsrtleqPRATLAPSETPFVpqkleifTSPEMQPTTPAPQQTTSIPSTPKRRPRP 476
Cdd:pfam05109 468 VSTADVTSPTPAGTTSGASPVTPSPS------PRDNGTESKAPDM-------TSPTSAVTTPTPNATSPTPAVTTPTPNA 534
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 477 KPPRTKPERTTSAGTITPKISKSPEPTWTTPAPGKTqFISLKPKIPLSPEVTHTKPAPEPQTLLPS-QSTIGPETPGTKP 555
Cdd:pfam05109 535 TSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNAT-IPTLGKTSPTSAVTTPTPNATSPTVGETSpQANTTNHTLGGTS 613
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 556 STTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKP---SERPktTHRPDAPQIQPGSKPP 632
Cdd:pfam05109 614 STPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPlltSAHP--TGGENITQVTPASTST 691
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 633 KQL-----LPKPQTTAEPDMPPTKSVS-EPVPFETEAPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKH- 705
Cdd:pfam05109 692 HHVstsspAPRPGTTSQASGPGNSSTStKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHg 771
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 706 -KTTPRPETlqtklDFGpitpGTSSAPTTTTKRtrrphpkpkTTPHPEVPQTKLAPKQTPRAPP-KPKTSPRPRIPQTQP 783
Cdd:pfam05109 772 aRTSTEPTT-----DYG----GDSTTPRTRYNA---------TTYLPPSTSSKLRPRWTFTSPPvTTAQATVPVPPTSQP 833
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
725-830 |
3.59e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 41.62 E-value: 3.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 725 PGTSSAPTTTTKRTRRPHPKPKTT----PHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVtakpktsPS 800
Cdd:PRK14951 375 PAEKKTPARPEAAAPAAAPVAQAAaapaPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAV-------AL 447
|
90 100 110
....*....|....*....|....*....|..
gi 2462588874 801 PEVSYTTPAPKDVLLPHK--PYPEVSQSEPAP 830
Cdd:PRK14951 448 APAPPAQAAPETVAIPVRvaPEPAVASAAPAP 479
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
741-811 |
3.80e-03 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 41.54 E-value: 3.80e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462588874 741 PHPKPKTTPHPEVPqtklAPKqtpraPPKPKTSPRPRIPQTQ--------PVPKVPQRVTAKPktSPSPEVSYTTPAPK 811
Cdd:NF033838 418 EQPQPAPAPQPEKP----APK-----PEKPAEQPKAEKPADQqaeedyarRSEEEYNRLTQQQ--PPKTEKPAQPSTPK 485
|
|
| PRK10905 |
PRK10905 |
cell division protein DamX; Validated |
593-694 |
4.44e-03 |
|
cell division protein DamX; Validated
Pssm-ID: 236792 [Multi-domain] Cd Length: 328 Bit Score: 40.69 E-value: 4.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 593 EPATIQP---EPLVPTTASKPSERPKTTHRPDAPQIQPGSKppkqllpKPQTTAE-PDMPPTKSVSEPVPFETEAPSMTI 668
Cdd:PRK10905 126 EPATVAPvrnGNASRQTAKTQTAERPATTRPARKQAVIEPK-------KPQATAKtEPKPVAQTPKRTEPAAPVASTKAP 198
|
90 100
....*....|....*....|....*.
gi 2462588874 669 VPTTDIEPVTVRTEATVTTLAPKTSQ 694
Cdd:PRK10905 199 AATSTPAPKETATTAPVQTASPAQTT 224
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
582-776 |
5.67e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 41.12 E-value: 5.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 582 SPEVPKSKPALEPATIQPEPLVPTTASKPsERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVS-EPVPFE 660
Cdd:PRK07764 596 GGEGPPAPASSGPPEEAARPAAPAAPAAP-AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDgWPAKAG 674
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 661 TEAPSMTIVPTTDIEPVTVRTEATvttlapktsqrtrtRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRR 740
Cdd:PRK07764 675 GAAPAAPPPAPAPAAPAAPAGAAP--------------AQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
|
170 180 190
....*....|....*....|....*....|....*.
gi 2462588874 741 PHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRP 776
Cdd:PRK07764 741 LPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPP 776
|
|
| Orthopox_A5L |
pfam06193 |
Orthopoxvirus A5L protein-like; This family includes several Orthopoxvirus A5L proteins. The ... |
755-875 |
7.19e-03 |
|
Orthopoxvirus A5L protein-like; This family includes several Orthopoxvirus A5L proteins. The vaccinia virus WR A5L open reading frame (corresponding to open reading frame A4L in vaccinia virus Copenhagen) encodes an immunodominant late protein found in the core of the vaccinia virion. The A5 protein appears to be required for the immature virion to form the brick-shaped intracellular mature virion.
Pssm-ID: 283778 [Multi-domain] Cd Length: 216 Bit Score: 39.58 E-value: 7.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 755 QTKLAPKQTPRAPPKPKTSPRP-RIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLET 833
Cdd:pfam06193 61 DNMLAASRQPIQPLQPTIHITPiEIPTPAPTPKPRQQELGTPSTSCTQNSDASIACSTDIVTPPQPPIVATVCTPTPTDG 140
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 2462588874 834 RgIPFIPMISPSP---SQEELQTTLEETDQSTQEPFTTKIPRTTE 875
Cdd:pfam06193 141 R-ICTTADQNPNPgatIQKELDNMALKDLMSSVEKDMCQLQAESE 184
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
722-1093 |
8.76e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.52 E-value: 8.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 722 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAP----KQTPRAPPKPKTSPRPRI-PQTQPVPkvPQRVTAKPK 796
Cdd:pfam03154 189 PGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPhtliQQTPTLHPQRLPSPHPPLqPMTQPPP--PSQVSPQPL 266
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 797 TSPSPEVSYT-TPAPKDVLLPHKPYPEVSQSEPAPL---ETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPfttkiPR 872
Cdd:pfam03154 267 PQPSLHGQMPpMPHSLQTGPSHMQHPVPPQPFPLTPqssQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-----PR 341
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 873 TTELAKttqAPHRFYTTVRPRTSDKPHI-RPVLNRTTTRPTRPKPSGMPSgngvgtgvkQAPRPSGADRNVSVDSTHPTK 951
Cdd:pfam03154 342 EQPLPP---APLSMPHIKPPPTTPIPQLpNPQSHKHPPHLSGPSPFQMNS---------NLPPPPALKPLSSLSTHHPPS 409
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588874 952 KPGTRRPPLPPRPTHPRRKPLPPNnVTGKPGSAGIISSGPittPPLRSTPRPTGTPLErietdiKQPTVPasgeelenit 1031
Cdd:pfam03154 410 AHPPPLQLMPQSQQLPPPPAQPPV-LTQSQSLPPPAASHP---PTSGLHQVPSQSPFP------QHPFVP---------- 469
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462588874 1032 dfsSSPTRETDPLGKPRFKGPHVRYIQKPDNSPCSITDSV-------------KRFPKEEATEGNATSPPQNPPT 1093
Cdd:pfam03154 470 ---GGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVpaavscplppvqiKEEALDEAEEPESPPPPPRSPS 541
|
|
|