|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
447-867 |
1.01e-16 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 86.53 E-value: 1.01e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 447 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 524
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 525 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 604
Cdd:PHA03247 2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 605 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 680
Cdd:PHA03247 2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 681 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 749
Cdd:PHA03247 2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 750 TTTTKRTRRPHPKPKTTPHPEV-----------PQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPS 818
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPARPPVrrlarpavsrsTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
|
410 420 430 440
....*....|....*....|....*....|....*....|....*....
gi 1770726339 819 PEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLEtrgIPFIPMISPSPSQE 867
Cdd:PHA03247 2942 PLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA---VPRFRVPQPAPSRE 2987
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1108-1199 |
2.85e-10 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 58.28 E-value: 2.85e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1108 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1185
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 1770726339 1186 LGEGPVSNTVAFST 1199
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1109-1189 |
5.29e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.46 E-value: 5.29e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1109 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1186
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 1770726339 1187 GEG 1189
Cdd:smart00060 81 GEG 83
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
747-1040 |
8.62e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.71 E-value: 8.62e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 747 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQT------PR-------------------APPKPKTSPRP----RIP 797
Cdd:PHA03247 2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPvgepvhPRmltwirgleelasddagdpPPPLPPAAPPAapdrSVP 2569
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 798 QTQPVPKVPQ-RVTAKPKTSPSPEVSYTTPAPKDvllPHKPYPEVSQSEPAPLETRGiPFIPMISPSPSQEELQTTLEET 876
Cdd:PHA03247 2570 PPRPAPRPSEpAVTSRARRPDAPPQSARPRAPVD---DRGDPRGPAPPSPLPPDTHA-PDPPPPSPSPAANEPDPHPPPT 2645
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 877 DQSTQEPFTTKIPRTTELAKTTQAPHR-FYTTVRPRTSDKPHIRPVLNRTTT--RPTRPKPSGMPSGNGVGTGVKQAPRP 953
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRRARRLGRaAQASSPPQRPRRRAARPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGP 2725
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 954 SGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPpnnvTGKPGSAGiiSSGPITTPPlRSTPRPTGTPLERIETDI 1033
Cdd:PHA03247 2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT----AGPPAPAP--PAAPAAGPP-RRLTRPAVASLSESRESL 2798
|
....*..
gi 1770726339 1034 KQPTVPA 1040
Cdd:PHA03247 2799 PSPWDPA 2805
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1109-1192 |
1.86e-05 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 44.33 E-value: 1.86e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1109 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1185
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 1770726339 1186 LGEGPVS 1192
Cdd:pfam00041 79 GGEGPPS 85
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
413-834 |
1.26e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.45 E-value: 1.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 413 PTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAP-------SETPF-VPQKLEIFTSPEMQPTTPAPQQTTSI 484
Cdd:pfam05109 310 PASQDMPTNTTDITYVGDNATYSVPMVTSEDANSPNVTVTAfwawpnnTETDFkCKWTLTSGTPSGCENISGAFASNRTF 389
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 485 PSTPKRRPRPKPPRTKPERTTSAGTITPKI--SKSPEPTWTTPAPGKTQFISLKPKIPLsPEVTH-----TKPA---PEP 554
Cdd:pfam05109 390 DITVSGLGTAPKTLIITRTATNATTTTHKVifSKAPESTTTSPTLNTTGFAAPNTTTGL-PSSTHvptnlTAPAstgPTV 468
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 555 QTLLPSQSTIGPETPGTKPST-TLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPK 633
Cdd:pfam05109 469 STADVTSPTPAGTTSGASPVTpSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAV 548
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 634 TTHRPDAPQIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVTV--RTEATVTTLAPKTS 711
Cdd:pfam05109 549 TTPTPNATSPTPAVTTPT---PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLggTSSTPVVTSPPKNA 625
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 712 QRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAP-PKPKT 790
Cdd:pfam05109 626 TSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPaPRPGT 705
|
410 420 430 440
....*....|....*....|....*....|....*....|....
gi 1770726339 791 SPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLP 834
Cdd:pfam05109 706 TSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVP 749
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
925-1204 |
1.56e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.15 E-value: 1.56e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 925 TTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGI 1004
Cdd:COG3401 48 TKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATT 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1005 ISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNS--- 1081
Cdd:COG3401 128 ATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTyyy 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1082 -PCSITDSVKRFPKEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMT 1160
Cdd:COG3401 208 rVAATDTGGESAPSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATV 282
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1770726339 1161 NQTFSTVENLKPNTSYEFQVKPKNPLG-EGPVSNTVAFSTESADP 1204
Cdd:COG3401 283 TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1103-1247 |
2.05e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 45.76 E-value: 2.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1103 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1180
Cdd:COG3401 324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1770726339 1181 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1247
Cdd:COG3401 398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
117-195 |
2.34e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.06 E-value: 2.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 117 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 192
Cdd:smart00060 4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72
|
...
gi 1770726339 193 GVK 195
Cdd:smart00060 73 RVR 75
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
116-195 |
2.46e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 41.25 E-value: 2.46e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 192
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 1770726339 193 GVK 195
Cdd:pfam00041 72 RVQ 74
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
566-799 |
8.32e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 43.60 E-value: 8.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 566 PETPGTK----PSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPseRPKTTHRPDAP 641
Cdd:NF033839 284 PKEPGNKkpsaPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETP--KPEVKPQPEKP 361
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 642 QIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPvpfETEAPSMTIVPTTDIEPVTVRTEATVTTLAPKtsqrtrtrRPRP 721
Cdd:NF033839 362 KPEVKPQPEK---PKPEVKPQPETPKPEVKPQP---EKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQ--------PEKP 427
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1770726339 722 KHKTTPRPETLQTKLDFGPITPGTSSAPTTTTkrtrrphPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQT 799
Cdd:NF033839 428 KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPET-------PKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPST 498
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
296-571 |
8.49e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 43.41 E-value: 8.49e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 296 SDALKTQLAKNETLALPAESKTpeVEKISARPTTVTPETVPRSTKPTTSSALDVSETTLVLSKRTPETLQTI--LIPQFE 373
Cdd:pfam17823 134 IAALPSEAFSAPRAAACRANAS--AAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPatLTPARG 211
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 374 LPLSTLASSEKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAP 453
Cdd:pfam17823 212 ISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTM 291
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 454 SETPFVPQKLEIfTSPEMQPTTPAP-QQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEPTwTTPAPgktqf 532
Cdd:pfam17823 292 ARNPAAPMGAQA-QGPIIQVSTDQPvHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPS-ASPVP----- 364
|
250 260 270
....*....|....*....|....*....|....*....
gi 1770726339 533 islKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGT 571
Cdd:pfam17823 365 ---VLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLA 400
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
116-195 |
9.56e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.79 E-value: 9.56e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 116 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 193
Cdd:cd00063 3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73
|
..
gi 1770726339 194 VK 195
Cdd:cd00063 74 VR 75
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
759-829 |
3.48e-03 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 41.92 E-value: 3.48e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1770726339 759 PHPKPKTTPHPEVPqtklAPKqtpraPPKPKTSPRPRIPQTQ--------PVPKVPQRVTAKPktSPSPEVSYTTPAPK 829
Cdd:NF033838 418 EQPQPAPAPQPEKP----APK-----PEKPAEQPKAEKPADQqaeedyarRSEEEYNRLTQQQ--PPKTEKPAQPSTPK 485
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
257-556 |
4.47e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 4.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 257 DSAKSPEKAPlggvilvHLIIPGLNETTVKLPASLMFEISDALKTQLAKNETLALPAESKTPEVEKISARPTTVTPETVP 336
Cdd:PHA03247 2703 PPPPTPEPAP-------HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 337 RSTKPTTSSALDVSEttlvLSKRTPETLQTILIPQFELPLSTLASSEKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTS 416
Cdd:PHA03247 2776 AAGPPRRLTRPAVAS----LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 417 DEPEISDSyTATSDRILDSIPPKTSRTLEQPRATLAPSetPFVPQKLEIFTSPEMQPT---TPAPQQTTSIPSTPKRRPR 493
Cdd:PHA03247 2852 LGGSVAPG-GDVRRRPPSRSPAAKPAAPARPPVRRLAR--PAVSRSTESFALPPDQPErppQPQAPPPPQPQPQPPPPPQ 2928
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1770726339 494 PKPPRTKPERTTSAGTITPKISKSPEPTWTTPAPGKTQFISLKPKIP--LSPEVTHTKPAPEPQT 556
Cdd:PHA03247 2929 PQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrfRVPQPAPSREAPASST 2993
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
740-1111 |
9.05e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.52 E-value: 9.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 740 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAP----KQTPRAPPKPKTSPRPRI-PQTQPVPkvPQRVTAKPK 814
Cdd:pfam03154 189 PGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPhtliQQTPTLHPQRLPSPHPPLqPMTQPPP--PSQVSPQPL 266
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 815 TSPSPEVSYT-TPAPKDVLLPHKPYPEVSQSEPAPL---ETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPfttkiPR 890
Cdd:pfam03154 267 PQPSLHGQMPpMPHSLQTGPSHMQHPVPPQPFPLTPqssQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-----PR 341
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 891 TTELAKttqAPHRFYTTVRPRTSDKPHI-RPVLNRTTTRPTRPKPSGMPSgngvgtgvkQAPRPSGADRNVSVDSTHPTK 969
Cdd:pfam03154 342 EQPLPP---APLSMPHIKPPPTTPIPQLpNPQSHKHPPHLSGPSPFQMNS---------NLPPPPALKPLSSLSTHHPPS 409
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 970 KPGTRRPPLPPRPTHPRRKPLPPNnVTGKPGSAGIISSGPittPPLRSTPRPTGTPLErietdiKQPTVPasgeelenit 1049
Cdd:pfam03154 410 AHPPPLQLMPQSQQLPPPPAQPPV-LTQSQSLPPPAASHP---PTSGLHQVPSQSPFP------QHPFVP---------- 469
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1770726339 1050 dfsSSPTRETDPLGKPRFKGPHVRYIQKPDNSPCSITDSV-------------KRFPKEEATEGNATSPPQNPPT 1111
Cdd:pfam03154 470 ---GGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVpaavscplppvqiKEEALDEAEEPESPPPPPRSPS 541
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
447-867 |
1.01e-16 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 86.53 E-value: 1.01e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 447 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 524
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 525 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 604
Cdd:PHA03247 2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 605 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 680
Cdd:PHA03247 2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 681 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 749
Cdd:PHA03247 2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 750 TTTTKRTRRPHPKPKTTPHPEV-----------PQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPS 818
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPARPPVrrlarpavsrsTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
|
410 420 430 440
....*....|....*....|....*....|....*....|....*....
gi 1770726339 819 PEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLEtrgIPFIPMISPSPSQE 867
Cdd:PHA03247 2942 PLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA---VPRFRVPQPAPSRE 2987
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
437-842 |
1.21e-13 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 76.52 E-value: 1.21e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 437 PPKTSRTLEQPRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPK--- 513
Cdd:PHA03247 2608 PRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRrra 2687
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 514 ---------ISKSPEPTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPePQTLLPSQSTIGPETPGT-----KPSTTLAP 579
Cdd:PHA03247 2688 arptvgsltSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL-PAAPAPPAVPAGPATPGGparpaRPPTTAGP 2766
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 580 rkTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQT 659
Cdd:PHA03247 2767 --PAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG 2844
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 660 TAEPDMPPTKSVSEPVPFETEAPSMTIVPTtdiepVTVRTEATVTTLA-PKTSQRTRTRRPRPKHKTTPRPETLQTKldf 738
Cdd:PHA03247 2845 PPPPSLPLGGSVAPGGDVRRRPPSRSPAAK-----PAAPARPPVRRLArPAVSRSTESFALPPDQPERPPQPQAPPP--- 2916
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 739 gPITPGTSSAPttttkrtRRPHPKPKTTPHPEVPqtklapkQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPS 818
Cdd:PHA03247 2917 -PQPQPQPPPP-------PQPQPPPPPPPRPQPP-------LAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQ 2981
|
410 420
....*....|....*....|....
gi 1770726339 819 PEVSYTTPAPKDVLLPHKPYPEVS 842
Cdd:PHA03247 2982 PAPSREAPASSTPPLTGHSLSRVS 3005
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
604-1031 |
3.86e-13 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 74.97 E-value: 3.86e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 604 PKSKPALEPATiqPEPLVPTT--ASKPSERPKTT--HRPDAPQIQ--------PGSKPPKQLLPKPQTTAEPDMPPTKSV 671
Cdd:PHA03247 2553 PPLPPAAPPAA--PDRSVPPPrpAPRPSEPAVTSraRRPDAPPQSarprapvdDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 672 SEPVPFETEAPSMTIVPttdiEPVTVRTEATVTTLA-PKTSQRTRTRRPRPKHKTTPRPETLQTkldfgPITPGTSSA-- 748
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVP----PPERPRDDPAPGRVSrPRRARRLGRAAQASSPPQRPRRRAARP-----TVGSLTSLAdp 2701
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 749 PTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPR-------PRIPQTQPVPKVPQRVT--AKPKTSPSP 819
Cdd:PHA03247 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAgpatpggPARPARPPTTAGPPAPAppAAPAAGPPR 2781
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 820 EVSYTTPAPKDVLLPHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPFTTKIPRTTELA---- 895
Cdd:PHA03247 2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVApggd 2861
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 896 -----KTTQAPHRFYTTVRPRTSDKPhiRPVLNRTTT-------RPTRPKPSGMPSGNGVGTGVKQAPRPSGADRnvsvd 963
Cdd:PHA03247 2862 vrrrpPSRSPAAKPAAPARPPVRRLA--RPAVSRSTEsfalppdQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP----- 2934
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1770726339 964 sTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGII-----SSGPITTPPLRSTPRPTGTPLERIET 1031
Cdd:PHA03247 2935 -PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrfrvpQPAPSREAPASSTPPLTGHSLSRVSS 3006
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1108-1199 |
2.85e-10 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 58.28 E-value: 2.85e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1108 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1185
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 1770726339 1186 LGEGPVSNTVAFST 1199
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1109-1189 |
5.29e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.46 E-value: 5.29e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1109 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1186
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 1770726339 1187 GEG 1189
Cdd:smart00060 81 GEG 83
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
670-870 |
7.07e-07 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 53.94 E-value: 7.07e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 670 SVSEPVPFETEAPSMTIVPTTDIEPVTvrTEATVTTLAPKTSQRTRtrrprpKHKTTPRPETLQTKLDFGPitpgTSSAP 749
Cdd:PRK10263 315 PITEPVAVAAAATTATQSWAAPVEPVT--QTPPVASVDVPPAQPTV------AWQPVPGPQTGEPVIAPAP----EGYPQ 382
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 750 TTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYttpAPK 829
Cdd:PRK10263 383 QSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTF---APQ 459
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1770726339 830 DVLLPHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEELQ 870
Cdd:PRK10263 460 STYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETK 500
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
747-1040 |
8.62e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.71 E-value: 8.62e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 747 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQT------PR-------------------APPKPKTSPRP----RIP 797
Cdd:PHA03247 2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPvgepvhPRmltwirgleelasddagdpPPPLPPAAPPAapdrSVP 2569
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 798 QTQPVPKVPQ-RVTAKPKTSPSPEVSYTTPAPKDvllPHKPYPEVSQSEPAPLETRGiPFIPMISPSPSQEELQTTLEET 876
Cdd:PHA03247 2570 PPRPAPRPSEpAVTSRARRPDAPPQSARPRAPVD---DRGDPRGPAPPSPLPPDTHA-PDPPPPSPSPAANEPDPHPPPT 2645
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 877 DQSTQEPFTTKIPRTTELAKTTQAPHR-FYTTVRPRTSDKPHIRPVLNRTTT--RPTRPKPSGMPSGNGVGTGVKQAPRP 953
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRRARRLGRaAQASSPPQRPRRRAARPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGP 2725
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 954 SGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPpnnvTGKPGSAGiiSSGPITTPPlRSTPRPTGTPLERIETDI 1033
Cdd:PHA03247 2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT----AGPPAPAP--PAAPAAGPP-RRLTRPAVASLSESRESL 2798
|
....*..
gi 1770726339 1034 KQPTVPA 1040
Cdd:PHA03247 2799 PSPWDPA 2805
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1109-1192 |
1.86e-05 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 44.33 E-value: 1.86e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1109 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1185
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 1770726339 1186 LGEGPVS 1192
Cdd:pfam00041 79 GGEGPPS 85
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
566-939 |
5.35e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 47.76 E-value: 5.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 566 PETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEpATIQPEPlvpTTASKPSERPKTTHRPDAPQIQP 645
Cdd:PTZ00449 511 PEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGE-VGKKPGP---AKEHKPSKIPTLSKKPEFPKDPK 586
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 646 GSKPPKQllPK----PQTTAEPDMPPTKSVSE--PVPFETEAPSMTIVPTTDIEPV-TVRTEATVTTLAPKTSQRTRTRR 718
Cdd:PTZ00449 587 HPKDPEE--PKkpkrPRSAQRPTRPKSPKLPEllDIPKSPKRPESPKSPKRPPPPQrPSSPERPEGPKIIKSPKPPKSPK 664
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 719 PRPKHKTTPR---------PETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPhPEVPQTKLAPKQTPRAPPKPK 789
Cdd:PTZ00449 665 PPFDPKFKEKfyddyldaaAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLP-PKLPRDEEFPFEPIGDPDAEQ 743
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 790 TSP----------RPRIPQTQPVPKVPQRVTAKPKTspsPEVSYTTPAPKDvllPHKPYPEVSQSEPAPLETRgiPFIPM 859
Cdd:PTZ00449 744 PDDiefftppeeeRTFFHETPADTPLPDILAEEFKE---EDIHAETGEPDE---AMKRPDSPSEHEDKPPGDH--PSLPK 815
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 860 ISPSPSQEELQTTLEETD------QSTQEPFTTKIPRT-TELAKTTQAPH---------------------------RFY 905
Cdd:PTZ00449 816 KRHRLDGLALSTTDLESDagriakDASGKIVKLKRSKSfDDLTTVEEAEEmgaearkivvdddgteaddedthppeeKHK 895
|
410 420 430
....*....|....*....|....*....|....
gi 1770726339 906 TTVRPRTSDKPHIRPVLNRTTTRPTRPKPSGMPS 939
Cdd:PTZ00449 896 SEVRRRRPPKKPSKPKKPSKPKKPKKPDSAFIPS 929
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
759-845 |
6.07e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 47.50 E-value: 6.07e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 759 PHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPY 838
Cdd:PRK14950 366 PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYT 445
|
....*..
gi 1770726339 839 PEVSQSE 845
Cdd:PRK14950 446 PPAPPKE 452
|
|
| PRK14954 |
PRK14954 |
DNA polymerase III subunits gamma and tau; Provisional |
772-857 |
1.08e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184918 [Multi-domain] Cd Length: 620 Bit Score: 46.47 E-value: 1.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 772 PQTKLAPKQTPRAPPKPKTSPRP-RIPQTQPVPKVPQRVTAKPKTSPSPEvSYTTPAPKDVLLPHKPYPEVSQSEPAPLE 850
Cdd:PRK14954 376 NDGGVAPSPAGSPDVKKKAPEPDlPQPDRHPGPAKPEAPGARPAELPSPA-SAPTPEQQPPVARSAPLPPSPQASAPRNV 454
|
....*..
gi 1770726339 851 TRGIPFI 857
Cdd:PRK14954 455 ASGKPGV 461
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
601-819 |
1.16e-04 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 46.46 E-value: 1.16e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 601 PEVPKSKPALEPATIQPEPLVPTTASKPSER--PK---TTHRPDAPQIQ-PGSKPPKQLLPKPQTTAEPDMPPTKSVSEP 674
Cdd:PLN03209 328 VPPKESDAADGPKPVPTKPVTPEAPSPPIEEepPQpkaVVPRPLSPYTAyEDLKPPTSPIPTPPSSSPASSKSVDAVAKP 407
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 675 VPFETEAPSMTIVPTTDIEPVTVRTE--------ATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTklDFGPITPGTS 746
Cdd:PLN03209 408 AEPDVVPSPGSASNVPEVEPAQVEAKktrplspyARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVP--DTAPATAATD 485
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 747 SA-PTTTTKRTRRPHP-----KPKTTPHPEVPQTKLAPKQTPRAPP----KPKTSPRPRIPQTQPVPK----VPQRVTAK 812
Cdd:PLN03209 486 AAaPPPANMRPLSPYAvyddlKPPTSPSPAAPVGKVAPSSTNEVVKvgnsAPPTALADEQHHAQPKPRplspYTMYEDLK 565
|
....*..
gi 1770726339 813 PKTSPSP 819
Cdd:PLN03209 566 PPTSPTP 572
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
413-834 |
1.26e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.45 E-value: 1.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 413 PTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAP-------SETPF-VPQKLEIFTSPEMQPTTPAPQQTTSI 484
Cdd:pfam05109 310 PASQDMPTNTTDITYVGDNATYSVPMVTSEDANSPNVTVTAfwawpnnTETDFkCKWTLTSGTPSGCENISGAFASNRTF 389
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 485 PSTPKRRPRPKPPRTKPERTTSAGTITPKI--SKSPEPTWTTPAPGKTQFISLKPKIPLsPEVTH-----TKPA---PEP 554
Cdd:pfam05109 390 DITVSGLGTAPKTLIITRTATNATTTTHKVifSKAPESTTTSPTLNTTGFAAPNTTTGL-PSSTHvptnlTAPAstgPTV 468
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 555 QTLLPSQSTIGPETPGTKPST-TLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPK 633
Cdd:pfam05109 469 STADVTSPTPAGTTSGASPVTpSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAV 548
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 634 TTHRPDAPQIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVTV--RTEATVTTLAPKTS 711
Cdd:pfam05109 549 TTPTPNATSPTPAVTTPT---PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLggTSSTPVVTSPPKNA 625
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 712 QRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAP-PKPKT 790
Cdd:pfam05109 626 TSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPaPRPGT 705
|
410 420 430 440
....*....|....*....|....*....|....*....|....
gi 1770726339 791 SPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLP 834
Cdd:pfam05109 706 TSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVP 749
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
925-1204 |
1.56e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.15 E-value: 1.56e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 925 TTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGI 1004
Cdd:COG3401 48 TKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATT 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1005 ISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNS--- 1081
Cdd:COG3401 128 ATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTyyy 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1082 -PCSITDSVKRFPKEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMT 1160
Cdd:COG3401 208 rVAATDTGGESAPSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATV 282
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1770726339 1161 NQTFSTVENLKPNTSYEFQVKPKNPLG-EGPVSNTVAFSTESADP 1204
Cdd:COG3401 283 TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
625-1023 |
2.00e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 45.93 E-value: 2.00e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 625 ASKPSERPKTTHRPDAPQIQPGSkpPKQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPS--MTIVPTTDIEPVTVRTEAT 702
Cdd:PHA03307 17 GGEFFPRPPATPGDAADDLLSGS--QGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPgpGTEAPANESRSTPTWSLST 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 703 VTTLAPKTSQRTRTRRPRPKhKTTPRPETlqtkldfgPITPGTSSAPttttkrtrrPHPKPKTTPHPEVPQTKLAPKQTP 782
Cdd:PHA03307 95 LAPASPAREGSPTPPGPSSP-DPPPPTPP--------PASPPPSPAP---------DLSEMLRPVGSPGPPPAASPPAAG 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 783 RAPPKPKTSPRPRIPQTQPVPKVPQrvTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVS--QSEPAPLETRGIPFIPMI 860
Cdd:PHA03307 157 ASPAAVASDAASSRQAALPLSSPEE--TARAPSSPPAEPPPSTPPAAASPRPPRRSSPISasASSPAPAPGRSAADDAGA 234
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 861 SPSPSqeelqttLEETDQSTQEPFTTKIPRTTELAKTTQAPHRFYTTVRPRTSDKPHIRPvlnRTTTRPTRPKPSGMPSG 940
Cdd:PHA03307 235 SSSDS-------SSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASS---SSSPRERSPSPSPSSPG 304
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 941 NGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGIISSGPITTPPLRSTPR 1020
Cdd:PHA03307 305 SGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGR 384
|
...
gi 1770726339 1021 PTG 1023
Cdd:PHA03307 385 PTR 387
|
|
| kgd |
PRK12270 |
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ... |
737-822 |
2.04e-04 |
|
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;
Pssm-ID: 237030 [Multi-domain] Cd Length: 1228 Bit Score: 46.04 E-value: 2.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 737 DFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPriPQTQPVPKVPQRVTAKPKTS 816
Cdd:PRK12270 35 DYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA--AAAPAAPPAAAAAAAPAAAA 112
|
....*.
gi 1770726339 817 PSPEVS 822
Cdd:PRK12270 113 VEDEVT 118
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1103-1247 |
2.05e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 45.76 E-value: 2.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1103 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1180
Cdd:COG3401 324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1770726339 1181 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1247
Cdd:COG3401 398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
117-195 |
2.34e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.06 E-value: 2.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 117 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 192
Cdd:smart00060 4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72
|
...
gi 1770726339 193 GVK 195
Cdd:smart00060 73 RVR 75
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
116-195 |
2.46e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 41.25 E-value: 2.46e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 192
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 1770726339 193 GVK 195
Cdd:pfam00041 72 RVQ 74
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
681-864 |
4.60e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 44.48 E-value: 4.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 681 APSMTIVPTTDIEPVTVRTEAT--VTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTtttkrtrr 758
Cdd:PRK12323 380 APVAQPAPAAAAPAAAAPAPAAppAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPA-------- 451
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 759 phPKPKTTPHPEVPqtklAPKQTPRAPPKPKTSPRPR---IPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPH 835
Cdd:PRK12323 452 --PAPAAAPAAAAR----PAAAGPRPVAAAAAAAPARaapAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAES 525
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 1770726339 836 KPYPEVSQSEP-----------APLETRGIPFIPMISPSP 864
Cdd:PRK12323 526 IPDPATADPDDafetlapapaaAPAPRAAAATEPVVAPRP 565
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
534-870 |
4.72e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 44.76 E-value: 4.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 534 SLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEP- 612
Cdd:pfam03154 143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTq 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 613 ATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPT--------KSVSEPVPFE----TE 680
Cdd:pfam03154 223 STAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMphslqtgpSHMQHPVPPQpfplTP 302
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 681 APSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPH 760
Cdd:pfam03154 303 QSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPS 382
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 761 PKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQR--VTAKPKTSPSPEVSYTTPAPKDVLLPHKPY 838
Cdd:pfam03154 383 PFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQppVLTQSQSLPPPAASHPPTSGLHQVPSQSPF 462
|
330 340 350
....*....|....*....|....*....|..
gi 1770726339 839 PEVSQSEPAPLETRGiPFIPMISPSPSQEELQ 870
Cdd:pfam03154 463 PQHPFVPGGPPPITP-PSGPPTSTSSAMPGIQ 493
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
608-676 |
4.95e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 44.42 E-value: 4.95e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1770726339 608 PALEPATIQPEPLVPTtASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVP 676
Cdd:PRK14950 362 PVPAPQPAKPTAAAPS-PVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAP 429
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
438-805 |
6.85e-04 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 44.30 E-value: 6.85e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 438 PKTSRTLEQPRATLAP-----SETPFVPQKLEIFTSPE-----MQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSA 507
Cdd:PTZ00449 597 PKRPRSAQRPTRPKSPklpelLDIPKSPKRPESPKSPKrppppQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFY 676
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 508 GTITPKISKSPEpTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPepqtlLPSQSTIGPETPGTKPSTTLAPRktkrpgr 587
Cdd:PTZ00449 677 DDYLDAAAKSKE-TKTTVVLDESFESILKETLPETPGTPFTTPRP-----LPPKLPRDEEFPFEPIGDPDAEQ------- 743
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 588 rprprPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSkpPKQLLPKPqTTAEPDMPP 667
Cdd:PTZ00449 744 -----PDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS--PSEHEDKP-PGDHPSLPK 815
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 668 TKSVSE-----PVPFETEAPSMTIVPTTdiEPVTVRTEATVTTLApktsqrtrtrrprpkhkttprpeTLQTKLDFGPIT 742
Cdd:PTZ00449 816 KRHRLDglalsTTDLESDAGRIAKDASG--KIVKLKRSKSFDDLT-----------------------TVEEAEEMGAEA 870
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1770726339 743 PGTSSAPTTTTKRTRRPHPkPKTTPHPEVPQTKlaPKQTPRAPPKPKTSPRPRIPQTQPVPKV 805
Cdd:PTZ00449 871 RKIVVDDDGTEADDEDTHP-PEEKHKSEVRRRR--PPKKPSKPKKPSKPKKPKKPDSAFIPSI 930
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
566-799 |
8.32e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 43.60 E-value: 8.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 566 PETPGTK----PSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPseRPKTTHRPDAP 641
Cdd:NF033839 284 PKEPGNKkpsaPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETP--KPEVKPQPEKP 361
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 642 QIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPvpfETEAPSMTIVPTTDIEPVTVRTEATVTTLAPKtsqrtrtrRPRP 721
Cdd:NF033839 362 KPEVKPQPEK---PKPEVKPQPETPKPEVKPQP---EKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQ--------PEKP 427
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1770726339 722 KHKTTPRPETLQTKLDFGPITPGTSSAPTTTTkrtrrphPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQT 799
Cdd:NF033839 428 KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPET-------PKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPST 498
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
296-571 |
8.49e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 43.41 E-value: 8.49e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 296 SDALKTQLAKNETLALPAESKTpeVEKISARPTTVTPETVPRSTKPTTSSALDVSETTLVLSKRTPETLQTI--LIPQFE 373
Cdd:pfam17823 134 IAALPSEAFSAPRAAACRANAS--AAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPatLTPARG 211
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 374 LPLSTLASSEKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAP 453
Cdd:pfam17823 212 ISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTM 291
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 454 SETPFVPQKLEIfTSPEMQPTTPAP-QQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEPTwTTPAPgktqf 532
Cdd:pfam17823 292 ARNPAAPMGAQA-QGPIIQVSTDQPvHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPS-ASPVP----- 364
|
250 260 270
....*....|....*....|....*....|....*....
gi 1770726339 533 islKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGT 571
Cdd:pfam17823 365 ---VLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLA 400
|
|
| PRK11633 |
PRK11633 |
cell division protein DedD; Provisional |
740-829 |
8.86e-04 |
|
cell division protein DedD; Provisional
Pssm-ID: 236940 [Multi-domain] Cd Length: 226 Bit Score: 42.30 E-value: 8.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 740 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPqtkLAPKQTPRAPPKPKtsPRPRiPQTQPVPKVPQRVTAKPKTSPSP 819
Cdd:PRK11633 64 PTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTP---VEPEPAPVEPPKPK--PVEK-PKPKPKPQQKVEAPPAPKPEPKP 137
|
90
....*....|
gi 1770726339 820 EVSyTTPAPK 829
Cdd:PRK11633 138 VVE-EKAAPT 146
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
436-794 |
9.53e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.60 E-value: 9.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 436 IPPKTSRTLEQPRATLAP-SETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKI 514
Cdd:pfam03154 182 SPPSPPPPGTTQAATAGPtPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQV 261
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 515 SKSPEPTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAprKTKRPGRRPRPRPR 594
Cdd:pfam03154 262 SPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRI--HTPPSQSQLQSQQP 339
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 595 PKTTPSPEVPKSKPALEPATIQPEPLVPTTASKpsERPKTTHRPDAPQIQPGSKPPKQLlpKPQTTAEPDMPPTksvSEP 674
Cdd:pfam03154 340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSH--KHPPHLSGPSPFQMNSNLPPPPAL--KPLSSLSTHHPPS---AHP 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 675 VPFETEAPSMTIVPTTDIEPVTVRTEATVTTLA----PKTSQRTRTRRPRPKHKTTPRPETLQTKldfgPITPGTSSAPT 750
Cdd:pfam03154 413 PPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAAshppTSGLHQVPSQSPFPQHPFVPGGPPPITP----PSGPPTSTSSA 488
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 1770726339 751 TTTKRTRRPHPKPKTTPHPEVPQTKLAPKQT----------PRAPPKPKTSPRP 794
Cdd:pfam03154 489 MPGIQPPSSASVSSSGPVPAAVSCPLPPVQIkeealdeaeePESPPPPPRSPSP 542
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
116-195 |
9.56e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.79 E-value: 9.56e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 116 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 193
Cdd:cd00063 3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73
|
..
gi 1770726339 194 VK 195
Cdd:cd00063 74 VR 75
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
736-868 |
9.58e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 43.92 E-value: 9.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 736 LDFGPITP--GTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSP-RPRIPQTQPV----PKVPQR 808
Cdd:PRK10263 736 LDDGPHEPlfTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPqQPVAPQPQYQqpqqPVAPQP 815
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1770726339 809 VTAKPKTSPSPEVSY------TTPAPKDVLLpHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEE 868
Cdd:PRK10263 816 QYQQPQQPVAPQPQYqqpqqpVAPQPQDTLL-HPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVE 880
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
607-817 |
2.18e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 42.36 E-value: 2.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 607 KPALEPATIQPEPLVPTTASKPSERPKTTHRPDAPQIQ---PGSKPPKQ----LLPKPQTTAE-------------PDMP 666
Cdd:PHA03378 575 QPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQshiPETSAPRQwpmpLRPIPMRPLRmqpitfnvlvfptPHQP 654
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 667 PTKSVSEPVPFETEAPSMTIVPTT---------DIEPVTVRTEATVTT-LAPKTSQRTRTRRPRPKHKTTPRPETLQTKL 736
Cdd:PHA03378 655 PQVEITPYKPTWTQIGHIPYQPSPtgantmlpiQWAPGTMQPPPRAPTpMRPPAAPPGRAQRPAAATGRARPPAAAPGRA 734
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 737 DFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTS 816
Cdd:PHA03378 735 RPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAA 814
|
.
gi 1770726339 817 P 817
Cdd:PHA03378 815 P 815
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
537-763 |
2.41e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.17 E-value: 2.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 537 PKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRktkrpgrrprprprpKTTPSPEVPKSKPALEPATIQ 616
Cdd:PRK12323 374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAAR---------------AVAAAPARRSPAPEALAAARQ 438
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 617 PEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVT 696
Cdd:PRK12323 439 ASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAP 518
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1770726339 697 VRTEAtvttlapktsqrtrtrrprpkhKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKP 763
Cdd:PRK12323 519 AGWVA----------------------ESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
740-828 |
2.85e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 42.10 E-value: 2.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 740 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTklAPKQTPRAPPKPKTSPRPRiPQTQPVPKVPQRVTAKPKTSPSP 819
Cdd:PRK14950 362 PVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAAN--IPPKEPVRETATPPPVPPR-PVAPPVPHTPESAPKLTRAAIPV 438
|
....*....
gi 1770726339 820 EVSYTTPAP 828
Cdd:PRK14950 439 DEKPKYTPP 447
|
|
| DamX |
COG3266 |
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ... |
761-848 |
3.09e-03 |
|
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442497 [Multi-domain] Cd Length: 455 Bit Score: 41.76 E-value: 3.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 761 PKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPripQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPE 840
Cdd:COG3266 265 SAPATTSLGEQQEVSLPPAVAAQPAAAAAAQPSA---VALPAAPAAAAAAAAPAEAAAPQPTAAKPVVTETAAPAAPAPE 341
|
....*...
gi 1770726339 841 VSQSEPAP 848
Cdd:COG3266 342 AAAAAAAP 349
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
509-743 |
3.20e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.99 E-value: 3.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 509 TITPKIskSPEPTWTTPAPGKTQfislkpkiplsPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRR 588
Cdd:PRK10263 368 TGEPVI--APAPEGYPQQSQYAQ-----------PAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYY 434
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 589 PRPRPRPKTTPSPEVPKSKPALEP-ATIQPEPLV--PTTASKPSERPKTTHRPDAPQIQPG------SKPP--------- 650
Cdd:PRK10263 435 APAPEQPVAGNAWQAEEQQSTFAPqSTYQTEQTYqqPAAQEPLYQQPQPVEQQPVVEPEPVveetkpARPPlyyfeevee 514
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 651 ------KQLLPKPQTTAEPDMPPtksvsEPVPFETEAPSMTIVPTTDIEPVTVRTEATV--TTLAPKTSQRTRTRRPRPK 722
Cdd:PRK10263 515 krarerEQLAAWYQPIPEPVKEP-----EPIKSSLKAPSVAAVPPVEAAAAVSPLASGVkkATLATGAAATVAAPVFSLA 589
|
250 260
....*....|....*....|.
gi 1770726339 723 HKTTPRPetlQTKLDFGPITP 743
Cdd:PRK10263 590 NSGGPRP---QVKEGIGPQLP 607
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
759-829 |
3.48e-03 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 41.92 E-value: 3.48e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1770726339 759 PHPKPKTTPHPEVPqtklAPKqtpraPPKPKTSPRPRIPQTQ--------PVPKVPQRVTAKPktSPSPEVSYTTPAPK 829
Cdd:NF033838 418 EQPQPAPAPQPEKP----APK-----PEKPAEQPKAEKPADQqaeedyarRSEEEYNRLTQQQ--PPKTEKPAQPSTPK 485
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
743-848 |
4.22e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 41.62 E-value: 4.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 743 PGTSSAPTTTTKRTRRPHPKPKTT----PHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVtakpktsPS 818
Cdd:PRK14951 375 PAEKKTPARPEAAAPAAAPVAQAAaapaPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAV-------AL 447
|
90 100 110
....*....|....*....|....*....|..
gi 1770726339 819 PEVSYTTPAPKDVLLPHK--PYPEVSQSEPAP 848
Cdd:PRK14951 448 APAPPAQAAPETVAIPVRvaPEPAVASAAPAP 479
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
257-556 |
4.47e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 4.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 257 DSAKSPEKAPlggvilvHLIIPGLNETTVKLPASLMFEISDALKTQLAKNETLALPAESKTPEVEKISARPTTVTPETVP 336
Cdd:PHA03247 2703 PPPPTPEPAP-------HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 337 RSTKPTTSSALDVSEttlvLSKRTPETLQTILIPQFELPLSTLASSEKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTS 416
Cdd:PHA03247 2776 AAGPPRRLTRPAVAS----LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 417 DEPEISDSyTATSDRILDSIPPKTSRTLEQPRATLAPSetPFVPQKLEIFTSPEMQPT---TPAPQQTTSIPSTPKRRPR 493
Cdd:PHA03247 2852 LGGSVAPG-GDVRRRPPSRSPAAKPAAPARPPVRRLAR--PAVSRSTESFALPPDQPErppQPQAPPPPQPQPQPPPPPQ 2928
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1770726339 494 PKPPRTKPERTTSAGTITPKISKSPEPTWTTPAPGKTQFISLKPKIP--LSPEVTHTKPAPEPQT 556
Cdd:PHA03247 2929 PQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrfRVPQPAPSREAPASST 2993
|
|
| PRK10905 |
PRK10905 |
cell division protein DamX; Validated |
611-712 |
4.63e-03 |
|
cell division protein DamX; Validated
Pssm-ID: 236792 [Multi-domain] Cd Length: 328 Bit Score: 40.69 E-value: 4.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 611 EPATIQP---EPLVPTTASKPSERPKTTHRPDAPQIQPGSKppkqllpKPQTTAE-PDMPPTKSVSEPVPFETEAPSMTI 686
Cdd:PRK10905 126 EPATVAPvrnGNASRQTAKTQTAERPATTRPARKQAVIEPK-------KPQATAKtEPKPVAQTPKRTEPAAPVASTKAP 198
|
90 100
....*....|....*....|....*.
gi 1770726339 687 VPTTDIEPVTVRTEATVTTLAPKTSQ 712
Cdd:PRK10905 199 AATSTPAPKETATTAPVQTASPAQTT 224
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
600-794 |
5.91e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 41.12 E-value: 5.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 600 SPEVPKSKPALEPATIQPEPLVPTTASKPsERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVS-EPVPFE 678
Cdd:PRK07764 596 GGEGPPAPASSGPPEEAARPAAPAAPAAP-AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDgWPAKAG 674
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 679 TEAPSMTIVPTTDIEPVTVRTEATvttlapktsqrtrtRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRR 758
Cdd:PRK07764 675 GAAPAAPPPAPAPAAPAAPAGAAP--------------AQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
|
170 180 190
....*....|....*....|....*....|....*.
gi 1770726339 759 PHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRP 794
Cdd:PRK07764 741 LPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPP 776
|
|
| Orthopox_A5L |
pfam06193 |
Orthopoxvirus A5L protein-like; This family includes several Orthopoxvirus A5L proteins. The ... |
773-893 |
8.83e-03 |
|
Orthopoxvirus A5L protein-like; This family includes several Orthopoxvirus A5L proteins. The vaccinia virus WR A5L open reading frame (corresponding to open reading frame A4L in vaccinia virus Copenhagen) encodes an immunodominant late protein found in the core of the vaccinia virion. The A5 protein appears to be required for the immature virion to form the brick-shaped intracellular mature virion.
Pssm-ID: 283778 [Multi-domain] Cd Length: 216 Bit Score: 39.19 E-value: 8.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 773 QTKLAPKQTPRAPPKPKTSPRP-RIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLET 851
Cdd:pfam06193 61 DNMLAASRQPIQPLQPTIHITPiEIPTPAPTPKPRQQELGTPSTSCTQNSDASIACSTDIVTPPQPPIVATVCTPTPTDG 140
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1770726339 852 RgIPFIPMISPSP---SQEELQTTLEETDQSTQEPFTTKIPRTTE 893
Cdd:pfam06193 141 R-ICTTADQNPNPgatIQKELDNMALKDLMSSVEKDMCQLQAESE 184
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
740-1111 |
9.05e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.52 E-value: 9.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 740 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAP----KQTPRAPPKPKTSPRPRI-PQTQPVPkvPQRVTAKPK 814
Cdd:pfam03154 189 PGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPhtliQQTPTLHPQRLPSPHPPLqPMTQPPP--PSQVSPQPL 266
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 815 TSPSPEVSYT-TPAPKDVLLPHKPYPEVSQSEPAPL---ETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPfttkiPR 890
Cdd:pfam03154 267 PQPSLHGQMPpMPHSLQTGPSHMQHPVPPQPFPLTPqssQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-----PR 341
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 891 TTELAKttqAPHRFYTTVRPRTSDKPHI-RPVLNRTTTRPTRPKPSGMPSgngvgtgvkQAPRPSGADRNVSVDSTHPTK 969
Cdd:pfam03154 342 EQPLPP---APLSMPHIKPPPTTPIPQLpNPQSHKHPPHLSGPSPFQMNS---------NLPPPPALKPLSSLSTHHPPS 409
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 970 KPGTRRPPLPPRPTHPRRKPLPPNnVTGKPGSAGIISSGPittPPLRSTPRPTGTPLErietdiKQPTVPasgeelenit 1049
Cdd:pfam03154 410 AHPPPLQLMPQSQQLPPPPAQPPV-LTQSQSLPPPAASHP---PTSGLHQVPSQSPFP------QHPFVP---------- 469
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1770726339 1050 dfsSSPTRETDPLGKPRFKGPHVRYIQKPDNSPCSITDSV-------------KRFPKEEATEGNATSPPQNPPT 1111
Cdd:pfam03154 470 ---GGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVpaavscplppvqiKEEALDEAEEPESPPPPPRSPS 541
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
413-801 |
9.30e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 40.28 E-value: 9.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 413 PTTSDEPEISDSYTATSDriLDSIPPKTSRTLEQPratLAPSETPF---VPQKLEIFTSPEMQPTTPAPQQTTSIPSTPK 489
Cdd:pfam05109 455 PTNLTAPASTGPTVSTAD--VTSPTPAGTTSGASP---VTPSPSPRdngTESKAPDMTSPTSAVTTPTPNATSPTPAVTT 529
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 490 RRPRPKPPRTKPERTTSAGTITPKISKSPEPTWTTPAPGKTqFISLKPKIPLSPEVTHTKPAPEPQTLLPS-QSTIGPET 568
Cdd:pfam05109 530 PTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNAT-IPTLGKTSPTSAVTTPTPNATSPTVGETSpQANTTNHT 608
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 569 PGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKP---SERPktTHRPDAPQIQP 645
Cdd:pfam05109 609 LGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPlltSAHP--TGGENITQVTP 686
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 646 GSKPPKQL-----LPKPQTTAEPDMPPTKSVS-EPVPFETEAPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRP 719
Cdd:pfam05109 687 ASTSTHHVstsspAPRPGTTSQASGPGNSSTStKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKH 766
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 720 RPKH--KTTPRPETlqtklDFGpitpGTSSAPTTTTKRtrrphpkpkTTPHPEVPQTKLAPKQTPRAPP-KPKTSPRPRI 796
Cdd:pfam05109 767 TTGHgaRTSTEPTT-----DYG----GDSTTPRTRYNA---------TTYLPPSTSSKLRPRWTFTSPPvTTAQATVPVP 828
|
....*
gi 1770726339 797 PQTQP 801
Cdd:pfam05109 829 PTSQP 833
|
|
|