|
Name |
Accession |
Description |
Interval |
E-value |
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
889-980 |
9.76e-11 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 59.43 E-value: 9.76e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 889 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 966
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 2462588888 967 LGEGPVSNTVAFST 980
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
509-716 |
1.01e-09 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 62.78 E-value: 1.01e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 509 ERTTSAGTITPKISKSPEPTWTT-----PAPGKTQFISLKPKIPLSPEVthtkPAPKQTPRAPPKPKTSPRPRIPQ---T 580
Cdd:PTZ00449 533 EHEDSKESDEPKEGGKPGETKEGevgkkPGPAKEHKPSKIPTLSKKPEF----PKDPKHPKDPEEPKKPKRPRSAQrptR 608
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 581 QPVPKVPQ--RVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLETRGIPFIPMI--------SPSPSQEEL 650
Cdd:PTZ00449 609 PKSPKLPEllDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFkekfyddyLDAAAKSKE 688
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462588888 651 QTTLEETDQSTQEPFTTKIPRTTELAKTTQAPhrfYTTVRPRTSDKPHIRPvlnrttTRPTRPKPS 716
Cdd:PTZ00449 689 TKTTVVLDESFESILKETLPETPGTPFTTPRP---LPPKLPRDEEFPFEPI------GDPDAEQPD 745
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
319-623 |
4.47e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 4.47e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 319 PAESKTPEvekisARPttvtPETVPRSTKPTTSSALDVSETTLVLSKRTPETLQTILIPQFELPLSTLASSEKPW--IVP 396
Cdd:PHA03247 2702 PPPPPTPE-----PAP----HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPapAPP 2772
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 397 TAKISEDSKVLQPQTATYDVFSSPTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAPSETPFVPQKLEIFTSP 476
Cdd:PHA03247 2773 AAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL 2852
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 477 E--MQP-----TTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTItPKISKSPEPTWTTPAPGKTQFISLKPKIPLS 549
Cdd:PHA03247 2853 GgsVAPggdvrRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL-PPDQPERPPQPQAPPPPQPQPQPPPPPQPQP 2931
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462588888 550 PEVTHTKPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVS 623
Cdd:PHA03247 2932 PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
890-970 |
2.08e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 52.62 E-value: 2.08e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 890 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 967
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 2462588888 968 GEG 970
Cdd:smart00060 81 GEG 83
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
543-892 |
6.16e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.26 E-value: 6.16e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 543 KPKIPLSPEVTHTKPAPKQTPRAPPKPKTSP----RPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKP 618
Cdd:PHA03247 2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPpdthAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRA 2667
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 619 YPEVSQSEP-APLETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPFTTKIPRTTELAKTTQAPHRFYTTVRPRTSDKP 697
Cdd:PHA03247 2668 RRLGRAAQAsSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG 2747
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 698 HIRPVLNRTTTRPTRPK--PSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPpnn 775
Cdd:PHA03247 2748 PATPGGPARPARPPTTAgpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASP--- 2824
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 776 VTGKPGSAGIISSGPITTPPLRSTPRPT------GTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPrfk 849
Cdd:PHA03247 2825 AGPLPPPTSAQPTAPPPPPGPPPPSLPLggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALP--- 2901
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 2462588888 850 gphvryiqkPDNSPCSITDSVKRFPKEEATEGNATSPPQNPPT 892
Cdd:PHA03247 2902 ---------PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP 2935
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
329-609 |
2.33e-07 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 54.97 E-value: 2.33e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 329 KISARPTTVTPETVpRSTKPTTSSALdvseTTLVLSKRTPETLQTILIPQFELPL----STLASSEKPWIVPTAKISEDS 404
Cdd:pfam17823 113 RALAAAASSSPSSA-AQSLPAAIAAL----PSEAFSAPRAAACRANASAAPRAAIaaasAPHAASPAPRTAASSTTAASS 187
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 405 KVLQPQTATYDVFSSPTT-SDEPEISDSYTAT----SDRILDSIPPKTS--RTLEQPRATLAPSETPFVPQKLEIFTSPE 477
Cdd:pfam17823 188 TTAASSAPTTAASSAPATlTPARGISTAATATghpaAGTALAAVGNSSPaaGTVTAAVGTVTPAALATLAAAAGTVASAA 267
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 478 MQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTIT------PKISKSPEPTwttPAPGKTQFISLKPKIPLSPE 551
Cdd:pfam17823 268 GTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIqvstdqPVHNTAGEPT---PSPSNTTLEPNTPKSVASTN 344
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462588888 552 ---VTHTKPAPKQTPRAP-PKPKTSPRPRI----PQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAP 609
Cdd:pfam17823 345 lavVTTTKAQAKEPSASPvPVLHTSMIPEVeatsPTTQPSPLLPTQGAAGPGILLAPEQVATEATA 410
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
890-973 |
8.35e-06 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 45.10 E-value: 8.35e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 890 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 966
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 2462588888 967 LGEGPVS 973
Cdd:pfam00041 79 GGEGPPS 85
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
706-985 |
5.93e-05 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 47.30 E-value: 5.93e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 706 TTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGI 785
Cdd:COG3401 48 TKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATT 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 786 ISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNS--- 862
Cdd:COG3401 128 ATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTyyy 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 863 -PCSITDSVKRFPKEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMT 941
Cdd:COG3401 208 rVAATDTGGESAPSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATV 282
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 2462588888 942 NQTFSTVENLKPNTSYEFQVKPKNPLG-EGPVSNTVAFSTESADP 985
Cdd:COG3401 283 TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
884-1028 |
8.35e-05 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.53 E-value: 8.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 884 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 961
Cdd:COG3401 324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462588888 962 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1028
Cdd:COG3401 398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
124-202 |
1.06e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.83 E-value: 1.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060 4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72
|
...
gi 2462588888 200 GVK 202
Cdd:smart00060 73 RVR 75
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
123-202 |
1.13e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 42.02 E-value: 1.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 123 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 199
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 2462588888 200 GVK 202
Cdd:pfam00041 72 RVQ 74
|
|
| DUF4813 |
pfam16072 |
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. ... |
525-655 |
2.86e-04 |
|
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. Proteins in this family are typically between 345 and 672 amino acids in length.
Pssm-ID: 435117 [Multi-domain] Cd Length: 288 Bit Score: 44.36 E-value: 2.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 525 PEPTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSY 604
Cdd:pfam16072 146 PGSVTTTSAGSGTTVINAGGQQPAAPAAPAYPVAPAAYPAQAPAAAPAPAPGAPQTPLAPLNPVAAAPAAAAGAAAAPVV 225
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 2462588888 605 TTPAPKDVlLPHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEELQTTLE 655
Cdd:pfam16072 226 AAAAPAAA-APPPPAPAAPPADAAPPAPGGIICVPVRVPEPDPKDATKTIE 275
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
123-202 |
3.75e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 40.56 E-value: 3.75e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 123 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 200
Cdd:cd00063 3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73
|
..
gi 2462588888 201 VK 202
Cdd:cd00063 74 VR 75
|
|
| DamX |
COG3266 |
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ... |
522-610 |
2.76e-03 |
|
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442497 [Multi-domain] Cd Length: 455 Bit Score: 41.76 E-value: 2.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 522 SKSPEPTWTTPAPGKT-----QFISLKPKIPLSP-------EVTHTKPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQR 589
Cdd:COG3266 255 LKAPSQASSASAPATTslgeqQEVSLPPAVAAQPaaaaaaqPSAVALPAAPAAAAAAAAPAEAAAPQPTAAKPVVTETAA 334
|
90 100
....*....|....*....|.
gi 2462588888 590 VTAkpktsPSPEVSYTTPAPK 610
Cdd:COG3266 335 PAA-----PAPEAAAAAAAPA 350
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
889-980 |
9.76e-11 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 59.43 E-value: 9.76e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 889 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 966
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 2462588888 967 LGEGPVSNTVAFST 980
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
509-716 |
1.01e-09 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 62.78 E-value: 1.01e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 509 ERTTSAGTITPKISKSPEPTWTT-----PAPGKTQFISLKPKIPLSPEVthtkPAPKQTPRAPPKPKTSPRPRIPQ---T 580
Cdd:PTZ00449 533 EHEDSKESDEPKEGGKPGETKEGevgkkPGPAKEHKPSKIPTLSKKPEF----PKDPKHPKDPEEPKKPKRPRSAQrptR 608
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 581 QPVPKVPQ--RVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLETRGIPFIPMI--------SPSPSQEEL 650
Cdd:PTZ00449 609 PKSPKLPEllDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFkekfyddyLDAAAKSKE 688
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462588888 651 QTTLEETDQSTQEPFTTKIPRTTELAKTTQAPhrfYTTVRPRTSDKPHIRPvlnrttTRPTRPKPS 716
Cdd:PTZ00449 689 TKTTVVLDESFESILKETLPETPGTPFTTPRP---LPPKLPRDEEFPFEPI------GDPDAEQPD 745
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
444-812 |
1.53e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 62.65 E-value: 1.53e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 444 PPKTSRTLEQPRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPK--- 520
Cdd:PHA03247 2608 PRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRrra 2687
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 521 ---------ISKSPEPTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPKQTPRAPPKPKTSPRP---RIPQTQPVPKVPQ 588
Cdd:PHA03247 2688 arptvgsltSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggpARPARPPTTAGPP 2767
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 589 RVT--AKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPFT 666
Cdd:PHA03247 2768 APAppAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPP 2847
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 667 TKIPRTTELA---------KTTQAPHRFYTTVRPRTSDKPhiRPVLNRTTT-------RPTRPKPSGMPSGNGVGTGVKQ 730
Cdd:PHA03247 2848 PSLPLGGSVApggdvrrrpPSRSPAAKPAAPARPPVRRLA--RPAVSRSTEsfalppdQPERPPQPQAPPPPQPQPQPPP 2925
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 731 APRPSGADRnvsvdsTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGII-----SSGPITTPPLRSTPRPTGT 805
Cdd:PHA03247 2926 PPQPQPPPP------PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrfrvpQPAPSREAPASSTPPLTGH 2999
|
....*..
gi 2462588888 806 PLERIET 812
Cdd:PHA03247 3000 SLSRVSS 3006
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
319-623 |
4.47e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 4.47e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 319 PAESKTPEvekisARPttvtPETVPRSTKPTTSSALDVSETTLVLSKRTPETLQTILIPQFELPLSTLASSEKPW--IVP 396
Cdd:PHA03247 2702 PPPPPTPE-----PAP----HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPapAPP 2772
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 397 TAKISEDSKVLQPQTATYDVFSSPTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAPSETPFVPQKLEIFTSP 476
Cdd:PHA03247 2773 AAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL 2852
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 477 E--MQP-----TTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTItPKISKSPEPTWTTPAPGKTQFISLKPKIPLS 549
Cdd:PHA03247 2853 GgsVAPggdvrRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL-PPDQPERPPQPQAPPPPQPQPQPPPPPQPQP 2931
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462588888 550 PEVTHTKPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVS 623
Cdd:PHA03247 2932 PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
890-970 |
2.08e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 52.62 E-value: 2.08e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 890 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 967
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 2462588888 968 GEG 970
Cdd:smart00060 81 GEG 83
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
444-734 |
3.03e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 58.41 E-value: 3.03e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 444 PPKTSRTLE-QPRATLAPSETPFVPQKL-EIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKI 521
Cdd:PHA03247 2701 PPPPPPTPEpAPHALVSATPLPPGPAAArQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP 2780
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 522 SKSPEP---------------------TWTTPAPGKTQFISLKPKIPLSPEVTHTkPAPKQTPRAPPKPKTSP------- 573
Cdd:PHA03247 2781 RRLTRPavaslsesreslpspwdpadpPAAVLAPAAALPPAASPAGPLPPPTSAQ-PTAPPPPPGPPPPSLPLggsvapg 2859
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 574 ---RPRIPQTQPVPKV------PQRVTAKPKTSPSPEvSYTTPAPKdvllPHKPYPEVSQSEPAPLETRGIPFIPMISPs 644
Cdd:PHA03247 2860 gdvRRRPPSRSPAAKPaaparpPVRRLARPAVSRSTE-SFALPPDQ----PERPPQPQAPPPPQPQPQPPPPPQPQPPP- 2933
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 645 PSQEELQTTLEETDQSTQEPFTTKIPRTTELAKTtqAPHRfYTTVRPRTSDKPHIRPVLNRTTTRPTRPKPSGMPS-GNG 723
Cdd:PHA03247 2934 PPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL--VPGR-VAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSwASS 3010
|
330
....*....|.
gi 2462588888 724 VGTGVKQAPRP 734
Cdd:PHA03247 3011 LALHEETDPPP 3021
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
454-869 |
5.03e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.64 E-value: 5.03e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 454 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 531
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 532 PAPGKTQFISLKPKIPLSPEVTHTKPAPkqtPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKD 611
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAP---GRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPT 2707
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 612 VLLPHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPFTTKIPRTTelakttqAPHRFYTTVRP 691
Cdd:PHA03247 2708 PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAP-------APPAAPAAGPP 2780
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 692 RTSDKPHIRPVLNRTTTRPTRPKPSGMPSgngVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPL 771
Cdd:PHA03247 2781 RRLTRPAVASLSESRESLPSPWDPADPPA---AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 772 PPNNVT--GKPGSAGIISSGPiTTPPLRSTPRPtgtPLERIETDIKQPtvPASGEELEniTDFSSSPTRETDPLGKPRFK 849
Cdd:PHA03247 2858 PGGDVRrrPPSRSPAAKPAAP-ARPPVRRLARP---AVSRSTESFALP--PDQPERPP--QPQAPPPPQPQPQPPPPPQP 2929
|
410 420
....*....|....*....|
gi 2462588888 850 GPHVRYIQKPDNSPCSITDS 869
Cdd:PHA03247 2930 QPPPPPPPRPQPPLAPTTDP 2949
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
543-892 |
6.16e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.26 E-value: 6.16e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 543 KPKIPLSPEVTHTKPAPKQTPRAPPKPKTSP----RPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKP 618
Cdd:PHA03247 2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPpdthAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRA 2667
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 619 YPEVSQSEP-APLETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPFTTKIPRTTELAKTTQAPHRFYTTVRPRTSDKP 697
Cdd:PHA03247 2668 RRLGRAAQAsSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG 2747
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 698 HIRPVLNRTTTRPTRPK--PSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPpnn 775
Cdd:PHA03247 2748 PATPGGPARPARPPTTAgpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASP--- 2824
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 776 VTGKPGSAGIISSGPITTPPLRSTPRPT------GTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPrfk 849
Cdd:PHA03247 2825 AGPLPPPTSAQPTAPPPPPGPPPPSLPLggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALP--- 2901
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 2462588888 850 gphvryiqkPDNSPCSITDSVKRFPKEEATEGNATSPPQNPPT 892
Cdd:PHA03247 2902 ---------PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP 2935
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
329-609 |
2.33e-07 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 54.97 E-value: 2.33e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 329 KISARPTTVTPETVpRSTKPTTSSALdvseTTLVLSKRTPETLQTILIPQFELPL----STLASSEKPWIVPTAKISEDS 404
Cdd:pfam17823 113 RALAAAASSSPSSA-AQSLPAAIAAL----PSEAFSAPRAAACRANASAAPRAAIaaasAPHAASPAPRTAASSTTAASS 187
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 405 KVLQPQTATYDVFSSPTT-SDEPEISDSYTAT----SDRILDSIPPKTS--RTLEQPRATLAPSETPFVPQKLEIFTSPE 477
Cdd:pfam17823 188 TTAASSAPTTAASSAPATlTPARGISTAATATghpaAGTALAAVGNSSPaaGTVTAAVGTVTPAALATLAAAAGTVASAA 267
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 478 MQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTIT------PKISKSPEPTwttPAPGKTQFISLKPKIPLSPE 551
Cdd:pfam17823 268 GTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIqvstdqPVHNTAGEPT---PSPSNTTLEPNTPKSVASTN 344
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462588888 552 ---VTHTKPAPKQTPRAP-PKPKTSPRPRI----PQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAP 609
Cdd:pfam17823 345 lavVTTTKAQAKEPSASPvPVLHTSMIPEVeatsPTTQPSPLLPTQGAAGPGILLAPEQVATEATA 410
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
890-973 |
8.35e-06 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 45.10 E-value: 8.35e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 890 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 966
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 2462588888 967 LGEGPVS 973
Cdd:pfam00041 79 GGEGPPS 85
|
|
| PRK14954 |
PRK14954 |
DNA polymerase III subunits gamma and tau; Provisional |
543-638 |
4.02e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184918 [Multi-domain] Cd Length: 620 Bit Score: 47.63 E-value: 4.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 543 KPKIPLSPEVTHTKPAPKQTPraPPKPKTSPRpripqtQPVPKVPQRVTAKPKTSPSPEvSYTTPAPKDVLLPHKPYPEV 622
Cdd:PRK14954 375 RNDGGVAPSPAGSPDVKKKAP--EPDLPQPDR------HPGPAKPEAPGARPAELPSPA-SAPTPEQQPPVARSAPLPPS 445
|
90
....*....|....*.
gi 2462588888 623 SQSEPAPLETRGIPFI 638
Cdd:PRK14954 446 PQASAPRNVASGKPGV 461
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
706-985 |
5.93e-05 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 47.30 E-value: 5.93e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 706 TTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGI 785
Cdd:COG3401 48 TKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATT 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 786 ISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNS--- 862
Cdd:COG3401 128 ATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTyyy 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 863 -PCSITDSVKRFPKEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMT 941
Cdd:COG3401 208 rVAATDTGGESAPSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATV 282
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 2462588888 942 NQTFSTVENLKPNTSYEFQVKPKNPLG-EGPVSNTVAFSTESADP 985
Cdd:COG3401 283 TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
884-1028 |
8.35e-05 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.53 E-value: 8.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 884 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 961
Cdd:COG3401 324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462588888 962 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1028
Cdd:COG3401 398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
124-202 |
1.06e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.83 E-value: 1.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060 4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72
|
...
gi 2462588888 200 GVK 202
Cdd:smart00060 73 RVR 75
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
123-202 |
1.13e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 42.02 E-value: 1.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 123 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 199
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 2462588888 200 GVK 202
Cdd:pfam00041 72 RVQ 74
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
510-626 |
1.26e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 45.96 E-value: 1.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 510 RTTSAGT-------ITPKISKSPEPTWTTPAPGktqfislkpkiplSPEVTHTKPAPKQTPRAPPKPKTSPRPRIPQTQP 582
Cdd:PRK14950 342 RTTSYGQlplelavIEALLVPVPAPQPAKPTAA-------------APSPVRPTPAPSTRPKAAAAANIPPKEPVRETAT 408
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 2462588888 583 VPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSE 626
Cdd:PRK14950 409 PPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKE 452
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
516-651 |
1.29e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 46.23 E-value: 1.29e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 516 TITPKISKSPEPTWTTPAPGKTQFISLKP-KIPLSPEVTHTKPAPKQTPRAPPKPKTSPRPrIPQTQPVPKVPQRVTAKP 594
Cdd:PRK10263 368 TGEPVIAPAPEGYPQQSQYAQPAVQYNEPlQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQP-AQQPYYAPAPEQPVAGNA 446
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 2462588888 595 KTSPSPEVSYttpAPKDVLLPHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEELQ 651
Cdd:PRK10263 447 WQAEEQQSTF---APQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETK 500
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
404-735 |
1.46e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 46.23 E-value: 1.46e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 404 SKVLQPQTATYDVFSSPTTSDEP--------EISDSYTATSDRILDSIP-PKTSRTLEQPRATLAPSETPFVPQKLeIFT 474
Cdd:PRK10263 297 NRATQPEYDEYDPLLNGAPITEPvavaaaatTATQSWAAPVEPVTQTPPvASVDVPPAQPTVAWQPVPGPQTGEPV-IAP 375
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 475 SPEM---QPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEptWTTPAPGKTQFISLKPKIPLSPE 551
Cdd:PRK10263 376 APEGypqQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQ--QPYYAPAPEQPVAGNAWQAEEQQ 453
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 552 vTHTKPAPKQTPRappKPKTSPRPRIPQTQPVPKVPQrvtaKPKTSPSPEVSYTTPAPKdvllPHKPYPEVSQSEPAPLE 631
Cdd:PRK10263 454 -STFAPQSTYQTE---QTYQQPAAQEPLYQQPQPVEQ----QPVVEPEPVVEETKPARP----PLYYFEEVEEKRARERE 521
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 632 TRGIPFIPMisPSPSQEElqttlEETDQSTQEPFTTKIPRTTELAKTTQAPHRFYTTVRPRTSDKPHIRPVLNRTTTRPT 711
Cdd:PRK10263 522 QLAAWYQPI--PEPVKEP-----EPIKSSLKAPSVAAVPPVEAAAAVSPLASGVKKATLATGAAATVAAPVFSLANSGGP 594
|
330 340
....*....|....*....|....
gi 2462588888 712 RPKPSgmpsgNGVGtgvKQAPRPS 735
Cdd:PRK10263 595 RPQVK-----EGIG---PQLPRPK 610
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
454-864 |
2.63e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 45.16 E-value: 2.63e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 454 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAP-QQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEPTWTTP 532
Cdd:PHA03307 49 ELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPaNESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPP 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 533 APGKTQFISLKPKIPLSPEVTHTKPAPKQTPRAPPKPKTSPRPRIPqtqPVPKVPQrvTAKPKTSPSPEVSYTTPAPKDV 612
Cdd:PHA03307 129 SPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAAL---PLSSPEE--TARAPSSPPAEPPPSTPPAAAS 203
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 613 LLPHKPYPEVS--QSEPAPLETRGIPFIPMISPSPSqeelqttLEETDQSTQEPFTTKIPRTTELAKTTQAPHRFYTTVR 690
Cdd:PHA03307 204 PRPPRRSSPISasASSPAPAPGRSAADDAGASSSDS-------SSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWN 276
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 691 PRTSDKPHIRPvlnRTTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPrrkp 770
Cdd:PHA03307 277 GPSSRPGPASS---SSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSR---- 349
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 771 lPPNNVTGKPGSAGiisSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKG 850
Cdd:PHA03307 350 -SPSPSRPPPPADP---SSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGA 425
|
410
....*....|....
gi 2462588888 851 PHVRYiqkPDNSPC 864
Cdd:PHA03307 426 FYARY---PLLTPS 436
|
|
| DUF4813 |
pfam16072 |
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. ... |
525-655 |
2.86e-04 |
|
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. Proteins in this family are typically between 345 and 672 amino acids in length.
Pssm-ID: 435117 [Multi-domain] Cd Length: 288 Bit Score: 44.36 E-value: 2.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 525 PEPTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSY 604
Cdd:pfam16072 146 PGSVTTTSAGSGTTVINAGGQQPAAPAAPAYPVAPAAYPAQAPAAAPAPAPGAPQTPLAPLNPVAAAPAAAAGAAAAPVV 225
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 2462588888 605 TTPAPKDVlLPHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEELQTTLE 655
Cdd:pfam16072 226 AAAAPAAA-APPPPAPAAPPADAAPPAPGGIICVPVRVPEPDPKDATKTIE 275
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
123-202 |
3.75e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 40.56 E-value: 3.75e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 123 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 200
Cdd:cd00063 3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73
|
..
gi 2462588888 201 VK 202
Cdd:cd00063 74 VR 75
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
481-694 |
8.19e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 43.71 E-value: 8.19e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 481 TTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEPTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPK 560
Cdd:PRK12323 372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPA 451
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 561 QTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPevSYTTPAPKDVLLPHKPYPEVSQSEPApletrgipfipm 640
Cdd:PRK12323 452 PAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAP--ADDDPPPWEELPPEFASPAPAQPDAA------------ 517
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 2462588888 641 ispsPSQEELQTTLEETDQSTQEPFTTKIPRTT--ELAKTTQAPHRFYTTVRPRTS 694
Cdd:PRK12323 518 ----PAGWVAESIPDPATADPDDAFETLAPAPAaaPAPRAAAATEPVVAPRPPRAS 569
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
478-892 |
8.23e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.60 E-value: 8.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 478 MQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKIS--KSPEPTWTTPAPGKTQFISLKPKIPlSPEVTHT 555
Cdd:pfam03154 167 LQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSpaTSQPPNQTQSTAAPHTLIQQTPTLH-PQRLPSP 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 556 KPAPKQTPRAPPKPKTSPRPRIPQTQ--PVPKVPQRVTAKPKTSPSPevsyttpapkdvlLPHKPYPEVSQSEpaplETR 633
Cdd:pfam03154 246 HPPLQPMTQPPPPSQVSPQPLPQPSLhgQMPPMPHSLQTGPSHMQHP-------------VPPQPFPLTPQSS----QSQ 308
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 634 GIPFIPMISPSPSQEELQTTLEETDQSTQEPfttkiPRTTELAKttqAPHRFYTTVRPRTSDKPHI-RPVLNRTTTRPTR 712
Cdd:pfam03154 309 VPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-----PREQPLPP---APLSMPHIKPPPTTPIPQLpNPQSHKHPPHLSG 380
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 713 PKPSGMPSgngvgtgvkQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNnVTGKPGSAGIISSGPit 792
Cdd:pfam03154 381 PSPFQMNS---------NLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPV-LTQSQSLPPPAASHP-- 448
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 793 tPPLRSTPRPTGTPLErietdiKQPTVPasgeelenitdfsSSPTRETDPLGKPRFKGPHVRYIQKPDNSPCSITDSV-- 870
Cdd:pfam03154 449 -PTSGLHQVPSQSPFP------QHPFVP-------------GGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVpa 508
|
410 420 430
....*....|....*....|....*....|...
gi 2462588888 871 -----------KRFPKEEATEGNATSPPQNPPT 892
Cdd:pfam03154 509 avscplppvqiKEEALDEAEEPESPPPPPRSPS 541
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
448-714 |
8.24e-04 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 43.52 E-value: 8.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 448 SRTLEQPRATLAPSETPFVPQKLEIFTSPEMQPTT-PAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPE 526
Cdd:PTZ00449 537 SKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKiPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPE 616
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 527 PTwTTPAPGKTQFISLKPKIPLSPevthTKPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRV----------TAKPKT 596
Cdd:PTZ00449 617 LL-DIPKSPKRPESPKSPKRPPPP----QRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFyddyldaaakSKETKT 691
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 597 SPSPEVSYTT-------PAPKDVLLPHKPYPEVSQSEPA-PLETRGIPFIPmiSPSPSQ-----EELQTTLEETDQSTQE 663
Cdd:PTZ00449 692 TVVLDESFESilketlpETPGTPFTTPRPLPPKLPRDEEfPFEPIGDPDAE--QPDDIEfftppEEERTFFHETPADTPL 769
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 2462588888 664 P-FTTKIPRTTELAKTTQAPHRfyTTVRPRTSDKPHIRPvlnrTTTRPTRPK 714
Cdd:PTZ00449 770 PdILAEEFKEEDIHAETGEPDE--AMKRPDSPSEHEDKP----PGDHPSLPK 815
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
388-587 |
1.03e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 43.52 E-value: 1.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 388 SSEKPWIVPTAKISEDSKVLQPqtATYDVFSSPTTSDEPEISDSYTATSDRILDSIPPKTSRTleQPRATLAPSETPfvp 467
Cdd:PHA03378 619 SAPRQWPMPLRPIPMRPLRMQP--ITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPT--GANTMLPIQWAP--- 691
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 468 qkleifTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEPTwTTPAPGKTQFISLKPKIP 547
Cdd:PHA03378 692 ------GTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARP-PAAAPGRARPPAAAPGRA 764
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 2462588888 548 LSPEVTHTKPAPKQTPRAPPKPKTSPRPRiPQTQPVPKVP 587
Cdd:PHA03378 765 RPPAAAPGAPTPQPPPQAPPAPQQRPRGA-PTPQPPPQAG 803
|
|
| PRK11633 |
PRK11633 |
cell division protein DedD; Provisional |
522-610 |
1.18e-03 |
|
cell division protein DedD; Provisional
Pssm-ID: 236940 [Multi-domain] Cd Length: 226 Bit Score: 41.91 E-value: 1.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 522 SKSPEPTWTTPAPGKTQFISLKPKIPLSPEVThtkPAPKQTPRAPPKPKtsPRPRiPQTQPVPKVPQRVTAKPKTSPSPE 601
Cdd:PRK11633 65 TQPPEGAAEAVRAGDAAAPSLDPATVAPPNTP---VEPEPAPVEPPKPK--PVEK-PKPKPKPQQKVEAPPAPKPEPKPV 138
|
....*....
gi 2462588888 602 VSyTTPAPK 610
Cdd:PRK11633 139 VE-EKAAPT 146
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
420-806 |
1.21e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.98 E-value: 1.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 420 PTTSDEPEISDSYTATSDriLDSIPPKTSRTLEQPratLAPSETPF---VPQKLEIFTSPEMQPTTPAPQQTTSIPSTPK 496
Cdd:pfam05109 455 PTNLTAPASTGPTVSTAD--VTSPTPAGTTSGASP---VTPSPSPRdngTESKAPDMTSPTSAVTTPTPNATSPTPAVTT 529
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 497 RRPRPKPPRTKPERTTSAGTITPKISKSPEPTWTTPAPGKTqfisLKPKIPLSPEVTHTKPAPKQTprAPPKPKTSPRPR 576
Cdd:pfam05109 530 PTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNAT----IPTLGKTSPTSAVTTPTPNAT--SPTVGETSPQAN 603
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 577 IP-QTQPVPKVPQRVTAKPKTSPSP----EVSYTTPAPKDVLLPHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEELQ 651
Cdd:pfam05109 604 TTnHTLGGTSSTPVVTSPPKNATSAvttgQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQ 683
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 652 TTLEETdqSTQEPFTTKIPRTTELAKTTQAPHRFYTTVRPRTSDKPHIRPVLNRTTTRPTRPKPSGMPSGNGVGTGVKQA 731
Cdd:pfam05109 684 VTPAST--STHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANST 761
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462588888 732 PRPSGADRNVSVDSTHPTkkPGTRRPPLPPRPTHPRRKPLPPNNvTGKPGSAGIISSGPITTPPLRSTPRPTGTP 806
Cdd:pfam05109 762 TGGKHTTGHGARTSTEPT--TDYGGDSTTPRTRYNATTYLPPST-SSKLRPRWTFTSPPVTTAQATVPVPPTSQP 833
|
|
| PHA03369 |
PHA03369 |
capsid maturational protease; Provisional |
528-620 |
2.65e-03 |
|
capsid maturational protease; Provisional
Pssm-ID: 223061 [Multi-domain] Cd Length: 663 Bit Score: 41.91 E-value: 2.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 528 TWTTPAPGKTQFISLKPKIPLSPEvTHTKPAPKQTPRAPPKPKTSPRPRIPQTQPVPkVPQRVTAKPKTSPSPEVSYTTP 607
Cdd:PHA03369 350 TASLTAPSRVLAAAAKVAVIAAPQ-THTGPADRQRPQRPDGIPYSVPARSPMTAYPP-VPQFCGDPGLVSPYNPQSPGTS 427
|
90
....*....|...
gi 2462588888 608 APKDVLLPHKPYP 620
Cdd:PHA03369 428 YGPEPVGPVPPQP 440
|
|
| DamX |
COG3266 |
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ... |
522-610 |
2.76e-03 |
|
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442497 [Multi-domain] Cd Length: 455 Bit Score: 41.76 E-value: 2.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 522 SKSPEPTWTTPAPGKT-----QFISLKPKIPLSP-------EVTHTKPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQR 589
Cdd:COG3266 255 LKAPSQASSASAPATTslgeqQEVSLPPAVAAQPaaaaaaqPSAVALPAAPAAAAAAAAPAEAAAPQPTAAKPVVTETAA 334
|
90 100
....*....|....*....|.
gi 2462588888 590 VTAkpktsPSPEVSYTTPAPK 610
Cdd:COG3266 335 PAA-----PAPEAAAAAAAPA 350
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
335-751 |
3.24e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 41.48 E-value: 3.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 335 TTVTPETVPRST---KPTTSSALDVSETTLVLSKRTPETLQTIlipQFELPLSTLASSEKPWIVPTAKISEDSKVLQPQT 411
Cdd:pfam17823 84 TEVTAEHTPHGTdlsEPATREGAADGAASRALAAAASSSPSSA---AQSLPAAIAALPSEAFSAPRAAACRANASAAPRA 160
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 412 ATYDVFSSPTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPrATLAPSeTPFVPQKLEIFTsPEMQPTTPAPQQTTSI 491
Cdd:pfam17823 161 AIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAP-ATLTPA-RGISTAATATGH-PAAGTALAAVGNSSPA 237
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 492 PSTPKRRPRPKPPRTKPERTTSAGTIT--PKISKSPEPTWTTPAPGKTqfislkpkiplspevthtkpAPKQTPRAPPKP 569
Cdd:pfam17823 238 AGTVTAAVGTVTPAALATLAAAAGTVAsaAGTINMGDPHARRLSPAKH--------------------MPSDTMARNPAA 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 570 KTSPRPRIPQTQPVPKVPqrvtakpktspspeVSYTTPAPKdvllphkpypevsqsePAPLETRGIPFIPMISPSPSQee 649
Cdd:pfam17823 298 PMGAQAQGPIIQVSTDQP--------------VHNTAGEPT----------------PSPSNTTLEPNTPKSVASTNL-- 345
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 650 lqTTLEETDQSTQEPFTTKIPRttelakttqaphrfyttvrPRTSDKPHIRpvlnrtTTRPTrPKPSGMPSGNGV-GTGV 728
Cdd:pfam17823 346 --AVVTTTKAQAKEPSASPVPV-------------------LHTSMIPEVE------ATSPT-TQPSPLLPTQGAaGPGI 397
|
410 420
....*....|....*....|...
gi 2462588888 729 KQAPRPSGADRNVSVDSTHPTKK 751
Cdd:pfam17823 398 LLAPEQVATEATAGTASAGPTPR 420
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
456-645 |
3.76e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 41.40 E-value: 3.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 456 ATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEPTWTTPAPG 535
Cdd:PRK12323 372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPA 451
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 536 KTqfislkpkiPLSPEVTHTKPAPKQTPRAPPKPKTSPRPRIP--QTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVL 613
Cdd:PRK12323 452 PA---------PAAAPAAAARPAAAGPRPVAAAAAAAPARAAPaaAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWV 522
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 2462588888 614 LPHKPYPEVSQSEP-----------APLETRGIPFIPMISPSP 645
Cdd:PRK12323 523 AESIPDPATADPDDafetlapapaaAPAPRAAAATEPVVAPRP 565
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
480-738 |
3.78e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 41.45 E-value: 3.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 480 PTTPAPQQTTSIPSTPKRRprpkpprtkpERTTSAGTITPKISKSPEPTWTTPAPGKTqfislkpkiPLSPEVTHTKPAP 559
Cdd:PLN03209 312 PLTPMEELLAKIPSQRVPP----------KESDAADGPKPVPTKPVTPEAPSPPIEEE---------PPQPKAVVPRPLS 372
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 560 KQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPApkdvllphkpypeVSQSEPAPLETRGI---- 635
Cdd:PLN03209 373 PYTAYEDLKPPTSPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASN-------------VPEVEPAQVEAKKTrpls 439
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 636 PFI------PMISPSPSQEE-LQTTLEETDQSTQEPFTTKIPRTTELAKTTQAPHRFYTTVRPRTSDKPHIRPvlnrttt 708
Cdd:PLN03209 440 PYAryedlkPPTSPSPTAPTgVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSP------- 512
|
250 260 270
....*....|....*....|....*....|.
gi 2462588888 709 RPTRPKPSGMP-SGNGVGTGVKQAPRPSGAD 738
Cdd:PLN03209 513 SPAAPVGKVAPsSTNEVVKVGNSAPPTALAD 543
|
|
| Orthopox_A5L |
pfam06193 |
Orthopoxvirus A5L protein-like; This family includes several Orthopoxvirus A5L proteins. The ... |
509-674 |
4.14e-03 |
|
Orthopoxvirus A5L protein-like; This family includes several Orthopoxvirus A5L proteins. The vaccinia virus WR A5L open reading frame (corresponding to open reading frame A4L in vaccinia virus Copenhagen) encodes an immunodominant late protein found in the core of the vaccinia virion. The A5 protein appears to be required for the immature virion to form the brick-shaped intracellular mature virion.
Pssm-ID: 283778 [Multi-domain] Cd Length: 216 Bit Score: 39.96 E-value: 4.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 509 ERTTSAGTITPKISKSPEPTWTTPAPGKTQFISLKPKIP--LSPEVTHTKP-APKQTPRAPPKPKTSPRP-RIPQTQPVP 584
Cdd:pfam06193 13 EDATAKNSAYYSEEDDLDILGKKDEAGEIGFESQERQYQqqLIEQLAKDNMlAASRQPIQPLQPTIHITPiEIPTPAPTP 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 585 KVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLETRgIPFIPMISPSP---SQEELQTTLEETDQST 661
Cdd:pfam06193 93 KPRQQELGTPSTSCTQNSDASIACSTDIVTPPQPPIVATVCTPTPTDGR-ICTTADQNPNPgatIQKELDNMALKDLMSS 171
|
170
....*....|...
gi 2462588888 662 QEPFTTKIPRTTE 674
Cdd:pfam06193 172 VEKDMCQLQAESE 184
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
512-613 |
5.14e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 40.95 E-value: 5.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 512 TSAGTITPKiSKSPEPTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPKQTPR--APPKPKTSPRPRIPQTQPVPKVPQR 589
Cdd:PRK14950 364 PAPQPAKPT-AAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRpvAPPVPHTPESAPKLTRAAIPVDEKP 442
|
90 100
....*....|....*....|....
gi 2462588888 590 VTAKPktSPSPEVSYTTPAPKDVL 613
Cdd:PRK14950 443 KYTPP--APPKEEEKALIADGDVL 464
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
518-649 |
6.30e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.84 E-value: 6.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588888 518 TPKISKSPEPTWTTPA-PGKTQFISLKPKIPLSPEVTHTKPAPKQTPR---APPKPKTSPRPRIPQTQPvPKVPQRVTAK 593
Cdd:PRK10263 754 QPQQPVAPQQQYQQPQqPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQpqyQQPQQPVAPQPQYQQPQQ-PVAPQPQYQQ 832
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 2462588888 594 PKTSPSpevsyttPAPKDVLLpHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEE 649
Cdd:PRK10263 833 PQQPVA-------PQPQDTLL-HPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVE 880
|
|
|