NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|578807234|ref|XP_006713632|]
View 

target of Nesh-SH3 isoform X31 [Homo sapiens]

Protein Classification

FN3 domain-containing protein( domain architecture ID 13885377)

FN3 domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
778-1275 5.53e-16

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 84.60  E-value: 5.53e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  778 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLVP-ATILEPV-------------LRTEASG-------TTAAPKVPQR--- 833
Cdd:PHA03247 2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPdEPVGEPVhprmltwirgleeLASDDAGdpppplpPAAPPAAPDRsvp 2569
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  834 THRPHPKPKTTLSPEELQTELVP---ATIFEPVSPIKEAPGTTfvPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPK 910
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPDAPpqsARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  911 TtpsPQAPETKPVPATVLEPVTLRPEASTTLASKTSQRTRRPRLRTKTTP-----RPEAPESKPVPtaelkpvtlRTETW 985
Cdd:PHA03247 2648 P---PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSltslaDPPPPPPTPEP---------APHAL 2715
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  986 VT-TQAPKTSQRTRRPRPKTKTTPSPevpqtklvPSTDLEPGTLRTEAPKTMVVTTVLEPDTFRTKFPETTLAPKTQRTR 1064
Cdd:PHA03247 2716 VSaTPLPPGPAAARQASPALPAAPAP--------PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1065 RPRPRPKTTS--SPEVPQNKSVSVTGFEPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESqPVSDVLESVTLSTES 1142
Cdd:PHA03247 2788 VASLSESRESlpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP-LGGSVAPGGDVRRRP 2866
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1143 PKETIAPAKTDYVYPTAKAPLWPEEPKTEVVESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPK 1222
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 578807234 1223 TSPRPRIPQTQPVPK------VPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVS 1275
Cdd:PHA03247 2947 TDPAGAGEPSGAVPQpwlgalVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005
PHA03247 super family cl33720
large tegument protein UL36; Provisional
478-980 1.71e-12

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 73.05  E-value: 1.71e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  478 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 555
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  556 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 635
Cdd:PHA03247 2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  636 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 711
Cdd:PHA03247 2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  712 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 780
Cdd:PHA03247 2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  781 TTTTKRTRRPHPKPKTTPHPEV-----PQTKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEE------ 849
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPARPPVrrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPpprpqp 2941
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  850 -LQTELVPATIFEPvSPIKEAPGTTFVPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAP-----ETKPV 923
Cdd:PHA03247 2942 pLAPTTDPAGAGEP-SGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSlalheETDPP 3020
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 578807234  924 PATVLEpvTLRPEASTTLASKTSQRTRRPRLRTKTTPRPEAPESKPVPTAELKPVTL 980
Cdd:PHA03247 3021 PVSLKQ--TLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATP 3075
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1506-1597 9.58e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 57.12  E-value: 9.58e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1506 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1583
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 578807234 1584 LGEGPVSNTVAFST 1597
Cdd:cd00063    80 GGESPPSESVTVTT 93
PTZ00449 super family cl33186
104 kDa microneme/rhoptry antigen; Provisional
1115-1463 3.34e-07

104 kDa microneme/rhoptry antigen; Provisional


The actual alignment was detected with superfamily member PTZ00449:

Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 55.47  E-value: 3.34e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1115 PVTSPSLEMTESQPVSDVLESvtlstESPKETIAPAKTDYVYPTAKaplwPEEPKTEVVESITYVSEPPEttletspLPS 1194
Cdd:PTZ00449  520 PPKAPGDKEGEEGEHEDSKES-----DEPKEGGKPGETKEGEVGKK----PGPAKEHKPSKIPTLSKKPE-------FPK 583
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1195 QSitlPSPDEPQtEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTA--KPKTSPSPEVSYTTPAPKDVLLPHKPYP 1272
Cdd:PTZ00449  584 DP---KHPKDPE-EPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESpkSPKRPPPPQRPSSPERPEGPKIIKSPKP 659
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1273 EVSQSEPVLQPVTFRF------EPPKTTIAPLETRGIPFIPMISPSPSQEELQTTLAPHRFYTTVRPRTSDKPHIRPGVK 1346
Cdd:PTZ00449  660 PKSPKPPFDPKFKEKFyddyldAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDP 739
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1347 QAPRPSGADRNVSVDSthptKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGIISSGPITtpPLRSTPRPTGTpler 1426
Cdd:PTZ00449  740 DAEQPDDIEFFTPPEE----ERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS--PSEHEDKPPGD---- 809
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 578807234 1427 ietdikQPTVPASGEELENI----TDFSSSPTR-ETDPLGKP 1463
Cdd:PTZ00449  810 ------HPSLPKKRHRLDGLalstTDLESDAGRiAKDASGKI 845
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
124-202 5.43e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


:

Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 40.29  E-value: 5.43e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234    124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 578807234    200 GVK 202
Cdd:smart00060   73 RVR 75
PHA03247 super family cl33720
large tegument protein UL36; Provisional
264-587 3.13e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 3.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  264 DSAKSPEKAPlggvilvHLIIPGLNETTVKLPASLMFEISDALKTQLAKNETLALPAESKTPEVEKISARPTTVTPETVP 343
Cdd:PHA03247 2703 PPPPTPEPAP-------HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  344 RSTKPTTSSALDVSETTLVL----SKRTPETLQTILIPQFELPLSTLAPKSLPEFPEAKTPFPFEKPRGTLASSEKP--W 417
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESReslpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggS 2855
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  418 IVPTAKISEDSKVLQPQTatydvfsSPTTSDEPEISdsytatsdRILDSIPPKTSRTLEQPRATLAPSETPFVPQKleif 497
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAA-------KPAAPARPPVR--------RLARPAVSRSTESFALPPDQPERPPQPQAPPP---- 2916
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  498 tsPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKperTTSAGTITPKIsksPEPTWTTPAPGKTQFISLKpkiplSPEVT 577
Cdd:PHA03247 2917 --PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD---PAGAGEPSGAV---PQPWLGALVPGRVAVPRFR-----VPQPA 2983
                         330
                  ....*....|
gi 578807234  578 HTKPAPEPQT 587
Cdd:PHA03247 2984 PSREAPASST 2993
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
778-1275 5.53e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 84.60  E-value: 5.53e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  778 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLVP-ATILEPV-------------LRTEASG-------TTAAPKVPQR--- 833
Cdd:PHA03247 2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPdEPVGEPVhprmltwirgleeLASDDAGdpppplpPAAPPAAPDRsvp 2569
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  834 THRPHPKPKTTLSPEELQTELVP---ATIFEPVSPIKEAPGTTfvPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPK 910
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPDAPpqsARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  911 TtpsPQAPETKPVPATVLEPVTLRPEASTTLASKTSQRTRRPRLRTKTTP-----RPEAPESKPVPtaelkpvtlRTETW 985
Cdd:PHA03247 2648 P---PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSltslaDPPPPPPTPEP---------APHAL 2715
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  986 VT-TQAPKTSQRTRRPRPKTKTTPSPevpqtklvPSTDLEPGTLRTEAPKTMVVTTVLEPDTFRTKFPETTLAPKTQRTR 1064
Cdd:PHA03247 2716 VSaTPLPPGPAAARQASPALPAAPAP--------PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1065 RPRPRPKTTS--SPEVPQNKSVSVTGFEPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESqPVSDVLESVTLSTES 1142
Cdd:PHA03247 2788 VASLSESRESlpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP-LGGSVAPGGDVRRRP 2866
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1143 PKETIAPAKTDYVYPTAKAPLWPEEPKTEVVESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPK 1222
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 578807234 1223 TSPRPRIPQTQPVPK------VPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVS 1275
Cdd:PHA03247 2947 TDPAGAGEPSGAVPQpwlgalVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005
PHA03247 PHA03247
large tegument protein UL36; Provisional
478-980 1.71e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 73.05  E-value: 1.71e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  478 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 555
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  556 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 635
Cdd:PHA03247 2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  636 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 711
Cdd:PHA03247 2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  712 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 780
Cdd:PHA03247 2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  781 TTTTKRTRRPHPKPKTTPHPEV-----PQTKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEE------ 849
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPARPPVrrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPpprpqp 2941
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  850 -LQTELVPATIFEPvSPIKEAPGTTFVPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAP-----ETKPV 923
Cdd:PHA03247 2942 pLAPTTDPAGAGEP-SGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSlalheETDPP 3020
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 578807234  924 PATVLEpvTLRPEASTTLASKTSQRTRRPRLRTKTTPRPEAPESKPVPTAELKPVTL 980
Cdd:PHA03247 3021 PVSLKQ--TLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATP 3075
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1506-1597 9.58e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 57.12  E-value: 9.58e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1506 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1583
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 578807234 1584 LGEGPVSNTVAFST 1597
Cdd:cd00063    80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1507-1587 1.34e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 50.69  E-value: 1.34e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   1507 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1584
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 578807234   1585 GEG 1587
Cdd:smart00060   81 GEG 83
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1115-1463 3.34e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 55.47  E-value: 3.34e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1115 PVTSPSLEMTESQPVSDVLESvtlstESPKETIAPAKTDYVYPTAKaplwPEEPKTEVVESITYVSEPPEttletspLPS 1194
Cdd:PTZ00449  520 PPKAPGDKEGEEGEHEDSKES-----DEPKEGGKPGETKEGEVGKK----PGPAKEHKPSKIPTLSKKPE-------FPK 583
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1195 QSitlPSPDEPQtEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTA--KPKTSPSPEVSYTTPAPKDVLLPHKPYP 1272
Cdd:PTZ00449  584 DP---KHPKDPE-EPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESpkSPKRPPPPQRPSSPERPEGPKIIKSPKP 659
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1273 EVSQSEPVLQPVTFRF------EPPKTTIAPLETRGIPFIPMISPSPSQEELQTTLAPHRFYTTVRPRTSDKPHIRPGVK 1346
Cdd:PTZ00449  660 PKSPKPPFDPKFKEKFyddyldAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDP 739
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1347 QAPRPSGADRNVSVDSthptKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGIISSGPITtpPLRSTPRPTGTpler 1426
Cdd:PTZ00449  740 DAEQPDDIEFFTPPEE----ERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS--PSEHEDKPPGD---- 809
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 578807234 1427 ietdikQPTVPASGEELENI----TDFSSSPTR-ETDPLGKP 1463
Cdd:PTZ00449  810 ------HPSLPKKRHRLDGLalstTDLESDAGRiAKDASGKI 845
fn3 pfam00041
Fibronectin type III domain;
1507-1590 4.23e-05

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 43.56  E-value: 4.23e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  1507 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1583
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 578807234  1584 LGEGPVS 1590
Cdd:pfam00041   79 GGEGPPS 85
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
124-202 5.43e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 40.29  E-value: 5.43e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234    124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 578807234    200 GVK 202
Cdd:smart00060   73 RVR 75
fn3 pfam00041
Fibronectin type III domain;
123-202 5.49e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 40.48  E-value: 5.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   123 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 199
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 578807234   200 GVK 202
Cdd:pfam00041   72 RVQ 74
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1501-1645 6.05e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.61  E-value: 6.05e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1501 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1578
Cdd:COG3401   324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578807234 1579 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1645
Cdd:COG3401   398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
935-1261 1.45e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.41  E-value: 1.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   935 PEASTTLASKTSQRT-RRPRLRTKTTPRPEAPESKPVPTAELKPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPSPEVP 1013
Cdd:pfam17823   66 APAPVTLTKGTSAAHlNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAP 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  1014 QTKlVPSTdlePGTLRTEAPKTMVVTTVLEPDTFRTKFPETTLAPKTQRTRRPRPRPKTTS----SPEVPQNKSVSVTGF 1089
Cdd:pfam17823  146 RAA-ACRA---NASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSApatlTPARGISTAATATGH 221
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  1090 EPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESQPVSDVLESVTLSTESP-KETIAPAKTDYVYPTAKAPLWPEEP 1168
Cdd:pfam17823  222 PAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPhARRLSPAKHMPSDTMARNPAAPMGA 301
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  1169 KTEVVESITYVSEPPETTlETSPLPSQSITLPSPDEPQ-----------TEPAPKQTPRAPPKP--KTSPRPRI----PQ 1231
Cdd:pfam17823  302 QAQGPIIQVSTDQPVHNT-AGEPTPSPSNTTLEPNTPKsvastnlavvtTTKAQAKEPSASPVPvlHTSMIPEVeatsPT 380
                          330       340       350
                   ....*....|....*....|....*....|
gi 578807234  1232 TQPVPKVPQRVTAKPKTSPSPEVSYTTPAP 1261
Cdd:pfam17823  381 TQPSPLLPTQGAAGPGILLAPEQVATEATA 410
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
123-202 2.70e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 38.63  E-value: 2.70e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  123 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 200
Cdd:cd00063     3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73

                  ..
gi 578807234  201 VK 202
Cdd:cd00063    74 VR 75
PHA03247 PHA03247
large tegument protein UL36; Provisional
264-587 3.13e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 3.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  264 DSAKSPEKAPlggvilvHLIIPGLNETTVKLPASLMFEISDALKTQLAKNETLALPAESKTPEVEKISARPTTVTPETVP 343
Cdd:PHA03247 2703 PPPPTPEPAP-------HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  344 RSTKPTTSSALDVSETTLVL----SKRTPETLQTILIPQFELPLSTLAPKSLPEFPEAKTPFPFEKPRGTLASSEKP--W 417
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESReslpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggS 2855
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  418 IVPTAKISEDSKVLQPQTatydvfsSPTTSDEPEISdsytatsdRILDSIPPKTSRTLEQPRATLAPSETPFVPQKleif 497
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAA-------KPAAPARPPVR--------RLARPAVSRSTESFALPPDQPERPPQPQAPPP---- 2916
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  498 tsPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKperTTSAGTITPKIsksPEPTWTTPAPGKTQFISLKpkiplSPEVT 577
Cdd:PHA03247 2917 --PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD---PAGAGEPSGAV---PQPWLGALVPGRVAVPRFR-----VPQPA 2983
                         330
                  ....*....|
gi 578807234  578 HTKPAPEPQT 587
Cdd:PHA03247 2984 PSREAPASST 2993
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
778-1275 5.53e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 84.60  E-value: 5.53e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  778 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLVP-ATILEPV-------------LRTEASG-------TTAAPKVPQR--- 833
Cdd:PHA03247 2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPdEPVGEPVhprmltwirgleeLASDDAGdpppplpPAAPPAAPDRsvp 2569
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  834 THRPHPKPKTTLSPEELQTELVP---ATIFEPVSPIKEAPGTTfvPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPK 910
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPDAPpqsARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  911 TtpsPQAPETKPVPATVLEPVTLRPEASTTLASKTSQRTRRPRLRTKTTP-----RPEAPESKPVPtaelkpvtlRTETW 985
Cdd:PHA03247 2648 P---PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSltslaDPPPPPPTPEP---------APHAL 2715
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  986 VT-TQAPKTSQRTRRPRPKTKTTPSPevpqtklvPSTDLEPGTLRTEAPKTMVVTTVLEPDTFRTKFPETTLAPKTQRTR 1064
Cdd:PHA03247 2716 VSaTPLPPGPAAARQASPALPAAPAP--------PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1065 RPRPRPKTTS--SPEVPQNKSVSVTGFEPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESqPVSDVLESVTLSTES 1142
Cdd:PHA03247 2788 VASLSESRESlpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP-LGGSVAPGGDVRRRP 2866
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1143 PKETIAPAKTDYVYPTAKAPLWPEEPKTEVVESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPK 1222
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 578807234 1223 TSPRPRIPQTQPVPK------VPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVS 1275
Cdd:PHA03247 2947 TDPAGAGEPSGAVPQpwlgalVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005
PHA03247 PHA03247
large tegument protein UL36; Provisional
635-1025 3.83e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 78.44  E-value: 3.83e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  635 PKSKPALEPATiqPEPLVPTT--ASKPSERPKTT--HRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFET 710
Cdd:PHA03247 2553 PPLPPAAPPAA--PDRSVPPPrpAPRPSEPAVTSraRRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  711 EAPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSA--PTTTTKRTR 788
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdpPPPPPTPEP 2710
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  789 RPHPKPKTTPHPEVPQT--KLVPATILEPVLRTEASGTT--------AAPKVPQRTHRPHPK--PKTTLSPEELQTELVP 856
Cdd:PHA03247 2711 APHALVSATPLPPGPAAarQASPALPAAPAPPAVPAGPAtpggparpARPPTTAGPPAPAPPaaPAAGPPRRLTRPAVAS 2790
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  857 ATIFEPVSPIKEAPGTTFVPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAPETKPVPATvlePVTLRPE 936
Cdd:PHA03247 2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG---DVRRRPP 2867
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  937 ASTTlASKTSQRTRRPRLRTKTTPRPEAPESKPVPTAELKPVTlrtetwvTTQAPKTSQRTRRPRPKTKTTPSPEV---P 1013
Cdd:PHA03247 2868 SRSP-AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP-------QPQAPPPPQPQPQPPPPPQPQPPPPPpprP 2939
                         410
                  ....*....|..
gi 578807234 1014 QTKLVPSTDLEP 1025
Cdd:PHA03247 2940 QPPLAPTTDPAG 2951
PHA03247 PHA03247
large tegument protein UL36; Provisional
478-980 1.71e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 73.05  E-value: 1.71e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  478 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 555
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  556 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 635
Cdd:PHA03247 2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  636 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 711
Cdd:PHA03247 2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  712 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 780
Cdd:PHA03247 2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  781 TTTTKRTRRPHPKPKTTPHPEV-----PQTKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEE------ 849
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPARPPVrrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPpprpqp 2941
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  850 -LQTELVPATIFEPvSPIKEAPGTTFVPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAP-----ETKPV 923
Cdd:PHA03247 2942 pLAPTTDPAGAGEP-SGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSlalheETDPP 3020
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 578807234  924 PATVLEpvTLRPEASTTLASKTSQRTRRPRLRTKTTPRPEAPESKPVPTAELKPVTL 980
Cdd:PHA03247 3021 PVSLKQ--TLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATP 3075
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1506-1597 9.58e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 57.12  E-value: 9.58e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1506 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1583
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 578807234 1584 LGEGPVSNTVAFST 1597
Cdd:cd00063    80 GGESPPSESVTVTT 93
PHA03247 PHA03247
large tegument protein UL36; Provisional
382-800 1.03e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.19  E-value: 1.03e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  382 PLSTLAPKSLPEFPEAKTPFPFE-KPRGTLASSEKPWIVPTAKISED----SKVLQPQTATYD----VFSSPTTSDEPEI 452
Cdd:PHA03247 2608 PRGPAPPSPLPPDTHAPDPPPPSpSPAANEPDPHPPPTVPPPERPRDdpapGRVSRPRRARRLgraaQASSPPQRPRRRA 2687
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  453 SDSYTATSDRILDsiPPKTSRTLE-QPRATLAPSETPFVPQKL-EIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRT 530
Cdd:PHA03247 2688 ARPTVGSLTSLAD--PPPPPPTPEpAPHALVSATPLPPGPAAArQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG 2765
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  531 KPERTTSAGTITPkiskspePTWTTPAPGKTQFISLKPKIPLSPEvthtkPAPEPQTLLPSQSTigpETPGTKPSTTLAP 610
Cdd:PHA03247 2766 PPAPAPPAAPAAG-------PPRRLTRPAVASLSESRESLPSPWD-----PADPPAAVLAPAAA---LPPAASPAGPLPP 2830
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  611 RKTKRPGRRPRPRPRPKTTPSPE--VPKSKPALEPATIQPEPLVPTTASKPSER-------PKTTHRPDAPQIQPGSKPP 681
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVRRRPPSRSPAAKPAAPARPPVRrlarpavSRSTESFALPPDQPERPPQ 2910
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  682 KQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPsmtIVPTTDIEPVtvrteatvttlapktsqrtrtrrprpkhkttPRPE 761
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP---LAPTTDPAGA-------------------------------GEPS 2956
                         410       420       430
                  ....*....|....*....|....*....|....*....
gi 578807234  762 TLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHP 800
Cdd:PHA03247 2957 GAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1507-1587 1.34e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 50.69  E-value: 1.34e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   1507 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1584
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 578807234   1585 GEG 1587
Cdd:smart00060   81 GEG 83
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1115-1463 3.34e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 55.47  E-value: 3.34e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1115 PVTSPSLEMTESQPVSDVLESvtlstESPKETIAPAKTDYVYPTAKaplwPEEPKTEVVESITYVSEPPEttletspLPS 1194
Cdd:PTZ00449  520 PPKAPGDKEGEEGEHEDSKES-----DEPKEGGKPGETKEGEVGKK----PGPAKEHKPSKIPTLSKKPE-------FPK 583
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1195 QSitlPSPDEPQtEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTA--KPKTSPSPEVSYTTPAPKDVLLPHKPYP 1272
Cdd:PTZ00449  584 DP---KHPKDPE-EPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESpkSPKRPPPPQRPSSPERPEGPKIIKSPKP 659
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1273 EVSQSEPVLQPVTFRF------EPPKTTIAPLETRGIPFIPMISPSPSQEELQTTLAPHRFYTTVRPRTSDKPHIRPGVK 1346
Cdd:PTZ00449  660 PKSPKPPFDPKFKEKFyddyldAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDP 739
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1347 QAPRPSGADRNVSVDSthptKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGIISSGPITtpPLRSTPRPTGTpler 1426
Cdd:PTZ00449  740 DAEQPDDIEFFTPPEE----ERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS--PSEHEDKPPGD---- 809
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 578807234 1427 ietdikQPTVPASGEELENI----TDFSSSPTR-ETDPLGKP 1463
Cdd:PTZ00449  810 ------HPSLPKKRHRLDGLalstTDLESDAGRiAKDASGKI 845
PRK10263 PRK10263
DNA translocase FtsK; Provisional
701-1337 5.08e-07

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 55.09  E-value: 5.08e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  701 SVSEPVPFETEAPSMTIVPTTDIEPVTvrTEATVTTLAPKTSQRTRtrrprpKHKTTPRPETLQTKLDFGPitpgTSSAP 780
Cdd:PRK10263  315 PITEPVAVAAAATTATQSWAAPVEPVT--QTPPVASVDVPPAQPTV------AWQPVPGPQTGEPVIAPAP----EGYPQ 382
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  781 TTTTKRTRRPHPKPKTTPHPEVPQTKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEELQTELVPATIF 860
Cdd:PRK10263  383 QSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTY 462
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  861 EPVSPIKE--APGTTFVPVTDLEPVTFRTEIPATTlATKTSKRTRPPRPRPKTTPSPQ----APETKPVPATVLEPVTLR 934
Cdd:PRK10263  463 QTEQTYQQpaAQEPLYQQPQPVEQQPVVEPEPVVE-ETKPARPPLYYFEEVEEKRAREreqlAAWYQPIPEPVKEPEPIK 541
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  935 PEASTTLASKTSqrtrrprlrtkttPRPEAPESKPVpTAELKPVTLRTETWVTTQAPKTSQRTR-RPRPKTKTTPSPEVP 1013
Cdd:PRK10263  542 SSLKAPSVAAVP-------------PVEAAAAVSPL-ASGVKKATLATGAAATVAAPVFSLANSgGPRPQVKEGIGPQLP 607
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1014 QtklvpstdlepgtlrteaPKTMVVTTVLEPDTFRTKFPETTLAPKtqrtrrprprpKTTSSPEVPQNKSVSVTGFEPVV 1093
Cdd:PRK10263  608 R------------------PKRIRVPTRRELASYGIKLPSQRAAEE-----------KAREAQRNQYDSGDQYNDDEIDA 658
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1094 HSTDAPGTTFALTELQTLilkpvtspSLEMTESQPVSDVLESVTLSTESPKETIAPAKTDYV--YPTAKAPLWPEEPKTE 1171
Cdd:PRK10263  659 MQQDELARQFAQTQQQRY--------GEQYQHDVPVNAEDADAAAEAELARQFAQTQQQRYSgeQPAGANPFSLDDFEFS 730
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1172 VVESItyVSEPPETTLETsPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQ-----PVPKVPQRVTAKP 1246
Cdd:PRK10263  731 PMKAL--LDDGPHEPLFT-PIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPqyqqpQQPVAPQPQYQQP 807
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1247 KTSPSPEVSYTTPAPkdvllPHKPYPEVSQSEPVLQPvtfrfEPPKTTIAPLETRG------------IPFIPMISPSPS 1314
Cdd:PRK10263  808 QQPVAPQPQYQQPQQ-----PVAPQPQYQQPQQPVAP-----QPQDTLLHPLLMRNgdsrplhkpttpLPSLDLLTPPPS 877
                         650       660
                  ....*....|....*....|...
gi 578807234 1315 QEELQTTLAPHRFYTTVRPRTSD 1337
Cdd:PRK10263  878 EVEPVDTFALEQMARLVEARLAD 900
PHA03247 PHA03247
large tegument protein UL36; Provisional
960-1471 6.46e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 6.46e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  960 PRPEAPESKPVPTAEL-------KPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPSPEVPQtKLVPSTDLEPgtlrtEA 1032
Cdd:PHA03247 2504 PDPDAPPAPSRLAPAIlpdepvgEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPD-RSVPPPRPAP-----RP 2577
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1033 PKTMVVTTVLEPDTfrtkfPETTLAPKTQRTRRPRPRPKTTSSPEVPQNKSVSVTGFEPVVHSTDAPGTTFALTELQTLI 1112
Cdd:PHA03247 2578 SEPAVTSRARRPDA-----PPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERP 2652
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1113 LKPVTSPSLEMTESQPVSDVLESVTLSTESPKETIAPAKTDYVYPTAKAPLWPEEPKTEVVESITYVSEPPETTLETSPL 1192
Cdd:PHA03247 2653 RDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQAS 2732
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1193 PSQSITlPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKV-PQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPY 1271
Cdd:PHA03247 2733 PALPAA-PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAgPPRRLTRPAVASLSESRESLPSPWDPADPPAAV 2811
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1272 PEVSQSEPVLQ-PVTFrfEPPKTTIAPletrgipfipmISPSPSQEELQTTLAPhrfyttvrprtsdKPHIRPGVKQAPR 1350
Cdd:PHA03247 2812 LAPAAALPPAAsPAGP--LPPPTSAQP-----------TAPPPPPGPPPPSLPL-------------GGSVAPGGDVRRR 2865
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1351 PsgadrnvsvdsthptkkpgtrrpPLPPRPTHPRRKPLPPNNVTGKPGSAGIISSGPITTPPLRSTPRPTGTPLERIETD 1430
Cdd:PHA03247 2866 P-----------------------PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQ 2922
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|.
gi 578807234 1431 IKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVR 1471
Cdd:PHA03247 2923 PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPW 2963
fn3 pfam00041
Fibronectin type III domain;
1507-1590 4.23e-05

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 43.56  E-value: 4.23e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  1507 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1583
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 578807234  1584 LGEGPVS 1590
Cdd:pfam00041   79 GGEGPPS 85
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1085-1352 4.87e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 48.54  E-value: 4.87e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1085 SVTgfEPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESQPVSDVLESVTLSTESPkeTIAPAKTDYV-YPTAKAPL 1163
Cdd:PRK10263  315 PIT--EPVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEP--VIAPAPEGYPqQSQYAQPA 390
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1164 WP-----EEPKTEVVESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPK-----PKTSPRPRIPQTQ 1233
Cdd:PRK10263  391 VQyneplQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQqstfaPQSTYQTEQTYQQ 470
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1234 PVPK-----VPQRVTAKPKTSPSPEVSYTTPAPKDVL----LPHKPYPEVSQSEPVLQPVTFRFEPPKTTIAPLETRGIP 1304
Cdd:PRK10263  471 PAAQeplyqQPQPVEQQPVVEPEPVVEETKPARPPLYyfeeVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVA 550
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1305 FIPMISPSPSQEEL-----QTTLAPHRFYTTVRPRTS------DKPHIRPGV-KQAPRPS 1352
Cdd:PRK10263  551 AVPPVEAAAAVSPLasgvkKATLATGAAATVAAPVFSlansggPRPQVKEGIgPQLPRPK 610
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
533-980 7.19e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 47.76  E-value: 7.19e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  533 ERTTSAGTITPKISKSPEPTWTT-----PAPGKTQFISLKPKIPLSPEVTHTKPAP-EPQTLLPSQSTIGPETPGTKPST 606
Cdd:PTZ00449  533 EHEDSKESDEPKEGGKPGETKEGevgkkPGPAKEHKPSKIPTLSKKPEFPKDPKHPkDPEEPKKPKRPRSAQRPTRPKSP 612
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  607 TLaprktkrpgrrprprprpktTPSPEVPKSKPALEPATIQPEPLVPTtaskpseRPKTTHRPDAPQIQPGSKPPKQllP 686
Cdd:PTZ00449  613 KL--------------------PELLDIPKSPKRPESPKSPKRPPPPQ-------RPSSPERPEGPKIIKSPKPPKS--P 663
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  687 KPqttaepdmpptksvsepvPFEteaPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPrpetLQTK 766
Cdd:PTZ00449  664 KP------------------PFD---PKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTP----FTTP 718
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  767 LDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTLS 846
Cdd:PTZ00449  719 RPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS 798
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  847 PEELQTElvpATIFEPVSPIKEAPGTTF-VPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAPETKPVpa 925
Cdd:PTZ00449  799 PSEHEDK---PPGDHPSLPKKRHRLDGLaLSTTDLESDAGRIAKDASGKIVKLKRSKSFDDLTTVEEAEEMGAEARKI-- 873
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 578807234  926 tVLEPVTLRPEASTTLASKTSQ----RTRRPRLRTKTTPRPEAPESKPVPTAELKPVTL 980
Cdd:PTZ00449  874 -VVDDDGTEADDEDTHPPEEKHksevRRRRPPKKPSKPKKPSKPKKPKKPDSAFIPSII 931
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
1184-1278 8.07e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 47.50  E-value: 8.07e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1184 ETTLETSPLPSQ-SITLPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPK 1262
Cdd:PRK14950  357 EALLVPVPAPQPaKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAI 436
                          90
                  ....*....|....*.
gi 578807234 1263 DVLLPHKPYPEVSQSE 1278
Cdd:PRK14950  437 PVDEKPKYTPPAPPKE 452
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1158-1419 1.75e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.62  E-value: 1.75e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1158 TAKAPLW--PEEPKTEVVESITYVSEPPETTLETSPLPSqsitlPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPV 1235
Cdd:PRK10263  327 TTATQSWaaPVEPVTQTPPVASVDVPPAQPTVAWQPVPG-----PQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPV 401
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1236 PkvPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQS-----------------EPVLQPVTFRFEPPKTTIAPL 1298
Cdd:PRK10263  402 Q--PQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPvagnawqaeeqqstfapQSTYQTEQTYQQPAAQEPLYQ 479
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1299 ETRGIPFIPMISPSPSQEELQTTLAPHRFYTTVRPRTSDKPHIRPGVKQaPRPSGADRNVSVDSTHPTKKPGTRRPPLPP 1378
Cdd:PRK10263  480 QPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQ-PIPEPVKEPEPIKSSLKAPSVAAVPPVEAA 558
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|.
gi 578807234 1379 RPTHPRRKPLppNNVTGKPGSAGIISSgPITTPPLRSTPRP 1419
Cdd:PRK10263  559 AAVSPLASGV--KKATLATGAAATVAA-PVFSLANSGGPRP 596
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1138-1509 2.25e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 46.32  E-value: 2.25e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1138 LSTESPKETIAPAKTDYVYPTAKAPLWPE-EPKTEVV--ESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQT 1214
Cdd:PHA03307   44 VSDSAELAAVTVVAGAAACDRFEPPTGPPpGPGTEAPanESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPP 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1215 PRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPypevsqsepvlqpvtfrfEPPKTT 1294
Cdd:PHA03307  124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSP------------------EETARA 185
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1295 IAPletrgipfiPMISPSPSQEELQTTLAPHRFYTTVRPRTSDkPHIRPGVKQAPRPSGAdrnvSVDSTHPTKKPGTRRP 1374
Cdd:PHA03307  186 PSS---------PPAEPPPSTPPAAASPRPPRRSSPISASASS-PAPAPGRSAADDAGAS----SSDSSSSESSGCGWGP 251
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1375 PLPPRPTHPRRKPLPPNNVTGKPGSAGiiSSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPT 1454
Cdd:PHA03307  252 ENECPLPRPAPITLPTRIWEASGWNGP--SSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSST 329
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 578807234 1455 RETDPlgKPRfkGPHVRYIQKPDNSPCSITDSVKRFPKEEATEGNATSPPQNPPT 1509
Cdd:PHA03307  330 SSSSE--SSR--GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAA 380
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
124-202 5.43e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 40.29  E-value: 5.43e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234    124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 578807234    200 GVK 202
Cdd:smart00060   73 RVR 75
fn3 pfam00041
Fibronectin type III domain;
123-202 5.49e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 40.48  E-value: 5.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   123 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 199
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 578807234   200 GVK 202
Cdd:pfam00041   72 RVQ 74
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1501-1645 6.05e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.61  E-value: 6.05e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1501 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1578
Cdd:COG3401   324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578807234 1579 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1645
Cdd:COG3401   398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
PRK11633 PRK11633
cell division protein DedD; Provisional
1159-1262 8.46e-04

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 42.68  E-value: 8.46e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1159 AKAPLWP---EEPKTEVVESITYV--SEPPETTLE-----TSPLPSQSITLPSPDEPQTEPAPKqtPRAPPKPKtsPRPR 1228
Cdd:PRK11633   39 AAIPLVPkpgDRDEPDMMPAATQAlpTQPPEGAAEavragDAAAPSLDPATVAPPNTPVEPEPA--PVEPPKPK--PVEK 114
                          90       100       110
                  ....*....|....*....|....*....|....
gi 578807234 1229 iPQTQPVPKVPQRVTAKPKTSPSPEVSyTTPAPK 1262
Cdd:PRK11633  115 -PKPKPKPQQKVEAPPAPKPEPKPVVE-EKAAPT 146
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1500-1602 9.51e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 43.84  E-value: 9.51e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1500 ATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMTNQTFSTVENLKPNTSYEFQVK 1579
Cdd:COG3401   229 PTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTNGTTYYYRVT 303
                          90       100
                  ....*....|....*....|....
gi 578807234 1580 PKNPLG-EGPVSNTVAFSTESADP 1602
Cdd:COG3401   304 AVDAAGnESAPSNVVSVTTDLTPP 327
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
639-707 1.29e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 43.64  E-value: 1.29e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 578807234  639 PALEPATIQPEPLVPTtASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVP 707
Cdd:PRK14950  362 PVPAPQPAKPTAAAPS-PVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAP 429
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1103-1337 1.39e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.33  E-value: 1.39e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1103 FALTELQTLILKPVTSPSlemtESQPVSDVLESVTLSTE--SPKETIAPAKTDYVYPTAKAPLwPEEPKTEVVESITYVS 1180
Cdd:PRK12323  353 FTMTLLRMLAFRPGQSGG----GAGPATAAAAPVAQPAPaaAAPAAAAPAPAAPPAAPAAAPA-AAAAARAVAAAPARRS 427
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1181 EPPEttletsplPSQSITLPSPDEPQTEPAPkqTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPA 1260
Cdd:PRK12323  428 PAPE--------ALAAARQASARGPGGAPAP--APAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPP 497
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578807234 1261 PKDVLLPHKPYPEVSQSEPVLQP-VTFRFEPPKTTIAPLETRGIPFIPMISPSPSQEELQTTLAPHRFYTTVRPRTSD 1337
Cdd:PRK12323  498 PWEELPPEFASPAPAQPDAAPAGwVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
935-1261 1.45e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.41  E-value: 1.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   935 PEASTTLASKTSQRT-RRPRLRTKTTPRPEAPESKPVPTAELKPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPSPEVP 1013
Cdd:pfam17823   66 APAPVTLTKGTSAAHlNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAP 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  1014 QTKlVPSTdlePGTLRTEAPKTMVVTTVLEPDTFRTKFPETTLAPKTQRTRRPRPRPKTTS----SPEVPQNKSVSVTGF 1089
Cdd:pfam17823  146 RAA-ACRA---NASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSApatlTPARGISTAATATGH 221
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  1090 EPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESQPVSDVLESVTLSTESP-KETIAPAKTDYVYPTAKAPLWPEEP 1168
Cdd:pfam17823  222 PAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPhARRLSPAKHMPSDTMARNPAAPMGA 301
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  1169 KTEVVESITYVSEPPETTlETSPLPSQSITLPSPDEPQ-----------TEPAPKQTPRAPPKP--KTSPRPRI----PQ 1231
Cdd:pfam17823  302 QAQGPIIQVSTDQPVHNT-AGEPTPSPSNTTLEPNTPKsvastnlavvtTTKAQAKEPSASPVPvlHTSMIPEVeatsPT 380
                          330       340       350
                   ....*....|....*....|....*....|
gi 578807234  1232 TQPVPKVPQRVTAKPKTSPSPEVSYTTPAP 1261
Cdd:pfam17823  381 TQPSPLLPTQGAAGPGILLAPEQVATEATA 410
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
657-1028 1.76e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.14  E-value: 1.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  657 SKPSERPKTTHRPDAPQIQPGSKPPKqllpkPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVTvrTEATVTT 736
Cdd:PTZ00449  537 SKESDEPKEGGKPGETKEGEVGKKPG-----PAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRS--AQRPTRP 609
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  737 LAPKTSQRTRTRRPRPKHKTTPRPETlqtkldfgPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEV-PQTK-LVPATILE 814
Cdd:PTZ00449  610 KSPKLPELLDIPKSPKRPESPKSPKR--------PPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFdPKFKeKFYDDYLD 681
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  815 PVLRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEELQTELVP------ATIFEPV-SPIKEAPGTTFVPVTDLEPVTFRT 887
Cdd:PTZ00449  682 AAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPklprdeEFPFEPIgDPDAEQPDDIEFFTPPEEERTFFH 761
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  888 EIPATTLATKTSKRTRPPRPRPKTTPSPQAPETKPvpatvLEPVTLRPEASTTLASKTSQRTRRPRLRTKTTPRpEAPES 967
Cdd:PTZ00449  762 ETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRP-----DSPSEHEDKPPGDHPSLPKKRHRLDGLALSTTDL-ESDAG 835
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  968 KPVPTAELKPVTLR-----------------------------------TETWVTTQAPKTSQRTRRPRPKTKTTPSPEV 1012
Cdd:PTZ00449  836 RIAKDASGKIVKLKrsksfddlttveeaeemgaearkivvdddgteaddEDTHPPEEKHKSEVRRRRPPKKPSKPKKPSK 915
                         410
                  ....*....|....*.
gi 578807234 1013 PQTKLVPSTDLEPGTL 1028
Cdd:PTZ00449  916 PKKPKKPDSAFIPSII 931
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
890-1058 2.24e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.94  E-value: 2.24e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  890 PATTLATKTSKRTRPPRPRPKTTPSPQAPETKPVPATVLEPVTLRPEA-----STTLASKTSQRTRRPRLRTKTTPRPEA 964
Cdd:PRK12323  374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAaparrSPAPEALAAARQASARGPGGAPAPAPA 453
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  965 PESKPVPTAELKPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPS-PEVPQTKLVPS-TDLEPGTLRTEAPKTMVVTTVL 1042
Cdd:PRK12323  454 PAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPwEELPPEFASPApAQPDAAPAGWVAESIPDPATAD 533
                         170
                  ....*....|....*.
gi 578807234 1043 EPDTFRTKFPETTLAP 1058
Cdd:PRK12323  534 PDDAFETLAPAPAAAP 549
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
921-1238 2.68e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 42.75  E-value: 2.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  921 KPVPATVLEP-----VTLRPEASTTLAS----KTSQRTRRPRLRTKTT--PRPEAPESKPVPTAELKPvtlrtETWVTTQ 989
Cdd:PTZ00449  560 KPGPAKEHKPskiptLSKKPEFPKDPKHpkdpEEPKKPKRPRSAQRPTrpKSPKLPELLDIPKSPKRP-----ESPKSPK 634
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  990 APKTSQRTRRPR----PKT-KTTPSPEVPQTKLVPS------TDLEPGTLRTEAPKTMVVTTVLEPDTFRTKFPETTLAP 1058
Cdd:PTZ00449  635 RPPPPQRPSSPErpegPKIiKSPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTP 714
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1059 KTQRTRRPRPRPKTTSSPEVPQNksvsvtgfEPVVHSTDaPGTTFALTELQTLILKpvtspslEMTESQPVSDVL----- 1133
Cdd:PTZ00449  715 FTTPRPLPPKLPRDEEFPFEPIG--------DPDAEQPD-DIEFFTPPEEERTFFH-------ETPADTPLPDILaeefk 778
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1134 -ESVTLSTESPKETIAPAKTDYVYPTAKAPLWPEEPK-----------------------------------TEVVESIT 1177
Cdd:PTZ00449  779 eEDIHAETGEPDEAMKRPDSPSEHEDKPPGDHPSLPKkrhrldglalsttdlesdagriakdasgkivklkrSKSFDDLT 858
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578807234 1178 YVSEPPETTLETSPLPSQSITLPSPDEpQTEPA------------PKQTPRAPPKPKTSPRPRIPQTQPVPKV 1238
Cdd:PTZ00449  859 TVEEAEEMGAEARKIVVDDDGTEADDE-DTHPPeekhksevrrrrPPKKPSKPKKPSKPKKPKKPDSAFIPSI 930
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
123-202 2.70e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 38.63  E-value: 2.70e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  123 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 200
Cdd:cd00063     3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73

                  ..
gi 578807234  201 VK 202
Cdd:cd00063    74 VR 75
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1182-1259 2.96e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 42.46  E-value: 2.96e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578807234 1182 PPETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTP 1259
Cdd:PRK14971  391 QPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGP 468
PHA03247 PHA03247
large tegument protein UL36; Provisional
264-587 3.13e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 3.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  264 DSAKSPEKAPlggvilvHLIIPGLNETTVKLPASLMFEISDALKTQLAKNETLALPAESKTPEVEKISARPTTVTPETVP 343
Cdd:PHA03247 2703 PPPPTPEPAP-------HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  344 RSTKPTTSSALDVSETTLVL----SKRTPETLQTILIPQFELPLSTLAPKSLPEFPEAKTPFPFEKPRGTLASSEKP--W 417
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESReslpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggS 2855
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  418 IVPTAKISEDSKVLQPQTatydvfsSPTTSDEPEISdsytatsdRILDSIPPKTSRTLEQPRATLAPSETPFVPQKleif 497
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAA-------KPAAPARPPVR--------RLARPAVSRSTESFALPPDQPERPPQPQAPPP---- 2916
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  498 tsPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKperTTSAGTITPKIsksPEPTWTTPAPGKTQFISLKpkiplSPEVT 577
Cdd:PHA03247 2917 --PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD---PAGAGEPSGAV---PQPWLGALVPGRVAVPRFR-----VPQPA 2983
                         330
                  ....*....|
gi 578807234  578 HTKPAPEPQT 587
Cdd:PHA03247 2984 PSREAPASST 2993
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
565-1009 4.92e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 4.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   565 SLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEP- 643
Cdd:pfam03154  143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTq 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   644 ATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTksvsePVPFETEAPSMtivpttdi 723
Cdd:pfam03154  223 STAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPM-----PHSLQTGPSHM-------- 289
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   724 ePVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETlqtkldfgpiTPGTSSAPTTTTKRTRRPHPKPKTTPHPEVP 803
Cdd:pfam03154  290 -QHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHT----------PPSQSQLQSQQPPREQPLPPAPLSMPHIKPP 358
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   804 QtklvpatilepvlrteasgTTAAPKVPQRTHRPHPKPKTTLSPEELQTELVPATIFEPVSPIK-EAPGTTFVPVTDLEP 882
Cdd:pfam03154  359 P-------------------TTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLStHHPPSAHPPPLQLMP 419
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   883 VTFRTEIPATT--LATKTSKRTRPPRPRPKTTPSPQAPETKPVPATVLEPVTlRPEASTTLASKTSQRTRRPRLRTKTTP 960
Cdd:pfam03154  420 QSQQLPPPPAQppVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGG-PPPITPPSGPPTSTSSAMPGIQPPSSA 498
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 578807234   961 RPEAPESKP-VPTAELKPVTLRTETWVTTQAPKT---SQRTRRPRPKTKTTPS 1009
Cdd:pfam03154  499 SVSSSGPVPaAVSCPLPPVQIKEEALDEAEEPESpppPPRSPSPEPTVVNTPS 551
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
972-1368 5.51e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.48  E-value: 5.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   972 TAELKPVTLRTETWVTT------QAPKTSQRTRRPRPKTKTTPSPEVPQTKLVPSTDLEPGTLRTEAPKTMVVTTVLEPD 1045
Cdd:pfam17823   64 TAAPAPVTLTKGTSAAHlnstevTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFS 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  1046 TFRTKFPETTLAPKTQRTRRPRPRPKTTSSpeVPQNKSVSVTGFEPVVHSTDAPgTTFALTELQTLILKPVTSPSLEMTE 1125
Cdd:pfam17823  144 APRAAACRANASAAPRAAIAAASAPHAASP--APRTAASSTTAASSTTAASSAP-TTAASSAPATLTPARGISTAATATG 220
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  1126 SQPVSDVLESVTLSTESPKETIAPAKTdyVYPTAKAPLwpeepkTEVVESITYVSEPPETTLETSPLPSQSITLPSpDEP 1205
Cdd:pfam17823  221 HPAAGTALAAVGNSSPAAGTVTAAVGT--VTPAALATL------AAAAGTVASAAGTINMGDPHARRLSPAKHMPS-DTM 291
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  1206 QTEPAPKQTPRAppkpkTSPRPRIPQTQPVpkvpqrVTAKPKTSPSPEVSyttpapkdVLLPHKPYPEVSQSEPVLQPVT 1285
Cdd:pfam17823  292 ARNPAAPMGAQA-----QGPIIQVSTDQPV------HNTAGEPTPSPSNT--------TLEPNTPKSVASTNLAVVTTTK 352
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  1286 FRFEPPKTTIAPletrgipfIPMISPSPSQEELQTTLAPHRFYTTVRPRTsdkphirPGVKQAPRPSGADRNVSVDSTHP 1365
Cdd:pfam17823  353 AQAKEPSASPVP--------VLHTSMIPEVEATSPTTQPSPLLPTQGAAG-------PGILLAPEQVATEATAGTASAGP 417

                   ...
gi 578807234  1366 TKK 1368
Cdd:pfam17823  418 TPR 420
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
1203-1283 5.74e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 41.47  E-value: 5.74e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234 1203 DEPQTEPAPKQTPRAPPKPKTSPRP-RIPQTQPVPKVPQRVTAKPKTSPSPEVSYT----TPAPKDVLLPHKPYPEVSQS 1277
Cdd:PRK14954  374 VRNDGGVAPSPAGSPDVKKKAPEPDlPQPDRHPGPAKPEAPGARPAELPSPASAPTpeqqPPVARSAPLPPSPQASAPRN 453

                  ....*.
gi 578807234 1278 EPVLQP 1283
Cdd:PRK14954  454 VASGKP 459
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
726-1039 6.95e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.10  E-value: 6.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   726 VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKldfGPITPGTSSAPTTTTKRTRRPH-PKPKTTPHPEVPQ 804
Cdd:pfam17823   86 VTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSA---AQSLPAAIAALPSEAFSAPRAAaCRANASAAPRAAI 162
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   805 TKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTlspeelqtelvPATIFePVSPIKEAPGTTFVPVTDlepvT 884
Cdd:pfam17823  163 AAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSA-----------PATLT-PARGISTAATATGHPAAG----T 226
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   885 FRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAPETKPVPATVL---EPVTLRPEASTTLASKTSQRTRRPRLRTK---- 957
Cdd:pfam17823  227 ALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTInmgDPHARRLSPAKHMPSDTMARNPAAPMGAQaqgp 306
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   958 ------------TTPRPE-APESKPVPTAELKPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPSPEVPQTKLVPSTDLE 1024
Cdd:pfam17823  307 iiqvstdqpvhnTAGEPTpSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSPL 386
                          330
                   ....*....|....*
gi 578807234  1025 PGTLRTEAPKTMVVT 1039
Cdd:pfam17823  387 LPTQGAAGPGILLAP 401
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
582-1021 7.40e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.06  E-value: 7.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   582 APEPQTLLPSQSTIGPETPGTK---PSTTLAPRKTKRPGRRPRPRPRPKTTpSPEVPKSKPALEPATIQPEPLVPTTASK 658
Cdd:pfam05109  424 APESTTTSPTLNTTGFAAPNTTtglPSSTHVPTNLTAPASTGPTVSTADVT-SPTPAGTTSGASPVTPSPSPRDNGTESK 502
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   659 PSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEP--DMPPTKSVSEPVPFETEAPSMTIVPTTDiepvtvrteATVTT 736
Cdd:pfam05109  503 APDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTlgKTSPTSAVTTPTPNATSPTPAVTTPTPN---------ATIPT 573
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   737 LApKTSqrtrtrrprpkhkttprpetlqtkldfgPITPGTSSAPTTTTKRTRRPHPKPKTTPHPevpqtklVPATILEPV 816
Cdd:pfam05109  574 LG-KTS----------------------------PTSAVTTPTPNATSPTVGETSPQANTTNHT-------LGGTSSTPV 617
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   817 LRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEELQTELVPATIFEPVSPIkeaPGTTFVPVTDLEPVTFRTEIPATTLAT 896
Cdd:pfam05109  618 VTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHM---PLLTSAHPTGGENITQVTPASTSTHHV 694
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234   897 KTSkRTRPPRPRPKTTPSPQAPETKPVPATVLEPVTLRPEASTTLASKTSQRTRRPRLRTK------------TTPRPEA 964
Cdd:pfam05109  695 STS-SPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTggkansttggkhTTGHGAR 773
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 578807234   965 PESKPVPTAELKPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPSPEVPQTKLVPST 1021
Cdd:pfam05109  774 TSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPT 830
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
568-794 7.66e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.01  E-value: 7.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  568 PKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRktkrpgrrprprprpKTTPSPEVPKSKPALEPATIQ 647
Cdd:PRK12323  374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAAR---------------AVAAAPARRSPAPEALAAARQ 438
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578807234  648 PEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVT 727
Cdd:PRK12323  439 ASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAP 518
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578807234  728 VRTEAtvttlapktsqrtrtrrprpkhKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKP 794
Cdd:PRK12323  519 AGWVA----------------------ESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH