NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|530374302|ref|XP_005247358|]
View 

target of Nesh-SH3 isoform X34 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
778-1275 5.51e-16

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 84.60  E-value: 5.51e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  778 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLVP-ATILEPV-------------LRTEASG-------TTAAPKVPQR--- 833
Cdd:PHA03247 2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPdEPVGEPVhprmltwirgleeLASDDAGdpppplpPAAPPAAPDRsvp 2569
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  834 THRPHPKPKTTLSPEELQTELVP---ATIFEPVSPIKEAPGTTfvPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPK 910
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPDAPpqsARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  911 TtpsPQAPETKPVPATVLEPVTLRPEASTTLASKTSQRTRRPRLRTKTTP-----RPEAPESKPVPtaelkpvtlRTETW 985
Cdd:PHA03247 2648 P---PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSltslaDPPPPPPTPEP---------APHAL 2715
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  986 VT-TQAPKTSQRTRRPRPKTKTTPSPevpqtklvPSTDLEPGTLRTEAPKTMVVTTVLEPDTFRTKFPETTLAPKTQRTR 1064
Cdd:PHA03247 2716 VSaTPLPPGPAAARQASPALPAAPAP--------PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1065 RPRPRPKTTS--SPEVPQNKSVSVTGFEPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESqPVSDVLESVTLSTES 1142
Cdd:PHA03247 2788 VASLSESRESlpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP-LGGSVAPGGDVRRRP 2866
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1143 PKETIAPAKTDYVYPTAKAPLWPEEPKTEVVESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPK 1222
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 530374302 1223 TSPRPRIPQTQPVPK------VPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVS 1275
Cdd:PHA03247 2947 TDPAGAGEPSGAVPQpwlgalVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005
PHA03247 super family cl33720
large tegument protein UL36; Provisional
478-980 1.76e-12

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 73.05  E-value: 1.76e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  478 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 555
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  556 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 635
Cdd:PHA03247 2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  636 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 711
Cdd:PHA03247 2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  712 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 780
Cdd:PHA03247 2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  781 TTTTKRTRRPHPKPKTTPHPEV-----PQTKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEE------ 849
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPARPPVrrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPpprpqp 2941
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  850 -LQTELVPATIFEPvSPIKEAPGTTFVPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAP-----ETKPV 923
Cdd:PHA03247 2942 pLAPTTDPAGAGEP-SGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSlalheETDPP 3020
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 530374302  924 PATVLEpvTLRPEASTTLASKTSQRTRRPRLRTKTTPRPEAPESKPVPTAELKPVTL 980
Cdd:PHA03247 3021 PVSLKQ--TLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATP 3075
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1490-1581 9.04e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 57.12  E-value: 9.04e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1490 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1567
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 530374302 1568 LGEGPVSNTVAFST 1581
Cdd:cd00063    80 GGESPPSESVTVTT 93
PHA03307 super family cl33723
transcriptional regulator ICP4; Provisional
1138-1493 5.13e-07

transcriptional regulator ICP4; Provisional


The actual alignment was detected with superfamily member PHA03307:

Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 54.79  E-value: 5.13e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1138 LSTESPKETIAPAKTDYVYPTAKAPLWPE-EPKTEVV--ESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQT 1214
Cdd:PHA03307   44 VSDSAELAAVTVVAGAAACDRFEPPTGPPpGPGTEAPanESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPP 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1215 PRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPyPEVSQSEPAPLETRGipfipmis 1294
Cdd:PHA03307  124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSP-EETARAPSSPPAEPP-------- 194
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1295 psPSQEELQTTLAPHRFYTTVRPRTSDkPHIRPGVKQAPRPSGAdrnvSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPP 1374
Cdd:PHA03307  195 --PSTPPAAASPRPPRRSSPISASASS-PAPAPGRSAADDAGAS----SSDSSSSESSGCGWGPENECPLPRPAPITLPT 267
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1375 NNVTGKPGSAGiiSSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPlgKPRfkGPHV 1454
Cdd:PHA03307  268 RIWEASGWNGP--SSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSE--SSR--GAAV 341
                         330       340       350
                  ....*....|....*....|....*....|....*....
gi 530374302 1455 RYIQKPDNSPCSITDSVKRFPKEEATEGNATSPPQNPPT 1493
Cdd:PHA03307  342 SPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAA 380
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
124-202 5.32e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


:

Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 40.29  E-value: 5.32e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302    124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 530374302    200 GVK 202
Cdd:smart00060   73 RVR 75
PHA03247 super family cl33720
large tegument protein UL36; Provisional
264-587 3.07e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 3.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  264 DSAKSPEKAPlggvilvHLIIPGLNETTVKLPASLMFEISDALKTQLAKNETLALPAESKTPEVEKISARPTTVTPETVP 343
Cdd:PHA03247 2703 PPPPTPEPAP-------HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  344 RSTKPTTSSALDVSETTLVL----SKRTPETLQTILIPQFELPLSTLAPKSLPEFPEAKTPFPFEKPRGTLASSEKP--W 417
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESReslpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggS 2855
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  418 IVPTAKISEDSKVLQPQTatydvfsSPTTSDEPEISdsytatsdRILDSIPPKTSRTLEQPRATLAPSETPFVPQKleif 497
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAA-------KPAAPARPPVR--------RLARPAVSRSTESFALPPDQPERPPQPQAPPP---- 2916
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  498 tsPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKperTTSAGTITPKIsksPEPTWTTPAPGKTQFISLKpkiplSPEVT 577
Cdd:PHA03247 2917 --PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD---PAGAGEPSGAV---PQPWLGALVPGRVAVPRFR-----VPQPA 2983
                         330
                  ....*....|
gi 530374302  578 HTKPAPEPQT 587
Cdd:PHA03247 2984 PSREAPASST 2993
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
778-1275 5.51e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 84.60  E-value: 5.51e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  778 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLVP-ATILEPV-------------LRTEASG-------TTAAPKVPQR--- 833
Cdd:PHA03247 2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPdEPVGEPVhprmltwirgleeLASDDAGdpppplpPAAPPAAPDRsvp 2569
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  834 THRPHPKPKTTLSPEELQTELVP---ATIFEPVSPIKEAPGTTfvPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPK 910
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPDAPpqsARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  911 TtpsPQAPETKPVPATVLEPVTLRPEASTTLASKTSQRTRRPRLRTKTTP-----RPEAPESKPVPtaelkpvtlRTETW 985
Cdd:PHA03247 2648 P---PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSltslaDPPPPPPTPEP---------APHAL 2715
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  986 VT-TQAPKTSQRTRRPRPKTKTTPSPevpqtklvPSTDLEPGTLRTEAPKTMVVTTVLEPDTFRTKFPETTLAPKTQRTR 1064
Cdd:PHA03247 2716 VSaTPLPPGPAAARQASPALPAAPAP--------PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1065 RPRPRPKTTS--SPEVPQNKSVSVTGFEPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESqPVSDVLESVTLSTES 1142
Cdd:PHA03247 2788 VASLSESRESlpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP-LGGSVAPGGDVRRRP 2866
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1143 PKETIAPAKTDYVYPTAKAPLWPEEPKTEVVESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPK 1222
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 530374302 1223 TSPRPRIPQTQPVPK------VPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVS 1275
Cdd:PHA03247 2947 TDPAGAGEPSGAVPQpwlgalVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005
PHA03247 PHA03247
large tegument protein UL36; Provisional
478-980 1.76e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 73.05  E-value: 1.76e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  478 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 555
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  556 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 635
Cdd:PHA03247 2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  636 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 711
Cdd:PHA03247 2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  712 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 780
Cdd:PHA03247 2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  781 TTTTKRTRRPHPKPKTTPHPEV-----PQTKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEE------ 849
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPARPPVrrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPpprpqp 2941
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  850 -LQTELVPATIFEPvSPIKEAPGTTFVPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAP-----ETKPV 923
Cdd:PHA03247 2942 pLAPTTDPAGAGEP-SGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSlalheETDPP 3020
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 530374302  924 PATVLEpvTLRPEASTTLASKTSQRTRRPRLRTKTTPRPEAPESKPVPTAELKPVTL 980
Cdd:PHA03247 3021 PVSLKQ--TLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATP 3075
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1490-1581 9.04e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 57.12  E-value: 9.04e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1490 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1567
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 530374302 1568 LGEGPVSNTVAFST 1581
Cdd:cd00063    80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1491-1571 1.31e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 50.69  E-value: 1.31e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   1491 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1568
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 530374302   1569 GEG 1571
Cdd:smart00060   81 GEG 83
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1138-1493 5.13e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 54.79  E-value: 5.13e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1138 LSTESPKETIAPAKTDYVYPTAKAPLWPE-EPKTEVV--ESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQT 1214
Cdd:PHA03307   44 VSDSAELAAVTVVAGAAACDRFEPPTGPPpGPGTEAPanESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPP 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1215 PRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPyPEVSQSEPAPLETRGipfipmis 1294
Cdd:PHA03307  124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSP-EETARAPSSPPAEPP-------- 194
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1295 psPSQEELQTTLAPHRFYTTVRPRTSDkPHIRPGVKQAPRPSGAdrnvSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPP 1374
Cdd:PHA03307  195 --PSTPPAAASPRPPRRSSPISASASS-PAPAPGRSAADDAGAS----SSDSSSSESSGCGWGPENECPLPRPAPITLPT 267
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1375 NNVTGKPGSAGiiSSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPlgKPRfkGPHV 1454
Cdd:PHA03307  268 RIWEASGWNGP--SSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSE--SSR--GAAV 341
                         330       340       350
                  ....*....|....*....|....*....|....*....
gi 530374302 1455 RYIQKPDNSPCSITDSVKRFPKEEATEGNATSPPQNPPT 1493
Cdd:PHA03307  342 SPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAA 380
fn3 pfam00041
Fibronectin type III domain;
1491-1574 4.15e-05

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 43.56  E-value: 4.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1491 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1567
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 530374302  1568 LGEGPVS 1574
Cdd:pfam00041   79 GGEGPPS 85
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
124-202 5.32e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 40.29  E-value: 5.32e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302    124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 530374302    200 GVK 202
Cdd:smart00060   73 RVR 75
fn3 pfam00041
Fibronectin type III domain;
123-202 5.33e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 40.48  E-value: 5.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   123 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 199
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 530374302   200 GVK 202
Cdd:pfam00041   72 RVQ 74
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1485-1629 6.25e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.61  E-value: 6.25e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1485 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1562
Cdd:COG3401   324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 530374302 1563 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1629
Cdd:COG3401   398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
935-1261 1.46e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.41  E-value: 1.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   935 PEASTTLASKTSQRT-RRPRLRTKTTPRPEAPESKPVPTAELKPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPSPEVP 1013
Cdd:pfam17823   66 APAPVTLTKGTSAAHlNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAP 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1014 QTKlVPSTdlePGTLRTEAPKTMVVTTVLEPDTFRTKFPETTLAPKTQRTRRPRPRPKTTS----SPEVPQNKSVSVTGF 1089
Cdd:pfam17823  146 RAA-ACRA---NASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSApatlTPARGISTAATATGH 221
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1090 EPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESQPVSDVLESVTLSTESP-KETIAPAKTDYVYPTAKAPLWPEEP 1168
Cdd:pfam17823  222 PAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPhARRLSPAKHMPSDTMARNPAAPMGA 301
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1169 KTEVVESITYVSEPPETTlETSPLPSQSITLPSPDEPQ-----------TEPAPKQTPRAPPKP--KTSPRPRI----PQ 1231
Cdd:pfam17823  302 QAQGPIIQVSTDQPVHNT-AGEPTPSPSNTTLEPNTPKsvastnlavvtTTKAQAKEPSASPVPvlHTSMIPEVeatsPT 380
                          330       340       350
                   ....*....|....*....|....*....|
gi 530374302  1232 TQPVPKVPQRVTAKPKTSPSPEVSYTTPAP 1261
Cdd:pfam17823  381 TQPSPLLPTQGAAGPGILLAPEQVATEATA 410
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
123-202 2.62e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 38.63  E-value: 2.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  123 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 200
Cdd:cd00063     3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73

                  ..
gi 530374302  201 VK 202
Cdd:cd00063    74 VR 75
PHA03247 PHA03247
large tegument protein UL36; Provisional
264-587 3.07e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 3.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  264 DSAKSPEKAPlggvilvHLIIPGLNETTVKLPASLMFEISDALKTQLAKNETLALPAESKTPEVEKISARPTTVTPETVP 343
Cdd:PHA03247 2703 PPPPTPEPAP-------HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  344 RSTKPTTSSALDVSETTLVL----SKRTPETLQTILIPQFELPLSTLAPKSLPEFPEAKTPFPFEKPRGTLASSEKP--W 417
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESReslpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggS 2855
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  418 IVPTAKISEDSKVLQPQTatydvfsSPTTSDEPEISdsytatsdRILDSIPPKTSRTLEQPRATLAPSETPFVPQKleif 497
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAA-------KPAAPARPPVR--------RLARPAVSRSTESFALPPDQPERPPQPQAPPP---- 2916
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  498 tsPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKperTTSAGTITPKIsksPEPTWTTPAPGKTQFISLKpkiplSPEVT 577
Cdd:PHA03247 2917 --PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD---PAGAGEPSGAV---PQPWLGALVPGRVAVPRFR-----VPQPA 2983
                         330
                  ....*....|
gi 530374302  578 HTKPAPEPQT 587
Cdd:PHA03247 2984 PSREAPASST 2993
DamX COG3266
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ...
1087-1281 3.38e-03

Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442497 [Multi-domain]  Cd Length: 455  Bit Score: 42.14  E-value: 3.38e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1087 TGFEPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESQPVSDVLESVTLSTESPKETIAPAKTDYVYPTAKAPLWPE 1166
Cdd:COG3266   159 EEQLLLLALQDIQGTLQALGAVAALLGLRKAEEALALRAGSAAADALALLLLLLASALGEAVAAAAELAALALLAAGAAE 238
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1167 EPKTEVVESITYV----SEPPETTLETSPLPSQsitLPSPDEPQTEPAPKQTPRAPPKPKTSPrpripQTQPVPKVPQRV 1242
Cdd:COG3266   239 VLTARLVLLLLIIgsalKAPSQASSASAPATTS---LGEQQEVSLPPAVAAQPAAAAAAQPSA-----VALPAAPAAAAA 310
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 530374302 1243 TAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPAP 1281
Cdd:COG3266   311 AAAPAEAAAPQPTAAKPVVTETAAPAAPAPEAAAAAAAP 349
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1124-1493 9.67e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.91  E-value: 9.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1124 TESQPVSDVLESVTLSTESPKETIAPAKTDYVYPTAKAPLWPEEPKTEVVESIT-YVSEPPETTLETSP---LPSQSITL 1199
Cdd:pfam03154  158 SDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSpATSQPPNQTQSTAAphtLIQQTPTL 237
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1200 PSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQ--PVPKVPQRVTAKPKTSPSPevsyTTPAPkdvlLPHKPYPEVSQS 1277
Cdd:pfam03154  238 HPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLhgQMPPMPHSLQTGPSHMQHP----VPPQP----FPLTPQSSQSQV 309
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1278 EPAPLETRGIPFIPMISPSPSQEELQTTLAPHRfyTTVRPRTSDKPHIRPG----VKQAPRPSGADRNVSVDSTHPTKKP 1353
Cdd:pfam03154  310 PPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPRE--QPLPPAPLSMPHIKPPpttpIPQLPNPQSHKHPPHLSGPSPFQMN 387
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1354 GTRRPPLPPRPTHPRRKPLPPNnvTGKPGSAGIISSGPITTPPLRS---TPRPTGTPLERIETDIKQPTVPASGEELENI 1430
Cdd:pfam03154  388 SNLPPPPALKPLSSLSTHHPPS--AHPPPLQLMPQSQQLPPPPAQPpvlTQSQSLPPPAASHPPTSGLHQVPSQSPFPQH 465
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 530374302  1431 TDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNSPCSITDSV-------------KRFPKEEATEGNATSPPQNPPT 1493
Cdd:pfam03154  466 PFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVpaavscplppvqiKEEALDEAEEPESPPPPPRSPS 541
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
778-1275 5.51e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 84.60  E-value: 5.51e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  778 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLVP-ATILEPV-------------LRTEASG-------TTAAPKVPQR--- 833
Cdd:PHA03247 2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPdEPVGEPVhprmltwirgleeLASDDAGdpppplpPAAPPAAPDRsvp 2569
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  834 THRPHPKPKTTLSPEELQTELVP---ATIFEPVSPIKEAPGTTfvPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPK 910
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPDAPpqsARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  911 TtpsPQAPETKPVPATVLEPVTLRPEASTTLASKTSQRTRRPRLRTKTTP-----RPEAPESKPVPtaelkpvtlRTETW 985
Cdd:PHA03247 2648 P---PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSltslaDPPPPPPTPEP---------APHAL 2715
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  986 VT-TQAPKTSQRTRRPRPKTKTTPSPevpqtklvPSTDLEPGTLRTEAPKTMVVTTVLEPDTFRTKFPETTLAPKTQRTR 1064
Cdd:PHA03247 2716 VSaTPLPPGPAAARQASPALPAAPAP--------PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1065 RPRPRPKTTS--SPEVPQNKSVSVTGFEPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESqPVSDVLESVTLSTES 1142
Cdd:PHA03247 2788 VASLSESRESlpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP-LGGSVAPGGDVRRRP 2866
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1143 PKETIAPAKTDYVYPTAKAPLWPEEPKTEVVESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPK 1222
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 530374302 1223 TSPRPRIPQTQPVPK------VPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVS 1275
Cdd:PHA03247 2947 TDPAGAGEPSGAVPQpwlgalVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005
PHA03247 PHA03247
large tegument protein UL36; Provisional
635-1025 3.78e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 78.44  E-value: 3.78e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  635 PKSKPALEPATiqPEPLVPTT--ASKPSERPKTT--HRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFET 710
Cdd:PHA03247 2553 PPLPPAAPPAA--PDRSVPPPrpAPRPSEPAVTSraRRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  711 EAPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSA--PTTTTKRTR 788
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdpPPPPPTPEP 2710
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  789 RPHPKPKTTPHPEVPQT--KLVPATILEPVLRTEASGTT--------AAPKVPQRTHRPHPK--PKTTLSPEELQTELVP 856
Cdd:PHA03247 2711 APHALVSATPLPPGPAAarQASPALPAAPAPPAVPAGPAtpggparpARPPTTAGPPAPAPPaaPAAGPPRRLTRPAVAS 2790
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  857 ATIFEPVSPIKEAPGTTFVPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAPETKPVPATvlePVTLRPE 936
Cdd:PHA03247 2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG---DVRRRPP 2867
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  937 ASTTlASKTSQRTRRPRLRTKTTPRPEAPESKPVPTAELKPVTlrtetwvTTQAPKTSQRTRRPRPKTKTTPSPEV---P 1013
Cdd:PHA03247 2868 SRSP-AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP-------QPQAPPPPQPQPQPPPPPQPQPPPPPpprP 2939
                         410
                  ....*....|..
gi 530374302 1014 QTKLVPSTDLEP 1025
Cdd:PHA03247 2940 QPPLAPTTDPAG 2951
PHA03247 PHA03247
large tegument protein UL36; Provisional
478-980 1.76e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 73.05  E-value: 1.76e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  478 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 555
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  556 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 635
Cdd:PHA03247 2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  636 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 711
Cdd:PHA03247 2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  712 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 780
Cdd:PHA03247 2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  781 TTTTKRTRRPHPKPKTTPHPEV-----PQTKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEE------ 849
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPARPPVrrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPpprpqp 2941
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  850 -LQTELVPATIFEPvSPIKEAPGTTFVPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAP-----ETKPV 923
Cdd:PHA03247 2942 pLAPTTDPAGAGEP-SGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSlalheETDPP 3020
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 530374302  924 PATVLEpvTLRPEASTTLASKTSQRTRRPRLRTKTTPRPEAPESKPVPTAELKPVTL 980
Cdd:PHA03247 3021 PVSLKQ--TLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATP 3075
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1490-1581 9.04e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 57.12  E-value: 9.04e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1490 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1567
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 530374302 1568 LGEGPVSNTVAFST 1581
Cdd:cd00063    80 GGESPPSESVTVTT 93
PHA03247 PHA03247
large tegument protein UL36; Provisional
382-800 1.05e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.80  E-value: 1.05e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  382 PLSTLAPKSLPEFPEAKTPFPFE-KPRGTLASSEKPWIVPTAKISED----SKVLQPQTATYD----VFSSPTTSDEPEI 452
Cdd:PHA03247 2608 PRGPAPPSPLPPDTHAPDPPPPSpSPAANEPDPHPPPTVPPPERPRDdpapGRVSRPRRARRLgraaQASSPPQRPRRRA 2687
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  453 SDSYTATSDRILDsiPPKTSRTLE-QPRATLAPSETPFVPQKL-EIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRT 530
Cdd:PHA03247 2688 ARPTVGSLTSLAD--PPPPPPTPEpAPHALVSATPLPPGPAAArQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG 2765
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  531 KPERTTSAGTITPkiskspePTWTTPAPGKTQFISLKPKIPLSPEvthtkPAPEPQTLLPSQSTigpETPGTKPSTTLAP 610
Cdd:PHA03247 2766 PPAPAPPAAPAAG-------PPRRLTRPAVASLSESRESLPSPWD-----PADPPAAVLAPAAA---LPPAASPAGPLPP 2830
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  611 RKTKRPGRRPRPRPRPKTTPSPE--VPKSKPALEPATIQPEPLVPTTASKPSER-------PKTTHRPDAPQIQPGSKPP 681
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVRRRPPSRSPAAKPAAPARPPVRrlarpavSRSTESFALPPDQPERPPQ 2910
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  682 KQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPsmtIVPTTDIEPVtvrteatvttlapktsqrtrtrrprpkhkttPRPE 761
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP---LAPTTDPAGA-------------------------------GEPS 2956
                         410       420       430
                  ....*....|....*....|....*....|....*....
gi 530374302  762 TLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHP 800
Cdd:PHA03247 2957 GAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
PRK10263 PRK10263
DNA translocase FtsK; Provisional
701-1321 4.32e-08

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 58.56  E-value: 4.32e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  701 SVSEPVPFETEAPSMTIVPTTDIEPVTvrTEATVTTLAPKTSQRTRtrrprpKHKTTPRPETLQTKLDFGPitpgTSSAP 780
Cdd:PRK10263  315 PITEPVAVAAAATTATQSWAAPVEPVT--QTPPVASVDVPPAQPTV------AWQPVPGPQTGEPVIAPAP----EGYPQ 382
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  781 TTTTKRTRRPHPKPKTTPHPEVPQTKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEELQTELVPATIF 860
Cdd:PRK10263  383 QSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTY 462
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  861 EPVSPIKE--APGTTFVPVTDLEPVTFRTEIPATTlATKTSKRTRPPRPRPKTTPSPQ----APETKPVPATVLEPVTLR 934
Cdd:PRK10263  463 QTEQTYQQpaAQEPLYQQPQPVEQQPVVEPEPVVE-ETKPARPPLYYFEEVEEKRAREreqlAAWYQPIPEPVKEPEPIK 541
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  935 PEASTTLASKTSqrtrrprlrtkttPRPEAPESKPVpTAELKPVTLRTETWVTTQAPKTSQRTR-RPRPKTKTTPSPEVP 1013
Cdd:PRK10263  542 SSLKAPSVAAVP-------------PVEAAAAVSPL-ASGVKKATLATGAAATVAAPVFSLANSgGPRPQVKEGIGPQLP 607
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1014 QtklvpstdlepgtlrteaPKTMVVTTVLEPDTFRTKFPETTLAPKTQRTRRPRPRPKTTS----------SPEVPQNKS 1083
Cdd:PRK10263  608 R------------------PKRIRVPTRRELASYGIKLPSQRAAEEKAREAQRNQYDSGDQynddeidamqQDELARQFA 669
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1084 VSVTGFEPVVHSTDAPGTT--------------FALTELQTLILKPVTSP---SLEMTESQPVSDVLEsvtlstESPKEt 1146
Cdd:PRK10263  670 QTQQQRYGEQYQHDVPVNAedadaaaeaelarqFAQTQQQRYSGEQPAGAnpfSLDDFEFSPMKALLD------DGPHE- 742
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1147 iaPAKTDYVYPTAKaPLWPEEPKTEVVESITYVSEPPETTLETSPLPSQsitlPSPDEPQTEPAPKQTPRAPPKPkTSPR 1226
Cdd:PRK10263  743 --PLFTPIVEPVQQ-PQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQ----PQYQQPQQPVAPQPQYQQPQQP-VAPQ 814
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1227 PRIPQTQPvPKVPQRVTAKPKTSPSpevsyttPAPKDVLLpHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEELQTTL 1306
Cdd:PRK10263  815 PQYQQPQQ-PVAPQPQYQQPQQPVA-------PQPQDTLL-HPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVEPVDTF 885
                         650
                  ....*....|....*
gi 530374302 1307 APHRFYTTVRPRTSD 1321
Cdd:PRK10263  886 ALEQMARLVEARLAD 900
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1491-1571 1.31e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 50.69  E-value: 1.31e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   1491 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1568
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 530374302   1569 GEG 1571
Cdd:smart00060   81 GEG 83
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1138-1493 5.13e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 54.79  E-value: 5.13e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1138 LSTESPKETIAPAKTDYVYPTAKAPLWPE-EPKTEVV--ESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQT 1214
Cdd:PHA03307   44 VSDSAELAAVTVVAGAAACDRFEPPTGPPpGPGTEAPanESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPP 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1215 PRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPyPEVSQSEPAPLETRGipfipmis 1294
Cdd:PHA03307  124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSP-EETARAPSSPPAEPP-------- 194
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1295 psPSQEELQTTLAPHRFYTTVRPRTSDkPHIRPGVKQAPRPSGAdrnvSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPP 1374
Cdd:PHA03307  195 --PSTPPAAASPRPPRRSSPISASASS-PAPAPGRSAADDAGAS----SSDSSSSESSGCGWGPENECPLPRPAPITLPT 267
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1375 NNVTGKPGSAGiiSSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPlgKPRfkGPHV 1454
Cdd:PHA03307  268 RIWEASGWNGP--SSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSE--SSR--GAAV 341
                         330       340       350
                  ....*....|....*....|....*....|....*....
gi 530374302 1455 RYIQKPDNSPCSITDSVKRFPKEEATEGNATSPPQNPPT 1493
Cdd:PHA03307  342 SPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAA 380
PHA03247 PHA03247
large tegument protein UL36; Provisional
960-1455 7.95e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 7.95e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  960 PRPEAPESKPVPTAEL-------KPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPSPEVPQtKLVPSTDLEPgtlrtEA 1032
Cdd:PHA03247 2504 PDPDAPPAPSRLAPAIlpdepvgEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPD-RSVPPPRPAP-----RP 2577
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1033 PKTMVVTTVLEPDTfrtkfPETTLAPKTQRTRRPRPRPKTTSSPEVPQNKSVSVTGFEPVVHSTDAPGTTFALTELQTLI 1112
Cdd:PHA03247 2578 SEPAVTSRARRPDA-----PPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERP 2652
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1113 LKPVTSPSLEMTESQPVSDVLESVTLSTESPKETIAPAKTDYVYPTAKAPLWPEEPKTEVVESITYVSEPPETTLETSPL 1192
Cdd:PHA03247 2653 RDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQAS 2732
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1193 PSQSITlPSPdepqtePAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPqrvtAKPKTSPSPEVSYTTPAPKDVLLPHKPYP 1272
Cdd:PHA03247 2733 PALPAA-PAP------PAVPAGPATPGGPARPARPPTTAGPPAPAPP----AAPAAGPPRRLTRPAVASLSESRESLPSP 2801
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1273 EVSQSEPAPLETRGIPFIPMISPSPsqeelqtTLAPHrfyTTVRPRTSDKPhirPGVKQAPRPSGAdrnvSVDSTHPTKK 1352
Cdd:PHA03247 2802 WDPADPPAAVLAPAAALPPAASPAG-------PLPPP---TSAQPTAPPPP---PGPPPPSLPLGG----SVAPGGDVRR 2864
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1353 PGTRRPplppRPTHPRRKPLPPNNVTGKPGSAGIISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITD 1432
Cdd:PHA03247 2865 RPPSRS----PAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
                         490       500
                  ....*....|....*....|...
gi 530374302 1433 FSSSPTRETDPLGKPRFKGPHVR 1455
Cdd:PHA03247 2941 PPLAPTTDPAGAGEPSGAVPQPW 2963
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1115-1447 2.04e-06

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 52.77  E-value: 2.04e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1115 PVTSPSLEMTESQPVSDVLESvtlstESPKETIAPAKTDYVYPTAKaplwPEEPKTEVVESITYVSEPPEttletspLPS 1194
Cdd:PTZ00449  520 PPKAPGDKEGEEGEHEDSKES-----DEPKEGGKPGETKEGEVGKK----PGPAKEHKPSKIPTLSKKPE-------FPK 583
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1195 QSitlPSPDEPQtEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTA--KPKTSPSPEVSYTTPAPKDVLLPHKPYP 1272
Cdd:PTZ00449  584 DP---KHPKDPE-EPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESpkSPKRPPPPQRPSSPERPEGPKIIKSPKP 659
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1273 EVSQSEP------------------APLETRGIPFIPMISPSPSQEELQTT----LAPHRFYTTVRPRTSDKPHIRPGVK 1330
Cdd:PTZ00449  660 PKSPKPPfdpkfkekfyddyldaaaKSKETKTTVVLDESFESILKETLPETpgtpFTTPRPLPPKLPRDEEFPFEPIGDP 739
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1331 QAPRPSGADRNVSVDSthptKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGIISSGPITtpPLRSTPRPTGTpler 1410
Cdd:PTZ00449  740 DAEQPDDIEFFTPPEE----ERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS--PSEHEDKPPGD---- 809
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 530374302 1411 ietdikQPTVPASGEELENI----TDFSSSPTR-ETDPLGKP 1447
Cdd:PTZ00449  810 ------HPSLPKKRHRLDGLalstTDLESDAGRiAKDASGKI 845
fn3 pfam00041
Fibronectin type III domain;
1491-1574 4.15e-05

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 43.56  E-value: 4.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1491 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1567
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 530374302  1568 LGEGPVS 1574
Cdd:pfam00041   79 GGEGPPS 85
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1085-1343 5.33e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 48.16  E-value: 5.33e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1085 SVTgfEPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESQPVSDVLESVTLSTESPkeTIAPAKTDYV-YPTAKAPL 1163
Cdd:PRK10263  315 PIT--EPVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEP--VIAPAPEGYPqQSQYAQPA 390
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1164 WP-----EEPKTEVVESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPK-----PKTSPRPRIPQTQ 1233
Cdd:PRK10263  391 VQyneplQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQqstfaPQSTYQTEQTYQQ 470
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1234 PVPK-----VPQRVTAKPKTSPSPEVSYTTPAPKdvllPHKPYPEVSQSEPAPLETRGIPFIPMisPSPSQEELQTTLAP 1308
Cdd:PRK10263  471 PAAQeplyqQPQPVEQQPVVEPEPVVEETKPARP----PLYYFEEVEEKRAREREQLAAWYQPI--PEPVKEPEPIKSSL 544
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 530374302 1309 HRFYTTVRPRTSDKPHIRP---GVKQAPRPSGADRNVS 1343
Cdd:PRK10263  545 KAPSVAAVPPVEAAAAVSPlasGVKKATLATGAAATVA 582
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
533-980 7.30e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 47.76  E-value: 7.30e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  533 ERTTSAGTITPKISKSPEPTWTT-----PAPGKTQFISLKPKIPLSPEVTHTKPAP-EPQTLLPSQSTIGPETPGTKPST 606
Cdd:PTZ00449  533 EHEDSKESDEPKEGGKPGETKEGevgkkPGPAKEHKPSKIPTLSKKPEFPKDPKHPkDPEEPKKPKRPRSAQRPTRPKSP 612
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  607 TLaprktkrpgrrprprprpktTPSPEVPKSKPALEPATIQPEPLVPTtaskpseRPKTTHRPDAPQIQPGSKPPKQllP 686
Cdd:PTZ00449  613 KL--------------------PELLDIPKSPKRPESPKSPKRPPPPQ-------RPSSPERPEGPKIIKSPKPPKS--P 663
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  687 KPqttaepdmpptksvsepvPFEteaPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPrpetLQTK 766
Cdd:PTZ00449  664 KP------------------PFD---PKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTP----FTTP 718
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  767 LDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTLS 846
Cdd:PTZ00449  719 RPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS 798
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  847 PEELQTElvpATIFEPVSPIKEAPGTTF-VPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAPETKPVpa 925
Cdd:PTZ00449  799 PSEHEDK---PPGDHPSLPKKRHRLDGLaLSTTDLESDAGRIAKDASGKIVKLKRSKSFDDLTTVEEAEEMGAEARKI-- 873
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 530374302  926 tVLEPVTLRPEASTTLASKTSQ----RTRRPRLRTKTTPRPEAPESKPVPTAELKPVTL 980
Cdd:PTZ00449  874 -VVDDDGTEADDEDTHPPEEKHksevRRRRPPKKPSKPKKPSKPKKPKKPDSAFIPSII 931
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
1184-1278 7.98e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 47.50  E-value: 7.98e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1184 ETTLETSPLPSQ-SITLPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPK 1262
Cdd:PRK14950  357 EALLVPVPAPQPaKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAI 436
                          90
                  ....*....|....*.
gi 530374302 1263 DVLLPHKPYPEVSQSE 1278
Cdd:PRK14950  437 PVDEKPKYTPPAPPKE 452
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
1203-1290 1.50e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 46.47  E-value: 1.50e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1203 DEPQTEPAPKQTPRAPPKPKTSPRP-RIPQTQPVPKVPQRVTAKPKTSPSPEvSYTTPAPKDVLLPHKPYPEVSQSEPAP 1281
Cdd:PRK14954  374 VRNDGGVAPSPAGSPDVKKKAPEPDlPQPDRHPGPAKPEAPGARPAELPSPA-SAPTPEQQPPVARSAPLPPSPQASAPR 452

                  ....*....
gi 530374302 1282 LETRGIPFI 1290
Cdd:PRK14954  453 NVASGKPGV 461
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
124-202 5.32e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 40.29  E-value: 5.32e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302    124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 530374302    200 GVK 202
Cdd:smart00060   73 RVR 75
fn3 pfam00041
Fibronectin type III domain;
123-202 5.33e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 40.48  E-value: 5.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   123 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 199
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 530374302   200 GVK 202
Cdd:pfam00041   72 RVQ 74
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1485-1629 6.25e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.61  E-value: 6.25e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1485 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1562
Cdd:COG3401   324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 530374302 1563 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1629
Cdd:COG3401   398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
1200-1335 7.04e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 44.32  E-value: 7.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1200 PSPDEPQTEPAPKQTPRAPPKPKTSPRPRiPQTQPVPkVPQRVTAKPKTSPSPEVSYTTPAPKDvllphkpypEVSQSEP 1279
Cdd:PRK14951  366 PAAAAEAAAPAEKKTPARPEAAAPAAAPV-AQAAAAP-APAAAPAAAASAPAAPPAAAPPAPVA---------APAAAAP 434
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 530374302 1280 APLETRGIPFIPMISPSPSQEELQTTLAPHRfyTTVRPRTSDKPHIRPGVKQAPRP 1335
Cdd:PRK14951  435 AAAPAAAPAAVALAPAPPAQAAPETVAIPVR--VAPEPAVASAAPAPAAAPAAARL 488
PRK11633 PRK11633
cell division protein DedD; Provisional
1159-1262 8.22e-04

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 42.68  E-value: 8.22e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1159 AKAPLWP---EEPKTEVVESITYV--SEPPETTLE-----TSPLPSQSITLPSPDEPQTEPAPKqtPRAPPKPKtsPRPR 1228
Cdd:PRK11633   39 AAIPLVPkpgDRDEPDMMPAATQAlpTQPPEGAAEavragDAAAPSLDPATVAPPNTPVEPEPA--PVEPPKPK--PVEK 114
                          90       100       110
                  ....*....|....*....|....*....|....
gi 530374302 1229 iPQTQPVPKVPQRVTAKPKTSPSPEVSyTTPAPK 1262
Cdd:PRK11633  115 -PKPKPKPQQKVEAPPAPKPEPKPVVE-EKAAPT 146
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1484-1586 9.74e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 43.84  E-value: 9.74e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1484 ATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMTNQTFSTVENLKPNTSYEFQVK 1563
Cdd:COG3401   229 PTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTNGTTYYYRVT 303
                          90       100
                  ....*....|....*....|....
gi 530374302 1564 PKNPLG-EGPVSNTVAFSTESADP 1586
Cdd:COG3401   304 AVDAAGnESAPSNVVSVTTDLTPP 327
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
639-707 1.27e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 43.64  E-value: 1.27e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 530374302  639 PALEPATIQPEPLVPTtASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVP 707
Cdd:PRK14950  362 PVPAPQPAKPTAAAPS-PVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAP 429
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
935-1261 1.46e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.41  E-value: 1.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   935 PEASTTLASKTSQRT-RRPRLRTKTTPRPEAPESKPVPTAELKPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPSPEVP 1013
Cdd:pfam17823   66 APAPVTLTKGTSAAHlNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAP 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1014 QTKlVPSTdlePGTLRTEAPKTMVVTTVLEPDTFRTKFPETTLAPKTQRTRRPRPRPKTTS----SPEVPQNKSVSVTGF 1089
Cdd:pfam17823  146 RAA-ACRA---NASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSApatlTPARGISTAATATGH 221
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1090 EPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESQPVSDVLESVTLSTESP-KETIAPAKTDYVYPTAKAPLWPEEP 1168
Cdd:pfam17823  222 PAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPhARRLSPAKHMPSDTMARNPAAPMGA 301
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1169 KTEVVESITYVSEPPETTlETSPLPSQSITLPSPDEPQ-----------TEPAPKQTPRAPPKP--KTSPRPRI----PQ 1231
Cdd:pfam17823  302 QAQGPIIQVSTDQPVHNT-AGEPTPSPSNTTLEPNTPKsvastnlavvtTTKAQAKEPSASPVPvlHTSMIPEVeatsPT 380
                          330       340       350
                   ....*....|....*....|....*....|
gi 530374302  1232 TQPVPKVPQRVTAKPKTSPSPEVSYTTPAP 1261
Cdd:pfam17823  381 TQPSPLLPTQGAAGPGILLAPEQVATEATA 410
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1103-1334 1.52e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.33  E-value: 1.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1103 FALTELQTLILKPVTSPSlemtESQPVSDVLESVTLSTE--SPKETIAPAKTDYVYPTAKAPLwPEEPKTEVVESITYVS 1180
Cdd:PRK12323  353 FTMTLLRMLAFRPGQSGG----GAGPATAAAAPVAQPAPaaAAPAAAAPAPAAPPAAPAAAPA-AAAAARAVAAAPARRS 427
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1181 EPPEttletsplPSQSITLPSPDEPQTEPAPkqTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPA 1260
Cdd:PRK12323  428 PAPE--------ALAAARQASARGPGGAPAP--APAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPP 497
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 530374302 1261 PKDVLLPHKPYPEVSQSEPAPLETRGIPFI-PMISPSPSQEELQTTLAPHRFYTTVRPRTSDKPHIRPGVKQAPR 1334
Cdd:PRK12323  498 PWEELPPEFASPAPAQPDAAPAGWVAESIPdPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG 572
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
657-1028 1.74e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.14  E-value: 1.74e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  657 SKPSERPKTTHRPDAPQIQPGSKPPKqllpkPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVTvrTEATVTT 736
Cdd:PTZ00449  537 SKESDEPKEGGKPGETKEGEVGKKPG-----PAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRS--AQRPTRP 609
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  737 LAPKTSQRTRTRRPRPKHKTTPRPETlqtkldfgPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEV-PQTK-LVPATILE 814
Cdd:PTZ00449  610 KSPKLPELLDIPKSPKRPESPKSPKR--------PPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFdPKFKeKFYDDYLD 681
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  815 PVLRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEELQTELVP------ATIFEPV-SPIKEAPGTTFVPVTDLEPVTFRT 887
Cdd:PTZ00449  682 AAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPklprdeEFPFEPIgDPDAEQPDDIEFFTPPEEERTFFH 761
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  888 EIPATTLATKTSKRTRPPRPRPKTTPSPQAPETKPvpatvLEPVTLRPEASTTLASKTSQRTRRPRLRTKTTPRpEAPES 967
Cdd:PTZ00449  762 ETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRP-----DSPSEHEDKPPGDHPSLPKKRHRLDGLALSTTDL-ESDAG 835
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  968 KPVPTAELKPVTLR-----------------------------------TETWVTTQAPKTSQRTRRPRPKTKTTPSPEV 1012
Cdd:PTZ00449  836 RIAKDASGKIVKLKrsksfddlttveeaeemgaearkivvdddgteaddEDTHPPEEKHKSEVRRRRPPKKPSKPKKPSK 915
                         410
                  ....*....|....*.
gi 530374302 1013 PQTKLVPSTDLEPGTL 1028
Cdd:PTZ00449  916 PKKPKKPDSAFIPSII 931
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
972-1352 2.06e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 42.64  E-value: 2.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   972 TAELKPVTLRTETWVTT------QAPKTSQRTRRPRPKTKTTPSPEVPQTKLVPSTDLEPGTLRTEAPKTMVVTTVLEPD 1045
Cdd:pfam17823   64 TAAPAPVTLTKGTSAAHlnstevTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFS 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1046 TFRTKFPETTLAPKTQRTRRPRPRPKTTSSpeVPQNKSVSVTGFEPVVHSTDAPgTTFALTELQTLILKPVTSPSLEMTE 1125
Cdd:pfam17823  144 APRAAACRANASAAPRAAIAAASAPHAASP--APRTAASSTTAASSTTAASSAP-TTAASSAPATLTPARGISTAATATG 220
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1126 SQPVSDVLESVTLSTESPKETIAPAKTdyVYPTAKAPLwpeepkTEVVESITYVSEPPETTLETSPLPSQSITLPSpDEP 1205
Cdd:pfam17823  221 HPAAGTALAAVGNSSPAAGTVTAAVGT--VTPAALATL------AAAAGTVASAAGTINMGDPHARRLSPAKHMPS-DTM 291
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1206 QTEPAPKQTPRAppkpkTSPRPRIPQTQPVpkvpqrVTAKPKTSPSPEVSYTTP-APKDVLLPHKPYPEVSQSEPAPLET 1284
Cdd:pfam17823  292 ARNPAAPMGAQA-----QGPIIQVSTDQPV------HNTAGEPTPSPSNTTLEPnTPKSVASTNLAVVTTTKAQAKEPSA 360
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 530374302  1285 RGIPfIPMISPSPSQEELQTTLAPHRFYTTVRPRTsdkphirPGVKQAPRPSGADRNVSVDSTHPTKK 1352
Cdd:pfam17823  361 SPVP-VLHTSMIPEVEATSPTTQPSPLLPTQGAAG-------PGILLAPEQVATEATAGTASAGPTPR 420
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
890-1058 2.21e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.94  E-value: 2.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  890 PATTLATKTSKRTRPPRPRPKTTPSPQAPETKPVPATVLEPVTLRPEA-----STTLASKTSQRTRRPRLRTKTTPRPEA 964
Cdd:PRK12323  374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAaparrSPAPEALAAARQASARGPGGAPAPAPA 453
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  965 PESKPVPTAELKPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPS-PEVPQTKLVPS-TDLEPGTLRTEAPKTMVVTTVL 1042
Cdd:PRK12323  454 PAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPwEELPPEFASPApAQPDAAPAGWVAESIPDPATAD 533
                         170
                  ....*....|....*.
gi 530374302 1043 EPDTFRTKFPETTLAP 1058
Cdd:PRK12323  534 PDDAFETLAPAPAAAP 549
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1158-1351 2.35e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 42.76  E-value: 2.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1158 TAKAPLW--PEEPKTEVVESITYVSEPPETTLETSPLPSqsitlPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPV 1235
Cdd:PRK10263  327 TTATQSWaaPVEPVTQTPPVASVDVPPAQPTVAWQPVPG-----PQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPV 401
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1236 PkvPQRVTAKPKTSPSPEVSYTTPAPKdvllphkpyPEVSQSEPAPLETRGIPFIPMISPSPSQE-ELQTTLAPHRFYtt 1314
Cdd:PRK10263  402 Q--PQQPYYAPAAEQPAQQPYYAPAPE---------QPAQQPYYAPAPEQPVAGNAWQAEEQQSTfAPQSTYQTEQTY-- 468
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 530374302 1315 VRPRTSDKPHIRPgvKQAPRPSGADRNVSVDSTHPTK 1351
Cdd:PRK10263  469 QQPAAQEPLYQQP--QPVEQQPVVEPEPVVEETKPAR 503
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1182-1290 2.56e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 42.46  E-value: 2.56e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1182 PPETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAp 1261
Cdd:PRK14971  391 QPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPS- 469
                          90       100
                  ....*....|....*....|....*....
gi 530374302 1262 kdVLLPHKPYPEVSQSEPAPLETRGIPFI 1290
Cdd:PRK14971  470 --TLRPIQEKAEQATGNIKEAPTGTQKEI 496
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
123-202 2.62e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 38.63  E-value: 2.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  123 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 200
Cdd:cd00063     3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73

                  ..
gi 530374302  201 VK 202
Cdd:cd00063    74 VR 75
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
921-1238 2.68e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 42.75  E-value: 2.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  921 KPVPATVLEP-----VTLRPEASTTLAS----KTSQRTRRPRLRTKTT--PRPEAPESKPVPTAELKPvtlrtETWVTTQ 989
Cdd:PTZ00449  560 KPGPAKEHKPskiptLSKKPEFPKDPKHpkdpEEPKKPKRPRSAQRPTrpKSPKLPELLDIPKSPKRP-----ESPKSPK 634
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  990 APKTSQRTRRPR----PKT-KTTPSPEVPQTKLVPS------TDLEPGTLRTEAPKTMVVTTVLEPDTFRTKFPETTLAP 1058
Cdd:PTZ00449  635 RPPPPQRPSSPErpegPKIiKSPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTP 714
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1059 KTQRTRRPRPRPKTTSSPEVPQNksvsvtgfEPVVHSTDaPGTTFALTELQTLILKpvtspslEMTESQPVSDVL----- 1133
Cdd:PTZ00449  715 FTTPRPLPPKLPRDEEFPFEPIG--------DPDAEQPD-DIEFFTPPEEERTFFH-------ETPADTPLPDILaeefk 778
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1134 -ESVTLSTESPKETIAPAKTDYVYPTAKAPLWPEEPK-----------------------------------TEVVESIT 1177
Cdd:PTZ00449  779 eEDIHAETGEPDEAMKRPDSPSEHEDKPPGDHPSLPKkrhrldglalsttdlesdagriakdasgkivklkrSKSFDDLT 858
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 530374302 1178 YVSEPPETTLETSPLPSQSITLPSPDEpQTEPA------------PKQTPRAPPKPKTSPRPRIPQTQPVPKV 1238
Cdd:PTZ00449  859 TVEEAEEMGAEARKIVVDDDGTEADDE-DTHPPeekhksevrrrrPPKKPSKPKKPSKPKKPKKPDSAFIPSI 930
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
1184-1329 2.72e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 42.61  E-value: 2.72e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1184 ETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTA------------KPKTSPS 1251
Cdd:PLN03209  308 ETTAPLTPMEELLAKIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPrplspytayedlKPPTSPI 387
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 530374302 1252 P-EVSYTTPAPKDVLLPHKPYPEVSQSEPAPLETrgipfIPMISPSPSQEELQTTLAPHRFYTTVRPRTSDKPHIRPGV 1329
Cdd:PLN03209  388 PtPPSSSPASSKSVDAVAKPAEPDVVPSPGSASN-----VPEVEPAQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTGV 461
PHA03247 PHA03247
large tegument protein UL36; Provisional
264-587 3.07e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 3.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  264 DSAKSPEKAPlggvilvHLIIPGLNETTVKLPASLMFEISDALKTQLAKNETLALPAESKTPEVEKISARPTTVTPETVP 343
Cdd:PHA03247 2703 PPPPTPEPAP-------HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  344 RSTKPTTSSALDVSETTLVL----SKRTPETLQTILIPQFELPLSTLAPKSLPEFPEAKTPFPFEKPRGTLASSEKP--W 417
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESReslpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggS 2855
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  418 IVPTAKISEDSKVLQPQTatydvfsSPTTSDEPEISdsytatsdRILDSIPPKTSRTLEQPRATLAPSETPFVPQKleif 497
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAA-------KPAAPARPPVR--------RLARPAVSRSTESFALPPDQPERPPQPQAPPP---- 2916
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  498 tsPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKperTTSAGTITPKIsksPEPTWTTPAPGKTQFISLKpkiplSPEVT 577
Cdd:PHA03247 2917 --PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD---PAGAGEPSGAV---PQPWLGALVPGRVAVPRFR-----VPQPA 2983
                         330
                  ....*....|
gi 530374302  578 HTKPAPEPQT 587
Cdd:PHA03247 2984 PSREAPASST 2993
DamX COG3266
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ...
1087-1281 3.38e-03

Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442497 [Multi-domain]  Cd Length: 455  Bit Score: 42.14  E-value: 3.38e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1087 TGFEPVVHSTDAPGTTFALTELQTLILKPVTSPSLEMTESQPVSDVLESVTLSTESPKETIAPAKTDYVYPTAKAPLWPE 1166
Cdd:COG3266   159 EEQLLLLALQDIQGTLQALGAVAALLGLRKAEEALALRAGSAAADALALLLLLLASALGEAVAAAAELAALALLAAGAAE 238
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1167 EPKTEVVESITYV----SEPPETTLETSPLPSQsitLPSPDEPQTEPAPKQTPRAPPKPKTSPrpripQTQPVPKVPQRV 1242
Cdd:COG3266   239 VLTARLVLLLLIIgsalKAPSQASSASAPATTS---LGEQQEVSLPPAVAAQPAAAAAAQPSA-----VALPAAPAAAAA 310
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 530374302 1243 TAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPAP 1281
Cdd:COG3266   311 AAAPAEAAAPQPTAAKPVVTETAAPAAPAPEAAAAAAAP 349
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1074-1336 3.99e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.99  E-value: 3.99e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1074 SSPEVPQNKSVSVtgfEPVVHSTDAPGTTFAltelqtlilKPVTSPSLEMTESQPVsdvLESVTLSTESPKETIAPAKTD 1153
Cdd:PRK10263  342 QTPPVASVDVPPA---QPTVAWQPVPGPQTG---------EPVIAPAPEGYPQQSQ---YAQPAVQYNEPLQQPVQPQQP 406
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1154 YVYPTAKAPLWPEEPKTEVVESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQT---PRAPPKPKTSPRPRIP 1230
Cdd:PRK10263  407 YYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTyqqPAAQEPLYQQPQPVEQ 486
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1231 Q--TQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLP-HKPYPEVSQsEPAP----LETRGIPFIPMISPSPSQEEL- 1302
Cdd:PRK10263  487 QpvVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAwYQPIPEPVK-EPEPikssLKAPSVAAVPPVEAAAAVSPLa 565
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*
gi 530374302 1303 ----QTTLAPHRFYTTVRPRTS------DKPHIRPGV-KQAPRPS 1336
Cdd:PRK10263  566 sgvkKATLATGAAATVAAPVFSlansggPRPQVKEGIgPQLPRPK 610
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
565-1009 5.72e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 5.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   565 SLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEP- 643
Cdd:pfam03154  143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTq 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   644 ATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTksvsePVPFETEAPSMtivpttdi 723
Cdd:pfam03154  223 STAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPM-----PHSLQTGPSHM-------- 289
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   724 ePVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETlqtkldfgpiTPGTSSAPTTTTKRTRRPHPKPKTTPHPEVP 803
Cdd:pfam03154  290 -QHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHT----------PPSQSQLQSQQPPREQPLPPAPLSMPHIKPP 358
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   804 QtklvpatilepvlrteasgTTAAPKVPQRTHRPHPKPKTTLSPEELQTELVPATIFEPVSPIK-EAPGTTFVPVTDLEP 882
Cdd:pfam03154  359 P-------------------TTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLStHHPPSAHPPPLQLMP 419
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   883 VTFRTEIPATT--LATKTSKRTRPPRPRPKTTPSPQAPETKPVPATVLEPVTlRPEASTTLASKTSQRTRRPRLRTKTTP 960
Cdd:pfam03154  420 QSQQLPPPPAQppVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGG-PPPITPPSGPPTSTSSAMPGIQPPSSA 498
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 530374302   961 RPEAPESKP-VPTAELKPVTLRTETWVTTQAPKT---SQRTRRPRPKTKTTPS 1009
Cdd:pfam03154  499 SVSSSGPVPaAVSCPLPPVQIKEEALDEAEEPESpppPPRSPSPEPTVVNTPS 551
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
1188-1317 6.38e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 41.24  E-value: 6.38e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302 1188 ETSPLPSQSITLPSPdePQTEPAPKqtPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSyTTPAPKDVLLP 1267
Cdd:PRK14951  370 AEAAAPAEKKTPARP--EAAAPAAA--PVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAA-PAAAPAAAPAA 444
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 530374302 1268 HKPYPevSQSEPAPLETRGIPfiPMISPSPSQEELQTTLAPHRFYTTVRP 1317
Cdd:PRK14951  445 VALAP--APPAQAAPETVAIP--VRVAPEPAVASAAPAPAAAPAAARLTP 490
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
726-1039 6.88e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.10  E-value: 6.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   726 VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKldfGPITPGTSSAPTTTTKRTRRPH-PKPKTTPHPEVPQ 804
Cdd:pfam17823   86 VTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSA---AQSLPAAIAALPSEAFSAPRAAaCRANASAAPRAAI 162
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   805 TKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTlspeelqtelvPATIFePVSPIKEAPGTTFVPVTDlepvT 884
Cdd:pfam17823  163 AAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSA-----------PATLT-PARGISTAATATGHPAAG----T 226
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   885 FRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAPETKPVPATVL---EPVTLRPEASTTLASKTSQRTRRPRLRTK---- 957
Cdd:pfam17823  227 ALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTInmgDPHARRLSPAKHMPSDTMARNPAAPMGAQaqgp 306
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   958 ------------TTPRPE-APESKPVPTAELKPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPSPEVPQTKLVPSTDLE 1024
Cdd:pfam17823  307 iiqvstdqpvhnTAGEPTpSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSPL 386
                          330
                   ....*....|....*
gi 530374302  1025 PGTLRTEAPKTMVVT 1039
Cdd:pfam17823  387 LPTQGAAGPGILLAP 401
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
582-1021 7.26e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.06  E-value: 7.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   582 APEPQTLLPSQSTIGPETPGTK---PSTTLAPRKTKRPGRRPRPRPRPKTTpSPEVPKSKPALEPATIQPEPLVPTTASK 658
Cdd:pfam05109  424 APESTTTSPTLNTTGFAAPNTTtglPSSTHVPTNLTAPASTGPTVSTADVT-SPTPAGTTSGASPVTPSPSPRDNGTESK 502
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   659 PSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEP--DMPPTKSVSEPVPFETEAPSMTIVPTTDiepvtvrteATVTT 736
Cdd:pfam05109  503 APDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTlgKTSPTSAVTTPTPNATSPTPAVTTPTPN---------ATIPT 573
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   737 LApKTSqrtrtrrprpkhkttprpetlqtkldfgPITPGTSSAPTTTTKRTRRPHPKPKTTPHPevpqtklVPATILEPV 816
Cdd:pfam05109  574 LG-KTS----------------------------PTSAVTTPTPNATSPTVGETSPQANTTNHT-------LGGTSSTPV 617
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   817 LRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEELQTELVPATIFEPVSPIkeaPGTTFVPVTDLEPVTFRTEIPATTLAT 896
Cdd:pfam05109  618 VTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHM---PLLTSAHPTGGENITQVTPASTSTHHV 694
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302   897 KTSkRTRPPRPRPKTTPSPQAPETKPVPATVLEPVTLRPEASTTLASKTSQRTRRPRLRTK------------TTPRPEA 964
Cdd:pfam05109  695 STS-SPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTggkansttggkhTTGHGAR 773
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 530374302   965 PESKPVPTAELKPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPSPEVPQTKLVPST 1021
Cdd:pfam05109  774 TSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPT 830
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
568-794 7.65e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.01  E-value: 7.65e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  568 PKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRktkrpgrrprprprpKTTPSPEVPKSKPALEPATIQ 647
Cdd:PRK12323  374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAAR---------------AVAAAPARRSPAPEALAAARQ 438
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  648 PEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVT 727
Cdd:PRK12323  439 ASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAP 518
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 530374302  728 VRTEAtvttlapktsqrtrtrrprpkhKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKP 794
Cdd:PRK12323  519 AGWVA----------------------ESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1124-1493 9.67e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.91  E-value: 9.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1124 TESQPVSDVLESVTLSTESPKETIAPAKTDYVYPTAKAPLWPEEPKTEVVESIT-YVSEPPETTLETSP---LPSQSITL 1199
Cdd:pfam03154  158 SDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSpATSQPPNQTQSTAAphtLIQQTPTL 237
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1200 PSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQ--PVPKVPQRVTAKPKTSPSPevsyTTPAPkdvlLPHKPYPEVSQS 1277
Cdd:pfam03154  238 HPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLhgQMPPMPHSLQTGPSHMQHP----VPPQP----FPLTPQSSQSQV 309
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1278 EPAPLETRGIPFIPMISPSPSQEELQTTLAPHRfyTTVRPRTSDKPHIRPG----VKQAPRPSGADRNVSVDSTHPTKKP 1353
Cdd:pfam03154  310 PPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPRE--QPLPPAPLSMPHIKPPpttpIPQLPNPQSHKHPPHLSGPSPFQMN 387
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530374302  1354 GTRRPPLPPRPTHPRRKPLPPNnvTGKPGSAGIISSGPITTPPLRS---TPRPTGTPLERIETDIKQPTVPASGEELENI 1430
Cdd:pfam03154  388 SNLPPPPALKPLSSLSTHHPPS--AHPPPLQLMPQSQQLPPPPAQPpvlTQSQSLPPPAASHPPTSGLHQVPSQSPFPQH 465
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 530374302  1431 TDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNSPCSITDSV-------------KRFPKEEATEGNATSPPQNPPT 1493
Cdd:pfam03154  466 PFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVpaavscplppvqiKEEALDEAEEPESPPPPPRSPS 541
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH