NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1770726339|ref|NP_001352571|]
View 

target of Nesh-SH3 isoform 6 precursor [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
447-867 1.01e-16

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 86.53  E-value: 1.01e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  447 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 524
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  525 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 604
Cdd:PHA03247  2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  605 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 680
Cdd:PHA03247  2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  681 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 749
Cdd:PHA03247  2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  750 TTTTKRTRRPHPKPKTTPHPEV-----------PQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPS 818
Cdd:PHA03247  2862 VRRRPPSRSPAAKPAAPARPPVrrlarpavsrsTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*....
gi 1770726339  819 PEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLEtrgIPFIPMISPSPSQE 867
Cdd:PHA03247  2942 PLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA---VPRFRVPQPAPSRE 2987
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1108-1199 2.85e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 2.85e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1108 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1185
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1770726339 1186 LGEGPVSNTVAFST 1199
Cdd:cd00063     80 GGESPPSESVTVTT 93
PHA03247 super family cl33720
large tegument protein UL36; Provisional
747-1040 8.62e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 8.62e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  747 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQT------PR-------------------APPKPKTSPRP----RIP 797
Cdd:PHA03247  2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPvgepvhPRmltwirgleelasddagdpPPPLPPAAPPAapdrSVP 2569
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  798 QTQPVPKVPQ-RVTAKPKTSPSPEVSYTTPAPKDvllPHKPYPEVSQSEPAPLETRGiPFIPMISPSPSQEELQTTLEET 876
Cdd:PHA03247  2570 PPRPAPRPSEpAVTSRARRPDAPPQSARPRAPVD---DRGDPRGPAPPSPLPPDTHA-PDPPPPSPSPAANEPDPHPPPT 2645
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  877 DQSTQEPFTTKIPRTTELAKTTQAPHR-FYTTVRPRTSDKPHIRPVLNRTTT--RPTRPKPSGMPSGNGVGTGVKQAPRP 953
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRRARRLGRaAQASSPPQRPRRRAARPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGP 2725
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  954 SGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPpnnvTGKPGSAGiiSSGPITTPPlRSTPRPTGTPLERIETDI 1033
Cdd:PHA03247  2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT----AGPPAPAP--PAAPAAGPP-RRLTRPAVASLSESRESL 2798

                   ....*..
gi 1770726339 1034 KQPTVPA 1040
Cdd:PHA03247  2799 PSPWDPA 2805
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
117-195 2.34e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


:

Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 2.34e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339   117 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 192
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 1770726339   193 GVK 195
Cdd:smart00060   73 RVR 75
DUF5585 super family cl39316
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
296-571 8.49e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


The actual alignment was detected with superfamily member pfam17823:

Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.41  E-value: 8.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  296 SDALKTQLAKNETLALPAESKTpeVEKISARPTTVTPETVPRSTKPTTSSALDVSETTLVLSKRTPETLQTI--LIPQFE 373
Cdd:pfam17823  134 IAALPSEAFSAPRAAACRANAS--AAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPatLTPARG 211
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  374 LPLSTLASSEKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAP 453
Cdd:pfam17823  212 ISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTM 291
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  454 SETPFVPQKLEIfTSPEMQPTTPAP-QQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEPTwTTPAPgktqf 532
Cdd:pfam17823  292 ARNPAAPMGAQA-QGPIIQVSTDQPvHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPS-ASPVP----- 364
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1770726339  533 islKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGT 571
Cdd:pfam17823  365 ---VLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLA 400
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
447-867 1.01e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 86.53  E-value: 1.01e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  447 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 524
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  525 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 604
Cdd:PHA03247  2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  605 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 680
Cdd:PHA03247  2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  681 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 749
Cdd:PHA03247  2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  750 TTTTKRTRRPHPKPKTTPHPEV-----------PQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPS 818
Cdd:PHA03247  2862 VRRRPPSRSPAAKPAAPARPPVrrlarpavsrsTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*....
gi 1770726339  819 PEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLEtrgIPFIPMISPSPSQE 867
Cdd:PHA03247  2942 PLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA---VPRFRVPQPAPSRE 2987
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1108-1199 2.85e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 2.85e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1108 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1185
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1770726339 1186 LGEGPVSNTVAFST 1199
Cdd:cd00063     80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1109-1189 5.29e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.46  E-value: 5.29e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  1109 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1186
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 1770726339  1187 GEG 1189
Cdd:smart00060   81 GEG 83
PHA03247 PHA03247
large tegument protein UL36; Provisional
747-1040 8.62e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 8.62e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  747 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQT------PR-------------------APPKPKTSPRP----RIP 797
Cdd:PHA03247  2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPvgepvhPRmltwirgleelasddagdpPPPLPPAAPPAapdrSVP 2569
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  798 QTQPVPKVPQ-RVTAKPKTSPSPEVSYTTPAPKDvllPHKPYPEVSQSEPAPLETRGiPFIPMISPSPSQEELQTTLEET 876
Cdd:PHA03247  2570 PPRPAPRPSEpAVTSRARRPDAPPQSARPRAPVD---DRGDPRGPAPPSPLPPDTHA-PDPPPPSPSPAANEPDPHPPPT 2645
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  877 DQSTQEPFTTKIPRTTELAKTTQAPHR-FYTTVRPRTSDKPHIRPVLNRTTT--RPTRPKPSGMPSGNGVGTGVKQAPRP 953
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRRARRLGRaAQASSPPQRPRRRAARPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGP 2725
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  954 SGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPpnnvTGKPGSAGiiSSGPITTPPlRSTPRPTGTPLERIETDI 1033
Cdd:PHA03247  2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT----AGPPAPAP--PAAPAAGPP-RRLTRPAVASLSESRESL 2798

                   ....*..
gi 1770726339 1034 KQPTVPA 1040
Cdd:PHA03247  2799 PSPWDPA 2805
fn3 pfam00041
Fibronectin type III domain;
1109-1192 1.86e-05

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 44.33  E-value: 1.86e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1109 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1185
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 1770726339 1186 LGEGPVS 1192
Cdd:pfam00041   79 GGEGPPS 85
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
413-834 1.26e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.45  E-value: 1.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  413 PTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAP-------SETPF-VPQKLEIFTSPEMQPTTPAPQQTTSI 484
Cdd:pfam05109  310 PASQDMPTNTTDITYVGDNATYSVPMVTSEDANSPNVTVTAfwawpnnTETDFkCKWTLTSGTPSGCENISGAFASNRTF 389
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  485 PSTPKRRPRPKPPRTKPERTTSAGTITPKI--SKSPEPTWTTPAPGKTQFISLKPKIPLsPEVTH-----TKPA---PEP 554
Cdd:pfam05109  390 DITVSGLGTAPKTLIITRTATNATTTTHKVifSKAPESTTTSPTLNTTGFAAPNTTTGL-PSSTHvptnlTAPAstgPTV 468
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  555 QTLLPSQSTIGPETPGTKPST-TLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPK 633
Cdd:pfam05109  469 STADVTSPTPAGTTSGASPVTpSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAV 548
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  634 TTHRPDAPQIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVTV--RTEATVTTLAPKTS 711
Cdd:pfam05109  549 TTPTPNATSPTPAVTTPT---PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLggTSSTPVVTSPPKNA 625
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  712 QRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAP-PKPKT 790
Cdd:pfam05109  626 TSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPaPRPGT 705
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....
gi 1770726339  791 SPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLP 834
Cdd:pfam05109  706 TSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVP 749
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
925-1204 1.56e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.15  E-value: 1.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  925 TTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGI 1004
Cdd:COG3401     48 TKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATT 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1005 ISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNS--- 1081
Cdd:COG3401    128 ATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTyyy 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1082 -PCSITDSVKRFPKEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMT 1160
Cdd:COG3401    208 rVAATDTGGESAPSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATV 282
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1770726339 1161 NQTFSTVENLKPNTSYEFQVKPKNPLG-EGPVSNTVAFSTESADP 1204
Cdd:COG3401    283 TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1103-1247 2.05e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 45.76  E-value: 2.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1103 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1180
Cdd:COG3401    324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1770726339 1181 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1247
Cdd:COG3401    398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
117-195 2.34e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 2.34e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339   117 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 192
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 1770726339   193 GVK 195
Cdd:smart00060   73 RVR 75
fn3 pfam00041
Fibronectin type III domain;
116-195 2.46e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.25  E-value: 2.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1770726339  193 GVK 195
Cdd:pfam00041   72 RVQ 74
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
566-799 8.32e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.60  E-value: 8.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  566 PETPGTK----PSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPseRPKTTHRPDAP 641
Cdd:NF033839   284 PKEPGNKkpsaPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETP--KPEVKPQPEKP 361
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  642 QIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPvpfETEAPSMTIVPTTDIEPVTVRTEATVTTLAPKtsqrtrtrRPRP 721
Cdd:NF033839   362 KPEVKPQPEK---PKPEVKPQPETPKPEVKPQP---EKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQ--------PEKP 427
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1770726339  722 KHKTTPRPETLQTKLDFGPITPGTSSAPTTTTkrtrrphPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQT 799
Cdd:NF033839   428 KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPET-------PKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPST 498
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
296-571 8.49e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.41  E-value: 8.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  296 SDALKTQLAKNETLALPAESKTpeVEKISARPTTVTPETVPRSTKPTTSSALDVSETTLVLSKRTPETLQTI--LIPQFE 373
Cdd:pfam17823  134 IAALPSEAFSAPRAAACRANAS--AAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPatLTPARG 211
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  374 LPLSTLASSEKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAP 453
Cdd:pfam17823  212 ISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTM 291
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  454 SETPFVPQKLEIfTSPEMQPTTPAP-QQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEPTwTTPAPgktqf 532
Cdd:pfam17823  292 ARNPAAPMGAQA-QGPIIQVSTDQPvHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPS-ASPVP----- 364
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1770726339  533 islKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGT 571
Cdd:pfam17823  365 ---VLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLA 400
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
116-195 9.56e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.79  E-value: 9.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  116 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 193
Cdd:cd00063      3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73

                   ..
gi 1770726339  194 VK 195
Cdd:cd00063     74 VR 75
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
759-829 3.48e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 41.92  E-value: 3.48e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1770726339  759 PHPKPKTTPHPEVPqtklAPKqtpraPPKPKTSPRPRIPQTQ--------PVPKVPQRVTAKPktSPSPEVSYTTPAPK 829
Cdd:NF033838   418 EQPQPAPAPQPEKP----APK-----PEKPAEQPKAEKPADQqaeedyarRSEEEYNRLTQQQ--PPKTEKPAQPSTPK 485
PHA03247 PHA03247
large tegument protein UL36; Provisional
257-556 4.47e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 4.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  257 DSAKSPEKAPlggvilvHLIIPGLNETTVKLPASLMFEISDALKTQLAKNETLALPAESKTPEVEKISARPTTVTPETVP 336
Cdd:PHA03247  2703 PPPPTPEPAP-------HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  337 RSTKPTTSSALDVSEttlvLSKRTPETLQTILIPQFELPLSTLASSEKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTS 416
Cdd:PHA03247  2776 AAGPPRRLTRPAVAS----LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  417 DEPEISDSyTATSDRILDSIPPKTSRTLEQPRATLAPSetPFVPQKLEIFTSPEMQPT---TPAPQQTTSIPSTPKRRPR 493
Cdd:PHA03247  2852 LGGSVAPG-GDVRRRPPSRSPAAKPAAPARPPVRRLAR--PAVSRSTESFALPPDQPErppQPQAPPPPQPQPQPPPPPQ 2928
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1770726339  494 PKPPRTKPERTTSAGTITPKISKSPEPTWTTPAPGKTQFISLKPKIP--LSPEVTHTKPAPEPQT 556
Cdd:PHA03247  2929 PQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrfRVPQPAPSREAPASST 2993
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
740-1111 9.05e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.52  E-value: 9.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  740 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAP----KQTPRAPPKPKTSPRPRI-PQTQPVPkvPQRVTAKPK 814
Cdd:pfam03154  189 PGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPhtliQQTPTLHPQRLPSPHPPLqPMTQPPP--PSQVSPQPL 266
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  815 TSPSPEVSYT-TPAPKDVLLPHKPYPEVSQSEPAPL---ETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPfttkiPR 890
Cdd:pfam03154  267 PQPSLHGQMPpMPHSLQTGPSHMQHPVPPQPFPLTPqssQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-----PR 341
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  891 TTELAKttqAPHRFYTTVRPRTSDKPHI-RPVLNRTTTRPTRPKPSGMPSgngvgtgvkQAPRPSGADRNVSVDSTHPTK 969
Cdd:pfam03154  342 EQPLPP---APLSMPHIKPPPTTPIPQLpNPQSHKHPPHLSGPSPFQMNS---------NLPPPPALKPLSSLSTHHPPS 409
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  970 KPGTRRPPLPPRPTHPRRKPLPPNnVTGKPGSAGIISSGPittPPLRSTPRPTGTPLErietdiKQPTVPasgeelenit 1049
Cdd:pfam03154  410 AHPPPLQLMPQSQQLPPPPAQPPV-LTQSQSLPPPAASHP---PTSGLHQVPSQSPFP------QHPFVP---------- 469
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1770726339 1050 dfsSSPTRETDPLGKPRFKGPHVRYIQKPDNSPCSITDSV-------------KRFPKEEATEGNATSPPQNPPT 1111
Cdd:pfam03154  470 ---GGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVpaavscplppvqiKEEALDEAEEPESPPPPPRSPS 541
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
447-867 1.01e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 86.53  E-value: 1.01e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  447 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 524
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  525 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 604
Cdd:PHA03247  2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  605 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 680
Cdd:PHA03247  2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  681 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 749
Cdd:PHA03247  2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  750 TTTTKRTRRPHPKPKTTPHPEV-----------PQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPS 818
Cdd:PHA03247  2862 VRRRPPSRSPAAKPAAPARPPVrrlarpavsrsTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*....
gi 1770726339  819 PEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLEtrgIPFIPMISPSPSQE 867
Cdd:PHA03247  2942 PLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA---VPRFRVPQPAPSRE 2987
PHA03247 PHA03247
large tegument protein UL36; Provisional
437-842 1.21e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 76.52  E-value: 1.21e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  437 PPKTSRTLEQPRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPK--- 513
Cdd:PHA03247  2608 PRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRrra 2687
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  514 ---------ISKSPEPTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPePQTLLPSQSTIGPETPGT-----KPSTTLAP 579
Cdd:PHA03247  2688 arptvgsltSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL-PAAPAPPAVPAGPATPGGparpaRPPTTAGP 2766
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  580 rkTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQT 659
Cdd:PHA03247  2767 --PAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG 2844
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  660 TAEPDMPPTKSVSEPVPFETEAPSMTIVPTtdiepVTVRTEATVTTLA-PKTSQRTRTRRPRPKHKTTPRPETLQTKldf 738
Cdd:PHA03247  2845 PPPPSLPLGGSVAPGGDVRRRPPSRSPAAK-----PAAPARPPVRRLArPAVSRSTESFALPPDQPERPPQPQAPPP--- 2916
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  739 gPITPGTSSAPttttkrtRRPHPKPKTTPHPEVPqtklapkQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPS 818
Cdd:PHA03247  2917 -PQPQPQPPPP-------PQPQPPPPPPPRPQPP-------LAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQ 2981
                          410       420
                   ....*....|....*....|....
gi 1770726339  819 PEVSYTTPAPKDVLLPHKPYPEVS 842
Cdd:PHA03247  2982 PAPSREAPASSTPPLTGHSLSRVS 3005
PHA03247 PHA03247
large tegument protein UL36; Provisional
604-1031 3.86e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 74.97  E-value: 3.86e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  604 PKSKPALEPATiqPEPLVPTT--ASKPSERPKTT--HRPDAPQIQ--------PGSKPPKQLLPKPQTTAEPDMPPTKSV 671
Cdd:PHA03247  2553 PPLPPAAPPAA--PDRSVPPPrpAPRPSEPAVTSraRRPDAPPQSarprapvdDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  672 SEPVPFETEAPSMTIVPttdiEPVTVRTEATVTTLA-PKTSQRTRTRRPRPKHKTTPRPETLQTkldfgPITPGTSSA-- 748
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVP----PPERPRDDPAPGRVSrPRRARRLGRAAQASSPPQRPRRRAARP-----TVGSLTSLAdp 2701
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  749 PTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPR-------PRIPQTQPVPKVPQRVT--AKPKTSPSP 819
Cdd:PHA03247  2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAgpatpggPARPARPPTTAGPPAPAppAAPAAGPPR 2781
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  820 EVSYTTPAPKDVLLPHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPFTTKIPRTTELA---- 895
Cdd:PHA03247  2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVApggd 2861
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  896 -----KTTQAPHRFYTTVRPRTSDKPhiRPVLNRTTT-------RPTRPKPSGMPSGNGVGTGVKQAPRPSGADRnvsvd 963
Cdd:PHA03247  2862 vrrrpPSRSPAAKPAAPARPPVRRLA--RPAVSRSTEsfalppdQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP----- 2934
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1770726339  964 sTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGII-----SSGPITTPPLRSTPRPTGTPLERIET 1031
Cdd:PHA03247  2935 -PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrfrvpQPAPSREAPASSTPPLTGHSLSRVSS 3006
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1108-1199 2.85e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 2.85e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1108 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1185
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1770726339 1186 LGEGPVSNTVAFST 1199
Cdd:cd00063     80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1109-1189 5.29e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.46  E-value: 5.29e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  1109 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1186
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 1770726339  1187 GEG 1189
Cdd:smart00060   81 GEG 83
PRK10263 PRK10263
DNA translocase FtsK; Provisional
670-870 7.07e-07

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 53.94  E-value: 7.07e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  670 SVSEPVPFETEAPSMTIVPTTDIEPVTvrTEATVTTLAPKTSQRTRtrrprpKHKTTPRPETLQTKLDFGPitpgTSSAP 749
Cdd:PRK10263   315 PITEPVAVAAAATTATQSWAAPVEPVT--QTPPVASVDVPPAQPTV------AWQPVPGPQTGEPVIAPAP----EGYPQ 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  750 TTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYttpAPK 829
Cdd:PRK10263   383 QSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTF---APQ 459
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1770726339  830 DVLLPHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEELQ 870
Cdd:PRK10263   460 STYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETK 500
PHA03247 PHA03247
large tegument protein UL36; Provisional
747-1040 8.62e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 8.62e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  747 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQT------PR-------------------APPKPKTSPRP----RIP 797
Cdd:PHA03247  2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPvgepvhPRmltwirgleelasddagdpPPPLPPAAPPAapdrSVP 2569
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  798 QTQPVPKVPQ-RVTAKPKTSPSPEVSYTTPAPKDvllPHKPYPEVSQSEPAPLETRGiPFIPMISPSPSQEELQTTLEET 876
Cdd:PHA03247  2570 PPRPAPRPSEpAVTSRARRPDAPPQSARPRAPVD---DRGDPRGPAPPSPLPPDTHA-PDPPPPSPSPAANEPDPHPPPT 2645
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  877 DQSTQEPFTTKIPRTTELAKTTQAPHR-FYTTVRPRTSDKPHIRPVLNRTTT--RPTRPKPSGMPSGNGVGTGVKQAPRP 953
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRRARRLGRaAQASSPPQRPRRRAARPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGP 2725
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  954 SGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPpnnvTGKPGSAGiiSSGPITTPPlRSTPRPTGTPLERIETDI 1033
Cdd:PHA03247  2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT----AGPPAPAP--PAAPAAGPP-RRLTRPAVASLSESRESL 2798

                   ....*..
gi 1770726339 1034 KQPTVPA 1040
Cdd:PHA03247  2799 PSPWDPA 2805
fn3 pfam00041
Fibronectin type III domain;
1109-1192 1.86e-05

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 44.33  E-value: 1.86e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1109 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1185
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 1770726339 1186 LGEGPVS 1192
Cdd:pfam00041   79 GGEGPPS 85
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
566-939 5.35e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 47.76  E-value: 5.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  566 PETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEpATIQPEPlvpTTASKPSERPKTTHRPDAPQIQP 645
Cdd:PTZ00449   511 PEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGE-VGKKPGP---AKEHKPSKIPTLSKKPEFPKDPK 586
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  646 GSKPPKQllPK----PQTTAEPDMPPTKSVSE--PVPFETEAPSMTIVPTTDIEPV-TVRTEATVTTLAPKTSQRTRTRR 718
Cdd:PTZ00449   587 HPKDPEE--PKkpkrPRSAQRPTRPKSPKLPEllDIPKSPKRPESPKSPKRPPPPQrPSSPERPEGPKIIKSPKPPKSPK 664
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  719 PRPKHKTTPR---------PETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPhPEVPQTKLAPKQTPRAPPKPK 789
Cdd:PTZ00449   665 PPFDPKFKEKfyddyldaaAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLP-PKLPRDEEFPFEPIGDPDAEQ 743
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  790 TSP----------RPRIPQTQPVPKVPQRVTAKPKTspsPEVSYTTPAPKDvllPHKPYPEVSQSEPAPLETRgiPFIPM 859
Cdd:PTZ00449   744 PDDiefftppeeeRTFFHETPADTPLPDILAEEFKE---EDIHAETGEPDE---AMKRPDSPSEHEDKPPGDH--PSLPK 815
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  860 ISPSPSQEELQTTLEETD------QSTQEPFTTKIPRT-TELAKTTQAPH---------------------------RFY 905
Cdd:PTZ00449   816 KRHRLDGLALSTTDLESDagriakDASGKIVKLKRSKSfDDLTTVEEAEEmgaearkivvdddgteaddedthppeeKHK 895
                          410       420       430
                   ....*....|....*....|....*....|....
gi 1770726339  906 TTVRPRTSDKPHIRPVLNRTTTRPTRPKPSGMPS 939
Cdd:PTZ00449   896 SEVRRRRPPKKPSKPKKPSKPKKPKKPDSAFIPS 929
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
759-845 6.07e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 47.50  E-value: 6.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  759 PHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPY 838
Cdd:PRK14950   366 PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYT 445

                   ....*..
gi 1770726339  839 PEVSQSE 845
Cdd:PRK14950   446 PPAPPKE 452
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
772-857 1.08e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 46.47  E-value: 1.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  772 PQTKLAPKQTPRAPPKPKTSPRP-RIPQTQPVPKVPQRVTAKPKTSPSPEvSYTTPAPKDVLLPHKPYPEVSQSEPAPLE 850
Cdd:PRK14954   376 NDGGVAPSPAGSPDVKKKAPEPDlPQPDRHPGPAKPEAPGARPAELPSPA-SAPTPEQQPPVARSAPLPPSPQASAPRNV 454

                   ....*..
gi 1770726339  851 TRGIPFI 857
Cdd:PRK14954   455 ASGKPGV 461
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
601-819 1.16e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 46.46  E-value: 1.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  601 PEVPKSKPALEPATIQPEPLVPTTASKPSER--PK---TTHRPDAPQIQ-PGSKPPKQLLPKPQTTAEPDMPPTKSVSEP 674
Cdd:PLN03209   328 VPPKESDAADGPKPVPTKPVTPEAPSPPIEEepPQpkaVVPRPLSPYTAyEDLKPPTSPIPTPPSSSPASSKSVDAVAKP 407
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  675 VPFETEAPSMTIVPTTDIEPVTVRTE--------ATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTklDFGPITPGTS 746
Cdd:PLN03209   408 AEPDVVPSPGSASNVPEVEPAQVEAKktrplspyARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVP--DTAPATAATD 485
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  747 SA-PTTTTKRTRRPHP-----KPKTTPHPEVPQTKLAPKQTPRAPP----KPKTSPRPRIPQTQPVPK----VPQRVTAK 812
Cdd:PLN03209   486 AAaPPPANMRPLSPYAvyddlKPPTSPSPAAPVGKVAPSSTNEVVKvgnsAPPTALADEQHHAQPKPRplspYTMYEDLK 565

                   ....*..
gi 1770726339  813 PKTSPSP 819
Cdd:PLN03209   566 PPTSPTP 572
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
413-834 1.26e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.45  E-value: 1.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  413 PTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAP-------SETPF-VPQKLEIFTSPEMQPTTPAPQQTTSI 484
Cdd:pfam05109  310 PASQDMPTNTTDITYVGDNATYSVPMVTSEDANSPNVTVTAfwawpnnTETDFkCKWTLTSGTPSGCENISGAFASNRTF 389
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  485 PSTPKRRPRPKPPRTKPERTTSAGTITPKI--SKSPEPTWTTPAPGKTQFISLKPKIPLsPEVTH-----TKPA---PEP 554
Cdd:pfam05109  390 DITVSGLGTAPKTLIITRTATNATTTTHKVifSKAPESTTTSPTLNTTGFAAPNTTTGL-PSSTHvptnlTAPAstgPTV 468
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  555 QTLLPSQSTIGPETPGTKPST-TLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPK 633
Cdd:pfam05109  469 STADVTSPTPAGTTSGASPVTpSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAV 548
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  634 TTHRPDAPQIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVTV--RTEATVTTLAPKTS 711
Cdd:pfam05109  549 TTPTPNATSPTPAVTTPT---PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLggTSSTPVVTSPPKNA 625
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  712 QRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAP-PKPKT 790
Cdd:pfam05109  626 TSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPaPRPGT 705
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....
gi 1770726339  791 SPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLP 834
Cdd:pfam05109  706 TSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVP 749
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
925-1204 1.56e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.15  E-value: 1.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  925 TTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGI 1004
Cdd:COG3401     48 TKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATT 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1005 ISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNS--- 1081
Cdd:COG3401    128 ATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTyyy 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1082 -PCSITDSVKRFPKEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMT 1160
Cdd:COG3401    208 rVAATDTGGESAPSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATV 282
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1770726339 1161 NQTFSTVENLKPNTSYEFQVKPKNPLG-EGPVSNTVAFSTESADP 1204
Cdd:COG3401    283 TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
625-1023 2.00e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.93  E-value: 2.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  625 ASKPSERPKTTHRPDAPQIQPGSkpPKQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPS--MTIVPTTDIEPVTVRTEAT 702
Cdd:PHA03307    17 GGEFFPRPPATPGDAADDLLSGS--QGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPgpGTEAPANESRSTPTWSLST 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  703 VTTLAPKTSQRTRTRRPRPKhKTTPRPETlqtkldfgPITPGTSSAPttttkrtrrPHPKPKTTPHPEVPQTKLAPKQTP 782
Cdd:PHA03307    95 LAPASPAREGSPTPPGPSSP-DPPPPTPP--------PASPPPSPAP---------DLSEMLRPVGSPGPPPAASPPAAG 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  783 RAPPKPKTSPRPRIPQTQPVPKVPQrvTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVS--QSEPAPLETRGIPFIPMI 860
Cdd:PHA03307   157 ASPAAVASDAASSRQAALPLSSPEE--TARAPSSPPAEPPPSTPPAAASPRPPRRSSPISasASSPAPAPGRSAADDAGA 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  861 SPSPSqeelqttLEETDQSTQEPFTTKIPRTTELAKTTQAPHRFYTTVRPRTSDKPHIRPvlnRTTTRPTRPKPSGMPSG 940
Cdd:PHA03307   235 SSSDS-------SSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASS---SSSPRERSPSPSPSSPG 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  941 NGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGIISSGPITTPPLRSTPR 1020
Cdd:PHA03307   305 SGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGR 384

                   ...
gi 1770726339 1021 PTG 1023
Cdd:PHA03307   385 PTR 387
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
737-822 2.04e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 46.04  E-value: 2.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  737 DFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPriPQTQPVPKVPQRVTAKPKTS 816
Cdd:PRK12270    35 DYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA--AAAPAAPPAAAAAAAPAAAA 112

                   ....*.
gi 1770726339  817 PSPEVS 822
Cdd:PRK12270   113 VEDEVT 118
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1103-1247 2.05e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 45.76  E-value: 2.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339 1103 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1180
Cdd:COG3401    324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1770726339 1181 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1247
Cdd:COG3401    398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
117-195 2.34e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 2.34e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339   117 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 192
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 1770726339   193 GVK 195
Cdd:smart00060   73 RVR 75
fn3 pfam00041
Fibronectin type III domain;
116-195 2.46e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.25  E-value: 2.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1770726339  193 GVK 195
Cdd:pfam00041   72 RVQ 74
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
681-864 4.60e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.48  E-value: 4.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  681 APSMTIVPTTDIEPVTVRTEAT--VTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTtttkrtrr 758
Cdd:PRK12323   380 APVAQPAPAAAAPAAAAPAPAAppAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPA-------- 451
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  759 phPKPKTTPHPEVPqtklAPKQTPRAPPKPKTSPRPR---IPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPH 835
Cdd:PRK12323   452 --PAPAAAPAAAAR----PAAAGPRPVAAAAAAAPARaapAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAES 525
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 1770726339  836 KPYPEVSQSEP-----------APLETRGIPFIPMISPSP 864
Cdd:PRK12323   526 IPDPATADPDDafetlapapaaAPAPRAAAATEPVVAPRP 565
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
534-870 4.72e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 4.72e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  534 SLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEP- 612
Cdd:pfam03154  143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTq 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  613 ATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPT--------KSVSEPVPFE----TE 680
Cdd:pfam03154  223 STAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMphslqtgpSHMQHPVPPQpfplTP 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  681 APSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPH 760
Cdd:pfam03154  303 QSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPS 382
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  761 PKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQR--VTAKPKTSPSPEVSYTTPAPKDVLLPHKPY 838
Cdd:pfam03154  383 PFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQppVLTQSQSLPPPAASHPPTSGLHQVPSQSPF 462
                          330       340       350
                   ....*....|....*....|....*....|..
gi 1770726339  839 PEVSQSEPAPLETRGiPFIPMISPSPSQEELQ 870
Cdd:pfam03154  463 PQHPFVPGGPPPITP-PSGPPTSTSSAMPGIQ 493
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
608-676 4.95e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.42  E-value: 4.95e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1770726339  608 PALEPATIQPEPLVPTtASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVP 676
Cdd:PRK14950   362 PVPAPQPAKPTAAAPS-PVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAP 429
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
438-805 6.85e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 44.30  E-value: 6.85e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  438 PKTSRTLEQPRATLAP-----SETPFVPQKLEIFTSPE-----MQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSA 507
Cdd:PTZ00449   597 PKRPRSAQRPTRPKSPklpelLDIPKSPKRPESPKSPKrppppQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFY 676
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  508 GTITPKISKSPEpTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPepqtlLPSQSTIGPETPGTKPSTTLAPRktkrpgr 587
Cdd:PTZ00449   677 DDYLDAAAKSKE-TKTTVVLDESFESILKETLPETPGTPFTTPRP-----LPPKLPRDEEFPFEPIGDPDAEQ------- 743
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  588 rprprPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSkpPKQLLPKPqTTAEPDMPP 667
Cdd:PTZ00449   744 -----PDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS--PSEHEDKP-PGDHPSLPK 815
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  668 TKSVSE-----PVPFETEAPSMTIVPTTdiEPVTVRTEATVTTLApktsqrtrtrrprpkhkttprpeTLQTKLDFGPIT 742
Cdd:PTZ00449   816 KRHRLDglalsTTDLESDAGRIAKDASG--KIVKLKRSKSFDDLT-----------------------TVEEAEEMGAEA 870
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1770726339  743 PGTSSAPTTTTKRTRRPHPkPKTTPHPEVPQTKlaPKQTPRAPPKPKTSPRPRIPQTQPVPKV 805
Cdd:PTZ00449   871 RKIVVDDDGTEADDEDTHP-PEEKHKSEVRRRR--PPKKPSKPKKPSKPKKPKKPDSAFIPSI 930
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
566-799 8.32e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.60  E-value: 8.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  566 PETPGTK----PSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPseRPKTTHRPDAP 641
Cdd:NF033839   284 PKEPGNKkpsaPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETP--KPEVKPQPEKP 361
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  642 QIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPvpfETEAPSMTIVPTTDIEPVTVRTEATVTTLAPKtsqrtrtrRPRP 721
Cdd:NF033839   362 KPEVKPQPEK---PKPEVKPQPETPKPEVKPQP---EKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQ--------PEKP 427
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1770726339  722 KHKTTPRPETLQTKLDFGPITPGTSSAPTTTTkrtrrphPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQT 799
Cdd:NF033839   428 KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPET-------PKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPST 498
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
296-571 8.49e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.41  E-value: 8.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  296 SDALKTQLAKNETLALPAESKTpeVEKISARPTTVTPETVPRSTKPTTSSALDVSETTLVLSKRTPETLQTI--LIPQFE 373
Cdd:pfam17823  134 IAALPSEAFSAPRAAACRANAS--AAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPatLTPARG 211
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  374 LPLSTLASSEKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAP 453
Cdd:pfam17823  212 ISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTM 291
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  454 SETPFVPQKLEIfTSPEMQPTTPAP-QQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEPTwTTPAPgktqf 532
Cdd:pfam17823  292 ARNPAAPMGAQA-QGPIIQVSTDQPvHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPS-ASPVP----- 364
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1770726339  533 islKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGT 571
Cdd:pfam17823  365 ---VLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLA 400
PRK11633 PRK11633
cell division protein DedD; Provisional
740-829 8.86e-04

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 42.30  E-value: 8.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  740 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPqtkLAPKQTPRAPPKPKtsPRPRiPQTQPVPKVPQRVTAKPKTSPSP 819
Cdd:PRK11633    64 PTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTP---VEPEPAPVEPPKPK--PVEK-PKPKPKPQQKVEAPPAPKPEPKP 137
                           90
                   ....*....|
gi 1770726339  820 EVSyTTPAPK 829
Cdd:PRK11633   138 VVE-EKAAPT 146
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
436-794 9.53e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.60  E-value: 9.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  436 IPPKTSRTLEQPRATLAP-SETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKI 514
Cdd:pfam03154  182 SPPSPPPPGTTQAATAGPtPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQV 261
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  515 SKSPEPTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAprKTKRPGRRPRPRPR 594
Cdd:pfam03154  262 SPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRI--HTPPSQSQLQSQQP 339
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  595 PKTTPSPEVPKSKPALEPATIQPEPLVPTTASKpsERPKTTHRPDAPQIQPGSKPPKQLlpKPQTTAEPDMPPTksvSEP 674
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSH--KHPPHLSGPSPFQMNSNLPPPPAL--KPLSSLSTHHPPS---AHP 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  675 VPFETEAPSMTIVPTTDIEPVTVRTEATVTTLA----PKTSQRTRTRRPRPKHKTTPRPETLQTKldfgPITPGTSSAPT 750
Cdd:pfam03154  413 PPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAAshppTSGLHQVPSQSPFPQHPFVPGGPPPITP----PSGPPTSTSSA 488
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1770726339  751 TTTKRTRRPHPKPKTTPHPEVPQTKLAPKQT----------PRAPPKPKTSPRP 794
Cdd:pfam03154  489 MPGIQPPSSASVSSSGPVPAAVSCPLPPVQIkeealdeaeePESPPPPPRSPSP 542
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
116-195 9.56e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.79  E-value: 9.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  116 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 193
Cdd:cd00063      3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73

                   ..
gi 1770726339  194 VK 195
Cdd:cd00063     74 VR 75
PRK10263 PRK10263
DNA translocase FtsK; Provisional
736-868 9.58e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.92  E-value: 9.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  736 LDFGPITP--GTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSP-RPRIPQTQPV----PKVPQR 808
Cdd:PRK10263   736 LDDGPHEPlfTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPqQPVAPQPQYQqpqqPVAPQP 815
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1770726339  809 VTAKPKTSPSPEVSY------TTPAPKDVLLpHKPYPEVSQSEPAPLETRGIPFIPMISPSPSQEE 868
Cdd:PRK10263   816 QYQQPQQPVAPQPQYqqpqqpVAPQPQDTLL-HPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVE 880
PHA03378 PHA03378
EBNA-3B; Provisional
607-817 2.18e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 42.36  E-value: 2.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  607 KPALEPATIQPEPLVPTTASKPSERPKTTHRPDAPQIQ---PGSKPPKQ----LLPKPQTTAE-------------PDMP 666
Cdd:PHA03378   575 QPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQshiPETSAPRQwpmpLRPIPMRPLRmqpitfnvlvfptPHQP 654
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  667 PTKSVSEPVPFETEAPSMTIVPTT---------DIEPVTVRTEATVTT-LAPKTSQRTRTRRPRPKHKTTPRPETLQTKL 736
Cdd:PHA03378   655 PQVEITPYKPTWTQIGHIPYQPSPtgantmlpiQWAPGTMQPPPRAPTpMRPPAAPPGRAQRPAAATGRARPPAAAPGRA 734
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  737 DFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTS 816
Cdd:PHA03378   735 RPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAA 814

                   .
gi 1770726339  817 P 817
Cdd:PHA03378   815 P 815
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
537-763 2.41e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.17  E-value: 2.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  537 PKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRktkrpgrrprprprpKTTPSPEVPKSKPALEPATIQ 616
Cdd:PRK12323   374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAAR---------------AVAAAPARRSPAPEALAAARQ 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  617 PEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVT 696
Cdd:PRK12323   439 ASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAP 518
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1770726339  697 VRTEAtvttlapktsqrtrtrrprpkhKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKP 763
Cdd:PRK12323   519 AGWVA----------------------ESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
740-828 2.85e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 42.10  E-value: 2.85e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  740 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTklAPKQTPRAPPKPKTSPRPRiPQTQPVPKVPQRVTAKPKTSPSP 819
Cdd:PRK14950   362 PVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAAN--IPPKEPVRETATPPPVPPR-PVAPPVPHTPESAPKLTRAAIPV 438

                   ....*....
gi 1770726339  820 EVSYTTPAP 828
Cdd:PRK14950   439 DEKPKYTPP 447
DamX COG3266
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ...
761-848 3.09e-03

Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442497 [Multi-domain]  Cd Length: 455  Bit Score: 41.76  E-value: 3.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  761 PKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPripQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPE 840
Cdd:COG3266    265 SAPATTSLGEQQEVSLPPAVAAQPAAAAAAQPSA---VALPAAPAAAAAAAAPAEAAAPQPTAAKPVVTETAAPAAPAPE 341

                   ....*...
gi 1770726339  841 VSQSEPAP 848
Cdd:COG3266    342 AAAAAAAP 349
PRK10263 PRK10263
DNA translocase FtsK; Provisional
509-743 3.20e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.99  E-value: 3.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  509 TITPKIskSPEPTWTTPAPGKTQfislkpkiplsPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRR 588
Cdd:PRK10263   368 TGEPVI--APAPEGYPQQSQYAQ-----------PAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYY 434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  589 PRPRPRPKTTPSPEVPKSKPALEP-ATIQPEPLV--PTTASKPSERPKTTHRPDAPQIQPG------SKPP--------- 650
Cdd:PRK10263   435 APAPEQPVAGNAWQAEEQQSTFAPqSTYQTEQTYqqPAAQEPLYQQPQPVEQQPVVEPEPVveetkpARPPlyyfeevee 514
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  651 ------KQLLPKPQTTAEPDMPPtksvsEPVPFETEAPSMTIVPTTDIEPVTVRTEATV--TTLAPKTSQRTRTRRPRPK 722
Cdd:PRK10263   515 krarerEQLAAWYQPIPEPVKEP-----EPIKSSLKAPSVAAVPPVEAAAAVSPLASGVkkATLATGAAATVAAPVFSLA 589
                          250       260
                   ....*....|....*....|.
gi 1770726339  723 HKTTPRPetlQTKLDFGPITP 743
Cdd:PRK10263   590 NSGGPRP---QVKEGIGPQLP 607
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
759-829 3.48e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 41.92  E-value: 3.48e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1770726339  759 PHPKPKTTPHPEVPqtklAPKqtpraPPKPKTSPRPRIPQTQ--------PVPKVPQRVTAKPktSPSPEVSYTTPAPK 829
Cdd:NF033838   418 EQPQPAPAPQPEKP----APK-----PEKPAEQPKAEKPADQqaeedyarRSEEEYNRLTQQQ--PPKTEKPAQPSTPK 485
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
743-848 4.22e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 41.62  E-value: 4.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  743 PGTSSAPTTTTKRTRRPHPKPKTT----PHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVtakpktsPS 818
Cdd:PRK14951   375 PAEKKTPARPEAAAPAAAPVAQAAaapaPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAV-------AL 447
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1770726339  819 PEVSYTTPAPKDVLLPHK--PYPEVSQSEPAP 848
Cdd:PRK14951   448 APAPPAQAAPETVAIPVRvaPEPAVASAAPAP 479
PHA03247 PHA03247
large tegument protein UL36; Provisional
257-556 4.47e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 4.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  257 DSAKSPEKAPlggvilvHLIIPGLNETTVKLPASLMFEISDALKTQLAKNETLALPAESKTPEVEKISARPTTVTPETVP 336
Cdd:PHA03247  2703 PPPPTPEPAP-------HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  337 RSTKPTTSSALDVSEttlvLSKRTPETLQTILIPQFELPLSTLASSEKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTS 416
Cdd:PHA03247  2776 AAGPPRRLTRPAVAS----LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  417 DEPEISDSyTATSDRILDSIPPKTSRTLEQPRATLAPSetPFVPQKLEIFTSPEMQPT---TPAPQQTTSIPSTPKRRPR 493
Cdd:PHA03247  2852 LGGSVAPG-GDVRRRPPSRSPAAKPAAPARPPVRRLAR--PAVSRSTESFALPPDQPErppQPQAPPPPQPQPQPPPPPQ 2928
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1770726339  494 PKPPRTKPERTTSAGTITPKISKSPEPTWTTPAPGKTQFISLKPKIP--LSPEVTHTKPAPEPQT 556
Cdd:PHA03247  2929 PQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrfRVPQPAPSREAPASST 2993
PRK10905 PRK10905
cell division protein DamX; Validated
611-712 4.63e-03

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 40.69  E-value: 4.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  611 EPATIQP---EPLVPTTASKPSERPKTTHRPDAPQIQPGSKppkqllpKPQTTAE-PDMPPTKSVSEPVPFETEAPSMTI 686
Cdd:PRK10905   126 EPATVAPvrnGNASRQTAKTQTAERPATTRPARKQAVIEPK-------KPQATAKtEPKPVAQTPKRTEPAAPVASTKAP 198
                           90       100
                   ....*....|....*....|....*.
gi 1770726339  687 VPTTDIEPVTVRTEATVTTLAPKTSQ 712
Cdd:PRK10905   199 AATSTPAPKETATTAPVQTASPAQTT 224
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
600-794 5.91e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.12  E-value: 5.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  600 SPEVPKSKPALEPATIQPEPLVPTTASKPsERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVS-EPVPFE 678
Cdd:PRK07764   596 GGEGPPAPASSGPPEEAARPAAPAAPAAP-AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDgWPAKAG 674
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  679 TEAPSMTIVPTTDIEPVTVRTEATvttlapktsqrtrtRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRR 758
Cdd:PRK07764   675 GAAPAAPPPAPAPAAPAAPAGAAP--------------AQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1770726339  759 PHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRP 794
Cdd:PRK07764   741 LPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPP 776
Orthopox_A5L pfam06193
Orthopoxvirus A5L protein-like; This family includes several Orthopoxvirus A5L proteins. The ...
773-893 8.83e-03

Orthopoxvirus A5L protein-like; This family includes several Orthopoxvirus A5L proteins. The vaccinia virus WR A5L open reading frame (corresponding to open reading frame A4L in vaccinia virus Copenhagen) encodes an immunodominant late protein found in the core of the vaccinia virion. The A5 protein appears to be required for the immature virion to form the brick-shaped intracellular mature virion.


Pssm-ID: 283778 [Multi-domain]  Cd Length: 216  Bit Score: 39.19  E-value: 8.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  773 QTKLAPKQTPRAPPKPKTSPRP-RIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPAPLET 851
Cdd:pfam06193   61 DNMLAASRQPIQPLQPTIHITPiEIPTPAPTPKPRQQELGTPSTSCTQNSDASIACSTDIVTPPQPPIVATVCTPTPTDG 140
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 1770726339  852 RgIPFIPMISPSP---SQEELQTTLEETDQSTQEPFTTKIPRTTE 893
Cdd:pfam06193  141 R-ICTTADQNPNPgatIQKELDNMALKDLMSSVEKDMCQLQAESE 184
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
740-1111 9.05e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.52  E-value: 9.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  740 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAP----KQTPRAPPKPKTSPRPRI-PQTQPVPkvPQRVTAKPK 814
Cdd:pfam03154  189 PGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPhtliQQTPTLHPQRLPSPHPPLqPMTQPPP--PSQVSPQPL 266
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  815 TSPSPEVSYT-TPAPKDVLLPHKPYPEVSQSEPAPL---ETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPfttkiPR 890
Cdd:pfam03154  267 PQPSLHGQMPpMPHSLQTGPSHMQHPVPPQPFPLTPqssQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-----PR 341
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  891 TTELAKttqAPHRFYTTVRPRTSDKPHI-RPVLNRTTTRPTRPKPSGMPSgngvgtgvkQAPRPSGADRNVSVDSTHPTK 969
Cdd:pfam03154  342 EQPLPP---APLSMPHIKPPPTTPIPQLpNPQSHKHPPHLSGPSPFQMNS---------NLPPPPALKPLSSLSTHHPPS 409
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  970 KPGTRRPPLPPRPTHPRRKPLPPNnVTGKPGSAGIISSGPittPPLRSTPRPTGTPLErietdiKQPTVPasgeelenit 1049
Cdd:pfam03154  410 AHPPPLQLMPQSQQLPPPPAQPPV-LTQSQSLPPPAASHP---PTSGLHQVPSQSPFP------QHPFVP---------- 469
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1770726339 1050 dfsSSPTRETDPLGKPRFKGPHVRYIQKPDNSPCSITDSV-------------KRFPKEEATEGNATSPPQNPPT 1111
Cdd:pfam03154  470 ---GGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVpaavscplppvqiKEEALDEAEEPESPPPPPRSPS 541
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
413-801 9.30e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 40.28  E-value: 9.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  413 PTTSDEPEISDSYTATSDriLDSIPPKTSRTLEQPratLAPSETPF---VPQKLEIFTSPEMQPTTPAPQQTTSIPSTPK 489
Cdd:pfam05109  455 PTNLTAPASTGPTVSTAD--VTSPTPAGTTSGASP---VTPSPSPRdngTESKAPDMTSPTSAVTTPTPNATSPTPAVTT 529
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  490 RRPRPKPPRTKPERTTSAGTITPKISKSPEPTWTTPAPGKTqFISLKPKIPLSPEVTHTKPAPEPQTLLPS-QSTIGPET 568
Cdd:pfam05109  530 PTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNAT-IPTLGKTSPTSAVTTPTPNATSPTVGETSpQANTTNHT 608
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  569 PGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKP---SERPktTHRPDAPQIQP 645
Cdd:pfam05109  609 LGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPlltSAHP--TGGENITQVTP 686
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  646 GSKPPKQL-----LPKPQTTAEPDMPPTKSVS-EPVPFETEAPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRP 719
Cdd:pfam05109  687 ASTSTHHVstsspAPRPGTTSQASGPGNSSTStKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKH 766
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1770726339  720 RPKH--KTTPRPETlqtklDFGpitpGTSSAPTTTTKRtrrphpkpkTTPHPEVPQTKLAPKQTPRAPP-KPKTSPRPRI 796
Cdd:pfam05109  767 TTGHgaRTSTEPTT-----DYG----GDSTTPRTRYNA---------TTYLPPSTSSKLRPRWTFTSPPvTTAQATVPVP 828

                   ....*
gi 1770726339  797 PQTQP 801
Cdd:pfam05109  829 PTSQP 833
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH