|
Name |
Accession |
Description |
Interval |
E-value |
| COG5139 |
COG5139 |
Uncharacterized conserved protein [Function unknown]; |
533-724 |
2.02e-27 |
|
Uncharacterized conserved protein [Function unknown];
Pssm-ID: 227468 Cd Length: 397 Bit Score: 115.18 E-value: 2.02e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 533 DFEMMLQRKKSMSGKRRRNRDGGTFISDADDVVSAMIVKMNEAAEEDRQLNNQKKPALKKLTLLPAVVMHLKKQDLKETF 612
Cdd:COG5139 126 ELGDTGDRQLKAPAASRARRKEDLLEQTVDEISLRLKKRMQDAAKKDNANNLEGRPATGKIKNLPEVSDVLMKKALQDTI 205
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 613 IDSGVMSAIKEWLSPLPDRSLPALKIREELLKILQELPsVSQETLKHSGIGRAVMYLYKHPKESRSNKDMAGKLINEWSR 692
Cdd:COG5139 206 LDNNILDSVRGWLEPLPDKSLPNIKIQKSLLDVLKTLP-IHTEHLVESGVGRIVYFYTISKKEEKEVRRSAKALVQEWTR 284
|
170 180 190
....*....|....*....|....*....|..
gi 2462575019 693 PIFGLTSNYKGmTREEREQRDLEQMPQRRRMN 724
Cdd:COG5139 285 PIIKPSGNYRD-KRIMQLEFDSEKLRKKSVMD 315
|
|
| Med26 |
pfam08711 |
TFIIS helical bundle-like domain; Mediator is a large complex of up to 33 proteins that is ... |
641-694 |
2.35e-12 |
|
TFIIS helical bundle-like domain; Mediator is a large complex of up to 33 proteins that is conserved from plants to fungi to humans - the number and representation of individual subunits varying with species {1-2]. It is arranged into four different sections, a core, a head, a tail and a kinase-activity part, and the number of subunits within each of these is what varies with species. Overall, Mediator regulates the transcriptional activity of RNA polymerase II but it would appear that each of the four different sections has a slightly different function. Mediator exists in two major forms in human cells: a smaller form that interacts strongly with pol II and activates transcription, and a large form that does not interact strongly with pol II and does not directly activate transcription. Notably, the 'small' and 'large' Mediator complexes differ in their subunit composition: the Med26 subunit preferentially associates with the small, active complex, whereas cdk8, cyclin C, Med12 and Med13 associate with the large Mediator complex. This family includesthe C terminal region of a number of eukaryotic hypothetical proteins which are homologous to the Saccharomyces cerevisiae protein IWS1. IWS1 is known to be an Pol II transcription elongation factor and interacts with Spt6 and Spt5.
Pssm-ID: 462573 [Multi-domain] Cd Length: 52 Bit Score: 62.15 E-value: 2.35e-12
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 2462575019 641 ELLKILQELPsVSQETLKHSGIGRAVMYLYKHPkESRSNKDMAGKLINEWSRPI 694
Cdd:pfam08711 1 KLLKKLEKLP-VTLELLKSTGIGKVVNKLRKHK-ENPEIKKLAKELVKKWKRLV 52
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
11-371 |
1.18e-08 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 58.77 E-value: 1.18e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 11 QDPPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSverhseNETSDREdglpkghhvTDSENDEPlnlNASDSESE 90
Cdd:NF033609 546 EQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGS------DSTSDSG---------SDSASDSD---SASDSDSA 607
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 91 ElhrqkDSDSESEERAeppASDSENEDVNQHGSDSESEETrklpgSDSENEELLNGHASDSENEDVGKHPASDSEIEELQ 170
Cdd:NF033609 608 S-----DSDSASDSDS---ASDSDSASDSDSASDSDSASD-----SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 674
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 171 KSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEelpkpqvSDSESEEPPRHQASDSENEELPKP 250
Cdd:NF033609 675 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-------SDSDSDSDSDSDSDSDSDSDSDSD 747
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 251 RISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKgpaSDSETEDASRHKQ 330
Cdd:NF033609 748 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDS 824
|
330 340 350 360
....*....|....*....|....*....|....*....|.
gi 2462575019 331 KPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSE 371
Cdd:NF033609 825 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSE 865
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
16-334 |
3.52e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 50.55 E-value: 3.52e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 16 EDDGGATPVqderdSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNA--SDSESEELH 93
Cdd:PHA03307 28 PGDAADDLL-----SGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSlsTLAPASPAR 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 94 RQKDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEI----EEL 169
Cdd:PHA03307 103 EGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPlsspEET 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 170 QKSPASDSETEDALKPQISDSESEEPPRHQASDSeneePPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEElPK 249
Cdd:PHA03307 183 ARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS----ASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEC-PL 257
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 250 PRIS--------------DSESEDPPRHQASDSENEELPKP--------------RISDSESEDPPRNQASDSENEELPK 301
Cdd:PHA03307 258 PRPApitlptriweasgwNGPSSRPGPASSSSSPRERSPSPspsspgsgpapsspRASSSSSSSRESSSSSTSSSSESSR 337
|
330 340 350
....*....|....*....|....*....|....
gi 2462575019 302 PR-VSDSESEGPQKGPASDSETEDASRHKQKPES 334
Cdd:PHA03307 338 GAaVSPGPSPSRSPSPSRPPPPADPSSPRKRPRP 371
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
9-300 |
8.09e-06 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.52 E-value: 8.09e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 9 SDQDPPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDRE-DGLPKGHHVTDSENDEPLNLNASDS 87
Cdd:NF033609 608 SDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSD 687
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 88 ESEELHRQKDSDSESEERAEpPASDSENEDVNQHGSDSESE-ETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEI 166
Cdd:NF033609 688 SDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 766
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 167 EELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQvSDSESEEPPRHQASDSENEE 246
Cdd:NF033609 767 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSD 845
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 2462575019 247 LPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSEnEELP 300
Cdd:NF033609 846 SDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKDSK-EPLP 898
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
206-533 |
1.32e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 48.75 E-value: 1.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 206 EEPPKPRMSDSESEELPKPQVSDSEseePPRHQASDSENEELPKPRISDSESeDPPRHQASDSENEELPKpriSDSESED 285
Cdd:NF033609 540 DKPVVPEQPDEPGEIEPIPEDSDSD---PGSDSGSDSSNSDSGSDSGSDSTS-DSGSDSASDSDSASDSD---SASDSDS 612
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 286 PPRNQASDSENEELPKPRVSDSESEGPQKGpASDSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKF 365
Cdd:NF033609 613 ASDSDSASDSDSASDSDSASDSDSASDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 691
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 366 HSSDSEEEEHKKQKMDSDEDEKEGEEEKVAKRKAAVLSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDS 445
Cdd:NF033609 692 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 771
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 446 EEEAGKELSDKKNEEKDLFGSDSESGNEEENLIADIFGESGDEEEEEFTGFNQEDLEEEKGETQVKEAEDSDSDDNIKRG 525
Cdd:NF033609 772 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 851
|
....*...
gi 2462575019 526 KHMDFLSD 533
Cdd:NF033609 852 SDSDSESD 859
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
156-476 |
2.62e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 47.60 E-value: 2.62e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 156 VGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESeelpkpqVSDSESEEPP 235
Cdd:NF033609 544 VPEQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDS-------ASDSDSASDS 616
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 236 RHQASDSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKG 315
Cdd:NF033609 617 DSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 696
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 316 PA-SDSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKEGEEEKv 394
Cdd:NF033609 697 DSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD- 775
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 395 AKRKAAVLSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDSEEEAGKEL-SDKKNEEKDLFGSDSESGNE 473
Cdd:NF033609 776 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSD 855
|
...
gi 2462575019 474 EEN 476
Cdd:NF033609 856 SES 858
|
|
| Ebola_NP |
pfam05505 |
Ebola nucleoprotein; This family consists of Ebola and Marburg virus nucleoproteins. These ... |
47-298 |
6.74e-05 |
|
Ebola nucleoprotein; This family consists of Ebola and Marburg virus nucleoproteins. These proteins are responsible for encapsidation of genomic RNA. It has been found that nucleoprotein DNA vaccines can offer protection from the virus.
Pssm-ID: 398905 Cd Length: 717 Bit Score: 46.27 E-value: 6.74e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 47 TGSVERHSENETSdredglpkGHHVTDSENDEPLNLNASDSESEelhrQKDSDSESEERAEPPASDSENEdvNQHGSDSE 126
Cdd:pfam05505 388 TEAITAASLPKTS--------GHYDDDDDIPFPGPINDDDNPGH----QDDDPTDSQDTTIPDVVVDPDD--GSYGEYQS 453
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 127 SEETrklpGSDSENEELLNGhaSDSENEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPR--HQASDSE 204
Cdd:pfam05505 454 YSEN----GMNAPDDLVLLN--EDEDDLEDTKPVPNRSTKGGQQKNSQKGQHIEGRQTQSRPIQNVPGPHRtiHHASAPL 527
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 205 NEEPPKPRMSDSESEELPKPQvsdseSEEPPRHQASDSENEELPkPRISDSESED-------------PPRHQASDSENE 271
Cdd:pfam05505 528 TDNDRRNEPSGSTSPRMLTPI-----NEEADPLDDADDETSSLP-PLESDDEEQDrdgtsnrtptvapPAPVYRDHSEKK 601
|
250 260
....*....|....*....|....*..
gi 2462575019 272 ELPKPRISDSESEDPPRNQASDSENEE 298
Cdd:pfam05505 602 ELPQDEQQDQDHTQEARNQDSDNTQSE 628
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
9-321 |
7.42e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 46.44 E-value: 7.42e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 9 SDQDPPEEDDGGAtpvQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASDSE 88
Cdd:NF033609 602 SDSDSASDSDSAS---DSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 678
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 89 SEELHRQKDSDSESEERAEPPA-SDSENEDVNQHGSDSESEETRKlpgSDSENEELLNGHASDSENEDVGKHPASDSEIE 167
Cdd:NF033609 679 DSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 755
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 168 ELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEelpkpqvSDSESEEPPRHQASDSENeel 247
Cdd:NF033609 756 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-------SDSDSDSDSDSDSDSDSD--- 825
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462575019 248 pkpriSDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSE 321
Cdd:NF033609 826 -----SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKDSK 894
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
16-208 |
1.55e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 45.37 E-value: 1.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 16 EDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASDSESEELHRQ 95
Cdd:TIGR00927 707 KGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAG 786
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 96 KDSDSESEERAEPP--ASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDvgkhpasDSEIEELQKSP 173
Cdd:TIGR00927 787 EDGEMKGDEGAEGKveHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQD-------EKGVDGGGGSD 859
|
170 180 190
....*....|....*....|....*....|....*
gi 2462575019 174 ASDSETEDALKPQISDSESEEPPRHQaSDSENEEP 208
Cdd:TIGR00927 860 GGDSEEEEEEEEEEEEEEEEEEEEEE-EEEENEEP 893
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
272-520 |
6.40e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 43.45 E-value: 6.40e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 272 ELPKPRISDSESEdPPRNQASDSENE---ELPKPRVSDSESEGPQKGPASDSETEDASRHKQ-----KPESDDDSDRENK 343
Cdd:TIGR00927 656 EGENGEESGGEAE-QEGETETKGENEsegEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEhegetEAEGTEDEGEIET 734
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 344 GEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDekegeeekvakrkaavlSDSEDEEKASAKKSRVVSDAD 423
Cdd:TIGR00927 735 GEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKE-----------------DEDEGEIQAGEDGEMKGDEGA 797
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 424 DSDSDAVSDKSGKREKTIASDSEEEAGKELSDKKNEEKDLFGSDSESGNEEENLIADifGESGDEEEEEFTGFNQEDLEE 503
Cdd:TIGR00927 798 EGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDG--GGGSDGGDSEEEEEEEEEEEE 875
|
250
....*....|....*..
gi 2462575019 504 EKGETQVKEAEDSDSDD 520
Cdd:TIGR00927 876 EEEEEEEEEEEEEENEE 892
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| COG5139 |
COG5139 |
Uncharacterized conserved protein [Function unknown]; |
533-724 |
2.02e-27 |
|
Uncharacterized conserved protein [Function unknown];
Pssm-ID: 227468 Cd Length: 397 Bit Score: 115.18 E-value: 2.02e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 533 DFEMMLQRKKSMSGKRRRNRDGGTFISDADDVVSAMIVKMNEAAEEDRQLNNQKKPALKKLTLLPAVVMHLKKQDLKETF 612
Cdd:COG5139 126 ELGDTGDRQLKAPAASRARRKEDLLEQTVDEISLRLKKRMQDAAKKDNANNLEGRPATGKIKNLPEVSDVLMKKALQDTI 205
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 613 IDSGVMSAIKEWLSPLPDRSLPALKIREELLKILQELPsVSQETLKHSGIGRAVMYLYKHPKESRSNKDMAGKLINEWSR 692
Cdd:COG5139 206 LDNNILDSVRGWLEPLPDKSLPNIKIQKSLLDVLKTLP-IHTEHLVESGVGRIVYFYTISKKEEKEVRRSAKALVQEWTR 284
|
170 180 190
....*....|....*....|....*....|..
gi 2462575019 693 PIFGLTSNYKGmTREEREQRDLEQMPQRRRMN 724
Cdd:COG5139 285 PIIKPSGNYRD-KRIMQLEFDSEKLRKKSVMD 315
|
|
| Med26 |
pfam08711 |
TFIIS helical bundle-like domain; Mediator is a large complex of up to 33 proteins that is ... |
641-694 |
2.35e-12 |
|
TFIIS helical bundle-like domain; Mediator is a large complex of up to 33 proteins that is conserved from plants to fungi to humans - the number and representation of individual subunits varying with species {1-2]. It is arranged into four different sections, a core, a head, a tail and a kinase-activity part, and the number of subunits within each of these is what varies with species. Overall, Mediator regulates the transcriptional activity of RNA polymerase II but it would appear that each of the four different sections has a slightly different function. Mediator exists in two major forms in human cells: a smaller form that interacts strongly with pol II and activates transcription, and a large form that does not interact strongly with pol II and does not directly activate transcription. Notably, the 'small' and 'large' Mediator complexes differ in their subunit composition: the Med26 subunit preferentially associates with the small, active complex, whereas cdk8, cyclin C, Med12 and Med13 associate with the large Mediator complex. This family includesthe C terminal region of a number of eukaryotic hypothetical proteins which are homologous to the Saccharomyces cerevisiae protein IWS1. IWS1 is known to be an Pol II transcription elongation factor and interacts with Spt6 and Spt5.
Pssm-ID: 462573 [Multi-domain] Cd Length: 52 Bit Score: 62.15 E-value: 2.35e-12
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 2462575019 641 ELLKILQELPsVSQETLKHSGIGRAVMYLYKHPkESRSNKDMAGKLINEWSRPI 694
Cdd:pfam08711 1 KLLKKLEKLP-VTLELLKSTGIGKVVNKLRKHK-ENPEIKKLAKELVKKWKRLV 52
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
11-371 |
1.18e-08 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 58.77 E-value: 1.18e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 11 QDPPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSverhseNETSDREdglpkghhvTDSENDEPlnlNASDSESE 90
Cdd:NF033609 546 EQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGS------DSTSDSG---------SDSASDSD---SASDSDSA 607
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 91 ElhrqkDSDSESEERAeppASDSENEDVNQHGSDSESEETrklpgSDSENEELLNGHASDSENEDVGKHPASDSEIEELQ 170
Cdd:NF033609 608 S-----DSDSASDSDS---ASDSDSASDSDSASDSDSASD-----SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 674
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 171 KSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEelpkpqvSDSESEEPPRHQASDSENEELPKP 250
Cdd:NF033609 675 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-------SDSDSDSDSDSDSDSDSDSDSDSD 747
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 251 RISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKgpaSDSETEDASRHKQ 330
Cdd:NF033609 748 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDS 824
|
330 340 350 360
....*....|....*....|....*....|....*....|.
gi 2462575019 331 KPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSE 371
Cdd:NF033609 825 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSE 865
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
16-334 |
3.52e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 50.55 E-value: 3.52e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 16 EDDGGATPVqderdSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNA--SDSESEELH 93
Cdd:PHA03307 28 PGDAADDLL-----SGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSlsTLAPASPAR 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 94 RQKDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEI----EEL 169
Cdd:PHA03307 103 EGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPlsspEET 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 170 QKSPASDSETEDALKPQISDSESEEPPRHQASDSeneePPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEElPK 249
Cdd:PHA03307 183 ARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS----ASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEC-PL 257
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 250 PRIS--------------DSESEDPPRHQASDSENEELPKP--------------RISDSESEDPPRNQASDSENEELPK 301
Cdd:PHA03307 258 PRPApitlptriweasgwNGPSSRPGPASSSSSPRERSPSPspsspgsgpapsspRASSSSSSSRESSSSSTSSSSESSR 337
|
330 340 350
....*....|....*....|....*....|....
gi 2462575019 302 PR-VSDSESEGPQKGPASDSETEDASRHKQKPES 334
Cdd:PHA03307 338 GAaVSPGPSPSRSPSPSRPPPPADPSSPRKRPRP 371
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
9-300 |
8.09e-06 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.52 E-value: 8.09e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 9 SDQDPPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDRE-DGLPKGHHVTDSENDEPLNLNASDS 87
Cdd:NF033609 608 SDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSD 687
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 88 ESEELHRQKDSDSESEERAEpPASDSENEDVNQHGSDSESE-ETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEI 166
Cdd:NF033609 688 SDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 766
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 167 EELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQvSDSESEEPPRHQASDSENEE 246
Cdd:NF033609 767 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSD 845
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 2462575019 247 LPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSEnEELP 300
Cdd:NF033609 846 SDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKDSK-EPLP 898
|
|
| PRK08581 |
PRK08581 |
amidase domain-containing protein; |
84-357 |
1.30e-05 |
|
amidase domain-containing protein;
Pssm-ID: 236304 [Multi-domain] Cd Length: 619 Bit Score: 48.63 E-value: 1.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 84 ASDSESEELHRQKDSDSESEEraeppasDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSEN--EDVGKHPA 161
Cdd:PRK08581 28 DDPQKDSTAKTTSHDSKKSND-------DETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNiiDFIYKNLP 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 162 SDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEelpkpQVSDSESEEPPRHQASD 241
Cdd:PRK08581 101 QTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIK-----NDTDTQSSKQDKADNQK 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 242 SENEELPKPRISDSESEDPPRHQASDSEneelpkpriSDSESEDPPRNQASDSENEEL---PKPRVSDSESEGPQKGPAS 318
Cdd:PRK08581 176 APSSNNTKPSTSNKQPNSPKPTQPNQSN---------SQPASDDTANQKSSSKDNQSMsdsALDSILDQYSEDAKKTQKD 246
|
250 260 270
....*....|....*....|....*....|....*....
gi 2462575019 319 DSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSD 357
Cdd:PRK08581 247 YASQSKKDKTETSNTKNPQLPTQDELKHKSKPAQSFEND 285
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
206-533 |
1.32e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 48.75 E-value: 1.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 206 EEPPKPRMSDSESEELPKPQVSDSEseePPRHQASDSENEELPKPRISDSESeDPPRHQASDSENEELPKpriSDSESED 285
Cdd:NF033609 540 DKPVVPEQPDEPGEIEPIPEDSDSD---PGSDSGSDSSNSDSGSDSGSDSTS-DSGSDSASDSDSASDSD---SASDSDS 612
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 286 PPRNQASDSENEELPKPRVSDSESEGPQKGpASDSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKF 365
Cdd:NF033609 613 ASDSDSASDSDSASDSDSASDSDSASDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 691
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 366 HSSDSEEEEHKKQKMDSDEDEKEGEEEKVAKRKAAVLSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDS 445
Cdd:NF033609 692 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 771
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 446 EEEAGKELSDKKNEEKDLFGSDSESGNEEENLIADIFGESGDEEEEEFTGFNQEDLEEEKGETQVKEAEDSDSDDNIKRG 525
Cdd:NF033609 772 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 851
|
....*...
gi 2462575019 526 KHMDFLSD 533
Cdd:NF033609 852 SDSDSESD 859
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
96-338 |
1.39e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.63 E-value: 1.39e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 96 KDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEellNGHASDSENEDvGKHPASDSEieelqkSPAS 175
Cdd:PHA03307 59 AAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREG---SPTPPGPSSPD-PPPPTPPPA------SPPP 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 176 DSETeDALKPQISDSESEEPPRHQASDSENEEPPKPR-------------MSDSESEELPKPQVSDSESEEPPRHQASDS 242
Cdd:PHA03307 129 SPAP-DLSEMLRPVGSPGPPPAASPPAAGASPAAVASdaassrqaalplsSPEETARAPSSPPAEPPPSTPPAAASPRPP 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 243 EneelPKPRISDSESED---PPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDS----ESEGPQKG 315
Cdd:PHA03307 208 R----RSSPISASASSPapaPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEasgwNGPSSRPG 283
|
250 260
....*....|....*....|...
gi 2462575019 316 PASDSETEDASRHKQKPESDDDS 338
Cdd:PHA03307 284 PASSSSSPRERSPSPSPSSPGSG 306
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
55-333 |
2.56e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 47.76 E-value: 2.56e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 55 ENETSDREDGLPKGHHVTDSENDEPlnlnaSDSESEELHRQKDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLP 134
Cdd:PTZ00449 500 EEEDSDKHDEPPEGPEASGLPPKAP-----GDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPT 574
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 135 GSDSENEELLNGHASDSENEDVGKHPASDS--------EIEELQKSPASDSETEDALKPQisdseSEEPPRHQASDSENE 206
Cdd:PTZ00449 575 LSKKPEFPKDPKHPKDPEEPKKPKRPRSAQrptrpkspKLPELLDIPKSPKRPESPKSPK-----RPPPPQRPSSPERPE 649
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 207 EPPKPRMSDS-ESEELP-----KPQVSDSESEEPPRHQASDSeNEELPKPRISDSESEDPPRHQASDSENEELPKPRISD 280
Cdd:PTZ00449 650 GPKIIKSPKPpKSPKPPfdpkfKEKFYDDYLDAAAKSKETKT-TVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRD 728
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462575019 281 SES-----EDPPRNQASDSENEELP---KPRVSDSESEGPQKG-PASDSETEDASRHKQKPE 333
Cdd:PTZ00449 729 EEFpfepiGDPDAEQPDDIEFFTPPeeeRTFFHETPADTPLPDiLAEEFKEEDIHAETGEPD 790
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
156-476 |
2.62e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 47.60 E-value: 2.62e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 156 VGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESeelpkpqVSDSESEEPP 235
Cdd:NF033609 544 VPEQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDS-------ASDSDSASDS 616
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 236 RHQASDSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKG 315
Cdd:NF033609 617 DSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 696
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 316 PA-SDSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKEGEEEKv 394
Cdd:NF033609 697 DSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD- 775
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 395 AKRKAAVLSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDSEEEAGKEL-SDKKNEEKDLFGSDSESGNE 473
Cdd:NF033609 776 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSD 855
|
...
gi 2462575019 474 EEN 476
Cdd:NF033609 856 SES 858
|
|
| PTZ00108 |
PTZ00108 |
DNA topoisomerase 2-like protein; Provisional |
82-341 |
2.64e-05 |
|
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain] Cd Length: 1388 Bit Score: 47.73 E-value: 2.64e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 82 LNASDSESEELHRQKDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPA 161
Cdd:PTZ00108 1134 LDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKP 1213
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 162 SDSEIEELQKSPASDSETEDALKpqiSDSESEEPPRHQASDSENEEPPKPRMSDSESEELPK--PQVSDSESEEPPrhqa 239
Cdd:PTZ00108 1214 DNKKSNSSGSDQEDDEEQKTKPK---KSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKnaPKRVSAVQYSPP---- 1286
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 240 sdSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASD 319
Cdd:PTZ00108 1287 --PPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSS 1364
|
250 260
....*....|....*....|..
gi 2462575019 320 SETEDASRHKQKPESDDDSDRE 341
Cdd:PTZ00108 1365 SEDDDDSEVDDSEDEDDEDDED 1386
|
|
| PHA03321 |
PHA03321 |
tegument protein VP11/12; Provisional |
192-348 |
3.80e-05 |
|
tegument protein VP11/12; Provisional
Pssm-ID: 223041 [Multi-domain] Cd Length: 694 Bit Score: 47.26 E-value: 3.80e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 192 SEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRhqaSDSENE 271
Cdd:PHA03321 427 SRQPPGAPAPRRDNDPPPPPRARPGSTPACARRARAQRARDAGPEYVDPLGALRRLPAGAAPPPEPAAAPS---PATYYT 503
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 272 EL--PKPRIsdsesedPPRNQASDSENEELPKPRVSDSESEGP-------QKGPASDSETEDASRHKQK-PESDDDSDRE 341
Cdd:PHA03321 504 RMggGPPRL-------PPRNRATETLRPDWGPPAAAPPEQMEDpylepddDRFDRRDGAAAAATSHPREaPAPDDDPIYE 576
|
....*..
gi 2462575019 342 NKGEDTE 348
Cdd:PHA03321 577 GVSDSEE 583
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
88-461 |
4.56e-05 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 47.06 E-value: 4.56e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 88 ESEELHRQKDSDSESEERAEPPASDSENEDvnqhGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEIE 167
Cdd:PTZ00121 1392 KADEAKKKAEEDKKKADELKKAAAAKKKAD----EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 168 ELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSEN--- 244
Cdd:PTZ00121 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEkkk 1547
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 245 -EELPKPR-ISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSET 322
Cdd:PTZ00121 1548 aDELKKAEeLKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKK 1627
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 323 EDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKEGEEEKVAKRKAAVL 402
Cdd:PTZ00121 1628 AEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEEL 1707
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462575019 403 SDSEDEEKASAKKSRVVSDADDSDSDAVSDKS--GKREKTIASDSEEEAGKELSDKKNEEK 461
Cdd:PTZ00121 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAeeDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768
|
|
| Ebola_NP |
pfam05505 |
Ebola nucleoprotein; This family consists of Ebola and Marburg virus nucleoproteins. These ... |
47-298 |
6.74e-05 |
|
Ebola nucleoprotein; This family consists of Ebola and Marburg virus nucleoproteins. These proteins are responsible for encapsidation of genomic RNA. It has been found that nucleoprotein DNA vaccines can offer protection from the virus.
Pssm-ID: 398905 Cd Length: 717 Bit Score: 46.27 E-value: 6.74e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 47 TGSVERHSENETSdredglpkGHHVTDSENDEPLNLNASDSESEelhrQKDSDSESEERAEPPASDSENEdvNQHGSDSE 126
Cdd:pfam05505 388 TEAITAASLPKTS--------GHYDDDDDIPFPGPINDDDNPGH----QDDDPTDSQDTTIPDVVVDPDD--GSYGEYQS 453
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 127 SEETrklpGSDSENEELLNGhaSDSENEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPR--HQASDSE 204
Cdd:pfam05505 454 YSEN----GMNAPDDLVLLN--EDEDDLEDTKPVPNRSTKGGQQKNSQKGQHIEGRQTQSRPIQNVPGPHRtiHHASAPL 527
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 205 NEEPPKPRMSDSESEELPKPQvsdseSEEPPRHQASDSENEELPkPRISDSESED-------------PPRHQASDSENE 271
Cdd:pfam05505 528 TDNDRRNEPSGSTSPRMLTPI-----NEEADPLDDADDETSSLP-PLESDDEEQDrdgtsnrtptvapPAPVYRDHSEKK 601
|
250 260
....*....|....*....|....*..
gi 2462575019 272 ELPKPRISDSESEDPPRNQASDSENEE 298
Cdd:pfam05505 602 ELPQDEQQDQDHTQEARNQDSDNTQSE 628
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
9-321 |
7.42e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 46.44 E-value: 7.42e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 9 SDQDPPEEDDGGAtpvQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASDSE 88
Cdd:NF033609 602 SDSDSASDSDSAS---DSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 678
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 89 SEELHRQKDSDSESEERAEPPA-SDSENEDVNQHGSDSESEETRKlpgSDSENEELLNGHASDSENEDVGKHPASDSEIE 167
Cdd:NF033609 679 DSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 755
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 168 ELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEelpkpqvSDSESEEPPRHQASDSENeel 247
Cdd:NF033609 756 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-------SDSDSDSDSDSDSDSDSD--- 825
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462575019 248 pkpriSDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSE 321
Cdd:NF033609 826 -----SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKDSK 894
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
16-208 |
1.55e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 45.37 E-value: 1.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 16 EDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASDSESEELHRQ 95
Cdd:TIGR00927 707 KGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAG 786
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 96 KDSDSESEERAEPP--ASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDvgkhpasDSEIEELQKSP 173
Cdd:TIGR00927 787 EDGEMKGDEGAEGKveHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQD-------EKGVDGGGGSD 859
|
170 180 190
....*....|....*....|....*....|....*
gi 2462575019 174 ASDSETEDALKPQISDSESEEPPRHQaSDSENEEP 208
Cdd:TIGR00927 860 GGDSEEEEEEEEEEEEEEEEEEEEEE-EEEENEEP 893
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
120-379 |
3.91e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 43.83 E-value: 3.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 120 QHGSDSESEETRKLPGSDSENEELLNGHAS-DSENEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPRH 198
Cdd:TIGR00927 639 EHTGERTGEEGERPTEAEGENGEESGGEAEqEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHE 718
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 199 QASDSENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRHQAsdSENEELpkpri 278
Cdd:TIGR00927 719 GETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQA--GEDGEM----- 791
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 279 sdsESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSETEDAsrhKQKPESDDDSDRENKGEDTEMQNDSFHSDS 358
Cdd:TIGR00927 792 ---KGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQ---ELNAENQGEAKQDEKGVDGGGGSDGGDSEE 865
|
250 260
....*....|....*....|.
gi 2462575019 359 HMDRKKFHSSDSEEEEHKKQK 379
Cdd:TIGR00927 866 EEEEEEEEEEEEEEEEEEEEE 886
|
|
| ECM1 |
pfam05782 |
Extracellular matrix protein 1 (ECM1); This family consists of several eukaryotic ... |
208-317 |
4.65e-04 |
|
Extracellular matrix protein 1 (ECM1); This family consists of several eukaryotic extracellular matrix protein 1 (ECM1) sequences. ECM1 has been shown to regulate endochondral bone formation, stimulate the proliferation of endothelial cells and induce angiogenesis. Mutations in the ECM1 gene can cause lipoid proteinosis, a disorder which causes generalized thickening of skin, mucosae and certain viscera. Classical features include beaded eyelid papules and laryngeal infiltration leading to hoarseness.
Pssm-ID: 461739 Cd Length: 518 Bit Score: 43.29 E-value: 4.65e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 208 PPKPR---MSDSESEELPKPQVSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESE 284
Cdd:pfam05782 9 PPQTRglpVDHPDTSQHDPPFEGQSEVQPPPSQEAIPVQEEELPPPQLPVEKKVDPPLPQEAIPLQEELPPPQLPIEQKE 88
|
90 100 110
....*....|....*....|....*....|....*...
gi 2462575019 285 -DPPRNQASD----SENEELPKPRVSDSESEGPQKGPA 317
Cdd:pfam05782 89 iDPPFPQQEEitpsKQREEKPAPLVGQGHPEPESWNPA 126
|
|
| PRK08691 |
PRK08691 |
DNA polymerase III subunits gamma and tau; Validated |
139-346 |
5.02e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236333 [Multi-domain] Cd Length: 709 Bit Score: 43.54 E-value: 5.02e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 139 ENEELLNGHASDSENEDVGKHPASDSEIEELQKSPASDSEtedALKPqiSDSESEEPPRHQAsdsENEEPPKPRMSDSES 218
Cdd:PRK08691 373 ENTELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASA---AAMP--SEGKTAGPVSNQE---NNDVPPWEDAPDEAQ 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 219 EELPKPQVSD------SESEEPPRHQ-----ASDSENE----ELPKPR-ISDSESEDPPRHQASDSENEELPKPRISDSE 282
Cdd:PRK08691 445 TAAGTAQTSAksiqtaSEAETPPENQvsknkAADNETDaplsEVPSENpIQATPNDEAVETETFAHEAPAEPFYGYGFPD 524
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462575019 283 SEDPPRnqasdsENEELPKPrvsDSESEGPQKGPASDSETEDASRHKQKPESDDDSDRENKGED 346
Cdd:PRK08691 525 NDCPPE------DGAEIPPP---DWEHAAPADTAGGGADEEAEAGGIGGNNTPSAPPPEFSTEN 579
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
16-272 |
5.40e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 43.45 E-value: 5.40e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 16 EDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASDSESEELHRQ 95
Cdd:TIGR00927 639 EHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHE 718
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 96 KDSDSESEERAEPPASDSENEDVNQHG-SDSESEETRKLPGSDSENEELLNGHASDSENEDVGK-HPASDSEIEELQKSP 173
Cdd:TIGR00927 719 GETEAEGTEDEGEIETGEEGEEVEDEGeGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEiQAGEDGEMKGDEGAE 798
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 174 ASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESEEpprhQASDSENEELPKPRIS 253
Cdd:TIGR00927 799 GKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGS----DGGDSEEEEEEEEEEE 874
|
250
....*....|....*....
gi 2462575019 254 DSESEDPPRHQaSDSENEE 272
Cdd:TIGR00927 875 EEEEEEEEEEE-EEEENEE 892
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
272-520 |
6.40e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 43.45 E-value: 6.40e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 272 ELPKPRISDSESEdPPRNQASDSENE---ELPKPRVSDSESEGPQKGPASDSETEDASRHKQ-----KPESDDDSDRENK 343
Cdd:TIGR00927 656 EGENGEESGGEAE-QEGETETKGENEsegEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEhegetEAEGTEDEGEIET 734
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 344 GEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDekegeeekvakrkaavlSDSEDEEKASAKKSRVVSDAD 423
Cdd:TIGR00927 735 GEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKE-----------------DEDEGEIQAGEDGEMKGDEGA 797
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 424 DSDSDAVSDKSGKREKTIASDSEEEAGKELSDKKNEEKDLFGSDSESGNEEENLIADifGESGDEEEEEFTGFNQEDLEE 503
Cdd:TIGR00927 798 EGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDG--GGGSDGGDSEEEEEEEEEEEE 875
|
250
....*....|....*..
gi 2462575019 504 EKGETQVKEAEDSDSDD 520
Cdd:TIGR00927 876 EEEEEEEEEEEEEENEE 892
|
|
| PRK13108 |
PRK13108 |
prolipoprotein diacylglyceryl transferase; Reviewed |
103-268 |
7.81e-04 |
|
prolipoprotein diacylglyceryl transferase; Reviewed
Pssm-ID: 237284 [Multi-domain] Cd Length: 460 Bit Score: 42.66 E-value: 7.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 103 EERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEIEElqKSPASDSETE-D 181
Cdd:PRK13108 293 DEALEREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGE--STPAVEETSEaD 370
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 182 ALKPQISDSESEEPPRHQASDS-ENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEElpkPRISDSESEDP 260
Cdd:PRK13108 371 IEREQPGDLAGQAPAAHQVDAEaASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAG---PGDDPAEPDGI 447
|
....*...
gi 2462575019 261 PRHQASDS 268
Cdd:PRK13108 448 RRQDDFSS 455
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
3-325 |
1.06e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.47 E-value: 1.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 3 TLLPRGSDQDPPEEDDGGATPVQDER--DSGSDGEDDVNEqHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPL 80
Cdd:PHA03307 94 TLAPASPAREGSPTPPGPSSPDPPPPtpPPASPPPSPAPD-LSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQA 172
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 81 NLNASDSESEElhRQKDSDSE----SEERAEPPASDSENEDVNQHGSDS------ESEETRKLPGSDSENEELLNGHASD 150
Cdd:PHA03307 173 ALPLSSPEETA--RAPSSPPAepppSTPPAAASPRPPRRSSPISASASSpapapgRSAADDAGASSSDSSSSESSGCGWG 250
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 151 SENEDVGKHPASDSEIEELQKSPASDSETEDAL--KPQISDSESEEPPRHQASDSEnEEPPKPRMSDSESEELPKPQVSD 228
Cdd:PHA03307 251 PENECPLPRPAPITLPTRIWEASGWNGPSSRPGpaSSSSSPRERSPSPSPSSPGSG-PAPSSPRASSSSSSSRESSSSST 329
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 229 SESEEPPRHQASDS--ENEELPKPRiSDSESEDPPRHQASDSENEELPKPRISDSESEdpPRNQASDSENEELPKPRVSD 306
Cdd:PHA03307 330 SSSSESSRGAAVSPgpSPSRSPSPS-RPPPPADPSSPRKRPRPSRAPSSPAASAGRPT--RRRARAAVAGRARRRDATGR 406
|
330
....*....|....*....
gi 2462575019 307 SESEGPQKGPASDSETEDA 325
Cdd:PHA03307 407 FPAGRPRPSPLDAGAASGA 425
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
170-350 |
1.12e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 1.12e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 170 QKSPASDSETEDALKPQISDSESEEPPRHQASDSE---NEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQA--SDSEN 244
Cdd:PHA03247 2864 RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFalpPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPprPQPPL 2943
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 245 EELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDP-PRNQASDSENEELPKPRVSDSES-----EGPQKGPAS 318
Cdd:PHA03247 2944 APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSrEAPASSTPPLTGHSLSRVSSWASslalhEETDPPPVS 3023
|
170 180 190
....*....|....*....|....*....|..
gi 2462575019 319 DSETEDASRHKQkpESDDDSDRENKGEDTEMQ 350
Cdd:PHA03247 3024 LKQTLWPPDDTE--DSDADSLFDSDSERSDLE 3053
|
|
| PHA03169 |
PHA03169 |
hypothetical protein; Provisional |
153-339 |
1.70e-03 |
|
hypothetical protein; Provisional
Pssm-ID: 223003 [Multi-domain] Cd Length: 413 Bit Score: 41.49 E-value: 1.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 153 NEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESE 232
Cdd:PHA03169 49 PAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSP 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 233 EpprhqaSDSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGP 312
Cdd:PHA03169 129 E------SPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSP 202
|
170 180
....*....|....*....|....*..
gi 2462575019 313 QKGPASDSETEDASRHKQKPESDDDSD 339
Cdd:PHA03169 203 PPQSPPDEPGEPQSPTPQQAPSPNTQQ 229
|
|
| PRK08581 |
PRK08581 |
amidase domain-containing protein; |
13-231 |
2.06e-03 |
|
amidase domain-containing protein;
Pssm-ID: 236304 [Multi-domain] Cd Length: 619 Bit Score: 41.31 E-value: 2.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 13 PPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKghhvtDSENDEPLNLNASDSESeel 92
Cdd:PRK08581 110 KNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQ-----SSKQDKADNQKAPSSNN--- 181
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 93 hrQKDSDSESEERAEPPASDSENedvnqhGSDSESEETRKLPGSDSENEEllnghASDSENEDVGKHPASDSEIEE---L 169
Cdd:PRK08581 182 --TKPSTSNKQPNSPKPTQPNQS------NSQPASDDTANQKSSSKDNQS-----MSDSALDSILDQYSEDAKKTQkdyA 248
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462575019 170 QKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELpkPQVSDSES 231
Cdd:PRK08581 249 SQSKKDKTETSNTKNPQLPTQDELKHKSKPAQSFENDVNQSNTRSTSLFETG--PSLSNNDD 308
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
6-251 |
2.16e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 41.52 E-value: 2.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 6 PRGSDQDPPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDrEDGLPKGHHVTDSENDEPLNLNAS 85
Cdd:TIGR00927 669 QEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEA-EGTEDEGEIETGEEGEEVEDEGEG 747
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 86 DSESEELHRQKDSDSESEERAEPPASDSENEDVN--QHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPASD 163
Cdd:TIGR00927 748 EAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGeiQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTE 827
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 164 SEIEELQKSPASDSETEDALKPQISDSESEEpprhQASDSENEEPPKPRMSDSESEElpkpqvsdsESEEPPRHQASDSE 243
Cdd:TIGR00927 828 VKDETGEQELNAENQGEAKQDEKGVDGGGGS----DGGDSEEEEEEEEEEEEEEEEE---------EEEEEEEEENEEPL 894
|
....*...
gi 2462575019 244 NEELPKPR 251
Cdd:TIGR00927 895 SLEWPETR 902
|
|
| PTZ00108 |
PTZ00108 |
DNA topoisomerase 2-like protein; Provisional |
72-293 |
2.26e-03 |
|
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain] Cd Length: 1388 Bit Score: 41.57 E-value: 2.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 72 TDSENDEPLNLNASDSESEELHRQKDSDSESeerAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNghASDS 151
Cdd:PTZ00108 1186 ADKSKKASVVGNSKRVDSDEKRKLDDKPDNK---KSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSED--NDEF 1260
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 152 ENEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEepPKPRMSDSESEELPKPQVSDSES 231
Cdd:PTZ00108 1261 SSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLE--GSLAALKKKKKSEKKTARKKKSK 1338
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462575019 232 EEPPRHQASDSENEELPKPRISDSESEDpprhqasDSENEELpkpriSDSESEDPPRNQASD 293
Cdd:PTZ00108 1339 TRVKQASASQSSRLLRRPRKKKSDSSSE-------DDDDSEV-----DDSEDEDDEDDEDDD 1388
|
|
| PRK13108 |
PRK13108 |
prolipoprotein diacylglyceryl transferase; Reviewed |
193-348 |
2.71e-03 |
|
prolipoprotein diacylglyceryl transferase; Reviewed
Pssm-ID: 237284 [Multi-domain] Cd Length: 460 Bit Score: 40.73 E-value: 2.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 193 EEPPRHQASDSENEEPPKPrmsdsESEELPKPQVSDSESEEPPRHQASDSENE---ELPKPRISDSESEDPPRHQASDSE 269
Cdd:PRK13108 280 EAPGALRGSEYVVDEALER-----EPAELAAAAVASAASAVGPVGPGEPNQPDdvaEAVKAEVAEVTDEVAAESVVQVAD 354
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 270 NEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDS-ESEGPQKGPASDSETEDASR----HKQKPESDDDSDRENKG 344
Cdd:PRK13108 355 RDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEaASAAPEEPAALASEAHDETEpevpEKAAPIPDPAKPDELAV 434
|
....
gi 2462575019 345 EDTE 348
Cdd:PRK13108 435 AGPG 438
|
|
| PRK08691 |
PRK08691 |
DNA polymerase III subunits gamma and tau; Validated |
79-293 |
4.55e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236333 [Multi-domain] Cd Length: 709 Bit Score: 40.46 E-value: 4.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 79 PLNLNASDS----ESEELHRQKDSDSESEERAEPP----ASDSENEDVNQHGSDSESEETRKL-PGSDSENEELLNGHAS 149
Cdd:PRK08691 360 PLAAASCDAnaviENTELQSPSAQTAEKETAAKKPqprpEAETAQTPVQTASAAAMPSEGKTAgPVSNQENNDVPPWEDA 439
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 150 DSENEDV-GKHPASDSEIE---ELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQ 225
Cdd:PRK08691 440 PDEAQTAaGTAQTSAKSIQtasEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAHEAPAEPFYG 519
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462575019 226 VSDSESEEPPRhqasdsENEELPKPrisDSESEDPPRHQASDSENEELPKpRISDSESEDPPRNQASD 293
Cdd:PRK08691 520 YGFPDNDCPPE------DGAEIPPP---DWEHAAPADTAGGGADEEAEAG-GIGGNNTPSAPPPEFST 577
|
|
| PTZ00482 |
PTZ00482 |
membrane-attack complex/perforin (MACPF) Superfamily; Provisional |
10-181 |
7.75e-03 |
|
membrane-attack complex/perforin (MACPF) Superfamily; Provisional
Pssm-ID: 240433 [Multi-domain] Cd Length: 844 Bit Score: 39.85 E-value: 7.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 10 DQDPpeeDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREdglpkghhvtDSENDEPLNlNASDSES 89
Cdd:PTZ00482 87 DDDD---DDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANN----------DQTNDFDQD-DSSNSQT 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 90 EELHRQKDSDSESEERAEPPASDSENE-DVNQHGSDSESEETRKLPGSDSENEELLNghaSDSENEDVGkhpASDSEIEE 168
Cdd:PTZ00482 153 DQGLKQSVNLSSAEKLIEEKKGQTENTfKFYNFGNDGEEAAAKDGGKSKSSDPGPLN---DSDGQGDDG---DPESAEED 226
|
170
....*....|...
gi 2462575019 169 LQKSPASDSETED 181
Cdd:PTZ00482 227 KAASNTRAAYTKA 239
|
|
| PTZ00108 |
PTZ00108 |
DNA topoisomerase 2-like protein; Provisional |
168-382 |
7.87e-03 |
|
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain] Cd Length: 1388 Bit Score: 39.64 E-value: 7.87e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 168 ELQKSPASDSETED--ALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEElpkpqVSDSESEEPPRHQASDSENE 245
Cdd:PTZ00108 1168 KLRKPKLKKKEKKKkkSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSG-----SDQEDDEEQKTKPKKSSVKR 1242
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575019 246 ELPKPRISDSESEDPPRHQASDSENEELPKPRISDSesedPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSETEDA 325
Cdd:PTZ00108 1243 LKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRV----SAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSL 1318
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 2462575019 326 SRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDS 382
Cdd:PTZ00108 1319 AALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDD 1375
|
|
|