|
Name |
Accession |
Description |
Interval |
E-value |
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
859-2081 |
6.57e-33 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 140.67 E-value: 6.57e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210 294 DTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLT 373
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210 374 TAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSG 453
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSF 1098
Cdd:COG3210 454 TTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTG 533
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1099 STASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPT 1178
Cdd:COG3210 534 GDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGT 613
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1179 TSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVC 1258
Cdd:COG3210 614 ITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNA 693
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1259 FGSSPYSGAGFGGTLST-SISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFG-----GTLSTSVSFGGSSGANAGFGGT 1332
Cdd:COG3210 694 ATGGTLNNAGNTLTISTgSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGvtitsGNAGTLSIGLTANTTASGTTLT 773
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1333 LNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLN 1412
Cdd:COG3210 774 LANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTT 853
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1413 GSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALN 1492
Cdd:COG3210 854 SDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAA 933
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1493 NSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLN 1572
Cdd:COG3210 934 GGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSG 1013
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1573 NSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAIS 1652
Cdd:COG3210 1014 AIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHT 1093
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1653 NSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNST 1732
Cdd:COG3210 1094 LGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAAT 1173
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1733 NCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFS 1812
Cdd:COG3210 1174 TTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGT 1253
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1813 GGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFS 1892
Cdd:COG3210 1254 GDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNS 1333
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1893 VSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGP 1972
Cdd:COG3210 1334 GGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGN 1413
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:COG3210 1414 NGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAG 1493
|
1210 1220
....*....|....*....|....*....
gi 50593518 2053 GGLNTSAGFSGGPPSTGTGFGGGASSHGG 2081
Cdd:COG3210 1494 VAGATASNGGTSTGAGGTAGGTTAEVAKA 1522
|
|
| MAGE |
pfam01454 |
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ... |
596-756 |
2.13e-24 |
|
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
Pssm-ID: 426270 Cd Length: 205 Bit Score: 103.12 E-value: 2.13e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 596 LVKYLLVKDQTKIPIKRSDMLKDVIQEYE-DYFPEIIERASYALEKMFRVNLKEID--------------------KQNN 654
Cdd:pfam01454 1 LVRYALACEYQRTPIRREDISKKVLGENRkRLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 655 LYILIST---QESSAGIMGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLH---PGVKHSLFG 719
Cdd:pfam01454 81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
|
170 180 190
....*....|....*....|....*....|....*....
gi 50593518 720 EVKKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 756
Cdd:pfam01454 161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1434-1791 |
3.72e-20 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 98.15 E-value: 3.72e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1434 SAGFGGAMNtnATFGGALNSNAGFGGaistSTNFGGALNNSAGFGGAMNTSASFGgalnNSAGFGGAISTNATFGGALNN 1513
Cdd:NF033849 220 SISFGVSLP--MMYAANLGQSAGTGY----GESVGHSTSQGQSHSVGTSESHSVG----TSQSQSHTTGHGSTRGWSHTQ 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1514 SAGFGGAISTNATFGGALNNSAGF--GGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGAL 1591
Cdd:NF033849 290 STSESESTGQSSSVGTSESQSHGTteGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGH 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1592 TNSAGFGGAISTSASFGGALNnsAGFGGAIstsasfGGALNNSAGFGGAISTNASFGGAISNSpDFGGAFSTSVGFG--- 1668
Cdd:NF033849 370 STSSSVSSSESSSRSSSSGVS--GGFSGGI------AGGGVTSEGLGASQGGSEGWGSGDSVQ-SVSQSYGSSSSTGtss 440
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1669 GTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSlcfGSASNTnlcFGGSNSTncfsgatsanfNEGHS 1748
Cdd:NF033849 441 GHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTS---QSETDS---VGDSTGT-----------SESVS 503
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 50593518 1749 ISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASF 1791
Cdd:NF033849 504 QGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLGKSY 546
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1494-1832 |
4.41e-20 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 98.15 E-value: 4.41e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1494 SAGFGgaISTNATFGGALNNSAG------FGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASF 1567
Cdd:NF033849 220 SISFG--VSLPMMYAANLGQSAGtgygesVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1568 GgvLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASF 1647
Cdd:NF033849 298 G--QSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV 375
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1648 GGAISNSPDFggAFSTSVGFGGTLNttdfgsthsnsisfgSAPTTSVSFGGSHSTNLCFGGApstslcfGSASNTNLCFG 1727
Cdd:NF033849 376 SSSESSSRSS--SSGVSGGFSGGIA---------------GGGVTSEGLGASQGGSEGWGSG-------DSVQSVSQSYG 431
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1728 GSNSTNCFSGATSanfNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTS----------TGFGGSLGPSASFNGGLGT 1797
Cdd:NF033849 432 SSSSTGTSSGHSD---SSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSeswstsqsetDSVGDSTGTSESVSQGDGR 508
|
330 340 350
....*....|....*....|....*....|....*
gi 50593518 1798 STGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFN 1832
Cdd:NF033849 509 STGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1454-1821 |
9.74e-18 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 90.45 E-value: 9.74e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1454 NAGFGgaISTSTNFGGALNNSAGFGGAMNTSASFggalnnSAGFGGAISTNATFGGAlnNSAGFGGAISTNATFGGALNN 1533
Cdd:NF033849 220 SISFG--VSLPMMYAANLGQSAGTGYGESVGHST------SQGQSHSVGTSESHSVG--TSQSQSHTTGHGSTRGWSHTQ 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1534 SAGFGGAISTSASFGG--TLNNSASFGGAINTSASFggvlnnSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGAL 1611
Cdd:NF033849 290 STSESESTGQSSSVGTseSQSHGTTEGTSTTDSSSH------SQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSEST 363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1612 NNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSpdFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSapT 1691
Cdd:NF033849 364 GTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEG--LGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT--S 439
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1692 TSVSFGGSHSTNLcfGGAPSTSLCFGSASNTnlcfgGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGF 1771
Cdd:NF033849 440 SGHSDSSSHSTSS--GQADSVSQGTSWSEGT-----GTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGR 512
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|
gi 50593518 1772 GSSLGTStgfggslgpsasfnggLGTSTGFGGGLGTSTDFSGGLNHNADF 1821
Cdd:NF033849 513 SESQGTS----------------LGTSGGRTSGAGGSMGLGPSISLGKSY 546
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1592-1958 |
7.29e-17 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 87.37 E-value: 7.29e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1592 TNSAGFGgaISTSASFGGALNNSAG--FGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGG 1669
Cdd:NF033849 218 QKSISFG--VSLPMMYAANLGQSAGtgYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESE 295
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1670 TLNTTD-FGSTHSNSISFGSAPTTSVSFGGSHSTnlcfggapSTSLCFGSASNTNLCFGGSNStncfsgaTSANFNEGHS 1748
Cdd:NF033849 296 STGQSSsVGTSESQSHGTTEGTSTTDSSSHSQSS--------SYNVSSGTGVSSSHSDGTSQS-------TSISHSESSS 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1749 ISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGglnhnadFNGGLGNS 1828
Cdd:NF033849 361 ESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQS-------VSQSYGSS 433
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1829 AGFngglntntdfggelGTSAGFGDGLGSSTSFG--AGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPN 1906
Cdd:NF033849 434 SST--------------GTSSGHSDSSSHSTSSGqaDSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTS 499
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|..
gi 50593518 1907 ASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSngpssiVGFsgGPSTGAG 1958
Cdd:NF033849 500 ESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGS------MGL--GPSISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1574-1873 |
1.01e-16 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 86.98 E-value: 1.01e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1574 SAGFGgaINTSANFGGALTNSAG------FGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASF 1647
Cdd:NF033849 220 SISFG--VSLPMMYAANLGQSAGtgygesVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1648 GGAISNspdfGGAFSTSVGFGGTLNTTDfGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNlcfG 1727
Cdd:NF033849 298 GQSSSV----GTSESQSHGTTEGTSTTD-SSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVG---H 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1728 GSNSTNCFSGATSANFNEGHSISFGNGLS----TSAGFGNGLGTSAGFGSSLG---TSTGFGGSLGPSASF----NGGLG 1796
Cdd:NF033849 370 STSSSVSSSESSSRSSSSGVSGGFSGGIAgggvTSEGLGASQGGSEGWGSGDSvqsVSQSYGSSSSTGTSSghsdSSSHS 449
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 50593518 1797 TSTGFGGGLGTSTDFSGGLNHNAdfNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAG 1873
Cdd:NF033849 450 TSSGQADSVSQGTSWSEGTGTSQ--GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSG 524
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1222-1547 |
1.41e-15 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 83.13 E-value: 1.41e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1222 ALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVcfgsspysgagfggTLSTSISFGGSPSTNTGFGGTLSTSVSFGA 1301
Cdd:NF033849 252 SQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSH--------------TQSTSESESTGQSSSVGTSESQSHGTTEGT 317
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1302 SSSTSSDFGGTLSTSVSFGgssganAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGgai 1381
Cdd:NF033849 318 STTDSSSHSQSSSYNVSSG------TGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG--- 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1382 nTSAGFGSTLNSSASFGSALSTSASfggvlnGSAGFGGAlntnatfGGVLNGSAGFGGAMNTNATFGGALNS--NAGFGG 1459
Cdd:NF033849 389 -VSGGFSGGIAGGGVTSEGLGASQG------GSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQ 454
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1460 AISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSA---- 1535
Cdd:NF033849 455 ADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAggsm 534
|
330
....*....|..
gi 50593518 1536 GFGGAISTSASF 1547
Cdd:NF033849 535 GLGPSISLGKSY 546
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1339-1647 |
3.50e-12 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 71.96 E-value: 3.50e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1339 FGGAISTSTGFGSalnnSANFGGAISTSFsgvlnsSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFG 1418
Cdd:NF033849 231 YAANLGQSAGTGY----GESVGHSTSQGQ------SHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQS 300
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1419 GALNTNATFG-GVLNG-----SAGFGGAMNTNATFG----GALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFG 1488
Cdd:NF033849 301 SSVGTSESQShGTTEGtsttdSSSHSQSSSYNVSSGtgvsSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1489 GALNNSAGFGGAISTNATFGGALnnSAGFGGAISTNATFG---GALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSA 1565
Cdd:NF033849 381 SSRSSSSGVSGGFSGGIAGGGVT--SEGLGASQGGSEGWGsgdSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSV 458
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1566 SFGGVL--NNSAGFGGAINTSANFG--GALTNSAGF--GGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGG 1639
Cdd:NF033849 459 SQGTSWseGTGTSQGQSVGTSESWStsQSETDSVGDstGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGP 538
|
....*...
gi 50593518 1640 AISTNASF 1647
Cdd:NF033849 539 SISLGKSY 546
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1356-1702 |
7.74e-12 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 71.19 E-value: 7.74e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1356 SANFGGAISTSFSGvlNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSA 1435
Cdd:NF033849 220 SISFGVSLPMMYAA--NLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1436 GFGGAMNTNATfggaLNSNAGFGGAISTSTNF--GGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNN 1513
Cdd:NF033849 298 GQSSSVGTSES----QSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSS 373
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1514 SAGFGGAISTNATFGgalnNSAGFGGAIS----TSASFGGTLNNSASFGgaintsaSFGGVLNNSAGFGGAINTSANFGG 1589
Cdd:NF033849 374 SVSSSESSSRSSSSG----VSGGFSGGIAgggvTSEGLGASQGGSEGWG-------SGDSVQSVSQSYGSSSSTGTSSGH 442
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1590 ALTNSAGFGgaISTSASFGGALNNSAGFGGAISTSASfggalnNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFG- 1668
Cdd:NF033849 443 SDSSSHSTS--SGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSe 514
|
330 340 350
....*....|....*....|....*....|....*.
gi 50593518 1669 --GTLNTTDFGSTHSNSISFGSAPttSVSFGGSHST 1702
Cdd:NF033849 515 sqGTSLGTSGGRTSGAGGSMGLGP--SISLGKSYQW 548
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1736-2064 |
2.07e-10 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 66.18 E-value: 2.07e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1736 SGATSANFNEGHSISFGNGLSTSAgfGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGL 1815
Cdd:NF033849 216 QGQKSISFGVSLPMMYAANLGQSA--GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSE 293
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1816 NHNAdfngGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTS--DGFAGNLGTNTGFGGTLGTGAGFSV 1893
Cdd:NF033849 294 SEST----GQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSShsDGTSQSTSISHSESSSESTGTSVGH 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1894 SLNNGNGFGNGPNASFNRGLNTiiGFGSGSNTSnGFTGEpntGSSFSNGPSSIVGFSGG-PSTGAGFCSGPSTGGFGGGP 1972
Cdd:NF033849 370 STSSSVSSSESSSRSSSSGVSG--GFSGGIAGG-GVTSE---GLGASQGGSEGWGSGDSvQSVSQSYGSSSSTGTSSGHS 443
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:NF033849 444 DSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTS 523
|
330
....*....|..
gi 50593518 2053 GGLNTSAGFSGG 2064
Cdd:NF033849 524 GGRTSGAGGSMG 535
|
|
| MscS_porin |
pfam12795 |
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ... |
268-445 |
2.01e-09 |
|
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.
Pssm-ID: 432790 [Multi-domain] Cd Length: 238 Bit Score: 60.01 E-value: 2.01e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASnrqigASNRQTE 347
Cdd:pfam12795 10 LDEAAKKKLLQDLQQALSLLDKIDASKQRAAAYQKALDDAPAELRELRQELAALQAKAEAAPKEILAS-----LSLEELE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 348 vsSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASsrQTEASSRQTEASS 427
Cdd:pfam12795 85 --QRLLQTSAQLQELQNQLAQLNSQLIELQTRPERAQQQLSEARQRLQQIRNRLNGPAPPGEPLS--EAQRWALQAELAA 160
|
170
....*....|....*...
gi 50593518 428 RQTEASSRQIEASAAAVR 445
Cdd:pfam12795 161 LKAQIDMLEQELLSNNNR 178
|
|
| auto_AIDA-I |
NF033176 |
autotransporter adhesin AIDA-I; |
1318-1785 |
1.30e-08 |
|
autotransporter adhesin AIDA-I;
Pssm-ID: 380183 [Multi-domain] Cd Length: 1287 Bit Score: 60.44 E-value: 1.30e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1318 SFGGSSGANAGFGGTLNSSTsfGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSaSF 1397
Cdd:NF033176 72 SNGQTSNATVNSGGIQNVNN--GGKTTSTTVNSSGAQNVGNSGTAISTIVNSGGVQRVSSGGVTSATSLSGGAQNIY-NL 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1398 GSALSTSASFGGVLNGSAGfGGALNTNATFGGVLNGSAGfGGAMNTNATFGGALNSNAGfGGAISTSTNFGGALNNSAGf 1477
Cdd:NF033176 149 GHASNTVIFNGGNQTIFSG-GISDDTNISSGGQQRVSSG-GVASNTTINSSGTQNILSG-GSTVSTHISSGGNQYISAG- 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1478 GGAMNTSASFGGALNNSAgfgGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGfGGAISTSASFGGTLNNSaSF 1557
Cdd:NF033176 225 GNASATVVSSGGFQRVSS---GGTATGTVLSGGTQNVSSGGSAISTSVYSSGVQTVYAG-ATVTDTTVNSGGKQNIS-SG 299
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1558 GGAINTSASFGGVLNNsagFGGAINTSANFGGALTNSAGfGGAISTSASFGGALNNSAGfGGAISTSASFGGALNNSAGf 1637
Cdd:NF033176 300 GIVSGTIVNSSGTQNI---YSGGSALSANIKGSQIVNSD-GTAINTLVNDGGYQHIRNG-GVASGTIINQSGRVNISSG- 373
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1638 GGAISTNASFGGAISNSPDfGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSapTTSVSFGGSHSTNLCFGGAPSTSLCFG 1717
Cdd:NF033176 374 GYAESTIINSGGTQSVLSG-GYASGTLINNSGRENVSNGGSAYNTIINAGG--NQYIYSNGEASGTTVNTSGFQRVNSGG 450
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 50593518 1718 SASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSL 1785
Cdd:NF033176 451 TATGTKLSGGNQNVSSGGKAIAAEVYSGGKQTVYAGGEASGTQIFDGGVVNVSGGSVSGASVNLNGRL 518
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
262-468 |
5.43e-08 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 57.22 E-value: 5.43e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 262 IGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGA 341
Cdd:COG4372 26 IAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELES 105
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 342 SNRQTEVSSRQIEASNRQIGASNRQTEASNRQIgasnrqteaSNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSR 421
Cdd:COG4372 106 LQEEAEELQEELEELQKERQDLEQQRKQLEAQI---------AELQSEIAEREEELKELEEQLESLQEELAALEQELQAL 176
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 50593518 422 QTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEA 468
Cdd:COG4372 177 SEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEA 223
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
97-524 |
1.17e-07 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 57.53 E-value: 1.17e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 97 SQASATTEAPNIQASVTSQTQKAKTMRVTPKVSLTGSEDATtqlkpplQALNLPVTTPTIQTPVANESANSLAS--TAVN 174
Cdd:NF041483 293 AKQLASAESANEQRTRTAKEEIARLVGEATKEAEALKAEAE-------QALADARAEAEKLVAEAAEKARTVAAedTAAQ 365
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 175 KSKKASTANNAANKTVPSAAEISLAsAATHTVTTQGQAAKETGSIQTIAATA------RSKKNSKGKRtpAKTTNTDNEY 248
Cdd:NF041483 366 LAKAARTAEEVLTKASEDAKATTRA-AAEEAERIRREAEAEADRLRGEAADQaeqlkgAAKDDTKEYR--AKTVELQEEA 442
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 249 V----EA----SNAIEASSRQIGASGRqtEASnRQIEASSRQTEA-------------SNRQTEASSRQTEASSRQT--- 304
Cdd:NF041483 443 RrlrgEAeqlrAEAVAEGERIRGEARR--EAV-QQIEEAARTAEElltkakadadelrSTATAESERVRTEAIERATtlr 519
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 305 ----ETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIgasnrqTEVSSRQIEAsnrqigasnRQTEASNRqigASNRQ 380
Cdd:NF041483 520 rqaeETLERTRAEAERLRAEAEEQAEEVRAAAERAAREL------REETERAIAA---------RQAEAAEE---LTRLH 581
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 381 TEASNRQIGASNRQTDASN------RQT-DASNRQ-TEASSR--------QTEASSRQTEASSrqtEASSRQIEASAAAV 444
Cdd:NF041483 582 TEAEERLTAAEEALADARAeaerirREAaEETERLrTEAAERirtlqaqaEQEAERLRTEAAA---DASAARAEGENVAV 658
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 445 RPKkprgkkgnnkgSNSASEPSeappaiqtvtnhalsvtvriRRGSRARKAANKNRAtESQAQIAEQGAQASEASISALE 524
Cdd:NF041483 659 RLR-----------SEAAAEAE--------------------RLKSEAQESADRVRA-EAAAAAERVGTEAAEALAAAQE 706
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
250-524 |
7.08e-07 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 54.76 E-value: 7.08e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 250 EASNAIEASSRqiGASGRQTEASNRQIEAssRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASN 329
Cdd:PTZ00121 1197 EDARKAEAARK--AEEERKAEEARKAEDA--KKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAI 1272
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 330 RQIE---------ASNRQIGASNRQTEVSSRQIEASN-----RQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQT 395
Cdd:PTZ00121 1273 KAEEarkadelkkAEEKKKADEAKKAEEKKKADEAKKkaeeaKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA 1352
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 396 DASNRQTDASNRQTEASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKP----RGKKGNNKGSNSASEPSEAPPA 471
Cdd:PTZ00121 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKadelKKAAAAKKKADEAKKKAEEKKK 1432
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 50593518 472 IQTVTNHALSVtvriRRGSRARKAANKNRATESQAQIAEQGAQASEASISALE 524
Cdd:PTZ00121 1433 ADEAKKKAEEA----KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1028-1427 |
1.66e-05 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 50.39 E-value: 1.66e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1028 GRNSITFG-SVPNT-SANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGgapststsfstasISF 1105
Cdd:NF033849 217 GQKSISFGvSLPMMyAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHG-------------STR 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1106 GGApststslstasisfggapststsfstasisfggapststslstasisfggapsiNSSSGGSSVSFGGAPTTSTSFSG 1185
Cdd:NF033849 284 GWS------------------------------------------------------HTQSTSESESTGQSSSVGTSESQ 309
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1186 GPCISFGgapcTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYS 1265
Cdd:NF033849 310 SHGTTEG----TSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSS 385
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1266 GAGFGGTLSTSIsfGGSPSTNTGFGGTLSTSVSFGASSSTSSdFGGTLSTSVSFGGSSGA--NAGFGGTLNSSTSFGGAI 1343
Cdd:NF033849 386 SSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWGSGDSVQS-VSQSYGSSSSTGTSSGHsdSSSHSTSSGQADSVSQGT 462
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1344 STSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNT 1423
Cdd:NF033849 463 SWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISL 542
|
....
gi 50593518 1424 NATF 1427
Cdd:NF033849 543 GKSY 546
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
206-525 |
7.28e-05 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 48.13 E-value: 7.28e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 206 VTTQGQAAKETGSIqtiaATARSKKNSKgkrtpakTTNTDNEYVEASNAIEASSRQIgasgrqTEASNRQIEAssrQTEA 285
Cdd:TIGR02168 648 VTLDGDLVRPGGVI----TGGSAKTNSS-------ILERRREIEELEEKIEELEEKI------AELEKALAEL---RKEL 707
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 286 SNRQTEASSRQteassRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNR 365
Cdd:TIGR02168 708 EELEEELEQLR-----KELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA 782
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 366 QTEASNRQIGASNRQTEASNRQIGASNRQTDASNRqtdasnRQTEASSRQtEASSRQTEASSRQTEASSRQIEASAAAVr 445
Cdd:TIGR02168 783 EIEELEAQIEQLKEELKALREALDELRAELTLLNE------EAANLRERL-ESLERRIAATERRLEDLEEQIEELSEDI- 854
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 446 pkkprgkKGNNKgsnSASEPSEAPPAIQTVTNHAL----SVTVRIRRG-SRARKAANKNRATESQAQIAEQGAQASEASI 520
Cdd:TIGR02168 855 -------ESLAA---EIEELEELIEELESELEALLneraSLEEALALLrSELEELSEELRELESKRSELRRELEELREKL 924
|
....*
gi 50593518 521 SALET 525
Cdd:TIGR02168 925 AQLEL 929
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
869-1089 |
2.79e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.15 E-value: 2.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 869 NQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGfggiSNPSGGFGGisnpSGGFGGISNPSGGFGGISNPSGGFGG 948
Cdd:NF033849 310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDG----TSQSTSISH----SESSSESTGTSVGHSTSSSVSSSESS 381
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 949 ISNPSGGFggisnpSGGFGGISNPSGGFggisnpSGGFGGISNPSGGFGGisnpSGGFGGISNPSGGFGGISNPSG-GFG 1027
Cdd:NF033849 382 SRSSSSGV------SGGFSGGIAGGGVT------SEGLGASQGGSEGWGS----GDSVQSVSQSYGSSSSTGTSSGhSDS 445
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 50593518 1028 GRNSITFGsvpnTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFG 1089
Cdd:NF033849 446 SSHSTSSG----QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVS 503
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1239-1459 |
2.86e-04 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 45.81 E-value: 2.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1239 FSGAVSTTTGFGGTLSTSVCFGSSPYS--GAGFGGTLSTSISFGGSPSTNTGFGGTLstsvsFGASSSTSSDFGGTLSTS 1316
Cdd:pfam15967 6 FGGGPGSTATAGGGFSFGAAAASNPGStgGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1317 VSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSAs 1396
Cdd:pfam15967 81 AATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTT- 159
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 50593518 1397 fgsALSTSASFGGVLNgsaGFGGALNTNATFGGVLNGSAGFGGAMNTNATFgGALNSNAGFGG 1459
Cdd:pfam15967 160 ---AVSTGLSLGSTLT---SLGGSLFQNTNSTGLGQTTLGLTLLATSTAPV-SAPAASEGLGG 215
|
|
| PHA02515 |
PHA02515 |
hypothetical protein; Provisional |
1294-1505 |
3.67e-04 |
|
hypothetical protein; Provisional
Pssm-ID: 107197 [Multi-domain] Cd Length: 508 Bit Score: 45.54 E-value: 3.67e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1294 STSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGgtlNSSTSFGGAISTSTGFGSALNNSANFG--GAISTSFSGVL 1371
Cdd:PHA02515 175 TVAASVGAVDTVAGDLGGTWAAGVSYDFGSIAVPPIG---NTSPPGGNIVIVANSIGNVDTVAENIGdvSTVSTHLSSML 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1372 -------------------NSSASFGGAINTSAGFGSTLNSSASfgSALSTSASFGGVLNGSAGFGGALNTNATFGGVLN 1432
Cdd:PHA02515 252 avandidsvvsvagdleniDAVADNAANINTVAGANANVNTVAS--NILDVGTVAGNIDDVQAVAGNAANINVVADNADN 329
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 50593518 1433 GSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGA--LNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNA 1505
Cdd:PHA02515 330 INATAANQANINAAVGNADNINAAVANQANINAVVGNAnnINAVAANEGNVNTVVDNLADVQTVAGIAADVSTVA 404
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
272-490 |
3.75e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 45.67 E-value: 3.75e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 272 SNRQIEASSRQ-TEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSS 350
Cdd:NF033609 33 SSKEADASENSvTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSSNTNNGETSVAQNPAQQETTQSASTNATTEETP 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 351 RQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSR--QTEASSRQTEASSR 428
Cdd:NF033609 113 VTGEATTTATNQANTPATTQSSNTNAEELVNQTSNETTSNDTNTVSSVNSPQNSTNAENVSTTQdtSTEATPSNNESAPQ 192
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 50593518 429 QTEASSRQIeaSAAAVRPKKPRgkkgnNKGSNSASEPSEAPPAIQTVTNHALSVTVRIRRGS 490
Cdd:NF033609 193 STDASNKDV--VNQAVNTSAPR-----MRAFSLAAVAADAPAAGTDITNQLTNVTVGIDSGT 247
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
20-243 |
5.51e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 41.83 E-value: 5.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 20 PAGSLGLPFSPDVQSETT---EKDPPIASRSKKNKNKKNSIKPMDKTTPAPPPVPSANDNASNKPKVTLQALNLPMFTQI 96
Cdd:pfam05109 442 PNTTTGLPSSTHVPTNLTapaSTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNAT 521
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 97 SQASA-TTEAPNIQASVTSQTQKAKTMrVTPKVSLTGSEDATTQLKPPLQALNLPVTTPTiqTPVANESANSLASTAVNK 175
Cdd:pfam05109 522 SPTPAvTTPTPNATSPTLGKTSPTSAV-TTPTPNATSPTPAVTTPTPNATIPTLGKTSPT--SAVTTPTPNATSPTVGET 598
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 50593518 176 SKKASTANNAANKTVPSAAEISLASAATHTVTTqGQAAKETGSIQTIAATARSKKNSKGKRTPAKTTN 243
Cdd:pfam05109 599 SPQANTTNHTLGGTSSTPVVTSPPKNATSAVTT-GQHNITSSSTSSMSLRPSSISETLSPSTSDNSTS 665
|
|
| dermokine |
cd21118 |
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ... |
1632-1860 |
6.99e-03 |
|
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.
Pssm-ID: 411053 [Multi-domain] Cd Length: 495 Bit Score: 41.14 E-value: 6.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1632 NNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNlcfggaPS 1711
Cdd:cd21118 133 QGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQGAVAQPGYGTVRGNNQNSGCTN------PP 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1712 TSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASF 1791
Cdd:cd21118 207 PSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSGGSNGGSSGNSGSGSGGSSSGGSN 286
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 50593518 1792 NGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTS 1860
Cdd:cd21118 287 GWGGSSSSGGSGGSGGGNKPECNNPGNDVRMAGGGGSQGSKESSGSHGSNGGNGQAEAVGGLNTLNSDA 355
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
859-2081 |
6.57e-33 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 140.67 E-value: 6.57e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210 294 DTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLT 373
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210 374 TAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSG 453
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSF 1098
Cdd:COG3210 454 TTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTG 533
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1099 STASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPT 1178
Cdd:COG3210 534 GDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGT 613
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1179 TSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVC 1258
Cdd:COG3210 614 ITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNA 693
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1259 FGSSPYSGAGFGGTLST-SISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFG-----GTLSTSVSFGGSSGANAGFGGT 1332
Cdd:COG3210 694 ATGGTLNNAGNTLTISTgSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGvtitsGNAGTLSIGLTANTTASGTTLT 773
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1333 LNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLN 1412
Cdd:COG3210 774 LANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTT 853
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1413 GSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALN 1492
Cdd:COG3210 854 SDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAA 933
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1493 NSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLN 1572
Cdd:COG3210 934 GGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSG 1013
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1573 NSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAIS 1652
Cdd:COG3210 1014 AIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHT 1093
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1653 NSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNST 1732
Cdd:COG3210 1094 LGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAAT 1173
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1733 NCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFS 1812
Cdd:COG3210 1174 TTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGT 1253
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1813 GGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFS 1892
Cdd:COG3210 1254 GDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNS 1333
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1893 VSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGP 1972
Cdd:COG3210 1334 GGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGN 1413
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:COG3210 1414 NGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAG 1493
|
1210 1220
....*....|....*....|....*....
gi 50593518 2053 GGLNTSAGFSGGPPSTGTGFGGGASSHGG 2081
Cdd:COG3210 1494 VAGATASNGGTSTGAGGTAGGTTAEVAKA 1522
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
863-2078 |
1.01e-31 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 136.43 E-value: 1.01e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 863 TTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 942
Cdd:COG3210 289 GASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGN 368
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 943 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 1022
Cdd:COG3210 369 GGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGG 448
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1023 SGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTAS 1102
Cdd:COG3210 449 LTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNAT 528
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1103 ISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTS 1182
Cdd:COG3210 529 SGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSA 608
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1183 FSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSS 1262
Cdd:COG3210 609 GATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTG 688
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1263 PYSGAGFGGTLSTS---ISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDfggtlSTSVSFGGSSGANAGFGGTLNSSTSF 1339
Cdd:COG3210 689 TTLNAATGGTLNNAgntLTISTGSITVTGQIGALANANGDTVTFGNLGT-----GATLTLNAGVTITSGNAGTLSIGLTA 763
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1340 GGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGG 1419
Cdd:COG3210 764 NTTASGTTLTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGS 843
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1420 ALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTS--TNFGGALNNSAGFGGAMNTSASFGGALNNSAGF 1497
Cdd:COG3210 844 NTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTgtANAGTLTNLGTTTNAASGNGAVLATVTATGTGG 923
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1498 GGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGF 1577
Cdd:COG3210 924 GGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSG 1003
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1578 GGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDF 1657
Cdd:COG3210 1004 TTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTA 1083
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1658 GGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSG 1737
Cdd:COG3210 1084 QASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASA 1163
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1738 ATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNH 1817
Cdd:COG3210 1164 GDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGS 1243
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1818 NADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNN 1897
Cdd:COG3210 1244 FVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTA 1323
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1898 GNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGPSTGPG 1977
Cdd:COG3210 1324 TGTAVAAVNSGGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGG 1403
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1978 FGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFGGGLNT 2057
Cdd:COG3210 1404 VTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIG 1483
|
1210 1220
....*....|....*....|.
gi 50593518 2058 SAGFSGGPPSTGTGFGGGASS 2078
Cdd:COG3210 1484 GTTTGGNGAGVAGATASNGGT 1504
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
859-2081 |
5.80e-31 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 134.12 E-value: 5.80e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210 129 TGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGV 208
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210 209 LANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTA 288
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSF 1098
Cdd:COG3210 289 GASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGN 368
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1099 STASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPT 1178
Cdd:COG3210 369 GGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGG 448
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1179 TSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVC 1258
Cdd:COG3210 449 LTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNAT 528
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1259 FGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTS 1338
Cdd:COG3210 529 SGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSA 608
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1339 FGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFG 1418
Cdd:COG3210 609 GATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTG 688
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1419 GALNTNATF-----GGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTN-FGGALNNSAGFGGAMNTSASFGGALN 1492
Cdd:COG3210 689 TTLNAATGGtlnnaGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGATLtLNAGVTITSGNAGTLSIGLTANTTAS 768
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1493 NSAGFGGAISTNATFGGALNNS-AGFGGAISTNATFGGALNNS---AGFGGAISTSASFGGTLNNSASFGGAINTSASFG 1568
Cdd:COG3210 769 GTTLTLANANGNTSAGATLDNAgAEISIDITADGTITAAGTTAinvTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDT 848
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1569 GVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFG 1648
Cdd:COG3210 849 TTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTG 928
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1649 GAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGG 1728
Cdd:COG3210 929 GNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTAST 1008
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1729 SNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTS 1808
Cdd:COG3210 1009 TGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGA 1088
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1809 TDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTG 1888
Cdd:COG3210 1089 GTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTA 1168
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1889 AGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGF 1968
Cdd:COG3210 1169 VAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAG 1248
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1969 GGGPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTS 2048
Cdd:COG3210 1249 SASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAV 1328
|
1210 1220 1230
....*....|....*....|....*....|...
gi 50593518 2049 TGFGGGLNTSAGFSGGPPSTGTGFGGGASSHGG 2081
Cdd:COG3210 1329 AAVNSGGVNAGGGTINTTAANTGLNGGNGATDS 1361
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
863-2077 |
1.96e-30 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 132.20 E-value: 1.96e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 863 TTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 942
Cdd:COG3210 477 GNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTT 556
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 943 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 1022
Cdd:COG3210 557 AASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTG 636
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1023 SGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTAS 1102
Cdd:COG3210 637 SAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVT 716
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1103 ISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASI--SFGGAPSINSSSGGSSVSFGGAPTTS 1180
Cdd:COG3210 717 GQIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASgtTLTLANANGNTSAGATLDNAGAEISI 796
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1181 TSFSGGPCISFGgapcttASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFG 1260
Cdd:COG3210 797 DITADGTITAAG------TTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1261 SSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFG 1340
Cdd:COG3210 871 AATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQG 950
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1341 GAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGA 1420
Cdd:COG3210 951 NAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTAS 1030
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1421 LNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGA 1500
Cdd:COG3210 1031 ATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTT 1110
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1501 ISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGA 1580
Cdd:COG3210 1111 TSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAAT 1190
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1581 INTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGA 1660
Cdd:COG3210 1191 EGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGAT 1270
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1661 FSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATS 1740
Cdd:COG3210 1271 STVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANT 1350
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1741 ANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNHNAD 1820
Cdd:COG3210 1351 GLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSA 1430
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1821 FNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNG 1900
Cdd:COG3210 1431 TTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGG 1510
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1901 FGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGPSTGPGFGG 1980
Cdd:COG3210 1511 TAGGTTAEVAKASLEGGEGTYGGSSVAEAGTGGGILGAVSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQAPT 1590
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1981 PSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFGGGLNTSAG 2060
Cdd:COG3210 1591 AGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSGAVNGAGNGWAV 1670
|
1210
....*....|....*..
gi 50593518 2061 FSGGPPSTGTGFGGGAS 2077
Cdd:COG3210 1671 DLTDATLAGLGGATTAA 1687
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
889-2078 |
1.92e-29 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 129.12 E-value: 1.92e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 889 ITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 968
Cdd:COG3210 1 GSGGLAGTTGNKTIGVDIAVTTTAATLGSNTAGTSGLNILGSGGVGTAGGIASNAGTTASTSGGSGTAGGVGNTSASTGG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 969 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAP 1048
Cdd:COG3210 81 IGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGN 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1049 SISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPST 1128
Cdd:COG3210 161 NTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGV 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1129 STSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSg 1208
Cdd:COG3210 241 ISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAG- 319
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1209 fgstlcSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTG 1288
Cdd:COG3210 320 ------ITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGN 393
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1289 FGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFS 1368
Cdd:COG3210 394 ASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVT 473
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1369 GVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFG 1448
Cdd:COG3210 474 NSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGAS 553
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1449 GALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFG 1528
Cdd:COG3210 554 GTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAG 633
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1529 GALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFG-----GAINTSANFGGALTNSAGFGGAIST 1603
Cdd:COG3210 634 LTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATggttgTTLNAATGGTLNNAGNTLTISTGSI 713
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1604 SASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNS 1683
Cdd:COG3210 714 TVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNAGAE 793
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1684 ISFGSAPTTSVSFGGSHSTNLcFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGN 1763
Cdd:COG3210 794 ISIDITADGTITAAGTTAINV-TGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAA 872
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1764 GLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGG 1843
Cdd:COG3210 873 TAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNA 952
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1844 ELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGS 1923
Cdd:COG3210 953 GLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASAT 1032
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1924 NTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGG 2003
Cdd:COG3210 1033 GTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTS 1112
|
1130 1140 1150 1160 1170 1180 1190
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 50593518 2004 GFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFGGGLNTSAGFSGGPPSTGTGFGGGASS 2078
Cdd:COG3210 1113 TGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADS 1187
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
859-1958 |
2.76e-29 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 128.73 E-value: 2.76e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210 513 GLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGT 592
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210 593 GTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGG 672
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1019 ISNPSGGFGGRNSITFGSVPNTSAN---------FSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFG 1089
Cdd:COG3210 673 GTTGTVTSGATGGTTGTTLNAATGGtlnnagntlTISTGSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTITSG 752
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1090 GAPSTSTSFSTASISFGGApsTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGS 1169
Cdd:COG3210 753 NAGTLSIGLTANTTASGTT--LTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGL 830
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1170 SVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGF 1249
Cdd:COG3210 831 TGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNG 910
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1250 GGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGF 1329
Cdd:COG3210 911 AVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGV 990
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1330 GGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGG 1409
Cdd:COG3210 991 IAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTT 1070
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1410 VLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGamNTSASFGG 1489
Cdd:COG3210 1071 GGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAA--GAGTLTGL 1148
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1490 ALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGG 1569
Cdd:COG3210 1149 VAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTG 1228
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1570 VLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGG 1649
Cdd:COG3210 1229 NTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAG 1308
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1650 AISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGS 1729
Cdd:COG3210 1309 ANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAG 1388
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1730 NSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTST 1809
Cdd:COG3210 1389 NNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGA 1468
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1810 DFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGA 1889
Cdd:COG3210 1469 GGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTTAEVAKASLEGGEGTYGGSSVAEAGTGGGILGA 1548
|
1050 1060 1070 1080 1090 1100
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 50593518 1890 GFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAG 1958
Cdd:COG3210 1549 VSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQAPTAGNTATLTLSLAEGTNAEYGGTTNVTS 1617
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
957-2081 |
2.72e-27 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 122.18 E-value: 2.72e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 957 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGS 1036
Cdd:COG3210 1 GSGGLAGTTGNKTIGVDIAVTTTAATLGSNTAGTSGLNILGSGGVGTAGGIASNAGTTASTSGGSGTAGGVGNTSASTGG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1037 VPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSApfcNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLS 1116
Cdd:COG3210 81 IGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAAS---ATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSG 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1117 TASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPC 1196
Cdd:COG3210 158 AGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTG 237
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1197 TTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTS 1276
Cdd:COG3210 238 AGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTA 317
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1277 ISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSS------GANAGFGGTLNSSTSFGGAISTSTGFG 1350
Cdd:COG3210 318 AGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTgtgnggGLTTAGAGTVASTVGTATASTGNASST 397
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1351 SALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGV 1430
Cdd:COG3210 398 TVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSAG 477
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1431 LNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGA 1510
Cdd:COG3210 478 NTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTA 557
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1511 LNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGA 1590
Cdd:COG3210 558 ASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGS 637
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1591 LTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSpDFGGAFSTSVGFGGT 1670
Cdd:COG3210 638 AVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNN-AGNTLTISTGSITVT 716
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1671 LNTTDFGSTHSNSISFGSAPT-TSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSI 1749
Cdd:COG3210 717 GQIGALANANGDTVTFGNLGTgATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNAGAEISI 796
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1750 SFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSA 1829
Cdd:COG3210 797 DITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAAS 876
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1830 GFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPNASF 1909
Cdd:COG3210 877 ITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSA 956
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1910 NRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGPSTGPGFGGPSTGPGFGG 1989
Cdd:COG3210 957 ASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGT 1036
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1990 PSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFGGGLNTSAGFSGGPPSTG 2069
Cdd:COG3210 1037 AATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGV 1116
|
1130
....*....|..
gi 50593518 2070 TGFGGGASSHGG 2081
Cdd:COG3210 1117 TASKVGGTTTVG 1128
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
859-1958 |
4.47e-27 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 121.41 E-value: 4.47e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210 558 ASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGS 637
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210 638 AVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTG 717
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSG--TPSTSAPFCNTASISFGGAPSTST 1096
Cdd:COG3210 718 QIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTltLANANGNTSAGATLDNAGAEISID 797
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1097 SFSTASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGA 1176
Cdd:COG3210 798 ITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASI 877
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1177 PTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTS 1256
Cdd:COG3210 878 TVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAA 957
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1257 VCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSS 1336
Cdd:COG3210 958 SASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTA 1037
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1337 TSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAG 1416
Cdd:COG3210 1038 ATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVT 1117
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1417 FGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAG 1496
Cdd:COG3210 1118 ASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTD 1197
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1497 FGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAG 1576
Cdd:COG3210 1198 LKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNA 1277
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1577 FGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPD 1656
Cdd:COG3210 1278 GATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGNG 1357
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1657 FGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFS 1736
Cdd:COG3210 1358 ATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGGT 1437
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1737 GATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLN 1816
Cdd:COG3210 1438 GNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTTA 1517
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1817 HNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLN 1896
Cdd:COG3210 1518 EVAKASLEGGEGTYGGSSVAEAGTGGGILGAVSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQAPTAGNTATL 1597
|
1050 1060 1070 1080 1090 1100
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 50593518 1897 NGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAG 1958
Cdd:COG3210 1598 TLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSG 1659
|
|
| MAGE |
pfam01454 |
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ... |
596-756 |
2.13e-24 |
|
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
Pssm-ID: 426270 Cd Length: 205 Bit Score: 103.12 E-value: 2.13e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 596 LVKYLLVKDQTKIPIKRSDMLKDVIQEYE-DYFPEIIERASYALEKMFRVNLKEID--------------------KQNN 654
Cdd:pfam01454 1 LVRYALACEYQRTPIRREDISKKVLGENRkRLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 655 LYILIST---QESSAGIMGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLH---PGVKHSLFG 719
Cdd:pfam01454 81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
|
170 180 190
....*....|....*....|....*....|....*....
gi 50593518 720 EVKKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 756
Cdd:pfam01454 161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1434-1791 |
3.72e-20 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 98.15 E-value: 3.72e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1434 SAGFGGAMNtnATFGGALNSNAGFGGaistSTNFGGALNNSAGFGGAMNTSASFGgalnNSAGFGGAISTNATFGGALNN 1513
Cdd:NF033849 220 SISFGVSLP--MMYAANLGQSAGTGY----GESVGHSTSQGQSHSVGTSESHSVG----TSQSQSHTTGHGSTRGWSHTQ 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1514 SAGFGGAISTNATFGGALNNSAGF--GGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGAL 1591
Cdd:NF033849 290 STSESESTGQSSSVGTSESQSHGTteGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGH 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1592 TNSAGFGGAISTSASFGGALNnsAGFGGAIstsasfGGALNNSAGFGGAISTNASFGGAISNSpDFGGAFSTSVGFG--- 1668
Cdd:NF033849 370 STSSSVSSSESSSRSSSSGVS--GGFSGGI------AGGGVTSEGLGASQGGSEGWGSGDSVQ-SVSQSYGSSSSTGtss 440
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1669 GTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSlcfGSASNTnlcFGGSNSTncfsgatsanfNEGHS 1748
Cdd:NF033849 441 GHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTS---QSETDS---VGDSTGT-----------SESVS 503
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 50593518 1749 ISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASF 1791
Cdd:NF033849 504 QGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLGKSY 546
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1494-1832 |
4.41e-20 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 98.15 E-value: 4.41e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1494 SAGFGgaISTNATFGGALNNSAG------FGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASF 1567
Cdd:NF033849 220 SISFG--VSLPMMYAANLGQSAGtgygesVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1568 GgvLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASF 1647
Cdd:NF033849 298 G--QSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV 375
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1648 GGAISNSPDFggAFSTSVGFGGTLNttdfgsthsnsisfgSAPTTSVSFGGSHSTNLCFGGApstslcfGSASNTNLCFG 1727
Cdd:NF033849 376 SSSESSSRSS--SSGVSGGFSGGIA---------------GGGVTSEGLGASQGGSEGWGSG-------DSVQSVSQSYG 431
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1728 GSNSTNCFSGATSanfNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTS----------TGFGGSLGPSASFNGGLGT 1797
Cdd:NF033849 432 SSSSTGTSSGHSD---SSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSeswstsqsetDSVGDSTGTSESVSQGDGR 508
|
330 340 350
....*....|....*....|....*....|....*
gi 50593518 1798 STGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFN 1832
Cdd:NF033849 509 STGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1454-1821 |
9.74e-18 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 90.45 E-value: 9.74e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1454 NAGFGgaISTSTNFGGALNNSAGFGGAMNTSASFggalnnSAGFGGAISTNATFGGAlnNSAGFGGAISTNATFGGALNN 1533
Cdd:NF033849 220 SISFG--VSLPMMYAANLGQSAGTGYGESVGHST------SQGQSHSVGTSESHSVG--TSQSQSHTTGHGSTRGWSHTQ 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1534 SAGFGGAISTSASFGG--TLNNSASFGGAINTSASFggvlnnSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGAL 1611
Cdd:NF033849 290 STSESESTGQSSSVGTseSQSHGTTEGTSTTDSSSH------SQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSEST 363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1612 NNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSpdFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSapT 1691
Cdd:NF033849 364 GTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEG--LGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT--S 439
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1692 TSVSFGGSHSTNLcfGGAPSTSLCFGSASNTnlcfgGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGF 1771
Cdd:NF033849 440 SGHSDSSSHSTSS--GQADSVSQGTSWSEGT-----GTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGR 512
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|
gi 50593518 1772 GSSLGTStgfggslgpsasfnggLGTSTGFGGGLGTSTDFSGGLNHNADF 1821
Cdd:NF033849 513 SESQGTS----------------LGTSGGRTSGAGGSMGLGPSISLGKSY 546
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1592-1958 |
7.29e-17 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 87.37 E-value: 7.29e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1592 TNSAGFGgaISTSASFGGALNNSAG--FGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGG 1669
Cdd:NF033849 218 QKSISFG--VSLPMMYAANLGQSAGtgYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESE 295
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1670 TLNTTD-FGSTHSNSISFGSAPTTSVSFGGSHSTnlcfggapSTSLCFGSASNTNLCFGGSNStncfsgaTSANFNEGHS 1748
Cdd:NF033849 296 STGQSSsVGTSESQSHGTTEGTSTTDSSSHSQSS--------SYNVSSGTGVSSSHSDGTSQS-------TSISHSESSS 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1749 ISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGglnhnadFNGGLGNS 1828
Cdd:NF033849 361 ESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQS-------VSQSYGSS 433
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1829 AGFngglntntdfggelGTSAGFGDGLGSSTSFG--AGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPN 1906
Cdd:NF033849 434 SST--------------GTSSGHSDSSSHSTSSGqaDSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTS 499
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|..
gi 50593518 1907 ASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSngpssiVGFsgGPSTGAG 1958
Cdd:NF033849 500 ESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGS------MGL--GPSISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1574-1873 |
1.01e-16 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 86.98 E-value: 1.01e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1574 SAGFGgaINTSANFGGALTNSAG------FGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASF 1647
Cdd:NF033849 220 SISFG--VSLPMMYAANLGQSAGtgygesVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1648 GGAISNspdfGGAFSTSVGFGGTLNTTDfGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNlcfG 1727
Cdd:NF033849 298 GQSSSV----GTSESQSHGTTEGTSTTD-SSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVG---H 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1728 GSNSTNCFSGATSANFNEGHSISFGNGLS----TSAGFGNGLGTSAGFGSSLG---TSTGFGGSLGPSASF----NGGLG 1796
Cdd:NF033849 370 STSSSVSSSESSSRSSSSGVSGGFSGGIAgggvTSEGLGASQGGSEGWGSGDSvqsVSQSYGSSSSTGTSSghsdSSSHS 449
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 50593518 1797 TSTGFGGGLGTSTDFSGGLNHNAdfNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAG 1873
Cdd:NF033849 450 TSSGQADSVSQGTSWSEGTGTSQ--GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSG 524
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1222-1547 |
1.41e-15 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 83.13 E-value: 1.41e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1222 ALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVcfgsspysgagfggTLSTSISFGGSPSTNTGFGGTLSTSVSFGA 1301
Cdd:NF033849 252 SQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSH--------------TQSTSESESTGQSSSVGTSESQSHGTTEGT 317
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1302 SSSTSSDFGGTLSTSVSFGgssganAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGgai 1381
Cdd:NF033849 318 STTDSSSHSQSSSYNVSSG------TGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG--- 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1382 nTSAGFGSTLNSSASFGSALSTSASfggvlnGSAGFGGAlntnatfGGVLNGSAGFGGAMNTNATFGGALNS--NAGFGG 1459
Cdd:NF033849 389 -VSGGFSGGIAGGGVTSEGLGASQG------GSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQ 454
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1460 AISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSA---- 1535
Cdd:NF033849 455 ADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAggsm 534
|
330
....*....|..
gi 50593518 1536 GFGGAISTSASF 1547
Cdd:NF033849 535 GLGPSISLGKSY 546
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
1221-1763 |
4.05e-14 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 78.27 E-value: 4.05e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1221 SALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFG 1300
Cdd:COG5295 64 AAATAGAGSGGTSATAASSVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAA 143
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1301 ASSSTSSDFGGTLSTSVSFGGSSGANAGFGGT-LNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGG 1379
Cdd:COG5295 144 STGGSSAAGGSNTATATGSSTANAATAAAGATsTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGV 223
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1380 AINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGG 1459
Cdd:COG5295 224 NAGAATGSAASAGGSASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGA 303
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1460 AISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGG 1539
Cdd:COG5295 304 ANATAGGGNAGSGGGGAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATA 383
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1540 AISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGG 1619
Cdd:COG5295 384 AGNAAGAAGAGSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAA 463
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1620 AISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGS 1699
Cdd:COG5295 464 NVGAATTAASAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAG 543
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 50593518 1700 HSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGN 1763
Cdd:COG5295 544 GGSTTAATGTNSVAVGNNTATGANSVALGAGSVASGANSVSVGAAGAENVAAGATDTDAVNGGG 607
|
|
| AidA |
COG3468 |
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ... |
1468-1879 |
4.41e-14 |
|
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442691 [Multi-domain] Cd Length: 846 Bit Score: 78.06 E-value: 4.41e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1468 GGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASF 1547
Cdd:COG3468 1 TASGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1548 GGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSagfgGAISTSASF 1627
Cdd:COG3468 81 SGGTGGNSTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGGGGGGGT----GVGGTGAAA 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1628 GGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFG 1707
Cdd:COG3468 157 AGGGTGSGGGGSGGGGGAGGGGGGGAGGSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGG 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1708 GAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGP 1787
Cdd:COG3468 237 GVGGGGGSAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGG 316
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1788 SASFNGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVT 1867
Cdd:COG3468 317 GGGSNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLT 396
|
410
....*....|..
gi 50593518 1868 SDGFAGNLGTNT 1879
Cdd:COG3468 397 TGGTGNNGGGGV 408
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
1221-1799 |
1.45e-13 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 76.35 E-value: 1.45e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1221 SALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGA--GFGGTLSTSISFGGSPSTNTGFGGTLSTSVS 1298
Cdd:COG5295 19 SGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAatAGAGSGGTSATAASSVASGGASAATAASTGT 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1299 FGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFG 1378
Cdd:COG5295 99 GNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATGSSTANAATAAAGATSTS 178
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1379 GAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFG 1458
Cdd:COG5295 179 ASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAASGNATTASASSVSG 258
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1459 GAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFG 1538
Cdd:COG5295 259 SAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALGSAGGSSGVGTASGA 338
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1539 GAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFG 1618
Cdd:COG5295 339 SAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTGASAGGGASAAGGAA 418
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1619 GAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGG 1698
Cdd:COG5295 419 AGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSSAAIAGATATGAGAA 498
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1699 SHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTS 1778
Cdd:COG5295 499 AGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNNTATGANSVALGAGSVAS 578
|
570 580
....*....|....*....|.
gi 50593518 1779 TGFGGSLGPSASFNGGLGTST 1799
Cdd:COG5295 579 GANSVSVGAAGAENVAAGATD 599
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
1215-1723 |
7.67e-13 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 74.04 E-value: 7.67e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1215 STNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLS 1294
Cdd:COG5295 88 ASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATGSSTANA 167
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1295 TSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSS 1374
Cdd:COG5295 168 ATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAASGN 247
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1375 ASFGGAINTSAGFGSTLNSS----ASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGA 1450
Cdd:COG5295 248 ATTASASSVSGSAVAAGTAStattASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALGSAG 327
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1451 LNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGA 1530
Cdd:COG5295 328 GSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTGASA 407
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1531 LNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGA 1610
Cdd:COG5295 408 GGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSSAAI 487
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1611 LNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGS--------THSN 1682
Cdd:COG5295 488 AGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSvavgnntaTGAN 567
|
490 500 510 520
....*....|....*....|....*....|....*....|....*
gi 50593518 1683 SISFGSAPTT----SVSFGGSHSTNLCFGGAPSTSLCFGSASNTN 1723
Cdd:COG5295 568 SVALGAGSVAsganSVSVGAAGAENVAAGATDTDAVNGGGAVATG 612
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
1351-1942 |
1.43e-12 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 73.27 E-value: 1.43e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1351 SALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGST-----LNSSASFGSALSTSASFGGVLNGSAGFGGALNTNA 1425
Cdd:COG5295 1 SASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGsaatsSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1426 TFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNA 1505
Cdd:COG5295 81 SSVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATAT 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1506 TFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSA 1585
Cdd:COG5295 161 GSSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSAS 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1586 NfgGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSV 1665
Cdd:COG5295 241 A--GAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGG 318
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1666 GFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNE 1745
Cdd:COG5295 319 GAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGS 398
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1746 GHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLnhnadfNGGL 1825
Cdd:COG5295 399 GGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAA------TTAA 472
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1826 GNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGP 1905
Cdd:COG5295 473 SAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATG 552
|
570 580 590
....*....|....*....|....*....|....*..
gi 50593518 1906 NASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNG 1942
Cdd:COG5295 553 TNSVAVGNNTATGANSVALGAGSVASGANSVSVGAAG 589
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
1445-1952 |
1.66e-12 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 72.89 E-value: 1.66e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1445 ATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTN 1524
Cdd:COG4625 1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1525 ATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTS 1604
Cdd:COG4625 81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1605 ASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSI 1684
Cdd:COG4625 161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1685 SFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANfneghsISFGNGLSTSAGFGNG 1764
Cdd:COG4625 241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGG------GGGGGGGGGGGGGGGG 314
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1765 LGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGE 1844
Cdd:COG4625 315 GGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGG 394
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1845 LGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSN 1924
Cdd:COG4625 395 GAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAG 474
|
490 500
....*....|....*....|....*...
gi 50593518 1925 TSNGFTGEPNTGSSFSNGPSSIVGFSGG 1952
Cdd:COG4625 475 TLTLTGNNTYTGTTTVNGGGNYTQSAGS 502
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
1106-1596 |
3.44e-12 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 72.12 E-value: 3.44e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1106 GGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTSFSG 1185
Cdd:COG4625 11 GGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGT 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1186 GPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYS 1265
Cdd:COG4625 91 GGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGG 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1266 GAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAIST 1345
Cdd:COG4625 171 GGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGG 250
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1346 STGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNA 1425
Cdd:COG4625 251 GGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 330
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1426 TFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNA 1505
Cdd:COG4625 331 GGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGG 410
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1506 TFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSA 1585
Cdd:COG4625 411 GGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTV 490
|
490
....*....|.
gi 50593518 1586 NFGGALTNSAG 1596
Cdd:COG4625 491 NGGGNYTQSAG 501
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1339-1647 |
3.50e-12 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 71.96 E-value: 3.50e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1339 FGGAISTSTGFGSalnnSANFGGAISTSFsgvlnsSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFG 1418
Cdd:NF033849 231 YAANLGQSAGTGY----GESVGHSTSQGQ------SHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQS 300
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1419 GALNTNATFG-GVLNG-----SAGFGGAMNTNATFG----GALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFG 1488
Cdd:NF033849 301 SSVGTSESQShGTTEGtsttdSSSHSQSSSYNVSSGtgvsSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1489 GALNNSAGFGGAISTNATFGGALnnSAGFGGAISTNATFG---GALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSA 1565
Cdd:NF033849 381 SSRSSSSGVSGGFSGGIAGGGVT--SEGLGASQGGSEGWGsgdSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSV 458
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1566 SFGGVL--NNSAGFGGAINTSANFG--GALTNSAGF--GGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGG 1639
Cdd:NF033849 459 SQGTSWseGTGTSQGQSVGTSESWStsQSETDSVGDstGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGP 538
|
....*...
gi 50593518 1640 AISTNASF 1647
Cdd:NF033849 539 SISLGKSY 546
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1356-1702 |
7.74e-12 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 71.19 E-value: 7.74e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1356 SANFGGAISTSFSGvlNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSA 1435
Cdd:NF033849 220 SISFGVSLPMMYAA--NLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1436 GFGGAMNTNATfggaLNSNAGFGGAISTSTNF--GGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNN 1513
Cdd:NF033849 298 GQSSSVGTSES----QSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSS 373
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1514 SAGFGGAISTNATFGgalnNSAGFGGAIS----TSASFGGTLNNSASFGgaintsaSFGGVLNNSAGFGGAINTSANFGG 1589
Cdd:NF033849 374 SVSSSESSSRSSSSG----VSGGFSGGIAgggvTSEGLGASQGGSEGWG-------SGDSVQSVSQSYGSSSSTGTSSGH 442
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1590 ALTNSAGFGgaISTSASFGGALNNSAGFGGAISTSASfggalnNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFG- 1668
Cdd:NF033849 443 SDSSSHSTS--SGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSe 514
|
330 340 350
....*....|....*....|....*....|....*.
gi 50593518 1669 --GTLNTTDFGSTHSNSISFGSAPttSVSFGGSHST 1702
Cdd:NF033849 515 sqGTSLGTSGGRTSGAGGSMGLGP--SISLGKSYQW 548
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1736-2064 |
2.07e-10 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 66.18 E-value: 2.07e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1736 SGATSANFNEGHSISFGNGLSTSAgfGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGL 1815
Cdd:NF033849 216 QGQKSISFGVSLPMMYAANLGQSA--GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSE 293
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1816 NHNAdfngGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTS--DGFAGNLGTNTGFGGTLGTGAGFSV 1893
Cdd:NF033849 294 SEST----GQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSShsDGTSQSTSISHSESSSESTGTSVGH 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1894 SLNNGNGFGNGPNASFNRGLNTiiGFGSGSNTSnGFTGEpntGSSFSNGPSSIVGFSGG-PSTGAGFCSGPSTGGFGGGP 1972
Cdd:NF033849 370 STSSSVSSSESSSRSSSSGVSG--GFSGGIAGG-GVTSE---GLGASQGGSEGWGSGDSvQSVSQSYGSSSSTGTSSGHS 443
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:NF033849 444 DSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTS 523
|
330
....*....|..
gi 50593518 2053 GGLNTSAGFSGG 2064
Cdd:NF033849 524 GGRTSGAGGSMG 535
|
|
| MscS_porin |
pfam12795 |
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ... |
268-445 |
2.01e-09 |
|
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.
Pssm-ID: 432790 [Multi-domain] Cd Length: 238 Bit Score: 60.01 E-value: 2.01e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASnrqigASNRQTE 347
Cdd:pfam12795 10 LDEAAKKKLLQDLQQALSLLDKIDASKQRAAAYQKALDDAPAELRELRQELAALQAKAEAAPKEILAS-----LSLEELE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 348 vsSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASsrQTEASSRQTEASS 427
Cdd:pfam12795 85 --QRLLQTSAQLQELQNQLAQLNSQLIELQTRPERAQQQLSEARQRLQQIRNRLNGPAPPGEPLS--EAQRWALQAELAA 160
|
170
....*....|....*...
gi 50593518 428 RQTEASSRQIEASAAAVR 445
Cdd:pfam12795 161 LKAQIDMLEQELLSNNNR 178
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
863-1511 |
5.69e-09 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 61.33 E-value: 5.69e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 863 TTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 942
Cdd:COG4625 28 AGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGG 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 943 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 1022
Cdd:COG4625 108 GGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 187
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1023 SGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTAS 1102
Cdd:COG4625 188 GGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGG 267
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1103 ISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTS 1182
Cdd:COG4625 268 GGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGG 347
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1183 FSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSS 1262
Cdd:COG4625 348 GGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGT 427
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1263 PYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGA 1342
Cdd:COG4625 428 GAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGGNYTQSAGSTLAVE 507
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1343 ISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALN 1422
Cdd:COG4625 508 VDAANSDRLVVTGTATLNGGTVVVLAGGYAPGTTYTILAVAAALDALAGNGDLSALYNALAALDAAAARAALDQLSGEIH 587
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1423 TNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALnnsAGFGGAIS 1502
Cdd:COG4625 588 ASAAAALLQASRALRDALSNRLRALRGAGAAGDAAAEGWGVWAQGFGSWGDQDGDGGAAGYDSSTGGLL---VGADYRLG 664
|
....*....
gi 50593518 1503 TNATFGGAL 1511
Cdd:COG4625 665 DNWRLGVAL 673
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
917-1432 |
7.53e-09 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 60.95 E-value: 7.53e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 917 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGF 996
Cdd:COG4625 1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 997 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPST 1076
Cdd:COG4625 81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1077 SAPFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISF 1156
Cdd:COG4625 161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1157 GGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTS 1236
Cdd:COG4625 241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1237 TVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTS 1316
Cdd:COG4625 321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1317 VSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGS-TLNSSA 1395
Cdd:COG4625 401 GGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAgTLTLTG 480
|
490 500 510
....*....|....*....|....*....|....*..
gi 50593518 1396 SFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLN 1432
Cdd:COG4625 481 NNTYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLV 517
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
997-1512 |
9.96e-09 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 60.56 E-value: 9.96e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 997 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPST 1076
Cdd:COG4625 2 GGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGG 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1077 SAPFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISF 1156
Cdd:COG4625 82 GGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAG 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1157 GGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTS 1236
Cdd:COG4625 162 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1237 TVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTS 1316
Cdd:COG4625 242 GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1317 VSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSAS 1396
Cdd:COG4625 322 GGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGG 401
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1397 FGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAG 1476
Cdd:COG4625 402 GGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGN 481
|
490 500 510
....*....|....*....|....*....|....*.
gi 50593518 1477 FGGAMNTSASFGGALNNSAGFGGAISTNATFGGALN 1512
Cdd:COG4625 482 NTYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLV 517
|
|
| auto_AIDA-I |
NF033176 |
autotransporter adhesin AIDA-I; |
1318-1785 |
1.30e-08 |
|
autotransporter adhesin AIDA-I;
Pssm-ID: 380183 [Multi-domain] Cd Length: 1287 Bit Score: 60.44 E-value: 1.30e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1318 SFGGSSGANAGFGGTLNSSTsfGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSaSF 1397
Cdd:NF033176 72 SNGQTSNATVNSGGIQNVNN--GGKTTSTTVNSSGAQNVGNSGTAISTIVNSGGVQRVSSGGVTSATSLSGGAQNIY-NL 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1398 GSALSTSASFGGVLNGSAGfGGALNTNATFGGVLNGSAGfGGAMNTNATFGGALNSNAGfGGAISTSTNFGGALNNSAGf 1477
Cdd:NF033176 149 GHASNTVIFNGGNQTIFSG-GISDDTNISSGGQQRVSSG-GVASNTTINSSGTQNILSG-GSTVSTHISSGGNQYISAG- 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1478 GGAMNTSASFGGALNNSAgfgGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGfGGAISTSASFGGTLNNSaSF 1557
Cdd:NF033176 225 GNASATVVSSGGFQRVSS---GGTATGTVLSGGTQNVSSGGSAISTSVYSSGVQTVYAG-ATVTDTTVNSGGKQNIS-SG 299
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1558 GGAINTSASFGGVLNNsagFGGAINTSANFGGALTNSAGfGGAISTSASFGGALNNSAGfGGAISTSASFGGALNNSAGf 1637
Cdd:NF033176 300 GIVSGTIVNSSGTQNI---YSGGSALSANIKGSQIVNSD-GTAINTLVNDGGYQHIRNG-GVASGTIINQSGRVNISSG- 373
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1638 GGAISTNASFGGAISNSPDfGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSapTTSVSFGGSHSTNLCFGGAPSTSLCFG 1717
Cdd:NF033176 374 GYAESTIINSGGTQSVLSG-GYASGTLINNSGRENVSNGGSAYNTIINAGG--NQYIYSNGEASGTTVNTSGFQRVNSGG 450
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 50593518 1718 SASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSL 1785
Cdd:NF033176 451 TATGTKLSGGNQNVSSGGKAIAAEVYSGGKQTVYAGGEASGTQIFDGGVVNVSGGSVSGASVNLNGRL 518
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
1559-2060 |
1.77e-08 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 59.79 E-value: 1.77e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1559 GAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFG 1638
Cdd:COG4625 1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1639 GAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGS 1718
Cdd:COG4625 81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1719 ASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTS 1798
Cdd:COG4625 161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1799 TGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTN 1878
Cdd:COG4625 241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1879 TGFGGTLGTGAGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAG 1958
Cdd:COG4625 321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1959 FCSGPSTGGFGGGPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTA 2038
Cdd:COG4625 401 GGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTG 480
|
490 500
....*....|....*....|..
gi 50593518 2039 AGFGSGLSTSTGfGGGLNTSAG 2060
Cdd:COG4625 481 NNTYTGTTTVNG-GGNYTQSAG 501
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
999-1646 |
2.68e-08 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 59.40 E-value: 2.68e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 999 ISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSA 1078
Cdd:COG5295 1 SASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1079 PFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTAsisfGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGG 1158
Cdd:COG5295 81 SSVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNA----GASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNT 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1159 APSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTV 1238
Cdd:COG5295 157 ATATGSSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAG 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1239 FSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSsdfGGTLSTSVS 1318
Cdd:COG5295 237 GSASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAA---NATAGGGNA 313
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1319 FGGSSGANAG--FGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSAS 1396
Cdd:COG5295 314 GSGGGGAAALgsAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGA 393
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1397 FGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAG 1476
Cdd:COG5295 394 GSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAAS 473
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1477 FGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSAS 1556
Cdd:COG5295 474 AAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGT 553
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1557 FGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAG 1636
Cdd:COG5295 554 NSVAVGNNTATGANSVALGAGSVASGANSVSVGAAGAENVAAGATDTDAVNGGGAVATGDNSVAVGNNAQASGANSVALG 633
|
650
....*....|
gi 50593518 1637 FGGAISTNAS 1646
Cdd:COG5295 634 AGATATANNS 643
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
1566-2071 |
3.73e-08 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 59.02 E-value: 3.73e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1566 SFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNA 1645
Cdd:COG4625 1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1646 SFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLC 1725
Cdd:COG4625 81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1726 FGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGL 1805
Cdd:COG4625 161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1806 GTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTL 1885
Cdd:COG4625 241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1886 GTGAGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGfcsgpST 1965
Cdd:COG4625 321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGG-----GG 395
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1966 GGFGGGPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGL 2045
Cdd:COG4625 396 AGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGT 475
|
490 500
....*....|....*....|....*.
gi 50593518 2046 STSTGFGGGLNTSAGFSGGPPSTGTG 2071
Cdd:COG4625 476 LTLTGNNTYTGTTTVNGGGNYTQSAG 501
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
262-468 |
5.43e-08 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 57.22 E-value: 5.43e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 262 IGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGA 341
Cdd:COG4372 26 IAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELES 105
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 342 SNRQTEVSSRQIEASNRQIGASNRQTEASNRQIgasnrqteaSNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSR 421
Cdd:COG4372 106 LQEEAEELQEELEELQKERQDLEQQRKQLEAQI---------AELQSEIAEREEELKELEEQLESLQEELAALEQELQAL 176
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 50593518 422 QTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEA 468
Cdd:COG4372 177 SEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEA 223
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
268-522 |
9.60e-08 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 56.76 E-value: 9.60e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTE 347
Cdd:COG3883 17 QIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALY 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 348 VSSRQI-------EASN-----RQIGASNRQTEASNRQIGA-SNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSR 414
Cdd:COG3883 97 RSGGSVsyldvllGSESfsdflDRLSALSKIADADADLLEElKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQ 176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 415 QTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEAPPAIQTVTNHALSVTVRIRRGSRARK 494
Cdd:COG3883 177 QAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAAGAAG 256
|
250 260
....*....|....*....|....*...
gi 50593518 495 AANKNRATESQAQIAEQGAQASEASISA 522
Cdd:COG3883 257 AAAGSAGAAGAAAGAAGAGAAAASAAGG 284
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
97-524 |
1.17e-07 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 57.53 E-value: 1.17e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 97 SQASATTEAPNIQASVTSQTQKAKTMRVTPKVSLTGSEDATtqlkpplQALNLPVTTPTIQTPVANESANSLAS--TAVN 174
Cdd:NF041483 293 AKQLASAESANEQRTRTAKEEIARLVGEATKEAEALKAEAE-------QALADARAEAEKLVAEAAEKARTVAAedTAAQ 365
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 175 KSKKASTANNAANKTVPSAAEISLAsAATHTVTTQGQAAKETGSIQTIAATA------RSKKNSKGKRtpAKTTNTDNEY 248
Cdd:NF041483 366 LAKAARTAEEVLTKASEDAKATTRA-AAEEAERIRREAEAEADRLRGEAADQaeqlkgAAKDDTKEYR--AKTVELQEEA 442
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 249 V----EA----SNAIEASSRQIGASGRqtEASnRQIEASSRQTEA-------------SNRQTEASSRQTEASSRQT--- 304
Cdd:NF041483 443 RrlrgEAeqlrAEAVAEGERIRGEARR--EAV-QQIEEAARTAEElltkakadadelrSTATAESERVRTEAIERATtlr 519
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 305 ----ETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIgasnrqTEVSSRQIEAsnrqigasnRQTEASNRqigASNRQ 380
Cdd:NF041483 520 rqaeETLERTRAEAERLRAEAEEQAEEVRAAAERAAREL------REETERAIAA---------RQAEAAEE---LTRLH 581
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 381 TEASNRQIGASNRQTDASN------RQT-DASNRQ-TEASSR--------QTEASSRQTEASSrqtEASSRQIEASAAAV 444
Cdd:NF041483 582 TEAEERLTAAEEALADARAeaerirREAaEETERLrTEAAERirtlqaqaEQEAERLRTEAAA---DASAARAEGENVAV 658
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 445 RPKkprgkkgnnkgSNSASEPSeappaiqtvtnhalsvtvriRRGSRARKAANKNRAtESQAQIAEQGAQASEASISALE 524
Cdd:NF041483 659 RLR-----------SEAAAEAE--------------------RLKSEAQESADRVRA-EAAAAAERVGTEAAEALAAAQE 706
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
255-525 |
1.52e-07 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 56.06 E-value: 1.52e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 255 IEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEA 334
Cdd:COG4372 40 LDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEE 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 335 SNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIgaSNRQTEASNRQIGASNRQTDASNRQTDASNRQTEassR 414
Cdd:COG4372 120 LQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQL--ESLQEELAALEQELQALSEAEAEQALDELLKEAN---R 194
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 415 QTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEAPPAIQTVT-NHALSVTVRIRRGSRAR 493
Cdd:COG4372 195 NAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVIlKEIEELELAILVEKDTE 274
|
250 260 270
....*....|....*....|....*....|..
gi 50593518 494 KAANKNRATESQAQIAEQGAQASEASISALET 525
Cdd:COG4372 275 EEELEIAALELEALEEAALELKLLALLLNLAA 306
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
264-442 |
2.40e-07 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 55.16 E-value: 2.40e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 264 ASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASN 343
Cdd:COG4942 17 AQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELR 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 344 RQTEvssRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDA-SNRQTEASSRQTEASSRQ 422
Cdd:COG4942 97 AELE---AQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEElRADLAELAALRAELEAER 173
|
170 180
....*....|....*....|
gi 50593518 423 TEASSRQTEASSRQIEASAA 442
Cdd:COG4942 174 AELEALLAELEEERAALEAL 193
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
251-459 |
2.89e-07 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 55.16 E-value: 2.89e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 251 ASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNR 330
Cdd:COG4942 18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRA 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 331 QIEASNRQIGASNRQTEVSSRQ-------------------------IEASNRQIGASNRQTEASNRQIGASNRQTEASN 385
Cdd:COG4942 98 ELEAQKEELAELLRALYRLGRQpplalllspedfldavrrlqylkylAPARREQAEELRADLAELAALRAELEAERAELE 177
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 50593518 386 RQIgASNRQTDASNRQTDASNRQTEASSRQTEASSRQT----EASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGS 459
Cdd:COG4942 178 ALL-AELEEERAALEALKAERQKLLARLEKELAELAAElaelQQEAEELEALIARLEAEAAAAAERTPAAGFAALKGK 254
|
|
| Tar |
COG0840 |
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms]; |
250-442 |
3.71e-07 |
|
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
Pssm-ID: 440602 [Multi-domain] Cd Length: 533 Bit Score: 55.03 E-value: 3.71e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 250 EASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIgasnRQIMASNRQIG--- 326
Cdd:COG0840 292 ETAAAMEELSATVQEVAENAQQAAELAEEASELAEEGGEVVEEAVEGIEEIRESVEETAETI----EELGESSQEIGeiv 367
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 327 -------------ASNRQIEAS------------------------------NRQIGASNRQTEVSSRQIEASNRQIGAS 363
Cdd:COG0840 368 dviddiaeqtnllALNAAIEAArageagrgfavvadevrklaersaeatkeiEELIEEIQSETEEAVEAMEEGSEEVEEG 447
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 364 NRQTEASN---RQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSRQTEASSRQTEASSRQIEAS 440
Cdd:COG0840 448 VELVEEAGealEEIVEAVEEVSDLIQEIAAASEEQSAGTEEVNQAIEQIAAAAQENAASVEEVAAAAEELAELAEELQEL 527
|
..
gi 50593518 441 AA 442
Cdd:COG0840 528 VS 529
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
254-524 |
6.90e-07 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 53.75 E-value: 6.90e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 254 AIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIE 333
Cdd:COG4372 25 LIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELE 104
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 334 ASNRQigASNRQTEVSsrQIEASNRQIGASNRQTEASNRQIGAS--NRQTEASNRQIGASNRQTDASNRQTDASNRQTEA 411
Cdd:COG4372 105 SLQEE--AEELQEELE--ELQKERQDLEQQRKQLEAQIAELQSEiaEREEELKELEEQLESLQEELAALEQELQALSEAE 180
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 412 SSRQTEASSRQTEASSRQTEASSRQIEA-------SAAAVRPKKPRGKKGNNKGSNSASEPSEAPPAIQTVTNHALSVTV 484
Cdd:COG4372 181 AEQALDELLKEANRNAEKEEELAEAEKLieslpreLAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEI 260
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 50593518 485 RIRRGSRARKAANKNRATESQAQIAEQGAQASEASISALE 524
Cdd:COG4372 261 EELELAILVEKDTEEEELEIAALELEALEEAALELKLLAL 300
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
250-524 |
7.08e-07 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 54.76 E-value: 7.08e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 250 EASNAIEASSRqiGASGRQTEASNRQIEAssRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASN 329
Cdd:PTZ00121 1197 EDARKAEAARK--AEEERKAEEARKAEDA--KKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAI 1272
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 330 RQIE---------ASNRQIGASNRQTEVSSRQIEASN-----RQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQT 395
Cdd:PTZ00121 1273 KAEEarkadelkkAEEKKKADEAKKAEEKKKADEAKKkaeeaKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA 1352
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 396 DASNRQTDASNRQTEASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKP----RGKKGNNKGSNSASEPSEAPPA 471
Cdd:PTZ00121 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKadelKKAAAAKKKADEAKKKAEEKKK 1432
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 50593518 472 IQTVTNHALSVtvriRRGSRARKAANKNRATESQAQIAEQGAQASEASISALE 524
Cdd:PTZ00121 1433 ADEAKKKAEEA----KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
1492-2053 |
1.03e-06 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 54.01 E-value: 1.03e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1492 NNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVL 1571
Cdd:COG5295 2 ASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAAS 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1572 NNSAGFGGAINTSANFGGA-LTNSAGFGGAISTSASfggalnNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGA 1650
Cdd:COG5295 82 SVASGGASAATAASTGTGNtAGTAATVAGAASSGSA------TNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSN 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1651 ISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNlcfGGSN 1730
Cdd:COG5295 156 TATATGSSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAA---TGSA 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1731 STNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTD 1810
Cdd:COG5295 233 ASAGGSASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGN 312
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1811 FSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAG 1890
Cdd:COG5295 313 AGSGGGGAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAG 392
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1891 FSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGG 1970
Cdd:COG5295 393 AGSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAA 472
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1971 GPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTG 2050
Cdd:COG5295 473 SAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATG 552
|
...
gi 50593518 2051 FGG 2053
Cdd:COG5295 553 TNS 555
|
|
| YhjY |
COG5571 |
Uncharacterized conserved protein YhjY, contains autotransporter beta-barrel domain [General ... |
1227-1654 |
1.41e-06 |
|
Uncharacterized conserved protein YhjY, contains autotransporter beta-barrel domain [General function prediction only];
Pssm-ID: 444313 [Multi-domain] Cd Length: 648 Bit Score: 53.34 E-value: 1.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1227 TSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTS 1306
Cdd:COG5571 5 SAAGSLGYLASASSNAATAPGLAAATASAAGAAGLGAASTASSLSGASLALLAAQALGAGLSGTNGFSGGAGSSSGTGPT 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1307 SDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAG 1386
Cdd:COG5571 85 ANGGLAGAGGVDLAGAGGGGGASGLAGGAGGAGGTAAAGGAAAAGGGAAGNAATAAAAAAAGTALQLSGLTTAGAVGGVA 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1387 FGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTN 1466
Cdd:COG5571 165 GTAALNGATANTGLGAAAALAAAAAAAAAAAAAAAAAAAAATAAAAAAAAAAAAAVLASPAPAAGGAAAAAAGAAAAAAS 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1467 FGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSAS 1546
Cdd:COG5571 245 AAANAATQANLLLLALALGSNGNAVGLNAVGLANEAAAPGAVGGDAGSTGATPSTLSSASCVASSLTAANANTLYAAADT 324
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1547 FGGTLNNSASFGGAINTSASFGGVlnnsAGFGGAINTSANFGGALTNSAGFGGAiSTSASFGGALNNSAGFGGAISTSAS 1626
Cdd:COG5571 325 AGPAGATAALAAAAAAVLASAAAV----AQAALALAAAGGQARSLAVAAGQGRG-ARGGQTRGGGGAGGTTGGGVGAGGG 399
|
410 420
....*....|....*....|....*...
gi 50593518 1627 FGGALNNSAGFGGAISTNASFGGAISNS 1654
Cdd:COG5571 400 DGDGPNLTLGVDYRLSDNLLLGAALSYG 427
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
909-1546 |
1.60e-06 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 53.24 E-value: 1.60e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 909 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 988
Cdd:COG5295 1 SASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 989 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANS 1068
Cdd:COG5295 81 SSVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATAT 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1069 SFSGTPSTSAPFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTS 1148
Cdd:COG5295 161 GSSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSAS 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1149 LSTASISFGGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTS 1228
Cdd:COG5295 241 AGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGA 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1229 FGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSIS-FGGSPSTNTGFGGTLSTSVSFGASSSTSS 1307
Cdd:COG5295 321 AALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGaAATSSSGGSATAAGNAAGAAGAGSAGSGG 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1308 DFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGF 1387
Cdd:COG5295 401 SSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAA 480
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1388 GSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNF 1467
Cdd:COG5295 481 ATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGN 560
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1468 GGALNNSAGFGGAMNTSASFGGALNNSAGFG----GAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAIST 1543
Cdd:COG5295 561 NTATGANSVALGAGSVASGANSVSVGAAGAEnvaaGATDTDAVNGGGAVATGDNSVAVGNNAQASGANSVALGAGATATA 640
|
...
gi 50593518 1544 SAS 1546
Cdd:COG5295 641 NNS 643
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1399-1630 |
1.95e-06 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 52.59 E-value: 1.95e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1399 SALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFG--GAMNTNATFGGALNSNAGFGGAISTSTNFGGAlnnsag 1476
Cdd:COG5651 159 AAAVALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFAnlGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFA------ 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1477 fGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSAS 1556
Cdd:COG5651 233 -GTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLG 311
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 50593518 1557 FGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGA 1630
Cdd:COG5651 312 AGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
|
|
| MscS_porin |
pfam12795 |
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ... |
250-436 |
1.99e-06 |
|
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.
Pssm-ID: 432790 [Multi-domain] Cd Length: 238 Bit Score: 51.15 E-value: 1.99e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 250 EASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASN 329
Cdd:pfam12795 48 DAPAELRELRQELAALQAKAEAAPKEILASLSLEELEQRLLQTSAQLQELQNQLAQLNSQLIELQTRPERAQQQLSEARQ 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 330 RQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNR-QIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQ 408
Cdd:pfam12795 128 RLQQIRNRLNGPAPPGEPLSEAQRWALQAELAALKAQIDMLEQeLLSNNNRQDLLKARRDLLTLRIQRLEQQLQALQELL 207
|
170 180
....*....|....*....|....*...
gi 50593518 409 TEasSRQTEAssRQTEASSRQTEASSRQ 436
Cdd:pfam12795 208 NE--KRLQEA--EQAVAQTEQLAEEAAG 231
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
268-524 |
3.37e-06 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 52.63 E-value: 3.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEAsnrqigASNRQTE 347
Cdd:COG1196 219 KEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEE------AQAEEYE 292
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 348 VSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSRQTEASS 427
Cdd:COG1196 293 LLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEA 372
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 428 RQTEASSRQIEASAAAVRPKKprgkkgnnkgsnsasepsEAPPAIQTVTNHALSVTVRIRRGSRARKAANKNRATESQAQ 507
Cdd:COG1196 373 ELAEAEEELEELAEELLEALR------------------AAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELE 434
|
250
....*....|....*..
gi 50593518 508 IAEQGAQASEASISALE 524
Cdd:COG1196 435 EEEEEEEEALEEAAEEE 451
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
250-455 |
1.14e-05 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 49.90 E-value: 1.14e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 250 EASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASN 329
Cdd:COG4372 56 QAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLE 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 330 RQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQ--IGASNRQTEASNRQIGASNRQTDASNRQTDASNR 407
Cdd:COG4372 136 AQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAeaEQALDELLKEANRNAEKEEELAEAEKLIESLPRE 215
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 50593518 408 QTEASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGN 455
Cdd:COG4372 216 LAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEEL 263
|
|
| COG4935 |
COG4935 |
Regulatory P domain of the subtilisin-like proprotein convertases and other proteases ... |
1216-1664 |
1.31e-05 |
|
Regulatory P domain of the subtilisin-like proprotein convertases and other proteases [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443962 [Multi-domain] Cd Length: 641 Bit Score: 50.20 E-value: 1.31e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1216 TNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLST 1295
Cdd:COG4935 96 GVVAVAGAGLAATASGAAAGAVAAAANGNTGAGPGSGGTGGGSGGAGAAAAAAALSAAGAAVGVAAVAGAAGGGGGVGVA 175
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1296 SVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSA 1375
Cdd:COG4935 176 AAVGVVLGAGLVADGGNGGGGAVAGGAAGGGGGGGGGGGLGGAAGGGGAGLAAAGGGGGGAAAAAAAGVGGLGAAATAAA 255
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1376 SFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNA 1455
Cdd:COG4935 256 ADGGGGGGAGAAGAGGSAGAAAGGAGAGVVGAAAGGGDAALGGAVGAAGTGNAAAAAAASAGSGGGGGSAAAAGAAAAAA 335
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1456 GFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSA 1535
Cdd:COG4935 336 AAAAGAAAGVSGAASVVAGASGGGAGTAAAAGGGAAAAAAGGAAAAGAAAGAAAGAAAGAAAAGGVASAAGAVGAGTAAG 415
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1536 GFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSA 1615
Cdd:COG4935 416 ASATAAVSTGAASGSSTTSSTGTTATATGLGGGADAGSTSTGTGSAAGAAGGTTTATSGLASSTTAAAAAAAAGLATTAA 495
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|..
gi 50593518 1616 GFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNS---PDFGGAFSTS 1664
Cdd:COG4935 496 VAAGAAGAAAAAATAASVGGATGAAGTTNSTATFSNTTDvaiPDNGPAGVTS 547
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1028-1427 |
1.66e-05 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 50.39 E-value: 1.66e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1028 GRNSITFG-SVPNT-SANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGgapststsfstasISF 1105
Cdd:NF033849 217 GQKSISFGvSLPMMyAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHG-------------STR 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1106 GGApststslstasisfggapststsfstasisfggapststslstasisfggapsiNSSSGGSSVSFGGAPTTSTSFSG 1185
Cdd:NF033849 284 GWS------------------------------------------------------HTQSTSESESTGQSSSVGTSESQ 309
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1186 GPCISFGgapcTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYS 1265
Cdd:NF033849 310 SHGTTEG----TSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSS 385
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1266 GAGFGGTLSTSIsfGGSPSTNTGFGGTLSTSVSFGASSSTSSdFGGTLSTSVSFGGSSGA--NAGFGGTLNSSTSFGGAI 1343
Cdd:NF033849 386 SSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWGSGDSVQS-VSQSYGSSSSTGTSSGHsdSSSHSTSSGQADSVSQGT 462
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1344 STSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNT 1423
Cdd:NF033849 463 SWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISL 542
|
....
gi 50593518 1424 NATF 1427
Cdd:NF033849 543 GKSY 546
|
|
| Tar |
COG0840 |
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms]; |
162-380 |
3.49e-05 |
|
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
Pssm-ID: 440602 [Multi-domain] Cd Length: 533 Bit Score: 48.86 E-value: 3.49e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 162 NESANSLASTAVNKSKKASTANNAANKTVPSAAEISlasAATHTVTTQGQAAKETgSIQTIAATARSKKN-SKGKRTPAK 240
Cdd:COG0840 266 ASASEELAASAEELAAGAEEQAASLEETAAAMEELS---ATVQEVAENAQQAAEL-AEEASELAEEGGEVvEEAVEGIEE 341
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 241 TTNTDNEYVEASNAIEASSRQIG-------------------AS---------GR--------------QTEASNRQIEA 278
Cdd:COG0840 342 IRESVEETAETIEELGESSQEIGeivdviddiaeqtnllalnAAieaarageaGRgfavvadevrklaeRSAEATKEIEE 421
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 279 ssrQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQImasnRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNR 358
Cdd:COG0840 422 ---LIEEIQSETEEAVEAMEEGSEEVEEGVELVEEAGEAL----EEIVEAVEEVSDLIQEIAAASEEQSAGTEEVNQAIE 494
|
250 260
....*....|....*....|..
gi 50593518 359 QIGASNRQTEASNRQIGASNRQ 380
Cdd:COG0840 495 QIAAAAQENAASVEEVAAAAEE 516
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1270-1470 |
4.05e-05 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 48.35 E-value: 4.05e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1270 GGTLSTSISFGGSPSTNTGFGGTLSTS---VSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTS 1346
Cdd:COG5651 178 GGLLGAQNAGSGNTSSNPGFANLGLTGlnqVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAGAGASA 257
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1347 TGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNAT 1426
Cdd:COG5651 258 ALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAA 337
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 50593518 1427 FGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGA 1470
Cdd:COG5651 338 GAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGG 381
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
206-525 |
7.28e-05 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 48.13 E-value: 7.28e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 206 VTTQGQAAKETGSIqtiaATARSKKNSKgkrtpakTTNTDNEYVEASNAIEASSRQIgasgrqTEASNRQIEAssrQTEA 285
Cdd:TIGR02168 648 VTLDGDLVRPGGVI----TGGSAKTNSS-------ILERRREIEELEEKIEELEEKI------AELEKALAEL---RKEL 707
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 286 SNRQTEASSRQteassRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNR 365
Cdd:TIGR02168 708 EELEEELEQLR-----KELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA 782
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 366 QTEASNRQIGASNRQTEASNRQIGASNRQTDASNRqtdasnRQTEASSRQtEASSRQTEASSRQTEASSRQIEASAAAVr 445
Cdd:TIGR02168 783 EIEELEAQIEQLKEELKALREALDELRAELTLLNE------EAANLRERL-ESLERRIAATERRLEDLEEQIEELSEDI- 854
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 446 pkkprgkKGNNKgsnSASEPSEAPPAIQTVTNHAL----SVTVRIRRG-SRARKAANKNRATESQAQIAEQGAQASEASI 520
Cdd:TIGR02168 855 -------ESLAA---EIEELEELIEELESELEALLneraSLEEALALLrSELEELSEELRELESKRSELRRELEELREKL 924
|
....*
gi 50593518 521 SALET 525
Cdd:TIGR02168 925 AQLEL 929
|
|
| Tar |
COG0840 |
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms]; |
241-445 |
1.89e-04 |
|
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
Pssm-ID: 440602 [Multi-domain] Cd Length: 533 Bit Score: 46.55 E-value: 1.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 241 TTNTDNEYVEASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMA 320
Cdd:COG0840 230 DVDSKDEIGQLADAFNRMIENLRELVGQVRESAEQVASASEELAASAEELAAGAEEQAASLEETAAAMEELSATVQEVAE 309
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 321 S------------------NRQIGASNRQIEASNRQIGASNR------------------------QT------------ 346
Cdd:COG0840 310 NaqqaaelaeeaselaeegGEVVEEAVEGIEEIRESVEETAEtieelgessqeigeivdviddiaeQTnllalnaaieaa 389
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 347 --------------EV---------SSRQIEAsnrQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQ----TDASN 399
Cdd:COG0840 390 rageagrgfavvadEVrklaersaeATKEIEE---LIEEIQSETEEAVEAMEEGSEEVEEGVELVEEAGEAleeiVEAVE 466
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 50593518 400 RQTDASNRQTEASSRQTEASS------RQTEASSRQTEASSRQIEASAAAVR 445
Cdd:COG0840 467 EVSDLIQEIAAASEEQSAGTEevnqaiEQIAAAAQENAASVEEVAAAAEELA 518
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
245-519 |
2.58e-04 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 46.21 E-value: 2.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 245 DNEYVEASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTE--TSNRQIgasnrqimASN 322
Cdd:TIGR02169 222 EYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKdlGEEEQL--------RVK 293
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 323 RQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEA-SNRQIGASNRQTDASNR- 400
Cdd:TIGR02169 294 EKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKlTEEYAELKEELEDLRAEl 373
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 401 -QTDASNRQT--EASSRQTEASSRQTEASSRQTEAS-----SRQIEASAAAVRPKKPRGKKGNNKgsnSASEPSEAPPAI 472
Cdd:TIGR02169 374 eEVDKEFAETrdELKDYREKLEKLKREINELKRELDrlqeeLQRLSEELADLNAAIAGIEAKINE---LEEEKEDKALEI 450
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 50593518 473 QTVTNHaLSVTVRIRRGSRARKAANKN-------RATESQAQIAEQGAQASEAS 519
Cdd:TIGR02169 451 KKQEWK-LEQLAADLSKYEQELYDLKEeydrvekELSKLQRELAEAEAQARASE 503
|
|
| YjbI |
COG1357 |
Uncharacterized conserved protein YjbI, contains pentapeptide repeats [Function unknown]; |
1453-1627 |
2.65e-04 |
|
Uncharacterized conserved protein YjbI, contains pentapeptide repeats [Function unknown];
Pssm-ID: 440968 [Multi-domain] Cd Length: 178 Bit Score: 43.78 E-value: 2.65e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1453 SNAGFGGAISTSTNFGGAlNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGAln 1532
Cdd:COG1357 8 SGADLSGADLSGADLSGA-NLSGALSGANLSGANLSGANLTGANLSGADLSGADLSGANLSGADLSGANLTGADLSGA-- 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1533 NSAGFGGAISTSASFGGtlnnsASFGGAINTSASFGGvlnnsAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALN 1612
Cdd:COG1357 85 NLANLSGANLSGANLSG-----ANLRGANLSGANLSG-----ADLSGADLSGANLSGADLSGANLSGANLSGADLSGADL 154
|
170
....*....|....*
gi 50593518 1613 NSAGFGGAISTSASF 1627
Cdd:COG1357 155 SGANLSGANLSGANL 169
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
319-404 |
2.66e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 45.53 E-value: 2.66e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 319 MASNRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDAS 398
Cdd:COG4942 16 AAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAEL 95
|
....*.
gi 50593518 399 NRQTDA 404
Cdd:COG4942 96 RAELEA 101
|
|
| COG5412 |
COG5412 |
Phage-related protein [Mobilome: prophages, transposons]; |
1215-1643 |
2.79e-04 |
|
Phage-related protein [Mobilome: prophages, transposons];
Pssm-ID: 444167 [Multi-domain] Cd Length: 704 Bit Score: 46.23 E-value: 2.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1215 STNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGfGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLS 1294
Cdd:COG5412 7 SAKEAASAALLLAQAKAADSELTAASGGVVSAAA-KAQGSIAQLGKIGAAAGAEAALADSSLAFATLAAALGATVAGASL 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1295 TSVSFGASSSTSSDFGGTLstsvsfGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSS 1374
Cdd:COG5412 86 LLAAGGARAKGSAAAAAAL------GAVAAAAKVLNGALAAAGAALAATQALAAAATGAKGEANAAAKAGGAAALASAGL 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1375 ASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNAtfggvlngsAGFGGAMNTNATFGGALNSN 1454
Cdd:COG5412 160 AAAGAAAAASALAAAGAIAKAILSASKLSGQALAGQSAAAGGALEAAAAAA---------AGAAAAGAAAAAATAASALL 230
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1455 AGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALnnsagfgGAISTNATFGGALNNS 1534
Cdd:COG5412 231 ALAALQGLAAGAATGAAAGAAGAAGLGAAGAGAGQAAALLGLVAGAEASGGTAGGAV-------AGLAAGLAAAAGASAN 303
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1535 AGFGGAISTSASFGGTLNNSAsfGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNS 1614
Cdd:COG5412 304 LGAAAAASFGASLAASAGVDT--AAAALAAAEAIADGSLVAGLGSAGTVLSTLSGAVGGLEGAIGQLGAAGGLGSALGGL 381
|
410 420 430
....*....|....*....|....*....|....
gi 50593518 1615 AGFGGAISTS-----ASFGGALNNSAGFGGAIST 1643
Cdd:COG5412 382 TGPIGIVIAAiaaliAAFVALWKNSETFRNLVQG 415
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
869-1089 |
2.79e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.15 E-value: 2.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 869 NQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGfggiSNPSGGFGGisnpSGGFGGISNPSGGFGGISNPSGGFGG 948
Cdd:NF033849 310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDG----TSQSTSISH----SESSSESTGTSVGHSTSSSVSSSESS 381
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 949 ISNPSGGFggisnpSGGFGGISNPSGGFggisnpSGGFGGISNPSGGFGGisnpSGGFGGISNPSGGFGGISNPSG-GFG 1027
Cdd:NF033849 382 SRSSSSGV------SGGFSGGIAGGGVT------SEGLGASQGGSEGWGS----GDSVQSVSQSYGSSSSTGTSSGhSDS 445
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 50593518 1028 GRNSITFGsvpnTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFG 1089
Cdd:NF033849 446 SSHSTSSG----QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVS 503
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1239-1459 |
2.86e-04 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 45.81 E-value: 2.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1239 FSGAVSTTTGFGGTLSTSVCFGSSPYS--GAGFGGTLSTSISFGGSPSTNTGFGGTLstsvsFGASSSTSSDFGGTLSTS 1316
Cdd:pfam15967 6 FGGGPGSTATAGGGFSFGAAAASNPGStgGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1317 VSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSAs 1396
Cdd:pfam15967 81 AATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTT- 159
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 50593518 1397 fgsALSTSASFGGVLNgsaGFGGALNTNATFGGVLNGSAGFGGAMNTNATFgGALNSNAGFGG 1459
Cdd:pfam15967 160 ---AVSTGLSLGSTLT---SLGGSLFQNTNSTGLGQTTLGLTLLATSTAPV-SAPAASEGLGG 215
|
|
| COG5412 |
COG5412 |
Phage-related protein [Mobilome: prophages, transposons]; |
1318-1666 |
2.91e-04 |
|
Phage-related protein [Mobilome: prophages, transposons];
Pssm-ID: 444167 [Multi-domain] Cd Length: 704 Bit Score: 45.84 E-value: 2.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1318 SFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSgvLNSSASFGGAINTSAGFGSTLNSSASF 1397
Cdd:COG5412 35 VVSAAAKAQGSIAQLGKIGAAAGAEAALADSSLAFATLAAALGATVAGASL--LLAAGGARAKGSAAAAAALGAVAAAAK 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1398 GSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGF 1477
Cdd:COG5412 113 VLNGALAAAGAALAATQALAAAATGAKGEANAAAKAGGAAALASAGLAAAGAAAAASALAAAGAIAKAILSASKLSGQAL 192
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1478 GGAMNTSASFGGALN---NSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNS 1554
Cdd:COG5412 193 AGQSAAAGGALEAAAaaaAGAAAAGAAAAAATAASALLALAALQGLAAGAATGAAAGAAGAAGLGAAGAGAGQAAALLGL 272
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1555 AsFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNS 1634
Cdd:COG5412 273 V-AGAEASGGTAGGAVAGLAAGLAAAAGASANLGAAAAASFGASLAASAGVDTAAAALAAAEAIADGSLVAGLGSAGTVL 351
|
330 340 350
....*....|....*....|....*....|..
gi 50593518 1635 AGFGGAISTNASFGGAISNSPDFGGAFSTSVG 1666
Cdd:COG5412 352 STLSGAVGGLEGAIGQLGAAGGLGSALGGLTG 383
|
|
| PRK09039 |
PRK09039 |
peptidoglycan -binding protein; |
259-380 |
3.34e-04 |
|
peptidoglycan -binding protein;
Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 45.34 E-value: 3.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 259 SRQIgaSGRQTE--ASNRQIEASSRQ---TEASNRQTEASSRQTEASSRQTETSNRQIGASN----RQIMASNRQIGASN 329
Cdd:PRK09039 45 SREI--SGKDSAldRLNSQIAELADLlslERQGNQDLQDSVANLRASLSAAEAERSRLQALLaelaGAGAAAEGRAGELA 122
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 50593518 330 RQIeASNRQIGA-SNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQ 380
Cdd:PRK09039 123 QEL-DSEKQVSArALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAK 173
|
|
| PHA02515 |
PHA02515 |
hypothetical protein; Provisional |
1294-1505 |
3.67e-04 |
|
hypothetical protein; Provisional
Pssm-ID: 107197 [Multi-domain] Cd Length: 508 Bit Score: 45.54 E-value: 3.67e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1294 STSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGgtlNSSTSFGGAISTSTGFGSALNNSANFG--GAISTSFSGVL 1371
Cdd:PHA02515 175 TVAASVGAVDTVAGDLGGTWAAGVSYDFGSIAVPPIG---NTSPPGGNIVIVANSIGNVDTVAENIGdvSTVSTHLSSML 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1372 -------------------NSSASFGGAINTSAGFGSTLNSSASfgSALSTSASFGGVLNGSAGFGGALNTNATFGGVLN 1432
Cdd:PHA02515 252 avandidsvvsvagdleniDAVADNAANINTVAGANANVNTVAS--NILDVGTVAGNIDDVQAVAGNAANINVVADNADN 329
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 50593518 1433 GSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGA--LNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNA 1505
Cdd:PHA02515 330 INATAANQANINAAVGNADNINAAVANQANINAVVGNAnnINAVAANEGNVNTVVDNLADVQTVAGIAADVSTVA 404
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
272-490 |
3.75e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 45.67 E-value: 3.75e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 272 SNRQIEASSRQ-TEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSS 350
Cdd:NF033609 33 SSKEADASENSvTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSSNTNNGETSVAQNPAQQETTQSASTNATTEETP 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 351 RQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSR--QTEASSRQTEASSR 428
Cdd:NF033609 113 VTGEATTTATNQANTPATTQSSNTNAEELVNQTSNETTSNDTNTVSSVNSPQNSTNAENVSTTQdtSTEATPSNNESAPQ 192
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 50593518 429 QTEASSRQIeaSAAAVRPKKPRgkkgnNKGSNSASEPSEAPPAIQTVTNHALSVTVRIRRGS 490
Cdd:NF033609 193 STDASNKDV--VNQAVNTSAPR-----MRAFSLAAVAADAPAAGTDITNQLTNVTVGIDSGT 247
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
889-1092 |
4.31e-04 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 44.88 E-value: 4.31e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 889 ITNPSGGFGGiSNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 968
Cdd:COG5651 174 ITNPGGLLGA-QNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAG 252
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 969 ISNPSGGFGGISNP--SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSS 1046
Cdd:COG5651 253 AGASAALASLAATLlnASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGA 332
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 50593518 1047 APSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAP 1092
Cdd:COG5651 333 AAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAA 378
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1373-1606 |
7.56e-04 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 44.66 E-value: 7.56e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1373 SSASFGGAINTSAGFGSTLnssaSFGSALSTSA-SFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGAL 1451
Cdd:pfam15967 2 SGFSFGGGPGSTATAGGGF----SFGAAAASNPgSTGGFSFGTLGAAPAATATTTTATLGLGGGLFGQKPATGFTFGTPA 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1452 NSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSAsfGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGAL 1531
Cdd:pfam15967 78 SSTAATGPTGLTLGTPAATTAASTGFSLGFNKPA--ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTP 155
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 50593518 1532 NNSAgfggAISTSASFGGTLNnsaSFGGAINTSASFGGVLNNSAGFGGAINTSANFgGALTNSAGFGGAISTSAS 1606
Cdd:pfam15967 156 ATTT----AVSTGLSLGSTLT---SLGGSLFQNTNSTGLGQTTLGLTLLATSTAPV-SAPAASEGLGGLDFSTSS 222
|
|
| PRK09039 |
PRK09039 |
peptidoglycan -binding protein; |
250-360 |
7.84e-04 |
|
peptidoglycan -binding protein;
Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 43.80 E-value: 7.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 250 EASNAIEASSRQIGASGRQTEASNRQIEASsrQTEASNRQTEASSRQTE-ASSRQTEtsnRQIGASnrqimaSNRQIGAS 328
Cdd:PRK09039 74 QGNQDLQDSVANLRASLSAAEAERSRLQAL--LAELAGAGAAAEGRAGElAQELDSE---KQVSAR------ALAQVELL 142
|
90 100 110
....*....|....*....|....*....|..
gi 50593518 329 NRQIEASNRQIGASNRQTEVSSRQIEASNRQI 360
Cdd:PRK09039 143 NQQIAALRRQLAALEAALDASEKRDRESQAKI 174
|
|
| Keratin_2_head |
pfam16208 |
Keratin type II head; |
877-1015 |
8.50e-04 |
|
Keratin type II head;
Pssm-ID: 465068 [Multi-domain] Cd Length: 156 Bit Score: 41.95 E-value: 8.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 877 SFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPS-GGFGGISNPSGGFGGISNPSGGFGGISNPSGG 955
Cdd:pfam16208 1 GFSSCSAVVPSRSRRSYSSVSSSRRGGGGGGGGGGGGGGFGSRSLYNlGGSKSISISVAGGGSRPGSGFGFGGGGGGGFG 80
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 956 FGGISNPSGGFGGISNPSGGFGGISNPSGGFGGisnpsGGFGGisnpSGGFGGISNPSGG 1015
Cdd:pfam16208 81 GGFGGGGGGGFGGGGGFGGGFGGGGYGGGGFGG-----GGFGG----RGGFGGPPCPPGG 131
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
881-1079 |
1.09e-03 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 43.73 E-value: 1.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 881 GAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSG----GFGGiSNPSGGFGGISNPSGGFGGISNPSGGF 956
Cdd:COG5651 182 GAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGpgntGFAG-TGAAAGAAAAAAAAAAAAGAGASAALA 260
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 957 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGS 1036
Cdd:COG5651 261 SLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAA 340
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 50593518 1037 VPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAP 1079
Cdd:COG5651 341 AGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
176-452 |
1.52e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 43.35 E-value: 1.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 176 SKKASTANNAANKTvpsAAEISLASAATHT-VTTQGQAAKETGSIQTIAATARSKKNskgkRTPAKTTNTDNEYVEASNA 254
Cdd:COG4372 9 GKARLSLFGLRPKT---GILIAALSEQLRKaLFELDKLQEELEQLREELEQAREELE----QLEEELEQARSELEQLEEE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 255 IEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEA 334
Cdd:COG4372 82 LEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLES 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 335 SNRQIgaSNRQTEVSSRQIEASNRQIgasNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSR 414
Cdd:COG4372 162 LQEEL--AALEQELQALSEAEAEQAL---DELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALS 236
|
250 260 270
....*....|....*....|....*....|....*...
gi 50593518 415 QTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGK 452
Cdd:COG4372 237 ALLDALELEEDKEELLEEVILKEIEELELAILVEKDTE 274
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
260-435 |
1.58e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 43.75 E-value: 1.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 260 RQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQtetsnRQIGASNRQImasnRQIGASNRQIEASNRQI 339
Cdd:COG4913 624 EELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAE-----REIAELEAEL----ERLDASSDDLAALEEQL 694
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 340 GASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNrQIGASNRQTDASNRQTDASNRQTEASSRQtEAS 419
Cdd:COG4913 695 EELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAE-DLARLELRALLEERFAAALGDAVERELRE-NLE 772
|
170
....*....|....*.
gi 50593518 420 SRQTEASSRQTEASSR 435
Cdd:COG4913 773 ERIDALRARLNRAEEE 788
|
|
| Tar |
COG0840 |
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms]; |
162-426 |
1.68e-03 |
|
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
Pssm-ID: 440602 [Multi-domain] Cd Length: 533 Bit Score: 43.47 E-value: 1.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 162 NESANSLASTAVNKSKKASTANNAANKTvpsAAEISLASAATHTVTTqgqaaketgSIQTIAATARskknskgkrtpakt 241
Cdd:COG0840 259 RESAEQVASASEELAASAEELAAGAEEQ---AASLEETAAAMEELSA---------TVQEVAENAQ-------------- 312
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 242 tntdneyvEASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASN---RQTEASSR--------------QT------- 297
Cdd:COG0840 313 --------QAAELAEEASELAEEGGEVVEEAVEGIEEIRESVEETAetiEELGESSQeigeivdviddiaeQTnllalna 384
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 298 --EA-----------------------SSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQI---GASNRQTEVS 349
Cdd:COG0840 385 aiEAarageagrgfavvadevrklaerSAEATKEIEELIEEIQSETEEAVEAMEEGSEEVEEGVELVeeaGEALEEIVEA 464
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 50593518 350 SRQIEASNRQIGAsnrqteasnrqigASNRQTEASNrQIGASNRQTDASNRQTDASNRQTEASSRQTEASSRQTEAS 426
Cdd:COG0840 465 VEEVSDLIQEIAA-------------ASEEQSAGTE-EVNQAIEQIAAAAQENAASVEEVAAAAEELAELAEELQEL 527
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
310-543 |
2.91e-03 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 42.12 E-value: 2.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 310 QIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIgaSNRQTEASNR--- 386
Cdd:COG3883 17 QIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEI--EERREELGERara 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 387 -------------------------QIGASNRQTDASNR--------QTDASNRQTEASSRQTEASSRQTEASSRQTEAS 433
Cdd:COG3883 95 lyrsggsvsyldvllgsesfsdfldRLSALSKIADADADlleelkadKAELEAKKAELEAKLAELEALKAELEAAKAELE 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 434 SRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEAPPAIQTVTNHALSVTVRIRRGSRARKAANKNRATESQAQIAEQGA 513
Cdd:COG3883 175 AQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAAGA 254
|
250 260 270
....*....|....*....|....*....|
gi 50593518 514 QASEASISALETQVAAAVQALADDYLAQLS 543
Cdd:COG3883 255 AGAAAGSAGAAGAAAGAAGAGAAAASAAGG 284
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
903-1109 |
2.99e-03 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 42.19 E-value: 2.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 903 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 982
Cdd:COG5651 177 PGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAGAGAS 256
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 983 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSF 1062
Cdd:COG5651 257 AALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAA 336
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 50593518 1063 SGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTASISFGGAP 1109
Cdd:COG5651 337 AGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
867-1049 |
3.14e-03 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 42.19 E-value: 3.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 867 LFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGF 946
Cdd:COG5651 202 LTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASS 281
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 947 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISN-PSGGFGGISNPSGGFGGISNPSGG 1025
Cdd:COG5651 282 AATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAaGAAAGAGAAAAAAAGGAGGGGGGA 361
|
170 180
....*....|....*....|....
gi 50593518 1026 FGGRNSITFGSVPNTSANFSSAPS 1049
Cdd:COG5651 362 LGAGGGGGSAGAAAGAASGGGAAA 385
|
|
| PHA00430 |
PHA00430 |
tail fiber protein |
325-467 |
3.26e-03 |
|
tail fiber protein
Pssm-ID: 222790 [Multi-domain] Cd Length: 568 Bit Score: 42.57 E-value: 3.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 325 IGASNR-QIEASNRQI----GASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASN 399
Cdd:PHA00430 121 IGVNNDgHLDARGRRIvnlaDAVDDGDAVPLGQIKTWNQSAWNARNEANRSRNEADRARNQAERFNNESGASATNTKQWR 200
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 50593518 400 RQTDASNRQTE-----ASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSE 467
Cdd:PHA00430 201 SEADGSNSEANrfkgyADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAAHASEVNAANSATAAATSA 273
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1522-1713 |
3.30e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.43 E-value: 3.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1522 STNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAI 1601
Cdd:COG3469 13 GGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATST 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1602 STSASFGGALNNSAGfggaiSTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHS 1681
Cdd:COG3469 93 SATLVATSTASGANT-----GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTS 167
|
170 180 190
....*....|....*....|....*....|..
gi 50593518 1682 NSISFGSAPTTSvSFGGSHSTNLCFGGAPSTS 1713
Cdd:COG3469 168 TTTTTTSASTTP-SATTTATATTASGATTPSA 198
|
|
| 34 |
PHA02584 |
long tail fiber, proximal subunit; Provisional |
1412-1595 |
5.23e-03 |
|
long tail fiber, proximal subunit; Provisional
Pssm-ID: 222890 [Multi-domain] Cd Length: 1229 Bit Score: 42.05 E-value: 5.23e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1412 NGSAGFGGALNTNATFggVLNGSAGFGGAMNTNATF------GGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSA 1485
Cdd:PHA02584 908 NGSLTFTKNTNLSAPL--VSSSTATFGGSVTANSTLttqntsNGTVVVVDETSIAFYSQNNTTGNIVFNIDGTVDPINVN 985
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1486 SFGGALNNSAG-FGGAISTNatfGGALNNSAGFGGAISTNATFGGALNNSagfggAISTSASFGGTLNNSASFGGAINTS 1564
Cdd:PHA02584 986 ANGTLNATGVAtNGRAVYAE---GGGIARTNNAARAITGGFTIRNDGSTT-----VFLLTAAGDQTGGFNGLKSLIINNA 1057
|
170 180 190
....*....|....*....|....*....|.
gi 50593518 1565 ASFGGVLNNSAGFGGAINTSanfgGALTNSA 1595
Cdd:PHA02584 1058 NGQVTINDNYIINAGGTIMS----GGLTVNS 1084
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
20-243 |
5.51e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 41.83 E-value: 5.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 20 PAGSLGLPFSPDVQSETT---EKDPPIASRSKKNKNKKNSIKPMDKTTPAPPPVPSANDNASNKPKVTLQALNLPMFTQI 96
Cdd:pfam05109 442 PNTTTGLPSSTHVPTNLTapaSTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNAT 521
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 97 SQASA-TTEAPNIQASVTSQTQKAKTMrVTPKVSLTGSEDATTQLKPPLQALNLPVTTPTiqTPVANESANSLASTAVNK 175
Cdd:pfam05109 522 SPTPAvTTPTPNATSPTLGKTSPTSAV-TTPTPNATSPTPAVTTPTPNATIPTLGKTSPT--SAVTTPTPNATSPTVGET 598
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 50593518 176 SKKASTANNAANKTVPSAAEISLASAATHTVTTqGQAAKETGSIQTIAATARSKKNSKGKRTPAKTTN 243
Cdd:pfam05109 599 SPQANTTNHTLGGTSSTPVVTSPPKNATSAVTT-GQHNITSSSTSSMSLRPSSISETLSPSTSDNSTS 665
|
|
| YjbI |
COG1357 |
Uncharacterized conserved protein YjbI, contains pentapeptide repeats [Function unknown]; |
1503-1660 |
5.96e-03 |
|
Uncharacterized conserved protein YjbI, contains pentapeptide repeats [Function unknown];
Pssm-ID: 440968 [Multi-domain] Cd Length: 178 Bit Score: 39.92 E-value: 5.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1503 TNATFGGALNNSAGFGGAISTNATFGGALNNsAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAIN 1582
Cdd:COG1357 3 SGADLSGADLSGADLSGADLSGANLSGALSG-ANLSGANLSGANLTGANLSGADLSGADLSGANLSGADLSGANLTGADL 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1583 TSAN---FGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGG 1659
Cdd:COG1357 82 SGANlanLSGANLSGANLSGANLRGANLSGANLSGADLSGADLSGANLSGADLSGANLSGANLSGADLSGADLSGANLSG 161
|
.
gi 50593518 1660 A 1660
Cdd:COG1357 162 A 162
|
|
| dermokine |
cd21118 |
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ... |
1632-1860 |
6.99e-03 |
|
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.
Pssm-ID: 411053 [Multi-domain] Cd Length: 495 Bit Score: 41.14 E-value: 6.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1632 NNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNlcfggaPS 1711
Cdd:cd21118 133 QGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQGAVAQPGYGTVRGNNQNSGCTN------PP 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1712 TSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASF 1791
Cdd:cd21118 207 PSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSGGSNGGSSGNSGSGSGGSSSGGSN 286
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 50593518 1792 NGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTS 1860
Cdd:cd21118 287 GWGGSSSSGGSGGSGGGNKPECNNPGNDVRMAGGGGSQGSKESSGSHGSNGGNGQAEAVGGLNTLNSDA 355
|
|
| 34 |
PHA02584 |
long tail fiber, proximal subunit; Provisional |
1390-1588 |
7.27e-03 |
|
long tail fiber, proximal subunit; Provisional
Pssm-ID: 222890 [Multi-domain] Cd Length: 1229 Bit Score: 41.66 E-value: 7.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1390 TLNSSASFGSALSTSASFggVLNGSAGFGGALNTNATF------GGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAIST 1463
Cdd:PHA02584 906 TVNGSLTFTKNTNLSAPL--VSSSTATFGGSVTANSTLttqntsNGTVVVVDETSIAFYSQNNTTGNIVFNIDGTVDPIN 983
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1464 STNFGGALNNSAG-FGGAMNTSasfGGALNNSAGFGGAISTNATFGGALNNSAG-----FGGAISTNATFGGALNNSAGF 1537
Cdd:PHA02584 984 VNANGTLNATGVAtNGRAVYAE---GGGIARTNNAARAITGGFTIRNDGSTTVFlltaaGDQTGGFNGLKSLIINNANGQ 1060
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 50593518 1538 -----------GGAISTsasfGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFG 1588
Cdd:PHA02584 1061 vtindnyiinaGGTIMS----GGLTVNSRIRSQGTKASYTRAPTADTVGFWSVDINDSATYN 1118
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
223-516 |
8.05e-03 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 40.97 E-value: 8.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 223 AATARSKKNSKGKRTPAKTTNTDNEYVEASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSR 302
Cdd:COG3883 14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERAR 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 303 QTETSNRQIGASNRQIMASN-----RQIGASNRQIEASNRQIgasnrqTEVSSRQIEASNRQIGASNRQTEASNRQIGAS 377
Cdd:COG3883 94 ALYRSGGSVSYLDVLLGSESfsdflDRLSALSKIADADADLL------EELKADKAELEAKKAELEAKLAELEALKAELE 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 378 NRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNK 457
Cdd:COG3883 168 AAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAA 247
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 50593518 458 GSNSASEPSEAPPAIQTVTNHALSVTVRIRRGSRARKAANKNRATESQAQIAEQGAQAS 516
Cdd:COG3883 248 GAGAAGAAGAAAGSAGAAGAAAGAAGAGAAAASAAGGGAGGAGGGGGGGGAASGGSGGG 306
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
29-297 |
8.74e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 41.10 E-value: 8.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 29 SPDVQSETTekdppiaSRSKKNKNKKNSIKPMDKTTPAPPPVPSANDNASNKPKVTLQALNLPmftQISQASATTEAPNI 108
Cdd:pfam17823 83 STEVTAEHT-------PHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALP---SEAFSAPRAAACRA 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 109 QASVTSQTQKAKTMRVTPKVSLTGSEDATTQLKPPLQALNLP-----VTTPTIQTPVanesaNSLASTAVNKSKKASTAN 183
Cdd:pfam17823 153 NASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSApttaaSSAPATLTPA-----RGISTAATATGHPAAGTA 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 184 NAANKTVPSAAEISLASAATHTVTTQGQAAKETGSIQTIAATARSKKNSKGKRTPAKTTNTDNeyveasnaieASSRQIG 263
Cdd:pfam17823 228 LAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDT----------MARNPAA 297
|
250 260 270
....*....|....*....|....*....|....
gi 50593518 264 ASGRQTEASNRQIEASSRQTEASNRQTEASSRQT 297
Cdd:pfam17823 298 PMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTT 331
|
|
|