NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|50593518|ref|NP_001002272|]
View 

trophinin isoform 1 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
859-2081 6.57e-33

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


:

Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 140.67  E-value: 6.57e-33
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210  294 DTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLT 373
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210  374 TAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSG 453
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSF 1098
Cdd:COG3210  454 TTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTG 533
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1099 STASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPT 1178
Cdd:COG3210  534 GDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGT 613
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1179 TSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVC 1258
Cdd:COG3210  614 ITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNA 693
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1259 FGSSPYSGAGFGGTLST-SISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFG-----GTLSTSVSFGGSSGANAGFGGT 1332
Cdd:COG3210  694 ATGGTLNNAGNTLTISTgSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGvtitsGNAGTLSIGLTANTTASGTTLT 773
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1333 LNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLN 1412
Cdd:COG3210  774 LANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTT 853
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1413 GSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALN 1492
Cdd:COG3210  854 SDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAA 933
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1493 NSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLN 1572
Cdd:COG3210  934 GGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSG 1013
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1573 NSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAIS 1652
Cdd:COG3210 1014 AIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHT 1093
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1653 NSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNST 1732
Cdd:COG3210 1094 LGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAAT 1173
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1733 NCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFS 1812
Cdd:COG3210 1174 TTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGT 1253
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1813 GGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFS 1892
Cdd:COG3210 1254 GDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNS 1333
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1893 VSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGP 1972
Cdd:COG3210 1334 GGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGN 1413
                       1130      1140      1150      1160      1170      1180      1190      1200
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:COG3210 1414 NGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAG 1493
                       1210      1220
                 ....*....|....*....|....*....
gi 50593518 2053 GGLNTSAGFSGGPPSTGTGFGGGASSHGG 2081
Cdd:COG3210 1494 VAGATASNGGTSTGAGGTAGGTTAEVAKA 1522
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
596-756 2.13e-24

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


:

Pssm-ID: 426270  Cd Length: 205  Bit Score: 103.12  E-value: 2.13e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    596 LVKYLLVKDQTKIPIKRSDMLKDVIQEYE-DYFPEIIERASYALEKMFRVNLKEID--------------------KQNN 654
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRkRLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    655 LYILIST---QESSAGIMGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLH---PGVKHSLFG 719
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 50593518    720 EVKKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 756
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
MscS_porin super family cl25507
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ...
268-445 2.01e-09

Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.


The actual alignment was detected with superfamily member pfam12795:

Pssm-ID: 432790 [Multi-domain]  Cd Length: 238  Bit Score: 60.01  E-value: 2.01e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASnrqigASNRQTE 347
Cdd:pfam12795   10 LDEAAKKKLLQDLQQALSLLDKIDASKQRAAAYQKALDDAPAELRELRQELAALQAKAEAAPKEILAS-----LSLEELE 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    348 vsSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASsrQTEASSRQTEASS 427
Cdd:pfam12795   85 --QRLLQTSAQLQELQNQLAQLNSQLIELQTRPERAQQQLSEARQRLQQIRNRLNGPAPPGEPLS--EAQRWALQAELAA 160
                          170
                   ....*....|....*...
gi 50593518    428 RQTEASSRQIEASAAAVR 445
Cdd:pfam12795  161 LKAQIDMLEQELLSNNNR 178
growth_prot_Scy super family cl49463
polarized growth protein Scy;
97-524 1.17e-07

polarized growth protein Scy;


The actual alignment was detected with superfamily member NF041483:

Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 57.53  E-value: 1.17e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    97 SQASATTEAPNIQASVTSQTQKAKTMRVTPKVSLTGSEDATtqlkpplQALNLPVTTPTIQTPVANESANSLAS--TAVN 174
Cdd:NF041483  293 AKQLASAESANEQRTRTAKEEIARLVGEATKEAEALKAEAE-------QALADARAEAEKLVAEAAEKARTVAAedTAAQ 365
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   175 KSKKASTANNAANKTVPSAAEISLAsAATHTVTTQGQAAKETGSIQTIAATA------RSKKNSKGKRtpAKTTNTDNEY 248
Cdd:NF041483  366 LAKAARTAEEVLTKASEDAKATTRA-AAEEAERIRREAEAEADRLRGEAADQaeqlkgAAKDDTKEYR--AKTVELQEEA 442
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   249 V----EA----SNAIEASSRQIGASGRqtEASnRQIEASSRQTEA-------------SNRQTEASSRQTEASSRQT--- 304
Cdd:NF041483  443 RrlrgEAeqlrAEAVAEGERIRGEARR--EAV-QQIEEAARTAEElltkakadadelrSTATAESERVRTEAIERATtlr 519
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   305 ----ETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIgasnrqTEVSSRQIEAsnrqigasnRQTEASNRqigASNRQ 380
Cdd:NF041483  520 rqaeETLERTRAEAERLRAEAEEQAEEVRAAAERAAREL------REETERAIAA---------RQAEAAEE---LTRLH 581
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   381 TEASNRQIGASNRQTDASN------RQT-DASNRQ-TEASSR--------QTEASSRQTEASSrqtEASSRQIEASAAAV 444
Cdd:NF041483  582 TEAEERLTAAEEALADARAeaerirREAaEETERLrTEAAERirtlqaqaEQEAERLRTEAAA---DASAARAEGENVAV 658
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   445 RPKkprgkkgnnkgSNSASEPSeappaiqtvtnhalsvtvriRRGSRARKAANKNRAtESQAQIAEQGAQASEASISALE 524
Cdd:NF041483  659 RLR-----------SEAAAEAE--------------------RLKSEAQESADRVRA-EAAAAAERVGTEAAEALAAAQE 706
 
Name Accession Description Interval E-value
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
859-2081 6.57e-33

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 140.67  E-value: 6.57e-33
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210  294 DTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLT 373
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210  374 TAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSG 453
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSF 1098
Cdd:COG3210  454 TTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTG 533
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1099 STASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPT 1178
Cdd:COG3210  534 GDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGT 613
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1179 TSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVC 1258
Cdd:COG3210  614 ITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNA 693
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1259 FGSSPYSGAGFGGTLST-SISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFG-----GTLSTSVSFGGSSGANAGFGGT 1332
Cdd:COG3210  694 ATGGTLNNAGNTLTISTgSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGvtitsGNAGTLSIGLTANTTASGTTLT 773
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1333 LNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLN 1412
Cdd:COG3210  774 LANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTT 853
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1413 GSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALN 1492
Cdd:COG3210  854 SDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAA 933
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1493 NSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLN 1572
Cdd:COG3210  934 GGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSG 1013
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1573 NSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAIS 1652
Cdd:COG3210 1014 AIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHT 1093
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1653 NSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNST 1732
Cdd:COG3210 1094 LGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAAT 1173
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1733 NCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFS 1812
Cdd:COG3210 1174 TTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGT 1253
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1813 GGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFS 1892
Cdd:COG3210 1254 GDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNS 1333
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1893 VSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGP 1972
Cdd:COG3210 1334 GGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGN 1413
                       1130      1140      1150      1160      1170      1180      1190      1200
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:COG3210 1414 NGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAG 1493
                       1210      1220
                 ....*....|....*....|....*....
gi 50593518 2053 GGLNTSAGFSGGPPSTGTGFGGGASSHGG 2081
Cdd:COG3210 1494 VAGATASNGGTSTGAGGTAGGTTAEVAKA 1522
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
596-756 2.13e-24

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


Pssm-ID: 426270  Cd Length: 205  Bit Score: 103.12  E-value: 2.13e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    596 LVKYLLVKDQTKIPIKRSDMLKDVIQEYE-DYFPEIIERASYALEKMFRVNLKEID--------------------KQNN 654
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRkRLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    655 LYILIST---QESSAGIMGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLH---PGVKHSLFG 719
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 50593518    720 EVKKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 756
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1434-1791 3.72e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 98.15  E-value: 3.72e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1434 SAGFGGAMNtnATFGGALNSNAGFGGaistSTNFGGALNNSAGFGGAMNTSASFGgalnNSAGFGGAISTNATFGGALNN 1513
Cdd:NF033849  220 SISFGVSLP--MMYAANLGQSAGTGY----GESVGHSTSQGQSHSVGTSESHSVG----TSQSQSHTTGHGSTRGWSHTQ 289
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1514 SAGFGGAISTNATFGGALNNSAGF--GGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGAL 1591
Cdd:NF033849  290 STSESESTGQSSSVGTSESQSHGTteGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGH 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1592 TNSAGFGGAISTSASFGGALNnsAGFGGAIstsasfGGALNNSAGFGGAISTNASFGGAISNSpDFGGAFSTSVGFG--- 1668
Cdd:NF033849  370 STSSSVSSSESSSRSSSSGVS--GGFSGGI------AGGGVTSEGLGASQGGSEGWGSGDSVQ-SVSQSYGSSSSTGtss 440
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1669 GTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSlcfGSASNTnlcFGGSNSTncfsgatsanfNEGHS 1748
Cdd:NF033849  441 GHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTS---QSETDS---VGDSTGT-----------SESVS 503
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 50593518  1749 ISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASF 1791
Cdd:NF033849  504 QGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1494-1832 4.41e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 98.15  E-value: 4.41e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1494 SAGFGgaISTNATFGGALNNSAG------FGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASF 1567
Cdd:NF033849  220 SISFG--VSLPMMYAANLGQSAGtgygesVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1568 GgvLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASF 1647
Cdd:NF033849  298 G--QSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV 375
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1648 GGAISNSPDFggAFSTSVGFGGTLNttdfgsthsnsisfgSAPTTSVSFGGSHSTNLCFGGApstslcfGSASNTNLCFG 1727
Cdd:NF033849  376 SSSESSSRSS--SSGVSGGFSGGIA---------------GGGVTSEGLGASQGGSEGWGSG-------DSVQSVSQSYG 431
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1728 GSNSTNCFSGATSanfNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTS----------TGFGGSLGPSASFNGGLGT 1797
Cdd:NF033849  432 SSSSTGTSSGHSD---SSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSeswstsqsetDSVGDSTGTSESVSQGDGR 508
                         330       340       350
                  ....*....|....*....|....*....|....*
gi 50593518  1798 STGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFN 1832
Cdd:NF033849  509 STGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1454-1821 9.74e-18

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 90.45  E-value: 9.74e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1454 NAGFGgaISTSTNFGGALNNSAGFGGAMNTSASFggalnnSAGFGGAISTNATFGGAlnNSAGFGGAISTNATFGGALNN 1533
Cdd:NF033849  220 SISFG--VSLPMMYAANLGQSAGTGYGESVGHST------SQGQSHSVGTSESHSVG--TSQSQSHTTGHGSTRGWSHTQ 289
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1534 SAGFGGAISTSASFGG--TLNNSASFGGAINTSASFggvlnnSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGAL 1611
Cdd:NF033849  290 STSESESTGQSSSVGTseSQSHGTTEGTSTTDSSSH------SQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSEST 363
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1612 NNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSpdFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSapT 1691
Cdd:NF033849  364 GTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEG--LGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT--S 439
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1692 TSVSFGGSHSTNLcfGGAPSTSLCFGSASNTnlcfgGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGF 1771
Cdd:NF033849  440 SGHSDSSSHSTSS--GQADSVSQGTSWSEGT-----GTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGR 512
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|
gi 50593518  1772 GSSLGTStgfggslgpsasfnggLGTSTGFGGGLGTSTDFSGGLNHNADF 1821
Cdd:NF033849  513 SESQGTS----------------LGTSGGRTSGAGGSMGLGPSISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1592-1958 7.29e-17

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 87.37  E-value: 7.29e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1592 TNSAGFGgaISTSASFGGALNNSAG--FGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGG 1669
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQSAGtgYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESE 295
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1670 TLNTTD-FGSTHSNSISFGSAPTTSVSFGGSHSTnlcfggapSTSLCFGSASNTNLCFGGSNStncfsgaTSANFNEGHS 1748
Cdd:NF033849  296 STGQSSsVGTSESQSHGTTEGTSTTDSSSHSQSS--------SYNVSSGTGVSSSHSDGTSQS-------TSISHSESSS 360
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1749 ISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGglnhnadFNGGLGNS 1828
Cdd:NF033849  361 ESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQS-------VSQSYGSS 433
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1829 AGFngglntntdfggelGTSAGFGDGLGSSTSFG--AGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPN 1906
Cdd:NF033849  434 SST--------------GTSSGHSDSSSHSTSSGqaDSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTS 499
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|..
gi 50593518  1907 ASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSngpssiVGFsgGPSTGAG 1958
Cdd:NF033849  500 ESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGS------MGL--GPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1574-1873 1.01e-16

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 86.98  E-value: 1.01e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1574 SAGFGgaINTSANFGGALTNSAG------FGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASF 1647
Cdd:NF033849  220 SISFG--VSLPMMYAANLGQSAGtgygesVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1648 GGAISNspdfGGAFSTSVGFGGTLNTTDfGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNlcfG 1727
Cdd:NF033849  298 GQSSSV----GTSESQSHGTTEGTSTTD-SSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVG---H 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1728 GSNSTNCFSGATSANFNEGHSISFGNGLS----TSAGFGNGLGTSAGFGSSLG---TSTGFGGSLGPSASF----NGGLG 1796
Cdd:NF033849  370 STSSSVSSSESSSRSSSSGVSGGFSGGIAgggvTSEGLGASQGGSEGWGSGDSvqsVSQSYGSSSSTGTSSghsdSSSHS 449
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 50593518  1797 TSTGFGGGLGTSTDFSGGLNHNAdfNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAG 1873
Cdd:NF033849  450 TSSGQADSVSQGTSWSEGTGTSQ--GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSG 524
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1222-1547 1.41e-15

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 83.13  E-value: 1.41e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1222 ALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVcfgsspysgagfggTLSTSISFGGSPSTNTGFGGTLSTSVSFGA 1301
Cdd:NF033849  252 SQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSH--------------TQSTSESESTGQSSSVGTSESQSHGTTEGT 317
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1302 SSSTSSDFGGTLSTSVSFGgssganAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGgai 1381
Cdd:NF033849  318 STTDSSSHSQSSSYNVSSG------TGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG--- 388
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1382 nTSAGFGSTLNSSASFGSALSTSASfggvlnGSAGFGGAlntnatfGGVLNGSAGFGGAMNTNATFGGALNS--NAGFGG 1459
Cdd:NF033849  389 -VSGGFSGGIAGGGVTSEGLGASQG------GSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQ 454
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1460 AISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSA---- 1535
Cdd:NF033849  455 ADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAggsm 534
                         330
                  ....*....|..
gi 50593518  1536 GFGGAISTSASF 1547
Cdd:NF033849  535 GLGPSISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1339-1647 3.50e-12

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 71.96  E-value: 3.50e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1339 FGGAISTSTGFGSalnnSANFGGAISTSFsgvlnsSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFG 1418
Cdd:NF033849  231 YAANLGQSAGTGY----GESVGHSTSQGQ------SHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQS 300
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1419 GALNTNATFG-GVLNG-----SAGFGGAMNTNATFG----GALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFG 1488
Cdd:NF033849  301 SSVGTSESQShGTTEGtsttdSSSHSQSSSYNVSSGtgvsSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1489 GALNNSAGFGGAISTNATFGGALnnSAGFGGAISTNATFG---GALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSA 1565
Cdd:NF033849  381 SSRSSSSGVSGGFSGGIAGGGVT--SEGLGASQGGSEGWGsgdSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSV 458
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1566 SFGGVL--NNSAGFGGAINTSANFG--GALTNSAGF--GGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGG 1639
Cdd:NF033849  459 SQGTSWseGTGTSQGQSVGTSESWStsQSETDSVGDstGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGP 538

                  ....*...
gi 50593518  1640 AISTNASF 1647
Cdd:NF033849  539 SISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1356-1702 7.74e-12

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 71.19  E-value: 7.74e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1356 SANFGGAISTSFSGvlNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSA 1435
Cdd:NF033849  220 SISFGVSLPMMYAA--NLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1436 GFGGAMNTNATfggaLNSNAGFGGAISTSTNF--GGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNN 1513
Cdd:NF033849  298 GQSSSVGTSES----QSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSS 373
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1514 SAGFGGAISTNATFGgalnNSAGFGGAIS----TSASFGGTLNNSASFGgaintsaSFGGVLNNSAGFGGAINTSANFGG 1589
Cdd:NF033849  374 SVSSSESSSRSSSSG----VSGGFSGGIAgggvTSEGLGASQGGSEGWG-------SGDSVQSVSQSYGSSSSTGTSSGH 442
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1590 ALTNSAGFGgaISTSASFGGALNNSAGFGGAISTSASfggalnNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFG- 1668
Cdd:NF033849  443 SDSSSHSTS--SGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSe 514
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 50593518  1669 --GTLNTTDFGSTHSNSISFGSAPttSVSFGGSHST 1702
Cdd:NF033849  515 sqGTSLGTSGGRTSGAGGSMGLGP--SISLGKSYQW 548
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1736-2064 2.07e-10

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 66.18  E-value: 2.07e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1736 SGATSANFNEGHSISFGNGLSTSAgfGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGL 1815
Cdd:NF033849  216 QGQKSISFGVSLPMMYAANLGQSA--GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSE 293
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1816 NHNAdfngGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTS--DGFAGNLGTNTGFGGTLGTGAGFSV 1893
Cdd:NF033849  294 SEST----GQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSShsDGTSQSTSISHSESSSESTGTSVGH 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1894 SLNNGNGFGNGPNASFNRGLNTiiGFGSGSNTSnGFTGEpntGSSFSNGPSSIVGFSGG-PSTGAGFCSGPSTGGFGGGP 1972
Cdd:NF033849  370 STSSSVSSSESSSRSSSSGVSG--GFSGGIAGG-GVTSE---GLGASQGGSEGWGSGDSvQSVSQSYGSSSSTGTSSGHS 443
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:NF033849  444 DSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTS 523
                         330
                  ....*....|..
gi 50593518  2053 GGLNTSAGFSGG 2064
Cdd:NF033849  524 GGRTSGAGGSMG 535
MscS_porin pfam12795
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ...
268-445 2.01e-09

Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.


Pssm-ID: 432790 [Multi-domain]  Cd Length: 238  Bit Score: 60.01  E-value: 2.01e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASnrqigASNRQTE 347
Cdd:pfam12795   10 LDEAAKKKLLQDLQQALSLLDKIDASKQRAAAYQKALDDAPAELRELRQELAALQAKAEAAPKEILAS-----LSLEELE 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    348 vsSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASsrQTEASSRQTEASS 427
Cdd:pfam12795   85 --QRLLQTSAQLQELQNQLAQLNSQLIELQTRPERAQQQLSEARQRLQQIRNRLNGPAPPGEPLS--EAQRWALQAELAA 160
                          170
                   ....*....|....*...
gi 50593518    428 RQTEASSRQIEASAAAVR 445
Cdd:pfam12795  161 LKAQIDMLEQELLSNNNR 178
auto_AIDA-I NF033176
autotransporter adhesin AIDA-I;
1318-1785 1.30e-08

autotransporter adhesin AIDA-I;


Pssm-ID: 380183 [Multi-domain]  Cd Length: 1287  Bit Score: 60.44  E-value: 1.30e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1318 SFGGSSGANAGFGGTLNSSTsfGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSaSF 1397
Cdd:NF033176   72 SNGQTSNATVNSGGIQNVNN--GGKTTSTTVNSSGAQNVGNSGTAISTIVNSGGVQRVSSGGVTSATSLSGGAQNIY-NL 148
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1398 GSALSTSASFGGVLNGSAGfGGALNTNATFGGVLNGSAGfGGAMNTNATFGGALNSNAGfGGAISTSTNFGGALNNSAGf 1477
Cdd:NF033176  149 GHASNTVIFNGGNQTIFSG-GISDDTNISSGGQQRVSSG-GVASNTTINSSGTQNILSG-GSTVSTHISSGGNQYISAG- 224
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1478 GGAMNTSASFGGALNNSAgfgGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGfGGAISTSASFGGTLNNSaSF 1557
Cdd:NF033176  225 GNASATVVSSGGFQRVSS---GGTATGTVLSGGTQNVSSGGSAISTSVYSSGVQTVYAG-ATVTDTTVNSGGKQNIS-SG 299
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1558 GGAINTSASFGGVLNNsagFGGAINTSANFGGALTNSAGfGGAISTSASFGGALNNSAGfGGAISTSASFGGALNNSAGf 1637
Cdd:NF033176  300 GIVSGTIVNSSGTQNI---YSGGSALSANIKGSQIVNSD-GTAINTLVNDGGYQHIRNG-GVASGTIINQSGRVNISSG- 373
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1638 GGAISTNASFGGAISNSPDfGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSapTTSVSFGGSHSTNLCFGGAPSTSLCFG 1717
Cdd:NF033176  374 GYAESTIINSGGTQSVLSG-GYASGTLINNSGRENVSNGGSAYNTIINAGG--NQYIYSNGEASGTTVNTSGFQRVNSGG 450
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 50593518  1718 SASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSL 1785
Cdd:NF033176  451 TATGTKLSGGNQNVSSGGKAIAAEVYSGGKQTVYAGGEASGTQIFDGGVVNVSGGSVSGASVNLNGRL 518
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
262-468 5.43e-08

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 57.22  E-value: 5.43e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  262 IGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGA 341
Cdd:COG4372   26 IAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELES 105
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  342 SNRQTEVSSRQIEASNRQIGASNRQTEASNRQIgasnrqteaSNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSR 421
Cdd:COG4372  106 LQEEAEELQEELEELQKERQDLEQQRKQLEAQI---------AELQSEIAEREEELKELEEQLESLQEELAALEQELQAL 176
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*..
gi 50593518  422 QTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEA 468
Cdd:COG4372  177 SEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEA 223
growth_prot_Scy NF041483
polarized growth protein Scy;
97-524 1.17e-07

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 57.53  E-value: 1.17e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    97 SQASATTEAPNIQASVTSQTQKAKTMRVTPKVSLTGSEDATtqlkpplQALNLPVTTPTIQTPVANESANSLAS--TAVN 174
Cdd:NF041483  293 AKQLASAESANEQRTRTAKEEIARLVGEATKEAEALKAEAE-------QALADARAEAEKLVAEAAEKARTVAAedTAAQ 365
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   175 KSKKASTANNAANKTVPSAAEISLAsAATHTVTTQGQAAKETGSIQTIAATA------RSKKNSKGKRtpAKTTNTDNEY 248
Cdd:NF041483  366 LAKAARTAEEVLTKASEDAKATTRA-AAEEAERIRREAEAEADRLRGEAADQaeqlkgAAKDDTKEYR--AKTVELQEEA 442
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   249 V----EA----SNAIEASSRQIGASGRqtEASnRQIEASSRQTEA-------------SNRQTEASSRQTEASSRQT--- 304
Cdd:NF041483  443 RrlrgEAeqlrAEAVAEGERIRGEARR--EAV-QQIEEAARTAEElltkakadadelrSTATAESERVRTEAIERATtlr 519
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   305 ----ETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIgasnrqTEVSSRQIEAsnrqigasnRQTEASNRqigASNRQ 380
Cdd:NF041483  520 rqaeETLERTRAEAERLRAEAEEQAEEVRAAAERAAREL------REETERAIAA---------RQAEAAEE---LTRLH 581
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   381 TEASNRQIGASNRQTDASN------RQT-DASNRQ-TEASSR--------QTEASSRQTEASSrqtEASSRQIEASAAAV 444
Cdd:NF041483  582 TEAEERLTAAEEALADARAeaerirREAaEETERLrTEAAERirtlqaqaEQEAERLRTEAAA---DASAARAEGENVAV 658
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   445 RPKkprgkkgnnkgSNSASEPSeappaiqtvtnhalsvtvriRRGSRARKAANKNRAtESQAQIAEQGAQASEASISALE 524
Cdd:NF041483  659 RLR-----------SEAAAEAE--------------------RLKSEAQESADRVRA-EAAAAAERVGTEAAEALAAAQE 706
PTZ00121 PTZ00121
MAEBL; Provisional
250-524 7.08e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 54.76  E-value: 7.08e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   250 EASNAIEASSRqiGASGRQTEASNRQIEAssRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASN 329
Cdd:PTZ00121 1197 EDARKAEAARK--AEEERKAEEARKAEDA--KKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAI 1272
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   330 RQIE---------ASNRQIGASNRQTEVSSRQIEASN-----RQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQT 395
Cdd:PTZ00121 1273 KAEEarkadelkkAEEKKKADEAKKAEEKKKADEAKKkaeeaKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA 1352
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   396 DASNRQTDASNRQTEASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKP----RGKKGNNKGSNSASEPSEAPPA 471
Cdd:PTZ00121 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKadelKKAAAAKKKADEAKKKAEEKKK 1432
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|...
gi 50593518   472 IQTVTNHALSVtvriRRGSRARKAANKNRATESQAQIAEQGAQASEASISALE 524
Cdd:PTZ00121 1433 ADEAKKKAEEA----KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1028-1427 1.66e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 50.39  E-value: 1.66e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1028 GRNSITFG-SVPNT-SANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGgapststsfstasISF 1105
Cdd:NF033849  217 GQKSISFGvSLPMMyAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHG-------------STR 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1106 GGApststslstasisfggapststsfstasisfggapststslstasisfggapsiNSSSGGSSVSFGGAPTTSTSFSG 1185
Cdd:NF033849  284 GWS------------------------------------------------------HTQSTSESESTGQSSSVGTSESQ 309
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1186 GPCISFGgapcTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYS 1265
Cdd:NF033849  310 SHGTTEG----TSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSS 385
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1266 GAGFGGTLSTSIsfGGSPSTNTGFGGTLSTSVSFGASSSTSSdFGGTLSTSVSFGGSSGA--NAGFGGTLNSSTSFGGAI 1343
Cdd:NF033849  386 SSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWGSGDSVQS-VSQSYGSSSSTGTSSGHsdSSSHSTSSGQADSVSQGT 462
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1344 STSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNT 1423
Cdd:NF033849  463 SWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISL 542

                  ....
gi 50593518  1424 NATF 1427
Cdd:NF033849  543 GKSY 546
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
206-525 7.28e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 48.13  E-value: 7.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    206 VTTQGQAAKETGSIqtiaATARSKKNSKgkrtpakTTNTDNEYVEASNAIEASSRQIgasgrqTEASNRQIEAssrQTEA 285
Cdd:TIGR02168  648 VTLDGDLVRPGGVI----TGGSAKTNSS-------ILERRREIEELEEKIEELEEKI------AELEKALAEL---RKEL 707
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    286 SNRQTEASSRQteassRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNR 365
Cdd:TIGR02168  708 EELEEELEQLR-----KELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA 782
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    366 QTEASNRQIGASNRQTEASNRQIGASNRQTDASNRqtdasnRQTEASSRQtEASSRQTEASSRQTEASSRQIEASAAAVr 445
Cdd:TIGR02168  783 EIEELEAQIEQLKEELKALREALDELRAELTLLNE------EAANLRERL-ESLERRIAATERRLEDLEEQIEELSEDI- 854
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    446 pkkprgkKGNNKgsnSASEPSEAPPAIQTVTNHAL----SVTVRIRRG-SRARKAANKNRATESQAQIAEQGAQASEASI 520
Cdd:TIGR02168  855 -------ESLAA---EIEELEELIEELESELEALLneraSLEEALALLrSELEELSEELRELESKRSELRRELEELREKL 924

                   ....*
gi 50593518    521 SALET 525
Cdd:TIGR02168  925 AQLEL 929
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
869-1089 2.79e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 46.15  E-value: 2.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   869 NQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGfggiSNPSGGFGGisnpSGGFGGISNPSGGFGGISNPSGGFGG 948
Cdd:NF033849  310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDG----TSQSTSISH----SESSSESTGTSVGHSTSSSVSSSESS 381
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   949 ISNPSGGFggisnpSGGFGGISNPSGGFggisnpSGGFGGISNPSGGFGGisnpSGGFGGISNPSGGFGGISNPSG-GFG 1027
Cdd:NF033849  382 SRSSSSGV------SGGFSGGIAGGGVT------SEGLGASQGGSEGWGS----GDSVQSVSQSYGSSSSTGTSSGhSDS 445
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 50593518  1028 GRNSITFGsvpnTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFG 1089
Cdd:NF033849  446 SSHSTSSG----QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVS 503
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1239-1459 2.86e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 45.81  E-value: 2.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   1239 FSGAVSTTTGFGGTLSTSVCFGSSPYS--GAGFGGTLSTSISFGGSPSTNTGFGGTLstsvsFGASSSTSSDFGGTLSTS 1316
Cdd:pfam15967    6 FGGGPGSTATAGGGFSFGAAAASNPGStgGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   1317 VSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSAs 1396
Cdd:pfam15967   81 AATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTT- 159
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 50593518   1397 fgsALSTSASFGGVLNgsaGFGGALNTNATFGGVLNGSAGFGGAMNTNATFgGALNSNAGFGG 1459
Cdd:pfam15967  160 ---AVSTGLSLGSTLT---SLGGSLFQNTNSTGLGQTTLGLTLLATSTAPV-SAPAASEGLGG 215
PHA02515 PHA02515
hypothetical protein; Provisional
1294-1505 3.67e-04

hypothetical protein; Provisional


Pssm-ID: 107197 [Multi-domain]  Cd Length: 508  Bit Score: 45.54  E-value: 3.67e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1294 STSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGgtlNSSTSFGGAISTSTGFGSALNNSANFG--GAISTSFSGVL 1371
Cdd:PHA02515  175 TVAASVGAVDTVAGDLGGTWAAGVSYDFGSIAVPPIG---NTSPPGGNIVIVANSIGNVDTVAENIGdvSTVSTHLSSML 251
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1372 -------------------NSSASFGGAINTSAGFGSTLNSSASfgSALSTSASFGGVLNGSAGFGGALNTNATFGGVLN 1432
Cdd:PHA02515  252 avandidsvvsvagdleniDAVADNAANINTVAGANANVNTVAS--NILDVGTVAGNIDDVQAVAGNAANINVVADNADN 329
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 50593518  1433 GSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGA--LNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNA 1505
Cdd:PHA02515  330 INATAANQANINAAVGNADNINAAVANQANINAVVGNAnnINAVAANEGNVNTVVDNLADVQTVAGIAADVSTVA 404
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
272-490 3.75e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.67  E-value: 3.75e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   272 SNRQIEASSRQ-TEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSS 350
Cdd:NF033609   33 SSKEADASENSvTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSSNTNNGETSVAQNPAQQETTQSASTNATTEETP 112
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   351 RQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSR--QTEASSRQTEASSR 428
Cdd:NF033609  113 VTGEATTTATNQANTPATTQSSNTNAEELVNQTSNETTSNDTNTVSSVNSPQNSTNAENVSTTQdtSTEATPSNNESAPQ 192
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 50593518   429 QTEASSRQIeaSAAAVRPKKPRgkkgnNKGSNSASEPSEAPPAIQTVTNHALSVTVRIRRGS 490
Cdd:NF033609  193 STDASNKDV--VNQAVNTSAPR-----MRAFSLAAVAADAPAAGTDITNQLTNVTVGIDSGT 247
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
20-243 5.51e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.83  E-value: 5.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518     20 PAGSLGLPFSPDVQSETT---EKDPPIASRSKKNKNKKNSIKPMDKTTPAPPPVPSANDNASNKPKVTLQALNLPMFTQI 96
Cdd:pfam05109  442 PNTTTGLPSSTHVPTNLTapaSTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNAT 521
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518     97 SQASA-TTEAPNIQASVTSQTQKAKTMrVTPKVSLTGSEDATTQLKPPLQALNLPVTTPTiqTPVANESANSLASTAVNK 175
Cdd:pfam05109  522 SPTPAvTTPTPNATSPTLGKTSPTSAV-TTPTPNATSPTPAVTTPTPNATIPTLGKTSPT--SAVTTPTPNATSPTVGET 598
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 50593518    176 SKKASTANNAANKTVPSAAEISLASAATHTVTTqGQAAKETGSIQTIAATARSKKNSKGKRTPAKTTN 243
Cdd:pfam05109  599 SPQANTTNHTLGGTSSTPVVTSPPKNATSAVTT-GQHNITSSSTSSMSLRPSSISETLSPSTSDNSTS 665
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1632-1860 6.99e-03

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 41.14  E-value: 6.99e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1632 NNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNlcfggaPS 1711
Cdd:cd21118  133 QGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQGAVAQPGYGTVRGNNQNSGCTN------PP 206
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1712 TSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASF 1791
Cdd:cd21118  207 PSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSGGSNGGSSGNSGSGSGGSSSGGSN 286
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 50593518 1792 NGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTS 1860
Cdd:cd21118  287 GWGGSSSSGGSGGSGGGNKPECNNPGNDVRMAGGGGSQGSKESSGSHGSNGGNGQAEAVGGLNTLNSDA 355
 
Name Accession Description Interval E-value
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
859-2081 6.57e-33

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 140.67  E-value: 6.57e-33
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210  294 DTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLT 373
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210  374 TAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSG 453
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSF 1098
Cdd:COG3210  454 TTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTG 533
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1099 STASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPT 1178
Cdd:COG3210  534 GDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGT 613
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1179 TSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVC 1258
Cdd:COG3210  614 ITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNA 693
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1259 FGSSPYSGAGFGGTLST-SISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFG-----GTLSTSVSFGGSSGANAGFGGT 1332
Cdd:COG3210  694 ATGGTLNNAGNTLTISTgSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGvtitsGNAGTLSIGLTANTTASGTTLT 773
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1333 LNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLN 1412
Cdd:COG3210  774 LANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTT 853
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1413 GSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALN 1492
Cdd:COG3210  854 SDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAA 933
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1493 NSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLN 1572
Cdd:COG3210  934 GGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSG 1013
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1573 NSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAIS 1652
Cdd:COG3210 1014 AIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHT 1093
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1653 NSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNST 1732
Cdd:COG3210 1094 LGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAAT 1173
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1733 NCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFS 1812
Cdd:COG3210 1174 TTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGT 1253
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1813 GGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFS 1892
Cdd:COG3210 1254 GDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNS 1333
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1893 VSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGP 1972
Cdd:COG3210 1334 GGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGN 1413
                       1130      1140      1150      1160      1170      1180      1190      1200
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:COG3210 1414 NGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAG 1493
                       1210      1220
                 ....*....|....*....|....*....
gi 50593518 2053 GGLNTSAGFSGGPPSTGTGFGGGASSHGG 2081
Cdd:COG3210 1494 VAGATASNGGTSTGAGGTAGGTTAEVAKA 1522
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
863-2078 1.01e-31

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 136.43  E-value: 1.01e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  863 TTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 942
Cdd:COG3210  289 GASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGN 368
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  943 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 1022
Cdd:COG3210  369 GGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGG 448
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1023 SGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTAS 1102
Cdd:COG3210  449 LTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNAT 528
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1103 ISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTS 1182
Cdd:COG3210  529 SGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSA 608
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1183 FSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSS 1262
Cdd:COG3210  609 GATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTG 688
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1263 PYSGAGFGGTLSTS---ISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDfggtlSTSVSFGGSSGANAGFGGTLNSSTSF 1339
Cdd:COG3210  689 TTLNAATGGTLNNAgntLTISTGSITVTGQIGALANANGDTVTFGNLGT-----GATLTLNAGVTITSGNAGTLSIGLTA 763
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1340 GGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGG 1419
Cdd:COG3210  764 NTTASGTTLTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGS 843
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1420 ALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTS--TNFGGALNNSAGFGGAMNTSASFGGALNNSAGF 1497
Cdd:COG3210  844 NTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTgtANAGTLTNLGTTTNAASGNGAVLATVTATGTGG 923
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1498 GGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGF 1577
Cdd:COG3210  924 GGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSG 1003
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1578 GGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDF 1657
Cdd:COG3210 1004 TTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTA 1083
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1658 GGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSG 1737
Cdd:COG3210 1084 QASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASA 1163
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1738 ATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNH 1817
Cdd:COG3210 1164 GDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGS 1243
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1818 NADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNN 1897
Cdd:COG3210 1244 FVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTA 1323
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1898 GNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGPSTGPG 1977
Cdd:COG3210 1324 TGTAVAAVNSGGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGG 1403
                       1130      1140      1150      1160      1170      1180      1190      1200
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1978 FGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFGGGLNT 2057
Cdd:COG3210 1404 VTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIG 1483
                       1210      1220
                 ....*....|....*....|.
gi 50593518 2058 SAGFSGGPPSTGTGFGGGASS 2078
Cdd:COG3210 1484 GTTTGGNGAGVAGATASNGGT 1504
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
859-2081 5.80e-31

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 134.12  E-value: 5.80e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210  129 TGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGV 208
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210  209 LANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTA 288
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSF 1098
Cdd:COG3210  289 GASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGN 368
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1099 STASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPT 1178
Cdd:COG3210  369 GGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGG 448
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1179 TSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVC 1258
Cdd:COG3210  449 LTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNAT 528
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1259 FGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTS 1338
Cdd:COG3210  529 SGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSA 608
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1339 FGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFG 1418
Cdd:COG3210  609 GATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTG 688
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1419 GALNTNATF-----GGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTN-FGGALNNSAGFGGAMNTSASFGGALN 1492
Cdd:COG3210  689 TTLNAATGGtlnnaGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGATLtLNAGVTITSGNAGTLSIGLTANTTAS 768
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1493 NSAGFGGAISTNATFGGALNNS-AGFGGAISTNATFGGALNNS---AGFGGAISTSASFGGTLNNSASFGGAINTSASFG 1568
Cdd:COG3210  769 GTTLTLANANGNTSAGATLDNAgAEISIDITADGTITAAGTTAinvTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDT 848
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1569 GVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFG 1648
Cdd:COG3210  849 TTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTG 928
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1649 GAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGG 1728
Cdd:COG3210  929 GNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTAST 1008
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1729 SNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTS 1808
Cdd:COG3210 1009 TGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGA 1088
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1809 TDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTG 1888
Cdd:COG3210 1089 GTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTA 1168
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1889 AGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGF 1968
Cdd:COG3210 1169 VAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAG 1248
                       1130      1140      1150      1160      1170      1180      1190      1200
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1969 GGGPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTS 2048
Cdd:COG3210 1249 SASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAV 1328
                       1210      1220      1230
                 ....*....|....*....|....*....|...
gi 50593518 2049 TGFGGGLNTSAGFSGGPPSTGTGFGGGASSHGG 2081
Cdd:COG3210 1329 AAVNSGGVNAGGGTINTTAANTGLNGGNGATDS 1361
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
863-2077 1.96e-30

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 132.20  E-value: 1.96e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  863 TTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 942
Cdd:COG3210  477 GNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTT 556
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  943 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 1022
Cdd:COG3210  557 AASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTG 636
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1023 SGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTAS 1102
Cdd:COG3210  637 SAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVT 716
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1103 ISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASI--SFGGAPSINSSSGGSSVSFGGAPTTS 1180
Cdd:COG3210  717 GQIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASgtTLTLANANGNTSAGATLDNAGAEISI 796
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1181 TSFSGGPCISFGgapcttASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFG 1260
Cdd:COG3210  797 DITADGTITAAG------TTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1261 SSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFG 1340
Cdd:COG3210  871 AATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQG 950
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1341 GAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGA 1420
Cdd:COG3210  951 NAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTAS 1030
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1421 LNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGA 1500
Cdd:COG3210 1031 ATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTT 1110
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1501 ISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGA 1580
Cdd:COG3210 1111 TSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAAT 1190
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1581 INTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGA 1660
Cdd:COG3210 1191 EGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGAT 1270
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1661 FSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATS 1740
Cdd:COG3210 1271 STVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANT 1350
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1741 ANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNHNAD 1820
Cdd:COG3210 1351 GLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSA 1430
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1821 FNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNG 1900
Cdd:COG3210 1431 TTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGG 1510
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1901 FGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGPSTGPGFGG 1980
Cdd:COG3210 1511 TAGGTTAEVAKASLEGGEGTYGGSSVAEAGTGGGILGAVSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQAPT 1590
                       1130      1140      1150      1160      1170      1180      1190      1200
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1981 PSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFGGGLNTSAG 2060
Cdd:COG3210 1591 AGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSGAVNGAGNGWAV 1670
                       1210
                 ....*....|....*..
gi 50593518 2061 FSGGPPSTGTGFGGGAS 2077
Cdd:COG3210 1671 DLTDATLAGLGGATTAA 1687
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
889-2078 1.92e-29

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 129.12  E-value: 1.92e-29
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  889 ITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 968
Cdd:COG3210    1 GSGGLAGTTGNKTIGVDIAVTTTAATLGSNTAGTSGLNILGSGGVGTAGGIASNAGTTASTSGGSGTAGGVGNTSASTGG 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  969 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAP 1048
Cdd:COG3210   81 IGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGN 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1049 SISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPST 1128
Cdd:COG3210  161 NTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGV 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1129 STSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSg 1208
Cdd:COG3210  241 ISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAG- 319
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1209 fgstlcSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTG 1288
Cdd:COG3210  320 ------ITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGN 393
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1289 FGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFS 1368
Cdd:COG3210  394 ASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVT 473
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1369 GVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFG 1448
Cdd:COG3210  474 NSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGAS 553
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1449 GALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFG 1528
Cdd:COG3210  554 GTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAG 633
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1529 GALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFG-----GAINTSANFGGALTNSAGFGGAIST 1603
Cdd:COG3210  634 LTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATggttgTTLNAATGGTLNNAGNTLTISTGSI 713
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1604 SASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNS 1683
Cdd:COG3210  714 TVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNAGAE 793
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1684 ISFGSAPTTSVSFGGSHSTNLcFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGN 1763
Cdd:COG3210  794 ISIDITADGTITAAGTTAINV-TGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAA 872
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1764 GLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGG 1843
Cdd:COG3210  873 TAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNA 952
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1844 ELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGS 1923
Cdd:COG3210  953 GLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASAT 1032
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1924 NTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGG 2003
Cdd:COG3210 1033 GTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTS 1112
                       1130      1140      1150      1160      1170      1180      1190
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 50593518 2004 GFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFGGGLNTSAGFSGGPPSTGTGFGGGASS 2078
Cdd:COG3210 1113 TGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADS 1187
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
859-1958 2.76e-29

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 128.73  E-value: 2.76e-29
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210  513 GLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGT 592
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210  593 GTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGG 672
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1019 ISNPSGGFGGRNSITFGSVPNTSAN---------FSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFG 1089
Cdd:COG3210  673 GTTGTVTSGATGGTTGTTLNAATGGtlnnagntlTISTGSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTITSG 752
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1090 GAPSTSTSFSTASISFGGApsTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGS 1169
Cdd:COG3210  753 NAGTLSIGLTANTTASGTT--LTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGL 830
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1170 SVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGF 1249
Cdd:COG3210  831 TGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNG 910
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1250 GGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGF 1329
Cdd:COG3210  911 AVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGV 990
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1330 GGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGG 1409
Cdd:COG3210  991 IAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTT 1070
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1410 VLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGamNTSASFGG 1489
Cdd:COG3210 1071 GGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAA--GAGTLTGL 1148
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1490 ALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGG 1569
Cdd:COG3210 1149 VAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTG 1228
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1570 VLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGG 1649
Cdd:COG3210 1229 NTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAG 1308
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1650 AISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGS 1729
Cdd:COG3210 1309 ANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAG 1388
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1730 NSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTST 1809
Cdd:COG3210 1389 NNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGA 1468
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1810 DFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGA 1889
Cdd:COG3210 1469 GGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTTAEVAKASLEGGEGTYGGSSVAEAGTGGGILGA 1548
                       1050      1060      1070      1080      1090      1100
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 50593518 1890 GFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAG 1958
Cdd:COG3210 1549 VSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQAPTAGNTATLTLSLAEGTNAEYGGTTNVTS 1617
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
957-2081 2.72e-27

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 122.18  E-value: 2.72e-27
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  957 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGS 1036
Cdd:COG3210    1 GSGGLAGTTGNKTIGVDIAVTTTAATLGSNTAGTSGLNILGSGGVGTAGGIASNAGTTASTSGGSGTAGGVGNTSASTGG 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1037 VPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSApfcNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLS 1116
Cdd:COG3210   81 IGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAAS---ATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSG 157
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1117 TASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPC 1196
Cdd:COG3210  158 AGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTG 237
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1197 TTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTS 1276
Cdd:COG3210  238 AGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTA 317
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1277 ISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSS------GANAGFGGTLNSSTSFGGAISTSTGFG 1350
Cdd:COG3210  318 AGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTgtgnggGLTTAGAGTVASTVGTATASTGNASST 397
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1351 SALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGV 1430
Cdd:COG3210  398 TVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSAG 477
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1431 LNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGA 1510
Cdd:COG3210  478 NTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTA 557
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1511 LNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGA 1590
Cdd:COG3210  558 ASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGS 637
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1591 LTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSpDFGGAFSTSVGFGGT 1670
Cdd:COG3210  638 AVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNN-AGNTLTISTGSITVT 716
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1671 LNTTDFGSTHSNSISFGSAPT-TSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSI 1749
Cdd:COG3210  717 GQIGALANANGDTVTFGNLGTgATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNAGAEISI 796
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1750 SFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSA 1829
Cdd:COG3210  797 DITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAAS 876
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1830 GFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPNASF 1909
Cdd:COG3210  877 ITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSA 956
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1910 NRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGPSTGPGFGGPSTGPGFGG 1989
Cdd:COG3210  957 ASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGT 1036
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1990 PSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFGGGLNTSAGFSGGPPSTG 2069
Cdd:COG3210 1037 AATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGV 1116
                       1130
                 ....*....|..
gi 50593518 2070 TGFGGGASSHGG 2081
Cdd:COG3210 1117 TASKVGGTTTVG 1128
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
859-1958 4.47e-27

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 121.41  E-value: 4.47e-27
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210  558 ASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGS 637
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210  638 AVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTG 717
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSG--TPSTSAPFCNTASISFGGAPSTST 1096
Cdd:COG3210  718 QIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTltLANANGNTSAGATLDNAGAEISID 797
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1097 SFSTASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGA 1176
Cdd:COG3210  798 ITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASI 877
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1177 PTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTS 1256
Cdd:COG3210  878 TVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAA 957
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1257 VCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSS 1336
Cdd:COG3210  958 SASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTA 1037
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1337 TSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAG 1416
Cdd:COG3210 1038 ATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVT 1117
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1417 FGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAG 1496
Cdd:COG3210 1118 ASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTD 1197
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1497 FGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAG 1576
Cdd:COG3210 1198 LKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNA 1277
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1577 FGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPD 1656
Cdd:COG3210 1278 GATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGNG 1357
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1657 FGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFS 1736
Cdd:COG3210 1358 ATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGGT 1437
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1737 GATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLN 1816
Cdd:COG3210 1438 GNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTTA 1517
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1817 HNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLN 1896
Cdd:COG3210 1518 EVAKASLEGGEGTYGGSSVAEAGTGGGILGAVSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQAPTAGNTATL 1597
                       1050      1060      1070      1080      1090      1100
                 ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 50593518 1897 NGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAG 1958
Cdd:COG3210 1598 TLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSG 1659
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
596-756 2.13e-24

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


Pssm-ID: 426270  Cd Length: 205  Bit Score: 103.12  E-value: 2.13e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    596 LVKYLLVKDQTKIPIKRSDMLKDVIQEYE-DYFPEIIERASYALEKMFRVNLKEID--------------------KQNN 654
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRkRLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    655 LYILIST---QESSAGIMGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLH---PGVKHSLFG 719
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 50593518    720 EVKKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 756
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1434-1791 3.72e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 98.15  E-value: 3.72e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1434 SAGFGGAMNtnATFGGALNSNAGFGGaistSTNFGGALNNSAGFGGAMNTSASFGgalnNSAGFGGAISTNATFGGALNN 1513
Cdd:NF033849  220 SISFGVSLP--MMYAANLGQSAGTGY----GESVGHSTSQGQSHSVGTSESHSVG----TSQSQSHTTGHGSTRGWSHTQ 289
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1514 SAGFGGAISTNATFGGALNNSAGF--GGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGAL 1591
Cdd:NF033849  290 STSESESTGQSSSVGTSESQSHGTteGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGH 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1592 TNSAGFGGAISTSASFGGALNnsAGFGGAIstsasfGGALNNSAGFGGAISTNASFGGAISNSpDFGGAFSTSVGFG--- 1668
Cdd:NF033849  370 STSSSVSSSESSSRSSSSGVS--GGFSGGI------AGGGVTSEGLGASQGGSEGWGSGDSVQ-SVSQSYGSSSSTGtss 440
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1669 GTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSlcfGSASNTnlcFGGSNSTncfsgatsanfNEGHS 1748
Cdd:NF033849  441 GHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTS---QSETDS---VGDSTGT-----------SESVS 503
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 50593518  1749 ISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASF 1791
Cdd:NF033849  504 QGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1494-1832 4.41e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 98.15  E-value: 4.41e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1494 SAGFGgaISTNATFGGALNNSAG------FGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASF 1567
Cdd:NF033849  220 SISFG--VSLPMMYAANLGQSAGtgygesVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1568 GgvLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASF 1647
Cdd:NF033849  298 G--QSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV 375
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1648 GGAISNSPDFggAFSTSVGFGGTLNttdfgsthsnsisfgSAPTTSVSFGGSHSTNLCFGGApstslcfGSASNTNLCFG 1727
Cdd:NF033849  376 SSSESSSRSS--SSGVSGGFSGGIA---------------GGGVTSEGLGASQGGSEGWGSG-------DSVQSVSQSYG 431
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1728 GSNSTNCFSGATSanfNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTS----------TGFGGSLGPSASFNGGLGT 1797
Cdd:NF033849  432 SSSSTGTSSGHSD---SSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSeswstsqsetDSVGDSTGTSESVSQGDGR 508
                         330       340       350
                  ....*....|....*....|....*....|....*
gi 50593518  1798 STGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFN 1832
Cdd:NF033849  509 STGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1454-1821 9.74e-18

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 90.45  E-value: 9.74e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1454 NAGFGgaISTSTNFGGALNNSAGFGGAMNTSASFggalnnSAGFGGAISTNATFGGAlnNSAGFGGAISTNATFGGALNN 1533
Cdd:NF033849  220 SISFG--VSLPMMYAANLGQSAGTGYGESVGHST------SQGQSHSVGTSESHSVG--TSQSQSHTTGHGSTRGWSHTQ 289
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1534 SAGFGGAISTSASFGG--TLNNSASFGGAINTSASFggvlnnSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGAL 1611
Cdd:NF033849  290 STSESESTGQSSSVGTseSQSHGTTEGTSTTDSSSH------SQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSEST 363
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1612 NNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSpdFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSapT 1691
Cdd:NF033849  364 GTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEG--LGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT--S 439
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1692 TSVSFGGSHSTNLcfGGAPSTSLCFGSASNTnlcfgGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGF 1771
Cdd:NF033849  440 SGHSDSSSHSTSS--GQADSVSQGTSWSEGT-----GTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGR 512
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|
gi 50593518  1772 GSSLGTStgfggslgpsasfnggLGTSTGFGGGLGTSTDFSGGLNHNADF 1821
Cdd:NF033849  513 SESQGTS----------------LGTSGGRTSGAGGSMGLGPSISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1592-1958 7.29e-17

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 87.37  E-value: 7.29e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1592 TNSAGFGgaISTSASFGGALNNSAG--FGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGG 1669
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQSAGtgYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESE 295
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1670 TLNTTD-FGSTHSNSISFGSAPTTSVSFGGSHSTnlcfggapSTSLCFGSASNTNLCFGGSNStncfsgaTSANFNEGHS 1748
Cdd:NF033849  296 STGQSSsVGTSESQSHGTTEGTSTTDSSSHSQSS--------SYNVSSGTGVSSSHSDGTSQS-------TSISHSESSS 360
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1749 ISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGglnhnadFNGGLGNS 1828
Cdd:NF033849  361 ESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQS-------VSQSYGSS 433
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1829 AGFngglntntdfggelGTSAGFGDGLGSSTSFG--AGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPN 1906
Cdd:NF033849  434 SST--------------GTSSGHSDSSSHSTSSGqaDSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTS 499
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|..
gi 50593518  1907 ASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSngpssiVGFsgGPSTGAG 1958
Cdd:NF033849  500 ESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGS------MGL--GPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1574-1873 1.01e-16

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 86.98  E-value: 1.01e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1574 SAGFGgaINTSANFGGALTNSAG------FGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASF 1647
Cdd:NF033849  220 SISFG--VSLPMMYAANLGQSAGtgygesVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1648 GGAISNspdfGGAFSTSVGFGGTLNTTDfGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNlcfG 1727
Cdd:NF033849  298 GQSSSV----GTSESQSHGTTEGTSTTD-SSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVG---H 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1728 GSNSTNCFSGATSANFNEGHSISFGNGLS----TSAGFGNGLGTSAGFGSSLG---TSTGFGGSLGPSASF----NGGLG 1796
Cdd:NF033849  370 STSSSVSSSESSSRSSSSGVSGGFSGGIAgggvTSEGLGASQGGSEGWGSGDSvqsVSQSYGSSSSTGTSSghsdSSSHS 449
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 50593518  1797 TSTGFGGGLGTSTDFSGGLNHNAdfNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAG 1873
Cdd:NF033849  450 TSSGQADSVSQGTSWSEGTGTSQ--GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSG 524
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1222-1547 1.41e-15

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 83.13  E-value: 1.41e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1222 ALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVcfgsspysgagfggTLSTSISFGGSPSTNTGFGGTLSTSVSFGA 1301
Cdd:NF033849  252 SQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSH--------------TQSTSESESTGQSSSVGTSESQSHGTTEGT 317
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1302 SSSTSSDFGGTLSTSVSFGgssganAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGgai 1381
Cdd:NF033849  318 STTDSSSHSQSSSYNVSSG------TGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG--- 388
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1382 nTSAGFGSTLNSSASFGSALSTSASfggvlnGSAGFGGAlntnatfGGVLNGSAGFGGAMNTNATFGGALNS--NAGFGG 1459
Cdd:NF033849  389 -VSGGFSGGIAGGGVTSEGLGASQG------GSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQ 454
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1460 AISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSA---- 1535
Cdd:NF033849  455 ADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAggsm 534
                         330
                  ....*....|..
gi 50593518  1536 GFGGAISTSASF 1547
Cdd:NF033849  535 GLGPSISLGKSY 546
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
1221-1763 4.05e-14

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 78.27  E-value: 4.05e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1221 SALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFG 1300
Cdd:COG5295   64 AAATAGAGSGGTSATAASSVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAA 143
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1301 ASSSTSSDFGGTLSTSVSFGGSSGANAGFGGT-LNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGG 1379
Cdd:COG5295  144 STGGSSAAGGSNTATATGSSTANAATAAAGATsTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGV 223
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1380 AINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGG 1459
Cdd:COG5295  224 NAGAATGSAASAGGSASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGA 303
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1460 AISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGG 1539
Cdd:COG5295  304 ANATAGGGNAGSGGGGAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATA 383
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1540 AISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGG 1619
Cdd:COG5295  384 AGNAAGAAGAGSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAA 463
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1620 AISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGS 1699
Cdd:COG5295  464 NVGAATTAASAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAG 543
                        490       500       510       520       530       540
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 50593518 1700 HSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGN 1763
Cdd:COG5295  544 GGSTTAATGTNSVAVGNNTATGANSVALGAGSVASGANSVSVGAAGAENVAAGATDTDAVNGGG 607
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
1468-1879 4.41e-14

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 78.06  E-value: 4.41e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1468 GGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASF 1547
Cdd:COG3468    1 TASGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAG 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1548 GGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSagfgGAISTSASF 1627
Cdd:COG3468   81 SGGTGGNSTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGGGGGGGT----GVGGTGAAA 156
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1628 GGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFG 1707
Cdd:COG3468  157 AGGGTGSGGGGSGGGGGAGGGGGGGAGGSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGG 236
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1708 GAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGP 1787
Cdd:COG3468  237 GVGGGGGSAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGG 316
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1788 SASFNGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVT 1867
Cdd:COG3468  317 GGGSNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLT 396
                        410
                 ....*....|..
gi 50593518 1868 SDGFAGNLGTNT 1879
Cdd:COG3468  397 TGGTGNNGGGGV 408
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
1221-1799 1.45e-13

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 76.35  E-value: 1.45e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1221 SALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGA--GFGGTLSTSISFGGSPSTNTGFGGTLSTSVS 1298
Cdd:COG5295   19 SGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAatAGAGSGGTSATAASSVASGGASAATAASTGT 98
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1299 FGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFG 1378
Cdd:COG5295   99 GNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATGSSTANAATAAAGATSTS 178
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1379 GAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFG 1458
Cdd:COG5295  179 ASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAASGNATTASASSVSG 258
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1459 GAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFG 1538
Cdd:COG5295  259 SAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALGSAGGSSGVGTASGA 338
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1539 GAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFG 1618
Cdd:COG5295  339 SAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTGASAGGGASAAGGAA 418
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1619 GAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGG 1698
Cdd:COG5295  419 AGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSSAAIAGATATGAGAA 498
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1699 SHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTS 1778
Cdd:COG5295  499 AGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNNTATGANSVALGAGSVAS 578
                        570       580
                 ....*....|....*....|.
gi 50593518 1779 TGFGGSLGPSASFNGGLGTST 1799
Cdd:COG5295  579 GANSVSVGAAGAENVAAGATD 599
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
1215-1723 7.67e-13

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 74.04  E-value: 7.67e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1215 STNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLS 1294
Cdd:COG5295   88 ASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATGSSTANA 167
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1295 TSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSS 1374
Cdd:COG5295  168 ATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAASGN 247
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1375 ASFGGAINTSAGFGSTLNSS----ASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGA 1450
Cdd:COG5295  248 ATTASASSVSGSAVAAGTAStattASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALGSAG 327
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1451 LNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGA 1530
Cdd:COG5295  328 GSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTGASA 407
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1531 LNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGA 1610
Cdd:COG5295  408 GGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSSAAI 487
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1611 LNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGS--------THSN 1682
Cdd:COG5295  488 AGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSvavgnntaTGAN 567
                        490       500       510       520
                 ....*....|....*....|....*....|....*....|....*
gi 50593518 1683 SISFGSAPTT----SVSFGGSHSTNLCFGGAPSTSLCFGSASNTN 1723
Cdd:COG5295  568 SVALGAGSVAsganSVSVGAAGAENVAAGATDTDAVNGGGAVATG 612
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
1351-1942 1.43e-12

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 73.27  E-value: 1.43e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1351 SALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGST-----LNSSASFGSALSTSASFGGVLNGSAGFGGALNTNA 1425
Cdd:COG5295    1 SASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGsaatsSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAA 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1426 TFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNA 1505
Cdd:COG5295   81 SSVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATAT 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1506 TFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSA 1585
Cdd:COG5295  161 GSSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSAS 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1586 NfgGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSV 1665
Cdd:COG5295  241 A--GAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGG 318
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1666 GFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNE 1745
Cdd:COG5295  319 GAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGS 398
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1746 GHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLnhnadfNGGL 1825
Cdd:COG5295  399 GGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAA------TTAA 472
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1826 GNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGP 1905
Cdd:COG5295  473 SAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATG 552
                        570       580       590
                 ....*....|....*....|....*....|....*..
gi 50593518 1906 NASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNG 1942
Cdd:COG5295  553 TNSVAVGNNTATGANSVALGAGSVASGANSVSVGAAG 589
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
1445-1952 1.66e-12

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 72.89  E-value: 1.66e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1445 ATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTN 1524
Cdd:COG4625    1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1525 ATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTS 1604
Cdd:COG4625   81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1605 ASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSI 1684
Cdd:COG4625  161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1685 SFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANfneghsISFGNGLSTSAGFGNG 1764
Cdd:COG4625  241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGG------GGGGGGGGGGGGGGGG 314
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1765 LGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGE 1844
Cdd:COG4625  315 GGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGG 394
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1845 LGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSN 1924
Cdd:COG4625  395 GAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAG 474
                        490       500
                 ....*....|....*....|....*...
gi 50593518 1925 TSNGFTGEPNTGSSFSNGPSSIVGFSGG 1952
Cdd:COG4625  475 TLTLTGNNTYTGTTTVNGGGNYTQSAGS 502
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
1106-1596 3.44e-12

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 72.12  E-value: 3.44e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1106 GGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTSFSG 1185
Cdd:COG4625   11 GGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGT 90
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1186 GPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYS 1265
Cdd:COG4625   91 GGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGG 170
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1266 GAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAIST 1345
Cdd:COG4625  171 GGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGG 250
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1346 STGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNA 1425
Cdd:COG4625  251 GGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 330
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1426 TFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNA 1505
Cdd:COG4625  331 GGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGG 410
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1506 TFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSA 1585
Cdd:COG4625  411 GGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTV 490
                        490
                 ....*....|.
gi 50593518 1586 NFGGALTNSAG 1596
Cdd:COG4625  491 NGGGNYTQSAG 501
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1339-1647 3.50e-12

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 71.96  E-value: 3.50e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1339 FGGAISTSTGFGSalnnSANFGGAISTSFsgvlnsSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFG 1418
Cdd:NF033849  231 YAANLGQSAGTGY----GESVGHSTSQGQ------SHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQS 300
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1419 GALNTNATFG-GVLNG-----SAGFGGAMNTNATFG----GALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFG 1488
Cdd:NF033849  301 SSVGTSESQShGTTEGtsttdSSSHSQSSSYNVSSGtgvsSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1489 GALNNSAGFGGAISTNATFGGALnnSAGFGGAISTNATFG---GALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSA 1565
Cdd:NF033849  381 SSRSSSSGVSGGFSGGIAGGGVT--SEGLGASQGGSEGWGsgdSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSV 458
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1566 SFGGVL--NNSAGFGGAINTSANFG--GALTNSAGF--GGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGG 1639
Cdd:NF033849  459 SQGTSWseGTGTSQGQSVGTSESWStsQSETDSVGDstGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGP 538

                  ....*...
gi 50593518  1640 AISTNASF 1647
Cdd:NF033849  539 SISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1356-1702 7.74e-12

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 71.19  E-value: 7.74e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1356 SANFGGAISTSFSGvlNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSA 1435
Cdd:NF033849  220 SISFGVSLPMMYAA--NLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1436 GFGGAMNTNATfggaLNSNAGFGGAISTSTNF--GGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNN 1513
Cdd:NF033849  298 GQSSSVGTSES----QSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSS 373
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1514 SAGFGGAISTNATFGgalnNSAGFGGAIS----TSASFGGTLNNSASFGgaintsaSFGGVLNNSAGFGGAINTSANFGG 1589
Cdd:NF033849  374 SVSSSESSSRSSSSG----VSGGFSGGIAgggvTSEGLGASQGGSEGWG-------SGDSVQSVSQSYGSSSSTGTSSGH 442
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1590 ALTNSAGFGgaISTSASFGGALNNSAGFGGAISTSASfggalnNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFG- 1668
Cdd:NF033849  443 SDSSSHSTS--SGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSe 514
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 50593518  1669 --GTLNTTDFGSTHSNSISFGSAPttSVSFGGSHST 1702
Cdd:NF033849  515 sqGTSLGTSGGRTSGAGGSMGLGP--SISLGKSYQW 548
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1736-2064 2.07e-10

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 66.18  E-value: 2.07e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1736 SGATSANFNEGHSISFGNGLSTSAgfGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGL 1815
Cdd:NF033849  216 QGQKSISFGVSLPMMYAANLGQSA--GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSE 293
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1816 NHNAdfngGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTS--DGFAGNLGTNTGFGGTLGTGAGFSV 1893
Cdd:NF033849  294 SEST----GQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSShsDGTSQSTSISHSESSSESTGTSVGH 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1894 SLNNGNGFGNGPNASFNRGLNTiiGFGSGSNTSnGFTGEpntGSSFSNGPSSIVGFSGG-PSTGAGFCSGPSTGGFGGGP 1972
Cdd:NF033849  370 STSSSVSSSESSSRSSSSGVSG--GFSGGIAGG-GVTSE---GLGASQGGSEGWGSGDSvQSVSQSYGSSSSTGTSSGHS 443
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:NF033849  444 DSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTS 523
                         330
                  ....*....|..
gi 50593518  2053 GGLNTSAGFSGG 2064
Cdd:NF033849  524 GGRTSGAGGSMG 535
MscS_porin pfam12795
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ...
268-445 2.01e-09

Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.


Pssm-ID: 432790 [Multi-domain]  Cd Length: 238  Bit Score: 60.01  E-value: 2.01e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASnrqigASNRQTE 347
Cdd:pfam12795   10 LDEAAKKKLLQDLQQALSLLDKIDASKQRAAAYQKALDDAPAELRELRQELAALQAKAEAAPKEILAS-----LSLEELE 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    348 vsSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASsrQTEASSRQTEASS 427
Cdd:pfam12795   85 --QRLLQTSAQLQELQNQLAQLNSQLIELQTRPERAQQQLSEARQRLQQIRNRLNGPAPPGEPLS--EAQRWALQAELAA 160
                          170
                   ....*....|....*...
gi 50593518    428 RQTEASSRQIEASAAAVR 445
Cdd:pfam12795  161 LKAQIDMLEQELLSNNNR 178
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
863-1511 5.69e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 61.33  E-value: 5.69e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  863 TTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 942
Cdd:COG4625   28 AGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGG 107
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  943 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 1022
Cdd:COG4625  108 GGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 187
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1023 SGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTAS 1102
Cdd:COG4625  188 GGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGG 267
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1103 ISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTS 1182
Cdd:COG4625  268 GGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGG 347
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1183 FSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSS 1262
Cdd:COG4625  348 GGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGT 427
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1263 PYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGA 1342
Cdd:COG4625  428 GAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGGNYTQSAGSTLAVE 507
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1343 ISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALN 1422
Cdd:COG4625  508 VDAANSDRLVVTGTATLNGGTVVVLAGGYAPGTTYTILAVAAALDALAGNGDLSALYNALAALDAAAARAALDQLSGEIH 587
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1423 TNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALnnsAGFGGAIS 1502
Cdd:COG4625  588 ASAAAALLQASRALRDALSNRLRALRGAGAAGDAAAEGWGVWAQGFGSWGDQDGDGGAAGYDSSTGGLL---VGADYRLG 664

                 ....*....
gi 50593518 1503 TNATFGGAL 1511
Cdd:COG4625  665 DNWRLGVAL 673
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
917-1432 7.53e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 60.95  E-value: 7.53e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  917 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGF 996
Cdd:COG4625    1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  997 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPST 1076
Cdd:COG4625   81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1077 SAPFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISF 1156
Cdd:COG4625  161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1157 GGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTS 1236
Cdd:COG4625  241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1237 TVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTS 1316
Cdd:COG4625  321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1317 VSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGS-TLNSSA 1395
Cdd:COG4625  401 GGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAgTLTLTG 480
                        490       500       510
                 ....*....|....*....|....*....|....*..
gi 50593518 1396 SFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLN 1432
Cdd:COG4625  481 NNTYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLV 517
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
997-1512 9.96e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 60.56  E-value: 9.96e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  997 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPST 1076
Cdd:COG4625    2 GGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGG 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1077 SAPFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISF 1156
Cdd:COG4625   82 GGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAG 161
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1157 GGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTS 1236
Cdd:COG4625  162 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 241
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1237 TVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTS 1316
Cdd:COG4625  242 GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 321
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1317 VSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSAS 1396
Cdd:COG4625  322 GGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGG 401
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1397 FGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAG 1476
Cdd:COG4625  402 GGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGN 481
                        490       500       510
                 ....*....|....*....|....*....|....*.
gi 50593518 1477 FGGAMNTSASFGGALNNSAGFGGAISTNATFGGALN 1512
Cdd:COG4625  482 NTYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLV 517
auto_AIDA-I NF033176
autotransporter adhesin AIDA-I;
1318-1785 1.30e-08

autotransporter adhesin AIDA-I;


Pssm-ID: 380183 [Multi-domain]  Cd Length: 1287  Bit Score: 60.44  E-value: 1.30e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1318 SFGGSSGANAGFGGTLNSSTsfGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSaSF 1397
Cdd:NF033176   72 SNGQTSNATVNSGGIQNVNN--GGKTTSTTVNSSGAQNVGNSGTAISTIVNSGGVQRVSSGGVTSATSLSGGAQNIY-NL 148
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1398 GSALSTSASFGGVLNGSAGfGGALNTNATFGGVLNGSAGfGGAMNTNATFGGALNSNAGfGGAISTSTNFGGALNNSAGf 1477
Cdd:NF033176  149 GHASNTVIFNGGNQTIFSG-GISDDTNISSGGQQRVSSG-GVASNTTINSSGTQNILSG-GSTVSTHISSGGNQYISAG- 224
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1478 GGAMNTSASFGGALNNSAgfgGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGfGGAISTSASFGGTLNNSaSF 1557
Cdd:NF033176  225 GNASATVVSSGGFQRVSS---GGTATGTVLSGGTQNVSSGGSAISTSVYSSGVQTVYAG-ATVTDTTVNSGGKQNIS-SG 299
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1558 GGAINTSASFGGVLNNsagFGGAINTSANFGGALTNSAGfGGAISTSASFGGALNNSAGfGGAISTSASFGGALNNSAGf 1637
Cdd:NF033176  300 GIVSGTIVNSSGTQNI---YSGGSALSANIKGSQIVNSD-GTAINTLVNDGGYQHIRNG-GVASGTIINQSGRVNISSG- 373
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1638 GGAISTNASFGGAISNSPDfGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSapTTSVSFGGSHSTNLCFGGAPSTSLCFG 1717
Cdd:NF033176  374 GYAESTIINSGGTQSVLSG-GYASGTLINNSGRENVSNGGSAYNTIINAGG--NQYIYSNGEASGTTVNTSGFQRVNSGG 450
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 50593518  1718 SASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSL 1785
Cdd:NF033176  451 TATGTKLSGGNQNVSSGGKAIAAEVYSGGKQTVYAGGEASGTQIFDGGVVNVSGGSVSGASVNLNGRL 518
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
1559-2060 1.77e-08

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 59.79  E-value: 1.77e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1559 GAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFG 1638
Cdd:COG4625    1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1639 GAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGS 1718
Cdd:COG4625   81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1719 ASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTS 1798
Cdd:COG4625  161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1799 TGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTN 1878
Cdd:COG4625  241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1879 TGFGGTLGTGAGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAG 1958
Cdd:COG4625  321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1959 FCSGPSTGGFGGGPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTA 2038
Cdd:COG4625  401 GGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTG 480
                        490       500
                 ....*....|....*....|..
gi 50593518 2039 AGFGSGLSTSTGfGGGLNTSAG 2060
Cdd:COG4625  481 NNTYTGTTTVNG-GGNYTQSAG 501
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
999-1646 2.68e-08

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 59.40  E-value: 2.68e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  999 ISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSA 1078
Cdd:COG5295    1 SASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAA 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1079 PFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTAsisfGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGG 1158
Cdd:COG5295   81 SSVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNA----GASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNT 156
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1159 APSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTV 1238
Cdd:COG5295  157 ATATGSSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAG 236
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1239 FSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSsdfGGTLSTSVS 1318
Cdd:COG5295  237 GSASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAA---NATAGGGNA 313
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1319 FGGSSGANAG--FGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSAS 1396
Cdd:COG5295  314 GSGGGGAAALgsAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGA 393
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1397 FGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAG 1476
Cdd:COG5295  394 GSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAAS 473
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1477 FGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSAS 1556
Cdd:COG5295  474 AAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGT 553
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1557 FGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAG 1636
Cdd:COG5295  554 NSVAVGNNTATGANSVALGAGSVASGANSVSVGAAGAENVAAGATDTDAVNGGGAVATGDNSVAVGNNAQASGANSVALG 633
                        650
                 ....*....|
gi 50593518 1637 FGGAISTNAS 1646
Cdd:COG5295  634 AGATATANNS 643
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
1566-2071 3.73e-08

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 59.02  E-value: 3.73e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1566 SFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNA 1645
Cdd:COG4625    1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1646 SFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLC 1725
Cdd:COG4625   81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1726 FGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGL 1805
Cdd:COG4625  161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1806 GTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTL 1885
Cdd:COG4625  241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1886 GTGAGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGfcsgpST 1965
Cdd:COG4625  321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGG-----GG 395
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1966 GGFGGGPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGL 2045
Cdd:COG4625  396 AGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGT 475
                        490       500
                 ....*....|....*....|....*.
gi 50593518 2046 STSTGFGGGLNTSAGFSGGPPSTGTG 2071
Cdd:COG4625  476 LTLTGNNTYTGTTTVNGGGNYTQSAG 501
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
262-468 5.43e-08

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 57.22  E-value: 5.43e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  262 IGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGA 341
Cdd:COG4372   26 IAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELES 105
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  342 SNRQTEVSSRQIEASNRQIGASNRQTEASNRQIgasnrqteaSNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSR 421
Cdd:COG4372  106 LQEEAEELQEELEELQKERQDLEQQRKQLEAQI---------AELQSEIAEREEELKELEEQLESLQEELAALEQELQAL 176
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*..
gi 50593518  422 QTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEA 468
Cdd:COG4372  177 SEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEA 223
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
268-522 9.60e-08

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 56.76  E-value: 9.60e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTE 347
Cdd:COG3883   17 QIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALY 96
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  348 VSSRQI-------EASN-----RQIGASNRQTEASNRQIGA-SNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSR 414
Cdd:COG3883   97 RSGGSVsyldvllGSESfsdflDRLSALSKIADADADLLEElKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQ 176
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  415 QTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEAPPAIQTVTNHALSVTVRIRRGSRARK 494
Cdd:COG3883  177 QAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAAGAAG 256
                        250       260
                 ....*....|....*....|....*...
gi 50593518  495 AANKNRATESQAQIAEQGAQASEASISA 522
Cdd:COG3883  257 AAAGSAGAAGAAAGAAGAGAAAASAAGG 284
growth_prot_Scy NF041483
polarized growth protein Scy;
97-524 1.17e-07

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 57.53  E-value: 1.17e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    97 SQASATTEAPNIQASVTSQTQKAKTMRVTPKVSLTGSEDATtqlkpplQALNLPVTTPTIQTPVANESANSLAS--TAVN 174
Cdd:NF041483  293 AKQLASAESANEQRTRTAKEEIARLVGEATKEAEALKAEAE-------QALADARAEAEKLVAEAAEKARTVAAedTAAQ 365
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   175 KSKKASTANNAANKTVPSAAEISLAsAATHTVTTQGQAAKETGSIQTIAATA------RSKKNSKGKRtpAKTTNTDNEY 248
Cdd:NF041483  366 LAKAARTAEEVLTKASEDAKATTRA-AAEEAERIRREAEAEADRLRGEAADQaeqlkgAAKDDTKEYR--AKTVELQEEA 442
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   249 V----EA----SNAIEASSRQIGASGRqtEASnRQIEASSRQTEA-------------SNRQTEASSRQTEASSRQT--- 304
Cdd:NF041483  443 RrlrgEAeqlrAEAVAEGERIRGEARR--EAV-QQIEEAARTAEElltkakadadelrSTATAESERVRTEAIERATtlr 519
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   305 ----ETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIgasnrqTEVSSRQIEAsnrqigasnRQTEASNRqigASNRQ 380
Cdd:NF041483  520 rqaeETLERTRAEAERLRAEAEEQAEEVRAAAERAAREL------REETERAIAA---------RQAEAAEE---LTRLH 581
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   381 TEASNRQIGASNRQTDASN------RQT-DASNRQ-TEASSR--------QTEASSRQTEASSrqtEASSRQIEASAAAV 444
Cdd:NF041483  582 TEAEERLTAAEEALADARAeaerirREAaEETERLrTEAAERirtlqaqaEQEAERLRTEAAA---DASAARAEGENVAV 658
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   445 RPKkprgkkgnnkgSNSASEPSeappaiqtvtnhalsvtvriRRGSRARKAANKNRAtESQAQIAEQGAQASEASISALE 524
Cdd:NF041483  659 RLR-----------SEAAAEAE--------------------RLKSEAQESADRVRA-EAAAAAERVGTEAAEALAAAQE 706
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
255-525 1.52e-07

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 56.06  E-value: 1.52e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  255 IEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEA 334
Cdd:COG4372   40 LDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEE 119
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  335 SNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIgaSNRQTEASNRQIGASNRQTDASNRQTDASNRQTEassR 414
Cdd:COG4372  120 LQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQL--ESLQEELAALEQELQALSEAEAEQALDELLKEAN---R 194
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  415 QTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEAPPAIQTVT-NHALSVTVRIRRGSRAR 493
Cdd:COG4372  195 NAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVIlKEIEELELAILVEKDTE 274
                        250       260       270
                 ....*....|....*....|....*....|..
gi 50593518  494 KAANKNRATESQAQIAEQGAQASEASISALET 525
Cdd:COG4372  275 EEELEIAALELEALEEAALELKLLALLLNLAA 306
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
264-442 2.40e-07

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 55.16  E-value: 2.40e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  264 ASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASN 343
Cdd:COG4942   17 AQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELR 96
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  344 RQTEvssRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDA-SNRQTEASSRQTEASSRQ 422
Cdd:COG4942   97 AELE---AQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEElRADLAELAALRAELEAER 173
                        170       180
                 ....*....|....*....|
gi 50593518  423 TEASSRQTEASSRQIEASAA 442
Cdd:COG4942  174 AELEALLAELEEERAALEAL 193
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
251-459 2.89e-07

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 55.16  E-value: 2.89e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  251 ASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNR 330
Cdd:COG4942   18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRA 97
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  331 QIEASNRQIGASNRQTEVSSRQ-------------------------IEASNRQIGASNRQTEASNRQIGASNRQTEASN 385
Cdd:COG4942   98 ELEAQKEELAELLRALYRLGRQpplalllspedfldavrrlqylkylAPARREQAEELRADLAELAALRAELEAERAELE 177
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 50593518  386 RQIgASNRQTDASNRQTDASNRQTEASSRQTEASSRQT----EASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGS 459
Cdd:COG4942  178 ALL-AELEEERAALEALKAERQKLLARLEKELAELAAElaelQQEAEELEALIARLEAEAAAAAERTPAAGFAALKGK 254
Tar COG0840
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
250-442 3.71e-07

Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];


Pssm-ID: 440602 [Multi-domain]  Cd Length: 533  Bit Score: 55.03  E-value: 3.71e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  250 EASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIgasnRQIMASNRQIG--- 326
Cdd:COG0840  292 ETAAAMEELSATVQEVAENAQQAAELAEEASELAEEGGEVVEEAVEGIEEIRESVEETAETI----EELGESSQEIGeiv 367
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  327 -------------ASNRQIEAS------------------------------NRQIGASNRQTEVSSRQIEASNRQIGAS 363
Cdd:COG0840  368 dviddiaeqtnllALNAAIEAArageagrgfavvadevrklaersaeatkeiEELIEEIQSETEEAVEAMEEGSEEVEEG 447
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  364 NRQTEASN---RQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSRQTEASSRQTEASSRQIEAS 440
Cdd:COG0840  448 VELVEEAGealEEIVEAVEEVSDLIQEIAAASEEQSAGTEEVNQAIEQIAAAAQENAASVEEVAAAAEELAELAEELQEL 527

                 ..
gi 50593518  441 AA 442
Cdd:COG0840  528 VS 529
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
254-524 6.90e-07

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 53.75  E-value: 6.90e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  254 AIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIE 333
Cdd:COG4372   25 LIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELE 104
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  334 ASNRQigASNRQTEVSsrQIEASNRQIGASNRQTEASNRQIGAS--NRQTEASNRQIGASNRQTDASNRQTDASNRQTEA 411
Cdd:COG4372  105 SLQEE--AEELQEELE--ELQKERQDLEQQRKQLEAQIAELQSEiaEREEELKELEEQLESLQEELAALEQELQALSEAE 180
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  412 SSRQTEASSRQTEASSRQTEASSRQIEA-------SAAAVRPKKPRGKKGNNKGSNSASEPSEAPPAIQTVTNHALSVTV 484
Cdd:COG4372  181 AEQALDELLKEANRNAEKEEELAEAEKLieslpreLAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEI 260
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|
gi 50593518  485 RIRRGSRARKAANKNRATESQAQIAEQGAQASEASISALE 524
Cdd:COG4372  261 EELELAILVEKDTEEEELEIAALELEALEEAALELKLLAL 300
PTZ00121 PTZ00121
MAEBL; Provisional
250-524 7.08e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 54.76  E-value: 7.08e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   250 EASNAIEASSRqiGASGRQTEASNRQIEAssRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASN 329
Cdd:PTZ00121 1197 EDARKAEAARK--AEEERKAEEARKAEDA--KKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAI 1272
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   330 RQIE---------ASNRQIGASNRQTEVSSRQIEASN-----RQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQT 395
Cdd:PTZ00121 1273 KAEEarkadelkkAEEKKKADEAKKAEEKKKADEAKKkaeeaKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA 1352
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   396 DASNRQTDASNRQTEASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKP----RGKKGNNKGSNSASEPSEAPPA 471
Cdd:PTZ00121 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKadelKKAAAAKKKADEAKKKAEEKKK 1432
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|...
gi 50593518   472 IQTVTNHALSVtvriRRGSRARKAANKNRATESQAQIAEQGAQASEASISALE 524
Cdd:PTZ00121 1433 ADEAKKKAEEA----KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
1492-2053 1.03e-06

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 54.01  E-value: 1.03e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1492 NNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVL 1571
Cdd:COG5295    2 ASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAAS 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1572 NNSAGFGGAINTSANFGGA-LTNSAGFGGAISTSASfggalnNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGA 1650
Cdd:COG5295   82 SVASGGASAATAASTGTGNtAGTAATVAGAASSGSA------TNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSN 155
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1651 ISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNlcfGGSN 1730
Cdd:COG5295  156 TATATGSSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAA---TGSA 232
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1731 STNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTD 1810
Cdd:COG5295  233 ASAGGSASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGN 312
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1811 FSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAG 1890
Cdd:COG5295  313 AGSGGGGAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAG 392
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1891 FSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGG 1970
Cdd:COG5295  393 AGSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAA 472
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1971 GPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTG 2050
Cdd:COG5295  473 SAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATG 552

                 ...
gi 50593518 2051 FGG 2053
Cdd:COG5295  553 TNS 555
YhjY COG5571
Uncharacterized conserved protein YhjY, contains autotransporter beta-barrel domain [General ...
1227-1654 1.41e-06

Uncharacterized conserved protein YhjY, contains autotransporter beta-barrel domain [General function prediction only];


Pssm-ID: 444313 [Multi-domain]  Cd Length: 648  Bit Score: 53.34  E-value: 1.41e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1227 TSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTS 1306
Cdd:COG5571    5 SAAGSLGYLASASSNAATAPGLAAATASAAGAAGLGAASTASSLSGASLALLAAQALGAGLSGTNGFSGGAGSSSGTGPT 84
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1307 SDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAG 1386
Cdd:COG5571   85 ANGGLAGAGGVDLAGAGGGGGASGLAGGAGGAGGTAAAGGAAAAGGGAAGNAATAAAAAAAGTALQLSGLTTAGAVGGVA 164
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1387 FGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTN 1466
Cdd:COG5571  165 GTAALNGATANTGLGAAAALAAAAAAAAAAAAAAAAAAAAATAAAAAAAAAAAAAVLASPAPAAGGAAAAAAGAAAAAAS 244
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1467 FGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSAS 1546
Cdd:COG5571  245 AAANAATQANLLLLALALGSNGNAVGLNAVGLANEAAAPGAVGGDAGSTGATPSTLSSASCVASSLTAANANTLYAAADT 324
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1547 FGGTLNNSASFGGAINTSASFGGVlnnsAGFGGAINTSANFGGALTNSAGFGGAiSTSASFGGALNNSAGFGGAISTSAS 1626
Cdd:COG5571  325 AGPAGATAALAAAAAAVLASAAAV----AQAALALAAAGGQARSLAVAAGQGRG-ARGGQTRGGGGAGGTTGGGVGAGGG 399
                        410       420
                 ....*....|....*....|....*...
gi 50593518 1627 FGGALNNSAGFGGAISTNASFGGAISNS 1654
Cdd:COG5571  400 DGDGPNLTLGVDYRLSDNLLLGAALSYG 427
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
909-1546 1.60e-06

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 53.24  E-value: 1.60e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  909 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 988
Cdd:COG5295    1 SASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAA 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  989 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANS 1068
Cdd:COG5295   81 SSVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATAT 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1069 SFSGTPSTSAPFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTS 1148
Cdd:COG5295  161 GSSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSAS 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1149 LSTASISFGGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTS 1228
Cdd:COG5295  241 AGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGA 320
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1229 FGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSIS-FGGSPSTNTGFGGTLSTSVSFGASSSTSS 1307
Cdd:COG5295  321 AALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGaAATSSSGGSATAAGNAAGAAGAGSAGSGG 400
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1308 DFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGF 1387
Cdd:COG5295  401 SSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAA 480
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1388 GSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNF 1467
Cdd:COG5295  481 ATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGN 560
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1468 GGALNNSAGFGGAMNTSASFGGALNNSAGFG----GAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAIST 1543
Cdd:COG5295  561 NTATGANSVALGAGSVASGANSVSVGAAGAEnvaaGATDTDAVNGGGAVATGDNSVAVGNNAQASGANSVALGAGATATA 640

                 ...
gi 50593518 1544 SAS 1546
Cdd:COG5295  641 NNS 643
PPE COG5651
PPE-repeat protein [Function unknown];
1399-1630 1.95e-06

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 52.59  E-value: 1.95e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1399 SALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFG--GAMNTNATFGGALNSNAGFGGAISTSTNFGGAlnnsag 1476
Cdd:COG5651  159 AAAVALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFAnlGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFA------ 232
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1477 fGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSAS 1556
Cdd:COG5651  233 -GTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLG 311
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 50593518 1557 FGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGA 1630
Cdd:COG5651  312 AGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
MscS_porin pfam12795
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ...
250-436 1.99e-06

Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.


Pssm-ID: 432790 [Multi-domain]  Cd Length: 238  Bit Score: 51.15  E-value: 1.99e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    250 EASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASN 329
Cdd:pfam12795   48 DAPAELRELRQELAALQAKAEAAPKEILASLSLEELEQRLLQTSAQLQELQNQLAQLNSQLIELQTRPERAQQQLSEARQ 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    330 RQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNR-QIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQ 408
Cdd:pfam12795  128 RLQQIRNRLNGPAPPGEPLSEAQRWALQAELAALKAQIDMLEQeLLSNNNRQDLLKARRDLLTLRIQRLEQQLQALQELL 207
                          170       180
                   ....*....|....*....|....*...
gi 50593518    409 TEasSRQTEAssRQTEASSRQTEASSRQ 436
Cdd:pfam12795  208 NE--KRLQEA--EQAVAQTEQLAEEAAG 231
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
268-524 3.37e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 52.63  E-value: 3.37e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEAsnrqigASNRQTE 347
Cdd:COG1196  219 KEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEE------AQAEEYE 292
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  348 VSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSRQTEASS 427
Cdd:COG1196  293 LLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEA 372
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  428 RQTEASSRQIEASAAAVRPKKprgkkgnnkgsnsasepsEAPPAIQTVTNHALSVTVRIRRGSRARKAANKNRATESQAQ 507
Cdd:COG1196  373 ELAEAEEELEELAEELLEALR------------------AAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELE 434
                        250
                 ....*....|....*..
gi 50593518  508 IAEQGAQASEASISALE 524
Cdd:COG1196  435 EEEEEEEEALEEAAEEE 451
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
250-455 1.14e-05

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 49.90  E-value: 1.14e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  250 EASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASN 329
Cdd:COG4372   56 QAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLE 135
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  330 RQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQ--IGASNRQTEASNRQIGASNRQTDASNRQTDASNR 407
Cdd:COG4372  136 AQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAeaEQALDELLKEANRNAEKEEELAEAEKLIESLPRE 215
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*...
gi 50593518  408 QTEASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGN 455
Cdd:COG4372  216 LAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEEL 263
COG4935 COG4935
Regulatory P domain of the subtilisin-like proprotein convertases and other proteases ...
1216-1664 1.31e-05

Regulatory P domain of the subtilisin-like proprotein convertases and other proteases [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443962 [Multi-domain]  Cd Length: 641  Bit Score: 50.20  E-value: 1.31e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1216 TNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLST 1295
Cdd:COG4935   96 GVVAVAGAGLAATASGAAAGAVAAAANGNTGAGPGSGGTGGGSGGAGAAAAAAALSAAGAAVGVAAVAGAAGGGGGVGVA 175
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1296 SVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSA 1375
Cdd:COG4935  176 AAVGVVLGAGLVADGGNGGGGAVAGGAAGGGGGGGGGGGLGGAAGGGGAGLAAAGGGGGGAAAAAAAGVGGLGAAATAAA 255
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1376 SFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNA 1455
Cdd:COG4935  256 ADGGGGGGAGAAGAGGSAGAAAGGAGAGVVGAAAGGGDAALGGAVGAAGTGNAAAAAAASAGSGGGGGSAAAAGAAAAAA 335
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1456 GFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSA 1535
Cdd:COG4935  336 AAAAGAAAGVSGAASVVAGASGGGAGTAAAAGGGAAAAAAGGAAAAGAAAGAAAGAAAGAAAAGGVASAAGAVGAGTAAG 415
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1536 GFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSA 1615
Cdd:COG4935  416 ASATAAVSTGAASGSSTTSSTGTTATATGLGGGADAGSTSTGTGSAAGAAGGTTTATSGLASSTTAAAAAAAAGLATTAA 495
                        410       420       430       440       450
                 ....*....|....*....|....*....|....*....|....*....|..
gi 50593518 1616 GFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNS---PDFGGAFSTS 1664
Cdd:COG4935  496 VAAGAAGAAAAAATAASVGGATGAAGTTNSTATFSNTTDvaiPDNGPAGVTS 547
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1028-1427 1.66e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 50.39  E-value: 1.66e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1028 GRNSITFG-SVPNT-SANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGgapststsfstasISF 1105
Cdd:NF033849  217 GQKSISFGvSLPMMyAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHG-------------STR 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1106 GGApststslstasisfggapststsfstasisfggapststslstasisfggapsiNSSSGGSSVSFGGAPTTSTSFSG 1185
Cdd:NF033849  284 GWS------------------------------------------------------HTQSTSESESTGQSSSVGTSESQ 309
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1186 GPCISFGgapcTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYS 1265
Cdd:NF033849  310 SHGTTEG----TSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSS 385
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1266 GAGFGGTLSTSIsfGGSPSTNTGFGGTLSTSVSFGASSSTSSdFGGTLSTSVSFGGSSGA--NAGFGGTLNSSTSFGGAI 1343
Cdd:NF033849  386 SSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWGSGDSVQS-VSQSYGSSSSTGTSSGHsdSSSHSTSSGQADSVSQGT 462
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1344 STSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNT 1423
Cdd:NF033849  463 SWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISL 542

                  ....
gi 50593518  1424 NATF 1427
Cdd:NF033849  543 GKSY 546
Tar COG0840
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
162-380 3.49e-05

Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];


Pssm-ID: 440602 [Multi-domain]  Cd Length: 533  Bit Score: 48.86  E-value: 3.49e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  162 NESANSLASTAVNKSKKASTANNAANKTVPSAAEISlasAATHTVTTQGQAAKETgSIQTIAATARSKKN-SKGKRTPAK 240
Cdd:COG0840  266 ASASEELAASAEELAAGAEEQAASLEETAAAMEELS---ATVQEVAENAQQAAEL-AEEASELAEEGGEVvEEAVEGIEE 341
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  241 TTNTDNEYVEASNAIEASSRQIG-------------------AS---------GR--------------QTEASNRQIEA 278
Cdd:COG0840  342 IRESVEETAETIEELGESSQEIGeivdviddiaeqtnllalnAAieaarageaGRgfavvadevrklaeRSAEATKEIEE 421
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  279 ssrQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQImasnRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNR 358
Cdd:COG0840  422 ---LIEEIQSETEEAVEAMEEGSEEVEEGVELVEEAGEAL----EEIVEAVEEVSDLIQEIAAASEEQSAGTEEVNQAIE 494
                        250       260
                 ....*....|....*....|..
gi 50593518  359 QIGASNRQTEASNRQIGASNRQ 380
Cdd:COG0840  495 QIAAAAQENAASVEEVAAAAEE 516
PPE COG5651
PPE-repeat protein [Function unknown];
1270-1470 4.05e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 48.35  E-value: 4.05e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1270 GGTLSTSISFGGSPSTNTGFGGTLSTS---VSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTS 1346
Cdd:COG5651  178 GGLLGAQNAGSGNTSSNPGFANLGLTGlnqVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAGAGASA 257
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1347 TGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNAT 1426
Cdd:COG5651  258 ALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAA 337
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....
gi 50593518 1427 FGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGA 1470
Cdd:COG5651  338 GAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGG 381
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
206-525 7.28e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 48.13  E-value: 7.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    206 VTTQGQAAKETGSIqtiaATARSKKNSKgkrtpakTTNTDNEYVEASNAIEASSRQIgasgrqTEASNRQIEAssrQTEA 285
Cdd:TIGR02168  648 VTLDGDLVRPGGVI----TGGSAKTNSS-------ILERRREIEELEEKIEELEEKI------AELEKALAEL---RKEL 707
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    286 SNRQTEASSRQteassRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNR 365
Cdd:TIGR02168  708 EELEEELEQLR-----KELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA 782
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    366 QTEASNRQIGASNRQTEASNRQIGASNRQTDASNRqtdasnRQTEASSRQtEASSRQTEASSRQTEASSRQIEASAAAVr 445
Cdd:TIGR02168  783 EIEELEAQIEQLKEELKALREALDELRAELTLLNE------EAANLRERL-ESLERRIAATERRLEDLEEQIEELSEDI- 854
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    446 pkkprgkKGNNKgsnSASEPSEAPPAIQTVTNHAL----SVTVRIRRG-SRARKAANKNRATESQAQIAEQGAQASEASI 520
Cdd:TIGR02168  855 -------ESLAA---EIEELEELIEELESELEALLneraSLEEALALLrSELEELSEELRELESKRSELRRELEELREKL 924

                   ....*
gi 50593518    521 SALET 525
Cdd:TIGR02168  925 AQLEL 929
Tar COG0840
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
241-445 1.89e-04

Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];


Pssm-ID: 440602 [Multi-domain]  Cd Length: 533  Bit Score: 46.55  E-value: 1.89e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  241 TTNTDNEYVEASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMA 320
Cdd:COG0840  230 DVDSKDEIGQLADAFNRMIENLRELVGQVRESAEQVASASEELAASAEELAAGAEEQAASLEETAAAMEELSATVQEVAE 309
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  321 S------------------NRQIGASNRQIEASNRQIGASNR------------------------QT------------ 346
Cdd:COG0840  310 NaqqaaelaeeaselaeegGEVVEEAVEGIEEIRESVEETAEtieelgessqeigeivdviddiaeQTnllalnaaieaa 389
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  347 --------------EV---------SSRQIEAsnrQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQ----TDASN 399
Cdd:COG0840  390 rageagrgfavvadEVrklaersaeATKEIEE---LIEEIQSETEEAVEAMEEGSEEVEEGVELVEEAGEAleeiVEAVE 466
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|..
gi 50593518  400 RQTDASNRQTEASSRQTEASS------RQTEASSRQTEASSRQIEASAAAVR 445
Cdd:COG0840  467 EVSDLIQEIAAASEEQSAGTEevnqaiEQIAAAAQENAASVEEVAAAAEELA 518
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
245-519 2.58e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 46.21  E-value: 2.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    245 DNEYVEASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTE--TSNRQIgasnrqimASN 322
Cdd:TIGR02169  222 EYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKdlGEEEQL--------RVK 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    323 RQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEA-SNRQIGASNRQTDASNR- 400
Cdd:TIGR02169  294 EKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKlTEEYAELKEELEDLRAEl 373
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    401 -QTDASNRQT--EASSRQTEASSRQTEASSRQTEAS-----SRQIEASAAAVRPKKPRGKKGNNKgsnSASEPSEAPPAI 472
Cdd:TIGR02169  374 eEVDKEFAETrdELKDYREKLEKLKREINELKRELDrlqeeLQRLSEELADLNAAIAGIEAKINE---LEEEKEDKALEI 450
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 50593518    473 QTVTNHaLSVTVRIRRGSRARKAANKN-------RATESQAQIAEQGAQASEAS 519
Cdd:TIGR02169  451 KKQEWK-LEQLAADLSKYEQELYDLKEeydrvekELSKLQRELAEAEAQARASE 503
YjbI COG1357
Uncharacterized conserved protein YjbI, contains pentapeptide repeats [Function unknown];
1453-1627 2.65e-04

Uncharacterized conserved protein YjbI, contains pentapeptide repeats [Function unknown];


Pssm-ID: 440968 [Multi-domain]  Cd Length: 178  Bit Score: 43.78  E-value: 2.65e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1453 SNAGFGGAISTSTNFGGAlNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGAln 1532
Cdd:COG1357    8 SGADLSGADLSGADLSGA-NLSGALSGANLSGANLSGANLTGANLSGADLSGADLSGANLSGADLSGANLTGADLSGA-- 84
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1533 NSAGFGGAISTSASFGGtlnnsASFGGAINTSASFGGvlnnsAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALN 1612
Cdd:COG1357   85 NLANLSGANLSGANLSG-----ANLRGANLSGANLSG-----ADLSGADLSGANLSGADLSGANLSGANLSGADLSGADL 154
                        170
                 ....*....|....*
gi 50593518 1613 NSAGFGGAISTSASF 1627
Cdd:COG1357  155 SGANLSGANLSGANL 169
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
319-404 2.66e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 45.53  E-value: 2.66e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  319 MASNRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDAS 398
Cdd:COG4942   16 AAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAEL 95

                 ....*.
gi 50593518  399 NRQTDA 404
Cdd:COG4942   96 RAELEA 101
COG5412 COG5412
Phage-related protein [Mobilome: prophages, transposons];
1215-1643 2.79e-04

Phage-related protein [Mobilome: prophages, transposons];


Pssm-ID: 444167 [Multi-domain]  Cd Length: 704  Bit Score: 46.23  E-value: 2.79e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1215 STNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGfGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLS 1294
Cdd:COG5412    7 SAKEAASAALLLAQAKAADSELTAASGGVVSAAA-KAQGSIAQLGKIGAAAGAEAALADSSLAFATLAAALGATVAGASL 85
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1295 TSVSFGASSSTSSDFGGTLstsvsfGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSS 1374
Cdd:COG5412   86 LLAAGGARAKGSAAAAAAL------GAVAAAAKVLNGALAAAGAALAATQALAAAATGAKGEANAAAKAGGAAALASAGL 159
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1375 ASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNAtfggvlngsAGFGGAMNTNATFGGALNSN 1454
Cdd:COG5412  160 AAAGAAAAASALAAAGAIAKAILSASKLSGQALAGQSAAAGGALEAAAAAA---------AGAAAAGAAAAAATAASALL 230
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1455 AGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALnnsagfgGAISTNATFGGALNNS 1534
Cdd:COG5412  231 ALAALQGLAAGAATGAAAGAAGAAGLGAAGAGAGQAAALLGLVAGAEASGGTAGGAV-------AGLAAGLAAAAGASAN 303
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1535 AGFGGAISTSASFGGTLNNSAsfGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNS 1614
Cdd:COG5412  304 LGAAAAASFGASLAASAGVDT--AAAALAAAEAIADGSLVAGLGSAGTVLSTLSGAVGGLEGAIGQLGAAGGLGSALGGL 381
                        410       420       430
                 ....*....|....*....|....*....|....
gi 50593518 1615 AGFGGAISTS-----ASFGGALNNSAGFGGAIST 1643
Cdd:COG5412  382 TGPIGIVIAAiaaliAAFVALWKNSETFRNLVQG 415
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
869-1089 2.79e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 46.15  E-value: 2.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   869 NQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGfggiSNPSGGFGGisnpSGGFGGISNPSGGFGGISNPSGGFGG 948
Cdd:NF033849  310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDG----TSQSTSISH----SESSSESTGTSVGHSTSSSVSSSESS 381
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   949 ISNPSGGFggisnpSGGFGGISNPSGGFggisnpSGGFGGISNPSGGFGGisnpSGGFGGISNPSGGFGGISNPSG-GFG 1027
Cdd:NF033849  382 SRSSSSGV------SGGFSGGIAGGGVT------SEGLGASQGGSEGWGS----GDSVQSVSQSYGSSSSTGTSSGhSDS 445
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 50593518  1028 GRNSITFGsvpnTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFG 1089
Cdd:NF033849  446 SSHSTSSG----QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVS 503
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1239-1459 2.86e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 45.81  E-value: 2.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   1239 FSGAVSTTTGFGGTLSTSVCFGSSPYS--GAGFGGTLSTSISFGGSPSTNTGFGGTLstsvsFGASSSTSSDFGGTLSTS 1316
Cdd:pfam15967    6 FGGGPGSTATAGGGFSFGAAAASNPGStgGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   1317 VSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSAs 1396
Cdd:pfam15967   81 AATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTT- 159
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 50593518   1397 fgsALSTSASFGGVLNgsaGFGGALNTNATFGGVLNGSAGFGGAMNTNATFgGALNSNAGFGG 1459
Cdd:pfam15967  160 ---AVSTGLSLGSTLT---SLGGSLFQNTNSTGLGQTTLGLTLLATSTAPV-SAPAASEGLGG 215
COG5412 COG5412
Phage-related protein [Mobilome: prophages, transposons];
1318-1666 2.91e-04

Phage-related protein [Mobilome: prophages, transposons];


Pssm-ID: 444167 [Multi-domain]  Cd Length: 704  Bit Score: 45.84  E-value: 2.91e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1318 SFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSgvLNSSASFGGAINTSAGFGSTLNSSASF 1397
Cdd:COG5412   35 VVSAAAKAQGSIAQLGKIGAAAGAEAALADSSLAFATLAAALGATVAGASL--LLAAGGARAKGSAAAAAALGAVAAAAK 112
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1398 GSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGF 1477
Cdd:COG5412  113 VLNGALAAAGAALAATQALAAAATGAKGEANAAAKAGGAAALASAGLAAAGAAAAASALAAAGAIAKAILSASKLSGQAL 192
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1478 GGAMNTSASFGGALN---NSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNS 1554
Cdd:COG5412  193 AGQSAAAGGALEAAAaaaAGAAAAGAAAAAATAASALLALAALQGLAAGAATGAAAGAAGAAGLGAAGAGAGQAAALLGL 272
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1555 AsFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNS 1634
Cdd:COG5412  273 V-AGAEASGGTAGGAVAGLAAGLAAAAGASANLGAAAAASFGASLAASAGVDTAAAALAAAEAIADGSLVAGLGSAGTVL 351
                        330       340       350
                 ....*....|....*....|....*....|..
gi 50593518 1635 AGFGGAISTNASFGGAISNSPDFGGAFSTSVG 1666
Cdd:COG5412  352 STLSGAVGGLEGAIGQLGAAGGLGSALGGLTG 383
PRK09039 PRK09039
peptidoglycan -binding protein;
259-380 3.34e-04

peptidoglycan -binding protein;


Pssm-ID: 181619 [Multi-domain]  Cd Length: 343  Bit Score: 45.34  E-value: 3.34e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   259 SRQIgaSGRQTE--ASNRQIEASSRQ---TEASNRQTEASSRQTEASSRQTETSNRQIGASN----RQIMASNRQIGASN 329
Cdd:PRK09039   45 SREI--SGKDSAldRLNSQIAELADLlslERQGNQDLQDSVANLRASLSAAEAERSRLQALLaelaGAGAAAEGRAGELA 122
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 50593518   330 RQIeASNRQIGA-SNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQ 380
Cdd:PRK09039  123 QEL-DSEKQVSArALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAK 173
PHA02515 PHA02515
hypothetical protein; Provisional
1294-1505 3.67e-04

hypothetical protein; Provisional


Pssm-ID: 107197 [Multi-domain]  Cd Length: 508  Bit Score: 45.54  E-value: 3.67e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1294 STSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGgtlNSSTSFGGAISTSTGFGSALNNSANFG--GAISTSFSGVL 1371
Cdd:PHA02515  175 TVAASVGAVDTVAGDLGGTWAAGVSYDFGSIAVPPIG---NTSPPGGNIVIVANSIGNVDTVAENIGdvSTVSTHLSSML 251
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1372 -------------------NSSASFGGAINTSAGFGSTLNSSASfgSALSTSASFGGVLNGSAGFGGALNTNATFGGVLN 1432
Cdd:PHA02515  252 avandidsvvsvagdleniDAVADNAANINTVAGANANVNTVAS--NILDVGTVAGNIDDVQAVAGNAANINVVADNADN 329
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 50593518  1433 GSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGA--LNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNA 1505
Cdd:PHA02515  330 INATAANQANINAAVGNADNINAAVANQANINAVVGNAnnINAVAANEGNVNTVVDNLADVQTVAGIAADVSTVA 404
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
272-490 3.75e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.67  E-value: 3.75e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   272 SNRQIEASSRQ-TEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSS 350
Cdd:NF033609   33 SSKEADASENSvTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSSNTNNGETSVAQNPAQQETTQSASTNATTEETP 112
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   351 RQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSR--QTEASSRQTEASSR 428
Cdd:NF033609  113 VTGEATTTATNQANTPATTQSSNTNAEELVNQTSNETTSNDTNTVSSVNSPQNSTNAENVSTTQdtSTEATPSNNESAPQ 192
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 50593518   429 QTEASSRQIeaSAAAVRPKKPRgkkgnNKGSNSASEPSEAPPAIQTVTNHALSVTVRIRRGS 490
Cdd:NF033609  193 STDASNKDV--VNQAVNTSAPR-----MRAFSLAAVAADAPAAGTDITNQLTNVTVGIDSGT 247
PPE COG5651
PPE-repeat protein [Function unknown];
889-1092 4.31e-04

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 44.88  E-value: 4.31e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  889 ITNPSGGFGGiSNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 968
Cdd:COG5651  174 ITNPGGLLGA-QNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAG 252
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  969 ISNPSGGFGGISNP--SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSS 1046
Cdd:COG5651  253 AGASAALASLAATLlnASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGA 332
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*.
gi 50593518 1047 APSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAP 1092
Cdd:COG5651  333 AAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAA 378
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1373-1606 7.56e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 44.66  E-value: 7.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   1373 SSASFGGAINTSAGFGSTLnssaSFGSALSTSA-SFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGAL 1451
Cdd:pfam15967    2 SGFSFGGGPGSTATAGGGF----SFGAAAASNPgSTGGFSFGTLGAAPAATATTTTATLGLGGGLFGQKPATGFTFGTPA 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   1452 NSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSAsfGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGAL 1531
Cdd:pfam15967   78 SSTAATGPTGLTLGTPAATTAASTGFSLGFNKPA--ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTP 155
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 50593518   1532 NNSAgfggAISTSASFGGTLNnsaSFGGAINTSASFGGVLNNSAGFGGAINTSANFgGALTNSAGFGGAISTSAS 1606
Cdd:pfam15967  156 ATTT----AVSTGLSLGSTLT---SLGGSLFQNTNSTGLGQTTLGLTLLATSTAPV-SAPAASEGLGGLDFSTSS 222
PRK09039 PRK09039
peptidoglycan -binding protein;
250-360 7.84e-04

peptidoglycan -binding protein;


Pssm-ID: 181619 [Multi-domain]  Cd Length: 343  Bit Score: 43.80  E-value: 7.84e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   250 EASNAIEASSRQIGASGRQTEASNRQIEASsrQTEASNRQTEASSRQTE-ASSRQTEtsnRQIGASnrqimaSNRQIGAS 328
Cdd:PRK09039   74 QGNQDLQDSVANLRASLSAAEAERSRLQAL--LAELAGAGAAAEGRAGElAQELDSE---KQVSAR------ALAQVELL 142
                          90       100       110
                  ....*....|....*....|....*....|..
gi 50593518   329 NRQIEASNRQIGASNRQTEVSSRQIEASNRQI 360
Cdd:PRK09039  143 NQQIAALRRQLAALEAALDASEKRDRESQAKI 174
Keratin_2_head pfam16208
Keratin type II head;
877-1015 8.50e-04

Keratin type II head;


Pssm-ID: 465068 [Multi-domain]  Cd Length: 156  Bit Score: 41.95  E-value: 8.50e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    877 SFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPS-GGFGGISNPSGGFGGISNPSGGFGGISNPSGG 955
Cdd:pfam16208    1 GFSSCSAVVPSRSRRSYSSVSSSRRGGGGGGGGGGGGGGFGSRSLYNlGGSKSISISVAGGGSRPGSGFGFGGGGGGGFG 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    956 FGGISNPSGGFGGISNPSGGFGGISNPSGGFGGisnpsGGFGGisnpSGGFGGISNPSGG 1015
Cdd:pfam16208   81 GGFGGGGGGGFGGGGGFGGGFGGGGYGGGGFGG-----GGFGG----RGGFGGPPCPPGG 131
PPE COG5651
PPE-repeat protein [Function unknown];
881-1079 1.09e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 43.73  E-value: 1.09e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  881 GAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSG----GFGGiSNPSGGFGGISNPSGGFGGISNPSGGF 956
Cdd:COG5651  182 GAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGpgntGFAG-TGAAAGAAAAAAAAAAAAGAGASAALA 260
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  957 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGS 1036
Cdd:COG5651  261 SLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAA 340
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|...
gi 50593518 1037 VPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAP 1079
Cdd:COG5651  341 AGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
176-452 1.52e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 43.35  E-value: 1.52e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  176 SKKASTANNAANKTvpsAAEISLASAATHT-VTTQGQAAKETGSIQTIAATARSKKNskgkRTPAKTTNTDNEYVEASNA 254
Cdd:COG4372    9 GKARLSLFGLRPKT---GILIAALSEQLRKaLFELDKLQEELEQLREELEQAREELE----QLEEELEQARSELEQLEEE 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  255 IEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEA 334
Cdd:COG4372   82 LEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLES 161
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  335 SNRQIgaSNRQTEVSSRQIEASNRQIgasNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSR 414
Cdd:COG4372  162 LQEEL--AALEQELQALSEAEAEQAL---DELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALS 236
                        250       260       270
                 ....*....|....*....|....*....|....*...
gi 50593518  415 QTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGK 452
Cdd:COG4372  237 ALLDALELEEDKEELLEEVILKEIEELELAILVEKDTE 274
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
260-435 1.58e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 43.75  E-value: 1.58e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  260 RQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQtetsnRQIGASNRQImasnRQIGASNRQIEASNRQI 339
Cdd:COG4913  624 EELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAE-----REIAELEAEL----ERLDASSDDLAALEEQL 694
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  340 GASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNrQIGASNRQTDASNRQTDASNRQTEASSRQtEAS 419
Cdd:COG4913  695 EELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAE-DLARLELRALLEERFAAALGDAVERELRE-NLE 772
                        170
                 ....*....|....*.
gi 50593518  420 SRQTEASSRQTEASSR 435
Cdd:COG4913  773 ERIDALRARLNRAEEE 788
Tar COG0840
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
162-426 1.68e-03

Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];


Pssm-ID: 440602 [Multi-domain]  Cd Length: 533  Bit Score: 43.47  E-value: 1.68e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  162 NESANSLASTAVNKSKKASTANNAANKTvpsAAEISLASAATHTVTTqgqaaketgSIQTIAATARskknskgkrtpakt 241
Cdd:COG0840  259 RESAEQVASASEELAASAEELAAGAEEQ---AASLEETAAAMEELSA---------TVQEVAENAQ-------------- 312
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  242 tntdneyvEASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASN---RQTEASSR--------------QT------- 297
Cdd:COG0840  313 --------QAAELAEEASELAEEGGEVVEEAVEGIEEIRESVEETAetiEELGESSQeigeivdviddiaeQTnllalna 384
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  298 --EA-----------------------SSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQI---GASNRQTEVS 349
Cdd:COG0840  385 aiEAarageagrgfavvadevrklaerSAEATKEIEELIEEIQSETEEAVEAMEEGSEEVEEGVELVeeaGEALEEIVEA 464
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 50593518  350 SRQIEASNRQIGAsnrqteasnrqigASNRQTEASNrQIGASNRQTDASNRQTDASNRQTEASSRQTEASSRQTEAS 426
Cdd:COG0840  465 VEEVSDLIQEIAA-------------ASEEQSAGTE-EVNQAIEQIAAAAQENAASVEEVAAAAEELAELAEELQEL 527
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
310-543 2.91e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 42.12  E-value: 2.91e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  310 QIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIgaSNRQTEASNR--- 386
Cdd:COG3883   17 QIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEI--EERREELGERara 94
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  387 -------------------------QIGASNRQTDASNR--------QTDASNRQTEASSRQTEASSRQTEASSRQTEAS 433
Cdd:COG3883   95 lyrsggsvsyldvllgsesfsdfldRLSALSKIADADADlleelkadKAELEAKKAELEAKLAELEALKAELEAAKAELE 174
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  434 SRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEAPPAIQTVTNHALSVTVRIRRGSRARKAANKNRATESQAQIAEQGA 513
Cdd:COG3883  175 AQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAAGA 254
                        250       260       270
                 ....*....|....*....|....*....|
gi 50593518  514 QASEASISALETQVAAAVQALADDYLAQLS 543
Cdd:COG3883  255 AGAAAGSAGAAGAAAGAAGAGAAAASAAGG 284
PPE COG5651
PPE-repeat protein [Function unknown];
903-1109 2.99e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 42.19  E-value: 2.99e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  903 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 982
Cdd:COG5651  177 PGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAGAGAS 256
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  983 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSF 1062
Cdd:COG5651  257 AALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAA 336
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*..
gi 50593518 1063 SGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTASISFGGAP 1109
Cdd:COG5651  337 AGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
PPE COG5651
PPE-repeat protein [Function unknown];
867-1049 3.14e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 42.19  E-value: 3.14e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  867 LFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGF 946
Cdd:COG5651  202 LTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASS 281
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  947 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISN-PSGGFGGISNPSGGFGGISNPSGG 1025
Cdd:COG5651  282 AATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAaGAAAGAGAAAAAAAGGAGGGGGGA 361
                        170       180
                 ....*....|....*....|....
gi 50593518 1026 FGGRNSITFGSVPNTSANFSSAPS 1049
Cdd:COG5651  362 LGAGGGGGSAGAAAGAASGGGAAA 385
PHA00430 PHA00430
tail fiber protein
325-467 3.26e-03

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 42.57  E-value: 3.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518   325 IGASNR-QIEASNRQI----GASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASN 399
Cdd:PHA00430  121 IGVNNDgHLDARGRRIvnlaDAVDDGDAVPLGQIKTWNQSAWNARNEANRSRNEADRARNQAERFNNESGASATNTKQWR 200
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 50593518   400 RQTDASNRQTE-----ASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSE 467
Cdd:PHA00430  201 SEADGSNSEANrfkgyADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAAHASEVNAANSATAAATSA 273
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1522-1713 3.30e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.43  E-value: 3.30e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1522 STNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAI 1601
Cdd:COG3469   13 GGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATST 92
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1602 STSASFGGALNNSAGfggaiSTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHS 1681
Cdd:COG3469   93 SATLVATSTASGANT-----GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTS 167
                        170       180       190
                 ....*....|....*....|....*....|..
gi 50593518 1682 NSISFGSAPTTSvSFGGSHSTNLCFGGAPSTS 1713
Cdd:COG3469  168 TTTTTTSASTTP-SATTTATATTASGATTPSA 198
34 PHA02584
long tail fiber, proximal subunit; Provisional
1412-1595 5.23e-03

long tail fiber, proximal subunit; Provisional


Pssm-ID: 222890 [Multi-domain]  Cd Length: 1229  Bit Score: 42.05  E-value: 5.23e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1412 NGSAGFGGALNTNATFggVLNGSAGFGGAMNTNATF------GGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSA 1485
Cdd:PHA02584  908 NGSLTFTKNTNLSAPL--VSSSTATFGGSVTANSTLttqntsNGTVVVVDETSIAFYSQNNTTGNIVFNIDGTVDPINVN 985
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1486 SFGGALNNSAG-FGGAISTNatfGGALNNSAGFGGAISTNATFGGALNNSagfggAISTSASFGGTLNNSASFGGAINTS 1564
Cdd:PHA02584  986 ANGTLNATGVAtNGRAVYAE---GGGIARTNNAARAITGGFTIRNDGSTT-----VFLLTAAGDQTGGFNGLKSLIINNA 1057
                         170       180       190
                  ....*....|....*....|....*....|.
gi 50593518  1565 ASFGGVLNNSAGFGGAINTSanfgGALTNSA 1595
Cdd:PHA02584 1058 NGQVTINDNYIINAGGTIMS----GGLTVNS 1084
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
20-243 5.51e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.83  E-value: 5.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518     20 PAGSLGLPFSPDVQSETT---EKDPPIASRSKKNKNKKNSIKPMDKTTPAPPPVPSANDNASNKPKVTLQALNLPMFTQI 96
Cdd:pfam05109  442 PNTTTGLPSSTHVPTNLTapaSTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNAT 521
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518     97 SQASA-TTEAPNIQASVTSQTQKAKTMrVTPKVSLTGSEDATTQLKPPLQALNLPVTTPTiqTPVANESANSLASTAVNK 175
Cdd:pfam05109  522 SPTPAvTTPTPNATSPTLGKTSPTSAV-TTPTPNATSPTPAVTTPTPNATIPTLGKTSPT--SAVTTPTPNATSPTVGET 598
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 50593518    176 SKKASTANNAANKTVPSAAEISLASAATHTVTTqGQAAKETGSIQTIAATARSKKNSKGKRTPAKTTN 243
Cdd:pfam05109  599 SPQANTTNHTLGGTSSTPVVTSPPKNATSAVTT-GQHNITSSSTSSMSLRPSSISETLSPSTSDNSTS 665
YjbI COG1357
Uncharacterized conserved protein YjbI, contains pentapeptide repeats [Function unknown];
1503-1660 5.96e-03

Uncharacterized conserved protein YjbI, contains pentapeptide repeats [Function unknown];


Pssm-ID: 440968 [Multi-domain]  Cd Length: 178  Bit Score: 39.92  E-value: 5.96e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1503 TNATFGGALNNSAGFGGAISTNATFGGALNNsAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAIN 1582
Cdd:COG1357    3 SGADLSGADLSGADLSGADLSGANLSGALSG-ANLSGANLSGANLTGANLSGADLSGADLSGANLSGADLSGANLTGADL 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1583 TSAN---FGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGG 1659
Cdd:COG1357   82 SGANlanLSGANLSGANLSGANLRGANLSGANLSGADLSGADLSGANLSGADLSGANLSGANLSGADLSGADLSGANLSG 161

                 .
gi 50593518 1660 A 1660
Cdd:COG1357  162 A 162
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1632-1860 6.99e-03

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 41.14  E-value: 6.99e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1632 NNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNlcfggaPS 1711
Cdd:cd21118  133 QGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQGAVAQPGYGTVRGNNQNSGCTN------PP 206
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518 1712 TSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASF 1791
Cdd:cd21118  207 PSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSGGSNGGSSGNSGSGSGGSSSGGSN 286
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 50593518 1792 NGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTS 1860
Cdd:cd21118  287 GWGGSSSSGGSGGSGGGNKPECNNPGNDVRMAGGGGSQGSKESSGSHGSNGGNGQAEAVGGLNTLNSDA 355
34 PHA02584
long tail fiber, proximal subunit; Provisional
1390-1588 7.27e-03

long tail fiber, proximal subunit; Provisional


Pssm-ID: 222890 [Multi-domain]  Cd Length: 1229  Bit Score: 41.66  E-value: 7.27e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1390 TLNSSASFGSALSTSASFggVLNGSAGFGGALNTNATF------GGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAIST 1463
Cdd:PHA02584  906 TVNGSLTFTKNTNLSAPL--VSSSTATFGGSVTANSTLttqntsNGTVVVVDETSIAFYSQNNTTGNIVFNIDGTVDPIN 983
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  1464 STNFGGALNNSAG-FGGAMNTSasfGGALNNSAGFGGAISTNATFGGALNNSAG-----FGGAISTNATFGGALNNSAGF 1537
Cdd:PHA02584  984 VNANGTLNATGVAtNGRAVYAE---GGGIARTNNAARAITGGFTIRNDGSTTVFlltaaGDQTGGFNGLKSLIINNANGQ 1060
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 50593518  1538 -----------GGAISTsasfGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFG 1588
Cdd:PHA02584 1061 vtindnyiinaGGTIMS----GGLTVNSRIRSQGTKASYTRAPTADTVGFWSVDINDSATYN 1118
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
223-516 8.05e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 40.97  E-value: 8.05e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  223 AATARSKKNSKGKRTPAKTTNTDNEYVEASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSR 302
Cdd:COG3883   14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERAR 93
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  303 QTETSNRQIGASNRQIMASN-----RQIGASNRQIEASNRQIgasnrqTEVSSRQIEASNRQIGASNRQTEASNRQIGAS 377
Cdd:COG3883   94 ALYRSGGSVSYLDVLLGSESfsdflDRLSALSKIADADADLL------EELKADKAELEAKKAELEAKLAELEALKAELE 167
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518  378 NRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNK 457
Cdd:COG3883  168 AAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAA 247
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|....*....
gi 50593518  458 GSNSASEPSEAPPAIQTVTNHALSVTVRIRRGSRARKAANKNRATESQAQIAEQGAQAS 516
Cdd:COG3883  248 GAGAAGAAGAAAGSAGAAGAAAGAAGAGAAAASAAGGGAGGAGGGGGGGGAASGGSGGG 306
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
29-297 8.74e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.10  E-value: 8.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518     29 SPDVQSETTekdppiaSRSKKNKNKKNSIKPMDKTTPAPPPVPSANDNASNKPKVTLQALNLPmftQISQASATTEAPNI 108
Cdd:pfam17823   83 STEVTAEHT-------PHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALP---SEAFSAPRAAACRA 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    109 QASVTSQTQKAKTMRVTPKVSLTGSEDATTQLKPPLQALNLP-----VTTPTIQTPVanesaNSLASTAVNKSKKASTAN 183
Cdd:pfam17823  153 NASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSApttaaSSAPATLTPA-----RGISTAATATGHPAAGTA 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50593518    184 NAANKTVPSAAEISLASAATHTVTTQGQAAKETGSIQTIAATARSKKNSKGKRTPAKTTNTDNeyveasnaieASSRQIG 263
Cdd:pfam17823  228 LAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDT----------MARNPAA 297
                          250       260       270
                   ....*....|....*....|....*....|....
gi 50593518    264 ASGRQTEASNRQIEASSRQTEASNRQTEASSRQT 297
Cdd:pfam17823  298 PMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTT 331
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH