NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|221316699|ref|NP_004659|]
View 

maltase-glucoamylase isoform 2 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
GH31_MGAM_SI_GAA cd06602
maltase-glucoamylase, sucrase-isomaltase, lysosomal acid alpha-glucosidase; This subgroup ...
368-731 0e+00

maltase-glucoamylase, sucrase-isomaltase, lysosomal acid alpha-glucosidase; This subgroup includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain. Both of the tandem GH31 domains of MGAM and SI are included in this family. The domain architecture of GAA includes an N-terminal TFF (trefoil factor family) domain in addition to the GH31 catalytic domain. Deficient GAA expression causes Pompe disease, an autosomal recessive genetic disorder also known as glycogen storage disease type II (GSDII).


:

Pssm-ID: 269888  Cd Length: 367  Bit Score: 662.67  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMDERRDFTYDSVDFKGFPEFVNELHNNGQKL 447
Cdd:cd06602     1 GRPAMPPYWSLGFHLCRWGYKNLDELKEVVERYRAAGIPLDVQWNDIDYMDRYRDFTLDPVNFPGLPAFVDDLHANGQHY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  448 VIIVDPAISNNSSSSkpYGPYDRGSDMKIWVNSSDGvTPLIGEVWPGQTVFPDYTNPNCAVWWTKEFELFHNQVEFDGIW 527
Cdd:cd06602    81 VPILDPGISANESGG--YPPYDRGLEMDVFIKNDDG-SPYVGKVWPGYTVFPDFTNPNTQEWWTEEIKDFHDQVPFDGLW 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  528 IDMNEVSNFVDGSVS------GCSTNNLNNPPFTPRIL-DGYLFCKTLCMDAVQH-WGKQYDIHNLYGYSMAVATAEAAK 599
Cdd:cd06602   158 IDMNEPSNFCTGSCGnspnapGCPDNKLNNPPYVPNNLgGGSLSDKTICMDAVHYdGGLHYDVHNLYGLSEAIATYKALK 237
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  600 TVFPNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTPEELCRRWMQLGAF 679
Cdd:cd06602   238 EIFPGKRPFIISRSTFPGSGKYAGHWLGDNYSTWEDMRYSIPGMLEFNLFGIPMVGADICGFNGNTTEELCARWMQLGAF 317
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|..
gi 221316699  680 YPFSRNHNGQGYKDQDPASFGADslLLNSSRHYLNIRYTLLPYLYTLFFRAH 731
Cdd:cd06602   318 YPFSRNHNDIGAIDQEPYVWGPS--VADASRKALLIRYSLLPYLYTLFYRAH 367
Glyco_hydro_31 pfam01055
Glycosyl hydrolases family 31; Glycosyl hydrolases are key enzymes of carbohydrate metabolism. ...
1215-1717 0e+00

Glycosyl hydrolases family 31; Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases.


:

Pssm-ID: 460044 [Multi-domain]  Cd Length: 443  Bit Score: 606.86  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1215 YVFLGPTPELVTQQYTELIGRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYSDIDYMERQLDFTLSP- 1293
Cdd:pfam01055    1 YFFLGPTPKDVVKQYTELTGRPPLPPYWALGYHQSRWGYKSEEEVLEVVDGFRERDIPLDVIWLDIDYMDGYRDFTWDPe 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1294 KFAGFPALINRMKADGMRVILILDPAISGNEtQPYPAFTRGVEDDVFIKYPnDGDIVWGKvWPDfpdvvvngsldwdsqv 1373
Cdd:pfam01055   81 RFPDPKGMVDELHAKGQKLVVIIDPGIKKVD-PGYPPYDEGLEKGYFVKNP-DGSLYVGG-WPG---------------- 141
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1374 elyraYVAFPDFFRNSTAKWWKREIEELYNNpqnpersLKFDGMWIDMNEPSSFVNgavSPGCRDASLNHPPYMPhlesr 1453
Cdd:pfam01055  142 -----MSAFPDFTNPEARDWWADQLFKFLLD-------MGVDGIWNDMNEPSVFCG---SGPEDTVAKDNDPGGG----- 201
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1454 drglssktlcmesqqilpdgslVQHYNVHNLYGWSQTRPTYEAVQEVTG-QRGVVITRSTFPSSGRWAGHWLGDNTAAWD 1532
Cdd:pfam01055  202 ----------------------VEHYDVHNLYGLLMAKATYEGLREKRPnKRPFVLTRSGFAGSQRYAAHWSGDNTSTWE 259
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1533 QLKKSIIGMMEFSLFGISYTGADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTIGTRRQDPVSWDVAFVNISRTVLQTR 1612
Cdd:pfam01055  260 HLRFSIPGGLSLGLSGIPFWGADIGGFFNPTTPELYVRWYQLGAFSPFFRNHSSIDTRRREPWLFGEEVEEIIRKAIRLR 339
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1613 YTLLPYLYTLMHKAHTEGVTVVRPLLHEFVSDQVTWDIDSQFLLGPAFLVSPVLERNARNVTAYFPRARWYDYYTGVDIn 1692
Cdd:pfam01055  340 YRLLPYLYTLFYEAHETGLPVMRPLFLEFPDDPNTFDIDDQFMFGPSLLVAPVLEEGATSVDVYLPGGRWYDFWTGERY- 418
                          490       500
                   ....*....|....*....|....*
gi 221316699  1693 ARGEWKTLPAPLDHINLHVRGGYIL 1717
Cdd:pfam01055  419 EGGGTVPVTAPLDRIPLFVRGGSII 443
NtCtMGAM_N pfam16863
N-terminal barrel of NtMGAM and CtMGAM, maltase-glucoamylase; NtCtMGAM_N is a beta-barrel-like ...
150-260 1.84e-47

N-terminal barrel of NtMGAM and CtMGAM, maltase-glucoamylase; NtCtMGAM_N is a beta-barrel-like structure just N-terminal to the catalytic domain of maltase-glucoamylase in eukaryotes. It contributes to the architecture of the substrate-binding site, by donating a loop that comes into close contact with two regions in the catalytic domain thereby creating the site. This family is frequently found at the N-terminus of Glycosyl hydrolase 31, pfam01055.to which it contributes as above.


:

Pssm-ID: 465286  Cd Length: 113  Bit Score: 165.35  E-value: 1.84e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699   150 GFTARLKNLPSSP-VFGSNVDNVLLTAEYQTSNRFHFKLTDQTNNRFEVPHEHVQ-SFSGNAAASLTYQVEISRQPFSIK 227
Cdd:pfam16863    1 GLTADLTLAGSPCnLYGNDIETLKLTVEYQTDNRLHVKITDPNNKRYEVPEELLPrPSPSSSASDSLYEFEYTNEPFGFK 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 221316699   228 VTRRSNNRVLFDSSIGPLLFADQFLQLSTRLPS 260
Cdd:pfam16863   81 VTRKSTGEVLFDTSGGPLVFEDQFLQLSTRLPS 113
NtCtMGAM_N pfam16863
N-terminal barrel of NtMGAM and CtMGAM, maltase-glucoamylase; NtCtMGAM_N is a beta-barrel-like ...
1014-1128 8.33e-39

N-terminal barrel of NtMGAM and CtMGAM, maltase-glucoamylase; NtCtMGAM_N is a beta-barrel-like structure just N-terminal to the catalytic domain of maltase-glucoamylase in eukaryotes. It contributes to the architecture of the substrate-binding site, by donating a loop that comes into close contact with two regions in the catalytic domain thereby creating the site. This family is frequently found at the N-terminus of Glycosyl hydrolase 31, pfam01055.to which it contributes as above.


:

Pssm-ID: 465286  Cd Length: 113  Bit Score: 140.69  E-value: 8.33e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1014 GATADISLKSSvYANAFPsTPVNPLRLDVTYHKNEMLQFKIYDPNKNRYEVP-VPLNIPSmPSSTPEGQLYDVLIKKNPF 1092
Cdd:pfam16863    1 GLTADLTLAGS-PCNLYG-NDIETLKLTVEYQTDNRLHVKITDPNNKRYEVPeELLPRPS-PSSSASDSLYEFEYTNEPF 77
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 221316699  1093 GIEIRRKSTGTIIWDSQLLGFTFSDMFIRISTRLPS 1128
Cdd:pfam16863   78 GFKVTRKSTGEVLFDTSGGPLVFEDQFLQLSTRLPS 113
GH31_N cd14752
N-terminal domain of glycosyl hydrolase family 31 (GH31); This family is found N-terminal to ...
252-368 1.70e-25

N-terminal domain of glycosyl hydrolase family 31 (GH31); This family is found N-terminal to the glycosyl-hydrolase domain of Glycoside hydrolase family 31 (GH31). GH31 includes the glycoside hydrolases alpha-glucosidase (EC 3.2.1.20), alpha-1,3-glucosidase (EC 3.2.1.84), alpha-xylosidase (EC 3.2.1.177), sucrase-isomaltase (EC 3.2.1.48 and EC 3.2.1.10), as well as alpha-glucan lyase (EC 4.2.2.13). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite-1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes. The lysosomal acid alpha-glucosidase (GAA) is essential for glycogen degradation and a deficiency or malfunction of this enzyme causes glycogen storage disease II, also known as Pompe disease. In the endoplasmic reticulum, alpha-glucosidase II catalyzes the second step in the N-linked oligosaccharide processing pathway that constitutes part of the quality control system for glycoprotein folding and maturation. The intestinal enzymes sucrase-isomaltase (SI) and maltase-glucoamylase (MGAM) play key roles in the final stage of carbohydrate digestion, making alpha-glucosidase inhibitors useful in the treatment of type 2 diabetes. GH31 alpha-glycosidases are retaining enzymes that cleave their substrates via an acid/base-catalyzed, double-displacement mechanism involving a covalent glycosyl-enzyme intermediate. Two aspartic acid residues of the catalytic domain have been identified as the catalytic nucleophile and the acid/base, respectively. A loop of the N-terminal beta-sandwich domain is part of the active site pocket.


:

Pssm-ID: 270212 [Multi-domain]  Cd Length: 122  Bit Score: 103.03  E-value: 1.70e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  252 LQLSTRLP-STNVYGLGEHVHqqyRHDMNWKTWPIFNRDT-TPNGNGTNLYGAQTFFLCLEdasglSFGVFLMNSNAMEV 329
Cdd:cd14752    10 LRLSFKLPpDEHFYGLGERFG---GLNKRGKRYRLWNTDQgGYRGSTDPLYGSIPFYLSSK-----GYGVFLDNPSRTEF 81
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 221316699  330 VLQPA--PAITYRTIGGILDFYVFLGNTPEQVVQEYLELIG 368
Cdd:cd14752    82 DFGSEdsDELTFSSEGGDLDYYFFAGPTPKEVIEQYTELTG 122
GH31_N cd14752
N-terminal domain of glycosyl hydrolase family 31 (GH31); This family is found N-terminal to ...
1113-1234 6.03e-24

N-terminal domain of glycosyl hydrolase family 31 (GH31); This family is found N-terminal to the glycosyl-hydrolase domain of Glycoside hydrolase family 31 (GH31). GH31 includes the glycoside hydrolases alpha-glucosidase (EC 3.2.1.20), alpha-1,3-glucosidase (EC 3.2.1.84), alpha-xylosidase (EC 3.2.1.177), sucrase-isomaltase (EC 3.2.1.48 and EC 3.2.1.10), as well as alpha-glucan lyase (EC 4.2.2.13). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite-1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes. The lysosomal acid alpha-glucosidase (GAA) is essential for glycogen degradation and a deficiency or malfunction of this enzyme causes glycogen storage disease II, also known as Pompe disease. In the endoplasmic reticulum, alpha-glucosidase II catalyzes the second step in the N-linked oligosaccharide processing pathway that constitutes part of the quality control system for glycoprotein folding and maturation. The intestinal enzymes sucrase-isomaltase (SI) and maltase-glucoamylase (MGAM) play key roles in the final stage of carbohydrate digestion, making alpha-glucosidase inhibitors useful in the treatment of type 2 diabetes. GH31 alpha-glycosidases are retaining enzymes that cleave their substrates via an acid/base-catalyzed, double-displacement mechanism involving a covalent glycosyl-enzyme intermediate. Two aspartic acid residues of the catalytic domain have been identified as the catalytic nucleophile and the acid/base, respectively. A loop of the N-terminal beta-sandwich domain is part of the active site pocket.


:

Pssm-ID: 270212 [Multi-domain]  Cd Length: 122  Bit Score: 98.80  E-value: 6.03e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1113 FTFSDmfIRISTRLP-SKYLYGFGEtehRSYRRDLEWHTWGMFSRDQPPGY--KKNSYGVHPYYMGLEedgsAHGVLLLN 1189
Cdd:cd14752     5 VRITP--LRLSFKLPpDEHFYGLGE---RFGGLNKRGKRYRLWNTDQGGYRgsTDPLYGSIPFYLSSK----GYGVFLDN 75
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 221316699 1190 SNAMDVTFQP--LPALTYRTTGGVLDFYVFLGPTPELVTQQYTELIG 1234
Cdd:cd14752    76 PSRTEFDFGSedSDELTFSSEGGDLDYYFFAGPTPKEVIEQYTELTG 122
Trefoil pfam00088
Trefoil (P-type) domain;
90-133 3.24e-15

Trefoil (P-type) domain;


:

Pssm-ID: 459666  Cd Length: 43  Bit Score: 71.20  E-value: 3.24e-15
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 221316699    90 CPVVNELERINCIPdQPPTKATCDQRGCCWNPQGAVSVPWCYYS 133
Cdd:pfam00088    1 CSSVPPSDRFDCGY-PGITQEECEARGCCWDPSVDPGVPWCFYP 43
PD smart00018
P or trefoil or TFF domain; Proposed role in renewal and pathology of mucous epithelia.
957-1000 2.12e-11

P or trefoil or TFF domain; Proposed role in renewal and pathology of mucous epithelia.


:

Pssm-ID: 197472  Cd Length: 46  Bit Score: 60.48  E-value: 2.12e-11
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....
gi 221316699    957 KIRDEEKIDCYPDenGASAENCTARGCIWEaSNSSGVPFCYFVN 1000
Cdd:smart00018    4 SVPPSERINCGPP--GITEAECEARGCCFD-SSISGVPWCFYPN 44
 
Name Accession Description Interval E-value
GH31_MGAM_SI_GAA cd06602
maltase-glucoamylase, sucrase-isomaltase, lysosomal acid alpha-glucosidase; This subgroup ...
368-731 0e+00

maltase-glucoamylase, sucrase-isomaltase, lysosomal acid alpha-glucosidase; This subgroup includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain. Both of the tandem GH31 domains of MGAM and SI are included in this family. The domain architecture of GAA includes an N-terminal TFF (trefoil factor family) domain in addition to the GH31 catalytic domain. Deficient GAA expression causes Pompe disease, an autosomal recessive genetic disorder also known as glycogen storage disease type II (GSDII).


Pssm-ID: 269888  Cd Length: 367  Bit Score: 662.67  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMDERRDFTYDSVDFKGFPEFVNELHNNGQKL 447
Cdd:cd06602     1 GRPAMPPYWSLGFHLCRWGYKNLDELKEVVERYRAAGIPLDVQWNDIDYMDRYRDFTLDPVNFPGLPAFVDDLHANGQHY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  448 VIIVDPAISNNSSSSkpYGPYDRGSDMKIWVNSSDGvTPLIGEVWPGQTVFPDYTNPNCAVWWTKEFELFHNQVEFDGIW 527
Cdd:cd06602    81 VPILDPGISANESGG--YPPYDRGLEMDVFIKNDDG-SPYVGKVWPGYTVFPDFTNPNTQEWWTEEIKDFHDQVPFDGLW 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  528 IDMNEVSNFVDGSVS------GCSTNNLNNPPFTPRIL-DGYLFCKTLCMDAVQH-WGKQYDIHNLYGYSMAVATAEAAK 599
Cdd:cd06602   158 IDMNEPSNFCTGSCGnspnapGCPDNKLNNPPYVPNNLgGGSLSDKTICMDAVHYdGGLHYDVHNLYGLSEAIATYKALK 237
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  600 TVFPNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTPEELCRRWMQLGAF 679
Cdd:cd06602   238 EIFPGKRPFIISRSTFPGSGKYAGHWLGDNYSTWEDMRYSIPGMLEFNLFGIPMVGADICGFNGNTTEELCARWMQLGAF 317
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|..
gi 221316699  680 YPFSRNHNGQGYKDQDPASFGADslLLNSSRHYLNIRYTLLPYLYTLFFRAH 731
Cdd:cd06602   318 YPFSRNHNDIGAIDQEPYVWGPS--VADASRKALLIRYSLLPYLYTLFYRAH 367
Glyco_hydro_31 pfam01055
Glycosyl hydrolases family 31; Glycosyl hydrolases are key enzymes of carbohydrate metabolism. ...
349-820 0e+00

Glycosyl hydrolases family 31; Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases.


Pssm-ID: 460044 [Multi-domain]  Cd Length: 443  Bit Score: 624.20  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699   349 YVFLGNTPEQVVQEYLELIGRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMDERRDFTYDSV 428
Cdd:pfam01055    1 YFFLGPTPKDVVKQYTELTGRPPLPPYWALGYHQSRWGYKSEEEVLEVVDGFRERDIPLDVIWLDIDYMDGYRDFTWDPE 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699   429 DFKGFPEFVNELHNNGQKLVIIVDPAISnnsSSSKPYGPYDRGSDMKIWVNSSDGvtPLIGEVWPGQTVFPDYTNPNCAV 508
Cdd:pfam01055   81 RFPDPKGMVDELHAKGQKLVVIIDPGIK---KVDPGYPPYDEGLEKGYFVKNPDG--SLYVGGWPGMSAFPDFTNPEARD 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699   509 WWTKEFELFHNQVEFDGIWIDMNEVSNFVDGSvsGCSTNNLNNPPFTPrildgylfcktlcmdavqhwGKQYDIHNLYGY 588
Cdd:pfam01055  156 WWADQLFKFLLDMGVDGIWNDMNEPSVFCGSG--PEDTVAKDNDPGGG--------------------VEHYDVHNLYGL 213
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699   589 SMAVATAEAAKTVFPNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTPEE 668
Cdd:pfam01055  214 LMAKATYEGLREKRPNKRPFVLTRSGFAGSQRYAAHWSGDNTSTWEHLRFSIPGGLSLGLSGIPFWGADIGGFFNPTTPE 293
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699   669 LCRRWMQLGAFYPFSRNHNGQGYKDQDPASFGADSllLNSSRHYLNIRYTLLPYLYTLFFRAHSRGDTVARPLLHEFYED 748
Cdd:pfam01055  294 LYVRWYQLGAFSPFFRNHSSIDTRRREPWLFGEEV--EEIIRKAIRLRYRLLPYLYTLFYEAHETGLPVMRPLFLEFPDD 371
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 221316699   749 NSTWDVHQQFLWGPGLLITPVLDEGAEKVMAYVPDAVWYDYETGSQVRwRKQKVEMELPGDKIGLHLRGGYI 820
Cdd:pfam01055  372 PNTFDIDDQFMFGPSLLVAPVLEEGATSVDVYLPGGRWYDFWTGERYE-GGGTVPVTAPLDRIPLFVRGGSI 442
Glyco_hydro_31 pfam01055
Glycosyl hydrolases family 31; Glycosyl hydrolases are key enzymes of carbohydrate metabolism. ...
1215-1717 0e+00

Glycosyl hydrolases family 31; Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases.


Pssm-ID: 460044 [Multi-domain]  Cd Length: 443  Bit Score: 606.86  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1215 YVFLGPTPELVTQQYTELIGRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYSDIDYMERQLDFTLSP- 1293
Cdd:pfam01055    1 YFFLGPTPKDVVKQYTELTGRPPLPPYWALGYHQSRWGYKSEEEVLEVVDGFRERDIPLDVIWLDIDYMDGYRDFTWDPe 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1294 KFAGFPALINRMKADGMRVILILDPAISGNEtQPYPAFTRGVEDDVFIKYPnDGDIVWGKvWPDfpdvvvngsldwdsqv 1373
Cdd:pfam01055   81 RFPDPKGMVDELHAKGQKLVVIIDPGIKKVD-PGYPPYDEGLEKGYFVKNP-DGSLYVGG-WPG---------------- 141
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1374 elyraYVAFPDFFRNSTAKWWKREIEELYNNpqnpersLKFDGMWIDMNEPSSFVNgavSPGCRDASLNHPPYMPhlesr 1453
Cdd:pfam01055  142 -----MSAFPDFTNPEARDWWADQLFKFLLD-------MGVDGIWNDMNEPSVFCG---SGPEDTVAKDNDPGGG----- 201
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1454 drglssktlcmesqqilpdgslVQHYNVHNLYGWSQTRPTYEAVQEVTG-QRGVVITRSTFPSSGRWAGHWLGDNTAAWD 1532
Cdd:pfam01055  202 ----------------------VEHYDVHNLYGLLMAKATYEGLREKRPnKRPFVLTRSGFAGSQRYAAHWSGDNTSTWE 259
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1533 QLKKSIIGMMEFSLFGISYTGADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTIGTRRQDPVSWDVAFVNISRTVLQTR 1612
Cdd:pfam01055  260 HLRFSIPGGLSLGLSGIPFWGADIGGFFNPTTPELYVRWYQLGAFSPFFRNHSSIDTRRREPWLFGEEVEEIIRKAIRLR 339
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1613 YTLLPYLYTLMHKAHTEGVTVVRPLLHEFVSDQVTWDIDSQFLLGPAFLVSPVLERNARNVTAYFPRARWYDYYTGVDIn 1692
Cdd:pfam01055  340 YRLLPYLYTLFYEAHETGLPVMRPLFLEFPDDPNTFDIDDQFMFGPSLLVAPVLEEGATSVDVYLPGGRWYDFWTGERY- 418
                          490       500
                   ....*....|....*....|....*
gi 221316699  1693 ARGEWKTLPAPLDHINLHVRGGYIL 1717
Cdd:pfam01055  419 EGGGTVPVTAPLDRIPLFVRGGSII 443
GH31_MGAM_SI_GAA cd06602
maltase-glucoamylase, sucrase-isomaltase, lysosomal acid alpha-glucosidase; This subgroup ...
1234-1627 0e+00

maltase-glucoamylase, sucrase-isomaltase, lysosomal acid alpha-glucosidase; This subgroup includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain. Both of the tandem GH31 domains of MGAM and SI are included in this family. The domain architecture of GAA includes an N-terminal TFF (trefoil factor family) domain in addition to the GH31 catalytic domain. Deficient GAA expression causes Pompe disease, an autosomal recessive genetic disorder also known as glycogen storage disease type II (GSDII).


Pssm-ID: 269888  Cd Length: 367  Bit Score: 592.56  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1234 GRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYSDIDYMERQLDFTLSP-KFAGFPALINRMKADGMRV 1312
Cdd:cd06602     1 GRPAMPPYWSLGFHLCRWGYKNLDELKEVVERYRAAGIPLDVQWNDIDYMDRYRDFTLDPvNFPGLPAFVDDLHANGQHY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1313 ILILDPAISGNETQPYPAFTRGVEDDVFIKYpNDGDIVWGKVWPDfpdvvvngsldwdsqvelyraYVAFPDFFRNSTAK 1392
Cdd:cd06602    81 VPILDPGISANESGGYPPYDRGLEMDVFIKN-DDGSPYVGKVWPG---------------------YTVFPDFTNPNTQE 138
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1393 WWKREIEELYNNpqnpersLKFDGMWIDMNEPSSFVNGAVSP-----GCRDASLNHPPYMPHLeSRDRGLSSKTLCMESQ 1467
Cdd:cd06602   139 WWTEEIKDFHDQ-------VPFDGLWIDMNEPSNFCTGSCGNspnapGCPDNKLNNPPYVPNN-LGGGSLSDKTICMDAV 210
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1468 QIlpDGSLvqHYNVHNLYGWSQTRPTYEAVQEV-TGQRGVVITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIGMMEFSL 1546
Cdd:cd06602   211 HY--DGGL--HYDVHNLYGLSEAIATYKALKEIfPGKRPFIISRSTFPGSGKYAGHWLGDNYSTWEDMRYSIPGMLEFNL 286
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1547 FGISYTGADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTIGTRRQDPVSWDVAFVNISRTVLQTRYTLLPYLYTLMHKA 1626
Cdd:cd06602   287 FGIPMVGADICGFNGNTTEELCARWMQLGAFYPFSRNHNDIGAIDQEPYVWGPSVADASRKALLIRYSLLPYLYTLFYRA 366

                  .
gi 221316699 1627 H 1627
Cdd:cd06602   367 H 367
PLN02763 PLN02763
hydrolase, hydrolyzing O-glycosyl compounds
258-895 2.72e-97

hydrolase, hydrolyzing O-glycosyl compounds


Pssm-ID: 215408 [Multi-domain]  Cd Length: 978  Bit Score: 338.40  E-value: 2.72e-97
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  258 LPS-TNVYGLGEHVHQQYRHDMNWKTWpifNRDTTPNGNGT-NLYGAQTF-FLCLEdaSGLSFGVFLMNSNAMEVVLQ-- 332
Cdd:PLN02763   70 LPSgTSFYGTGEVSGPLERTGKRVYTW---NTDAWGYGQNTtSLYQSHPWvFVVLP--NGEALGVLADTTRRCEIDLRke 144
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  333 ---------PAPAITyrtiggildFYVFlgNTPEQVVQEYLELIGRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAA 403
Cdd:PLN02763  145 siiriiapaSYPVIT---------FGPF--PSPEALLTSLSHAIGTVFMPPKWALGYQQCRWSYESAKRVAEIARTFREK 213
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  404 QLPYDVQHADIDYMDERRDFTYDSVDFKGFPEFVNELHNNGQKLVIIVDPAISNNssssKPYGPYDRGSDMKIWVNSSDG 483
Cdd:PLN02763  214 KIPCDVVWMDIDYMDGFRCFTFDKERFPDPKGLADDLHSIGFKAIWMLDPGIKAE----EGYFVYDSGCENDVWIQTADG 289
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  484 vTPLIGEVWPGQTVFPDYTNPNCAVWWTKEFELFHNqVEFDGIWIDMNEVSNFVDGSVSGCSTN-NLNNPPFTPRILDGY 562
Cdd:PLN02763  290 -KPFVGEVWPGPCVFPDFTNKKTRSWWANLVKDFVS-NGVDGIWNDMNEPAVFKTVTKTMPETNiHRGDEELGGVQNHSH 367
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  563 LfcktlcmdavqhwgkqydiHNLYGYSMAVATAEAAKTVFPNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPG 642
Cdd:PLN02763  368 Y-------------------HNVYGMLMARSTYEGMLLANKNKRPFVLTRAGFIGSQRYAATWTGDNLSNWEHLHMSIPM 428
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  643 VLEFNLFGIPMVGPDICGFALDTPEELCRRWMQLGAFYPFSRNHNGQGYKDQDPASFGADslLLNSSRHYLNIRYTLLPY 722
Cdd:PLN02763  429 VLQLGLSGQPLSGPDIGGFAGDATPKLFGRWMGVGAMFPFARGHSEQGTIDHEPWSFGEE--CEEVCRLALKRRYRLLPH 506
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  723 LYTLFFRAHSRGDTVARPLLHEFYEDNSTWDVHQQFLWGPGLLITPVL-DEGAEKVMAYVPDAVW--YDYETGSQvrwrk 799
Cdd:PLN02763  507 FYTLFYKAHTTGLPVMTPIFFADPKDPSLRKVENSFLLGPLLISASTLpDQGSDNLQHVLPKGIWqrFDFDDSHP----- 581
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  800 qkvemELPgdkiGLHLRGGYIFPTQQP-NTTTLASRKNPLGLIIALDENKEAKGELFWDNGETKDtVANKVYLLCEFSVT 878
Cdd:PLN02763  582 -----DLP----LLYLQGGSIIPLGPPiQHVGEASLSDDLTLLIALDENGKAEGVLYEDDGDGFG-YTKGDYLLTHYEAE 651
                         650       660
                  ....*....|....*....|.
gi 221316699  879 QNRLEVNI----SQSTYKDPN 895
Cdd:PLN02763  652 LVSSEVTVrvasTEGSWKRPK 672
YicI COG1501
Alpha-glucosidase/xylosidase, GH31 family [Carbohydrate transport and metabolism];
259-861 1.83e-96

Alpha-glucosidase/xylosidase, GH31 family [Carbohydrate transport and metabolism];


Pssm-ID: 441110 [Multi-domain]  Cd Length: 609  Bit Score: 325.58  E-value: 1.83e-96
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  259 PSTNVYGLGEH---VHQQYRHDMNWktwpifNRDTTPNGNGTNLYGAQTFFLCLEDasglsFGVFLmNSNAM---EVVLQ 332
Cdd:COG1501    60 LGEQIYGLGERfttLHKRGRIVVNW------NLDHGGHKDNGNTYAPIPFYVSSKG-----YGVFV-NSASYvtfDVGSA 127
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  333 PAPAITYRTIGGILDFYVFLGNTPEQVVQEYLELIGRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHA 412
Cdd:COG1501   128 YSDLVEFTVPGDSLEFYVIEGPSPEDVLEKYTELTGKPPLPPRWAFGYWQSRKSYYDQDQVLAFADEFRDRGFPLDVIHL 207
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  413 DIDYMDE--RRDFTYDSVDFKGFPEFVNELHNNGQKLVIIVDPAISNNSSsskpygPYDRGsdMKIWVNSSDGvTPLIGE 490
Cdd:COG1501   208 DIRWMDKyyWGDFEWDPRRFPDPKAMVKELHDRGVKLVLWINPYVAPDSA------IFAEG--MANFVKIASG-TVFVGK 278
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  491 VWPGQTVFPDYTNPNCAVWWTKEFELFHNQVEFDGIWIDMNEvsnfvdgsvsGCSTNNLNNPPFTPrildgylfcktlcm 570
Cdd:COG1501   279 MWPGTTGLLDFTRPDAREWFWAGLEKELLSIGVDGIKLDMNE----------GWPTDVATFPSNVP-------------- 334
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  571 davqhwgKQYdiHNLYGYSMAVATAEAAKTVFpNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFG 650
Cdd:COG1501   335 -------QQM--RNLYGLLEAKATFEGFRTSR-NNRTFILTRSGFAGGQRYPVIWTGDNTSSWESLEDQLTQGLNLSLSG 404
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  651 IPMVGPDICGFALDTPEELCRRWMQLGAFYPFSRNHngQGYKDQDPASFG--ADSLLlnssRHYLNIRYTLLPYLYTLFF 728
Cdd:COG1501   405 VPFWTPDIGGFFGSPSRELWIRWFQVGAFSPFARIH--GWASSTEPWFFDeeAKQIV----KEYAQLRYRLLPYIYSLFA 478
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  729 RAHSRGDTVARPLLHEFYEDNSTWDVHQQFLWGPGLLITPVLdEGAEKVMAYVPDAVWYDYETGSQVRwRKQKVEMELPG 808
Cdd:COG1501   479 KASTDGTPVIRPLFLEFPDDPTTRFIDDQYMFGEYLLVAPIF-AGTESRLVYLPKGKWYDFWTGELIE-GGQWITVTAPL 556
                         570       580       590       600       610
                  ....*....|....*....|....*....|....*....|....*....|...
gi 221316699  809 DKIGLHLRGGYIFPTQQPNTTTLASRKNPLGLIIALDenKEAKGELFWDNGET 861
Cdd:COG1501   557 DRLPLYVRDGSIIPLGPVSLRPSMQKIDGIELRVYGS--GETAYTLYDDDGET 607
alpha_gluc_MalA NF040948
alpha-glucosidase MalA;
347-824 1.49e-93

alpha-glucosidase MalA;


Pssm-ID: 468879 [Multi-domain]  Cd Length: 626  Bit Score: 317.35  E-value: 1.49e-93
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  347 DFYVFLGNTPEQVVQEYLELIGRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMDERRDFTYD 426
Cdd:NF040948  145 ELYVIEGPTIEEVLETYSELTGKPFLPPKWALGYQISRYSYYPQDAVVEVVDELRKEGFPVSAVYLDIDYMDSYKLFTWD 224
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  427 SVDFKGFPEFVNELHNNGQKLVIIVDPAISNNSSsskpYGPYDRGsdMKIWVNSSDGvTPLIGEVWPGQTVFPDYTNPNC 506
Cdd:NF040948  225 KEKFPDPRKFIEELHSRGVKVITIVDPSVKADQN----YEVFRSG--LGKYCETENG-ELYVGKLWPGNSVFPDFLNEET 297
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  507 AVWWTKEFELFHNQVEFDGIWIDMNEVSNFV-DGSVSGCSTNNLNNPPFTPRILDGYLFcKTLCMDAVQHwgkqYDIHNL 585
Cdd:NF040948  298 REWWAELVEEWVKQYGVDGIWLDMNEPTDFTeDIERAALGPHQLREDRLLYTFPPGAVH-RLDDGKKVKH----EKVRNA 372
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  586 YGYSMAVATAEAAKTVfpNKRS-FILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFA-- 662
Cdd:NF040948  373 YPYFEAMATYEGLKRA--GKDEpFILSRSGYAGIQRYAAIWTGDNTSSWDDLKLQLQLVLGLSISGVPYVGCDIGGFAgr 450
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  663 ---LDTPEELCRRWMQLGAFYPFSRNHNGQGYKDQDPASFGADSllLNSSRHYLNIRYTLLPYLYTLFFRAHSRGDTVAR 739
Cdd:NF040948  451 sfpIDNSPELLVRYYQAALFFPLFRTHKSKDGNDQEPYFLPSKY--KEKVKRVIKLRYKFLPYLYSLAWEAHETGHPIIR 528
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  740 PLLHEFYEDNSTWDVHQQFLWGPGLLITPVLDEGAEKVMAYVPDAVWYDYETGSQV---RWRKQkvEMELPgdkigLHLR 816
Cdd:NF040948  529 PLFYEFQDDEDAYRIEDEYMVGKYLLYAPQIYPKEESRDVYLPRGKWLDFWTGEEYegpSWIES--EAELP-----IYIR 601

                  ....*...
gi 221316699  817 GGYIFPTQ 824
Cdd:NF040948  602 EGSAVPLD 609
YicI COG1501
Alpha-glucosidase/xylosidase, GH31 family [Carbohydrate transport and metabolism];
1127-1758 3.23e-93

Alpha-glucosidase/xylosidase, GH31 family [Carbohydrate transport and metabolism];


Pssm-ID: 441110 [Multi-domain]  Cd Length: 609  Bit Score: 315.95  E-value: 3.23e-93
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1127 PSKYLYGFGE---TEHRSYRRDLEWHTwgmfsrdQPPGYKK--NSYGVHPYYMGLEEDGsahgvLLLNSNAM---DVTFQ 1198
Cdd:COG1501    60 LGEQIYGLGErftTLHKRGRIVVNWNL-------DHGGHKDngNTYAPIPFYVSSKGYG-----VFVNSASYvtfDVGSA 127
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1199 PLPALTYRTTGGVLDFYVFLGPTPELVTQQYTELIGRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYS 1278
Cdd:COG1501   128 YSDLVEFTVPGDSLEFYVIEGPSPEDVLEKYTELTGKPPLPPRWAFGYWQSRKSYYDQDQVLAFADEFRDRGFPLDVIHL 207
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1279 DIDYMERQL--DFTLSP-KFAGFPALINRMKADGMRVILILDPAIsGNETQPYPAFTRGveddvFIKYPNdGDIVWGKVW 1355
Cdd:COG1501   208 DIRWMDKYYwgDFEWDPrRFPDPKAMVKELHDRGVKLVLWINPYV-APDSAIFAEGMAN-----FVKIAS-GTVFVGKMW 280
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1356 PDfpdvvvngsldwdsqvelyraYVAFPDFFRNSTAKWWKREIEELYnnpqnpeRSLKFDGMWIDMNEpssfvngavspG 1435
Cdd:COG1501   281 PG---------------------TTGLLDFTRPDAREWFWAGLEKEL-------LSIGVDGIKLDMNE-----------G 321
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1436 CRDASLNHPPYMPHlesrdrglssktlcmesqqilpdgslvqhyNVHNLYGWSQTRPTYEAVQEVTGQRGVVITRSTFPS 1515
Cdd:COG1501   322 WPTDVATFPSNVPQ------------------------------QMRNLYGLLEAKATFEGFRTSRNNRTFILTRSGFAG 371
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1516 SGRWAGHWLGDNTAAWDQLKKSIIGMMEFSLFGISYTGADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTigTRRQDPV 1595
Cdd:COG1501   372 GQRYPVIWTGDNTSSWESLEDQLTQGLNLSLSGVPFWTPDIGGFFGSPSRELWIRWFQVGAFSPFARIHGW--ASSTEPW 449
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1596 SWDVAFVNISRTVLQTRYTLLPYLYTLMHKAHTEGVTVVRPLLHEFVSDQVTWDIDSQFLLGPAFLVSPVLErNARNVTA 1675
Cdd:COG1501   450 FFDEEAKQIVKEYAQLRYRLLPYIYSLFAKASTDGTPVIRPLFLEFPDDPTTRFIDDQYMFGEYLLVAPIFA-GTESRLV 528
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1676 YFPRARWYDYYTGVDINArGEWKTLPAPLDHINLHVRGGYILPWQePALNtHLSRQKFMGFKIALDDEGTAGGWLFWDDG 1755
Cdd:COG1501   529 YLPKGKWYDFWTGELIEG-GQWITVTAPLDRLPLYVRDGSIIPLG-PVSL-RPSMQKIDGIELRVYGSGETAYTLYDDDG 605

                  ...
gi 221316699 1756 QSI 1758
Cdd:COG1501   606 ETV 608
PLN02763 PLN02763
hydrolase, hydrolyzing O-glycosyl compounds
1132-1775 3.79e-75

hydrolase, hydrolyzing O-glycosyl compounds


Pssm-ID: 215408 [Multi-domain]  Cd Length: 978  Bit Score: 272.15  E-value: 3.79e-75
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1132 YGFGETE---HRSYRRDLEWHT--WGmfsrdqppgYKKNS---YGVHPYYMGLEEDGSAHGVLLLNSNAMDVTFQ----- 1198
Cdd:PLN02763   77 YGTGEVSgplERTGKRVYTWNTdaWG---------YGQNTtslYQSHPWVFVVLPNGEALGVLADTTRRCEIDLRkesii 147
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1199 ------PLPALTyrttggvldFYVFlgPTPELVTQQYTELIGRPVMVPYWSLGFQLCRYGYQND---SEIASLYDEmvaA 1269
Cdd:PLN02763  148 riiapaSYPVIT---------FGPF--PSPEALLTSLSHAIGTVFMPPKWALGYQQCRWSYESAkrvAEIARTFRE---K 213
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1270 QIPYDVQYSDIDYMERQLDFTLSPKFAGFP-ALINRMKADGMRVILILDPAISGNETqpYPAFTRGVEDDVFIKyPNDGD 1348
Cdd:PLN02763  214 KIPCDVVWMDIDYMDGFRCFTFDKERFPDPkGLADDLHSIGFKAIWMLDPGIKAEEG--YFVYDSGCENDVWIQ-TADGK 290
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1349 IVWGKVWPdfpdvvvngsldwdsqvelyrAYVAFPDFFRNSTAKWWKREIEELYNNpqnperslKFDGMWIDMNEPSSFv 1428
Cdd:PLN02763  291 PFVGEVWP---------------------GPCVFPDFTNKKTRSWWANLVKDFVSN--------GVDGIWNDMNEPAVF- 340
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1429 ngavspgcRDASLNHPPYMPHLESRDRGlssktlcmesqqilpdgSLVQHYNVHNLYGWSQTRPTYEAVQEV-TGQRGVV 1507
Cdd:PLN02763  341 --------KTVTKTMPETNIHRGDEELG-----------------GVQNHSHYHNVYGMLMARSTYEGMLLAnKNKRPFV 395
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1508 ITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIGMMEFSLFGISYTGADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTI 1587
Cdd:PLN02763  396 LTRAGFIGSQRYAATWTGDNLSNWEHLHMSIPMVLQLGLSGQPLSGPDIGGFAGDATPKLFGRWMGVGAMFPFARGHSEQ 475
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1588 GTRRQDPVSWDVAFVNISRTVLQTRYTLLPYLYTLMHKAHTEGVTVVRPLLHEFVSDQVTWDIDSQFLLGPAFL-VSPVL 1666
Cdd:PLN02763  476 GTIDHEPWSFGEECEEVCRLALKRRYRLLPHFYTLFYKAHTTGLPVMTPIFFADPKDPSLRKVENSFLLGPLLIsASTLP 555
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1667 ERNARNVTAYFPRARWYDYYTGVDinargeWKTLPApldhinLHVRGGYILPWQEPALntHLSRQKF---MGFKIALDDE 1743
Cdd:PLN02763  556 DQGSDNLQHVLPKGIWQRFDFDDS------HPDLPL------LYLQGGSIIPLGPPIQ--HVGEASLsddLTLLIALDEN 621
                         650       660       670
                  ....*....|....*....|....*....|..
gi 221316699 1744 GTAGGWLFWDDGQSIDtYGKGLYYLASFSASQ 1775
Cdd:PLN02763  622 GKAEGVLYEDDGDGFG-YTKGDYLLTHYEAEL 652
NtCtMGAM_N pfam16863
N-terminal barrel of NtMGAM and CtMGAM, maltase-glucoamylase; NtCtMGAM_N is a beta-barrel-like ...
150-260 1.84e-47

N-terminal barrel of NtMGAM and CtMGAM, maltase-glucoamylase; NtCtMGAM_N is a beta-barrel-like structure just N-terminal to the catalytic domain of maltase-glucoamylase in eukaryotes. It contributes to the architecture of the substrate-binding site, by donating a loop that comes into close contact with two regions in the catalytic domain thereby creating the site. This family is frequently found at the N-terminus of Glycosyl hydrolase 31, pfam01055.to which it contributes as above.


Pssm-ID: 465286  Cd Length: 113  Bit Score: 165.35  E-value: 1.84e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699   150 GFTARLKNLPSSP-VFGSNVDNVLLTAEYQTSNRFHFKLTDQTNNRFEVPHEHVQ-SFSGNAAASLTYQVEISRQPFSIK 227
Cdd:pfam16863    1 GLTADLTLAGSPCnLYGNDIETLKLTVEYQTDNRLHVKITDPNNKRYEVPEELLPrPSPSSSASDSLYEFEYTNEPFGFK 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 221316699   228 VTRRSNNRVLFDSSIGPLLFADQFLQLSTRLPS 260
Cdd:pfam16863   81 VTRKSTGEVLFDTSGGPLVFEDQFLQLSTRLPS 113
NtCtMGAM_N pfam16863
N-terminal barrel of NtMGAM and CtMGAM, maltase-glucoamylase; NtCtMGAM_N is a beta-barrel-like ...
1014-1128 8.33e-39

N-terminal barrel of NtMGAM and CtMGAM, maltase-glucoamylase; NtCtMGAM_N is a beta-barrel-like structure just N-terminal to the catalytic domain of maltase-glucoamylase in eukaryotes. It contributes to the architecture of the substrate-binding site, by donating a loop that comes into close contact with two regions in the catalytic domain thereby creating the site. This family is frequently found at the N-terminus of Glycosyl hydrolase 31, pfam01055.to which it contributes as above.


Pssm-ID: 465286  Cd Length: 113  Bit Score: 140.69  E-value: 8.33e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1014 GATADISLKSSvYANAFPsTPVNPLRLDVTYHKNEMLQFKIYDPNKNRYEVP-VPLNIPSmPSSTPEGQLYDVLIKKNPF 1092
Cdd:pfam16863    1 GLTADLTLAGS-PCNLYG-NDIETLKLTVEYQTDNRLHVKITDPNNKRYEVPeELLPRPS-PSSSASDSLYEFEYTNEPF 77
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 221316699  1093 GIEIRRKSTGTIIWDSQLLGFTFSDMFIRISTRLPS 1128
Cdd:pfam16863   78 GFKVTRKSTGEVLFDTSGGPLVFEDQFLQLSTRLPS 113
GH31_N cd14752
N-terminal domain of glycosyl hydrolase family 31 (GH31); This family is found N-terminal to ...
252-368 1.70e-25

N-terminal domain of glycosyl hydrolase family 31 (GH31); This family is found N-terminal to the glycosyl-hydrolase domain of Glycoside hydrolase family 31 (GH31). GH31 includes the glycoside hydrolases alpha-glucosidase (EC 3.2.1.20), alpha-1,3-glucosidase (EC 3.2.1.84), alpha-xylosidase (EC 3.2.1.177), sucrase-isomaltase (EC 3.2.1.48 and EC 3.2.1.10), as well as alpha-glucan lyase (EC 4.2.2.13). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite-1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes. The lysosomal acid alpha-glucosidase (GAA) is essential for glycogen degradation and a deficiency or malfunction of this enzyme causes glycogen storage disease II, also known as Pompe disease. In the endoplasmic reticulum, alpha-glucosidase II catalyzes the second step in the N-linked oligosaccharide processing pathway that constitutes part of the quality control system for glycoprotein folding and maturation. The intestinal enzymes sucrase-isomaltase (SI) and maltase-glucoamylase (MGAM) play key roles in the final stage of carbohydrate digestion, making alpha-glucosidase inhibitors useful in the treatment of type 2 diabetes. GH31 alpha-glycosidases are retaining enzymes that cleave their substrates via an acid/base-catalyzed, double-displacement mechanism involving a covalent glycosyl-enzyme intermediate. Two aspartic acid residues of the catalytic domain have been identified as the catalytic nucleophile and the acid/base, respectively. A loop of the N-terminal beta-sandwich domain is part of the active site pocket.


Pssm-ID: 270212 [Multi-domain]  Cd Length: 122  Bit Score: 103.03  E-value: 1.70e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  252 LQLSTRLP-STNVYGLGEHVHqqyRHDMNWKTWPIFNRDT-TPNGNGTNLYGAQTFFLCLEdasglSFGVFLMNSNAMEV 329
Cdd:cd14752    10 LRLSFKLPpDEHFYGLGERFG---GLNKRGKRYRLWNTDQgGYRGSTDPLYGSIPFYLSSK-----GYGVFLDNPSRTEF 81
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 221316699  330 VLQPA--PAITYRTIGGILDFYVFLGNTPEQVVQEYLELIG 368
Cdd:cd14752    82 DFGSEdsDELTFSSEGGDLDYYFFAGPTPKEVIEQYTELTG 122
GH31_N cd14752
N-terminal domain of glycosyl hydrolase family 31 (GH31); This family is found N-terminal to ...
1113-1234 6.03e-24

N-terminal domain of glycosyl hydrolase family 31 (GH31); This family is found N-terminal to the glycosyl-hydrolase domain of Glycoside hydrolase family 31 (GH31). GH31 includes the glycoside hydrolases alpha-glucosidase (EC 3.2.1.20), alpha-1,3-glucosidase (EC 3.2.1.84), alpha-xylosidase (EC 3.2.1.177), sucrase-isomaltase (EC 3.2.1.48 and EC 3.2.1.10), as well as alpha-glucan lyase (EC 4.2.2.13). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite-1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes. The lysosomal acid alpha-glucosidase (GAA) is essential for glycogen degradation and a deficiency or malfunction of this enzyme causes glycogen storage disease II, also known as Pompe disease. In the endoplasmic reticulum, alpha-glucosidase II catalyzes the second step in the N-linked oligosaccharide processing pathway that constitutes part of the quality control system for glycoprotein folding and maturation. The intestinal enzymes sucrase-isomaltase (SI) and maltase-glucoamylase (MGAM) play key roles in the final stage of carbohydrate digestion, making alpha-glucosidase inhibitors useful in the treatment of type 2 diabetes. GH31 alpha-glycosidases are retaining enzymes that cleave their substrates via an acid/base-catalyzed, double-displacement mechanism involving a covalent glycosyl-enzyme intermediate. Two aspartic acid residues of the catalytic domain have been identified as the catalytic nucleophile and the acid/base, respectively. A loop of the N-terminal beta-sandwich domain is part of the active site pocket.


Pssm-ID: 270212 [Multi-domain]  Cd Length: 122  Bit Score: 98.80  E-value: 6.03e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1113 FTFSDmfIRISTRLP-SKYLYGFGEtehRSYRRDLEWHTWGMFSRDQPPGY--KKNSYGVHPYYMGLEedgsAHGVLLLN 1189
Cdd:cd14752     5 VRITP--LRLSFKLPpDEHFYGLGE---RFGGLNKRGKRYRLWNTDQGGYRgsTDPLYGSIPFYLSSK----GYGVFLDN 75
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 221316699 1190 SNAMDVTFQP--LPALTYRTTGGVLDFYVFLGPTPELVTQQYTELIG 1234
Cdd:cd14752    76 PSRTEFDFGSedSDELTFSSEGGDLDYYFFAGPTPKEVIEQYTELTG 122
Trefoil pfam00088
Trefoil (P-type) domain;
90-133 3.24e-15

Trefoil (P-type) domain;


Pssm-ID: 459666  Cd Length: 43  Bit Score: 71.20  E-value: 3.24e-15
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 221316699    90 CPVVNELERINCIPdQPPTKATCDQRGCCWNPQGAVSVPWCYYS 133
Cdd:pfam00088    1 CSSVPPSDRFDCGY-PGITQEECEARGCCWDPSVDPGVPWCFYP 43
PD smart00018
P or trefoil or TFF domain; Proposed role in renewal and pathology of mucous epithelia.
88-136 2.12e-14

P or trefoil or TFF domain; Proposed role in renewal and pathology of mucous epithelia.


Pssm-ID: 197472  Cd Length: 46  Bit Score: 68.95  E-value: 2.12e-14
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 221316699     88 AECpVVNELERINCIPDqPPTKATCDQRGCCWNPQGaVSVPWCYYSKNH 136
Cdd:smart00018    1 AQC-SVPPSERINCGPP-GITEAECEARGCCFDSSI-SGVPWCFYPNTV 46
Trefoil cd00111
P or trefoil or TFF domain; Trefoil factor family domain peptides are mucin-associated ...
88-134 1.29e-13

P or trefoil or TFF domain; Trefoil factor family domain peptides are mucin-associated molecules, largely found in epithelia of gastrointestinal tissues. Function is not known but it was originally identified from mucosal tissues, where it may have a regulatory or structural role and has also been implicated as a growth fractor in other tissues.The domain is found in 1 to 6 copies where it occurs.


Pssm-ID: 238059  Cd Length: 44  Bit Score: 66.60  E-value: 1.29e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 221316699   88 AECPVvNELERINCIPdQPPTKATCDQRGCCWNPQGaVSVPWCYYSK 134
Cdd:cd00111     1 EWCSV-PPSERIDCGP-PGITQEECEARGCCFDPSI-SGVPWCFYPK 44
PD smart00018
P or trefoil or TFF domain; Proposed role in renewal and pathology of mucous epithelia.
957-1000 2.12e-11

P or trefoil or TFF domain; Proposed role in renewal and pathology of mucous epithelia.


Pssm-ID: 197472  Cd Length: 46  Bit Score: 60.48  E-value: 2.12e-11
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....
gi 221316699    957 KIRDEEKIDCYPDenGASAENCTARGCIWEaSNSSGVPFCYFVN 1000
Cdd:smart00018    4 SVPPSERINCGPP--GITEAECEARGCCFD-SSISGVPWCFYPN 44
Trefoil cd00111
P or trefoil or TFF domain; Trefoil factor family domain peptides are mucin-associated ...
958-1000 5.68e-11

P or trefoil or TFF domain; Trefoil factor family domain peptides are mucin-associated molecules, largely found in epithelia of gastrointestinal tissues. Function is not known but it was originally identified from mucosal tissues, where it may have a regulatory or structural role and has also been implicated as a growth fractor in other tissues.The domain is found in 1 to 6 copies where it occurs.


Pssm-ID: 238059  Cd Length: 44  Bit Score: 58.89  E-value: 5.68e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 221316699  958 IRDEEKIDCYPdeNGASAENCTARGCIWEaSNSSGVPFCYFVN 1000
Cdd:cd00111     5 VPPSERIDCGP--PGITQEECEARGCCFD-PSISGVPWCFYPK 44
Trefoil pfam00088
Trefoil (P-type) domain;
958-998 3.64e-08

Trefoil (P-type) domain;


Pssm-ID: 459666  Cd Length: 43  Bit Score: 51.17  E-value: 3.64e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 221316699   958 IRDEEKIDC-YPdenGASAENCTARGCIWEASNSSGVPFCYF 998
Cdd:pfam00088    4 VPPSDRFDCgYP---GITQEECEARGCCWDPSVDPGVPWCFY 42
Gal_mutarotas_2 pfam13802
Galactose mutarotase-like; This family is found N-terminal to glycosyl-hydrolase domains, and ...
262-329 2.43e-03

Galactose mutarotase-like; This family is found N-terminal to glycosyl-hydrolase domains, and appears to be similar to the galactose mutarotase superfamily.


Pssm-ID: 463987 [Multi-domain]  Cd Length: 67  Bit Score: 38.22  E-value: 2.43e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 221316699   262 NVYGLGEHVhqqyrHDMNWKTWP--IFNRDT-TPNGNGTNLYGAQTFFLCLEDasGLSFGVFLMNSNAMEV 329
Cdd:pfam13802    3 HVYGLGERA-----GPLNKRGTRyrLWNTDAfGYELDTDPLYKSIPFYISHNG--GRGYGVFWDNPAETWF 66
 
Name Accession Description Interval E-value
GH31_MGAM_SI_GAA cd06602
maltase-glucoamylase, sucrase-isomaltase, lysosomal acid alpha-glucosidase; This subgroup ...
368-731 0e+00

maltase-glucoamylase, sucrase-isomaltase, lysosomal acid alpha-glucosidase; This subgroup includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain. Both of the tandem GH31 domains of MGAM and SI are included in this family. The domain architecture of GAA includes an N-terminal TFF (trefoil factor family) domain in addition to the GH31 catalytic domain. Deficient GAA expression causes Pompe disease, an autosomal recessive genetic disorder also known as glycogen storage disease type II (GSDII).


Pssm-ID: 269888  Cd Length: 367  Bit Score: 662.67  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMDERRDFTYDSVDFKGFPEFVNELHNNGQKL 447
Cdd:cd06602     1 GRPAMPPYWSLGFHLCRWGYKNLDELKEVVERYRAAGIPLDVQWNDIDYMDRYRDFTLDPVNFPGLPAFVDDLHANGQHY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  448 VIIVDPAISNNSSSSkpYGPYDRGSDMKIWVNSSDGvTPLIGEVWPGQTVFPDYTNPNCAVWWTKEFELFHNQVEFDGIW 527
Cdd:cd06602    81 VPILDPGISANESGG--YPPYDRGLEMDVFIKNDDG-SPYVGKVWPGYTVFPDFTNPNTQEWWTEEIKDFHDQVPFDGLW 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  528 IDMNEVSNFVDGSVS------GCSTNNLNNPPFTPRIL-DGYLFCKTLCMDAVQH-WGKQYDIHNLYGYSMAVATAEAAK 599
Cdd:cd06602   158 IDMNEPSNFCTGSCGnspnapGCPDNKLNNPPYVPNNLgGGSLSDKTICMDAVHYdGGLHYDVHNLYGLSEAIATYKALK 237
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  600 TVFPNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTPEELCRRWMQLGAF 679
Cdd:cd06602   238 EIFPGKRPFIISRSTFPGSGKYAGHWLGDNYSTWEDMRYSIPGMLEFNLFGIPMVGADICGFNGNTTEELCARWMQLGAF 317
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|..
gi 221316699  680 YPFSRNHNGQGYKDQDPASFGADslLLNSSRHYLNIRYTLLPYLYTLFFRAH 731
Cdd:cd06602   318 YPFSRNHNDIGAIDQEPYVWGPS--VADASRKALLIRYSLLPYLYTLFYRAH 367
Glyco_hydro_31 pfam01055
Glycosyl hydrolases family 31; Glycosyl hydrolases are key enzymes of carbohydrate metabolism. ...
349-820 0e+00

Glycosyl hydrolases family 31; Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases.


Pssm-ID: 460044 [Multi-domain]  Cd Length: 443  Bit Score: 624.20  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699   349 YVFLGNTPEQVVQEYLELIGRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMDERRDFTYDSV 428
Cdd:pfam01055    1 YFFLGPTPKDVVKQYTELTGRPPLPPYWALGYHQSRWGYKSEEEVLEVVDGFRERDIPLDVIWLDIDYMDGYRDFTWDPE 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699   429 DFKGFPEFVNELHNNGQKLVIIVDPAISnnsSSSKPYGPYDRGSDMKIWVNSSDGvtPLIGEVWPGQTVFPDYTNPNCAV 508
Cdd:pfam01055   81 RFPDPKGMVDELHAKGQKLVVIIDPGIK---KVDPGYPPYDEGLEKGYFVKNPDG--SLYVGGWPGMSAFPDFTNPEARD 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699   509 WWTKEFELFHNQVEFDGIWIDMNEVSNFVDGSvsGCSTNNLNNPPFTPrildgylfcktlcmdavqhwGKQYDIHNLYGY 588
Cdd:pfam01055  156 WWADQLFKFLLDMGVDGIWNDMNEPSVFCGSG--PEDTVAKDNDPGGG--------------------VEHYDVHNLYGL 213
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699   589 SMAVATAEAAKTVFPNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTPEE 668
Cdd:pfam01055  214 LMAKATYEGLREKRPNKRPFVLTRSGFAGSQRYAAHWSGDNTSTWEHLRFSIPGGLSLGLSGIPFWGADIGGFFNPTTPE 293
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699   669 LCRRWMQLGAFYPFSRNHNGQGYKDQDPASFGADSllLNSSRHYLNIRYTLLPYLYTLFFRAHSRGDTVARPLLHEFYED 748
Cdd:pfam01055  294 LYVRWYQLGAFSPFFRNHSSIDTRRREPWLFGEEV--EEIIRKAIRLRYRLLPYLYTLFYEAHETGLPVMRPLFLEFPDD 371
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 221316699   749 NSTWDVHQQFLWGPGLLITPVLDEGAEKVMAYVPDAVWYDYETGSQVRwRKQKVEMELPGDKIGLHLRGGYI 820
Cdd:pfam01055  372 PNTFDIDDQFMFGPSLLVAPVLEEGATSVDVYLPGGRWYDFWTGERYE-GGGTVPVTAPLDRIPLFVRGGSI 442
Glyco_hydro_31 pfam01055
Glycosyl hydrolases family 31; Glycosyl hydrolases are key enzymes of carbohydrate metabolism. ...
1215-1717 0e+00

Glycosyl hydrolases family 31; Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases.


Pssm-ID: 460044 [Multi-domain]  Cd Length: 443  Bit Score: 606.86  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1215 YVFLGPTPELVTQQYTELIGRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYSDIDYMERQLDFTLSP- 1293
Cdd:pfam01055    1 YFFLGPTPKDVVKQYTELTGRPPLPPYWALGYHQSRWGYKSEEEVLEVVDGFRERDIPLDVIWLDIDYMDGYRDFTWDPe 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1294 KFAGFPALINRMKADGMRVILILDPAISGNEtQPYPAFTRGVEDDVFIKYPnDGDIVWGKvWPDfpdvvvngsldwdsqv 1373
Cdd:pfam01055   81 RFPDPKGMVDELHAKGQKLVVIIDPGIKKVD-PGYPPYDEGLEKGYFVKNP-DGSLYVGG-WPG---------------- 141
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1374 elyraYVAFPDFFRNSTAKWWKREIEELYNNpqnpersLKFDGMWIDMNEPSSFVNgavSPGCRDASLNHPPYMPhlesr 1453
Cdd:pfam01055  142 -----MSAFPDFTNPEARDWWADQLFKFLLD-------MGVDGIWNDMNEPSVFCG---SGPEDTVAKDNDPGGG----- 201
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1454 drglssktlcmesqqilpdgslVQHYNVHNLYGWSQTRPTYEAVQEVTG-QRGVVITRSTFPSSGRWAGHWLGDNTAAWD 1532
Cdd:pfam01055  202 ----------------------VEHYDVHNLYGLLMAKATYEGLREKRPnKRPFVLTRSGFAGSQRYAAHWSGDNTSTWE 259
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1533 QLKKSIIGMMEFSLFGISYTGADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTIGTRRQDPVSWDVAFVNISRTVLQTR 1612
Cdd:pfam01055  260 HLRFSIPGGLSLGLSGIPFWGADIGGFFNPTTPELYVRWYQLGAFSPFFRNHSSIDTRRREPWLFGEEVEEIIRKAIRLR 339
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1613 YTLLPYLYTLMHKAHTEGVTVVRPLLHEFVSDQVTWDIDSQFLLGPAFLVSPVLERNARNVTAYFPRARWYDYYTGVDIn 1692
Cdd:pfam01055  340 YRLLPYLYTLFYEAHETGLPVMRPLFLEFPDDPNTFDIDDQFMFGPSLLVAPVLEEGATSVDVYLPGGRWYDFWTGERY- 418
                          490       500
                   ....*....|....*....|....*
gi 221316699  1693 ARGEWKTLPAPLDHINLHVRGGYIL 1717
Cdd:pfam01055  419 EGGGTVPVTAPLDRIPLFVRGGSII 443
GH31_MGAM_SI_GAA cd06602
maltase-glucoamylase, sucrase-isomaltase, lysosomal acid alpha-glucosidase; This subgroup ...
1234-1627 0e+00

maltase-glucoamylase, sucrase-isomaltase, lysosomal acid alpha-glucosidase; This subgroup includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain. Both of the tandem GH31 domains of MGAM and SI are included in this family. The domain architecture of GAA includes an N-terminal TFF (trefoil factor family) domain in addition to the GH31 catalytic domain. Deficient GAA expression causes Pompe disease, an autosomal recessive genetic disorder also known as glycogen storage disease type II (GSDII).


Pssm-ID: 269888  Cd Length: 367  Bit Score: 592.56  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1234 GRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYSDIDYMERQLDFTLSP-KFAGFPALINRMKADGMRV 1312
Cdd:cd06602     1 GRPAMPPYWSLGFHLCRWGYKNLDELKEVVERYRAAGIPLDVQWNDIDYMDRYRDFTLDPvNFPGLPAFVDDLHANGQHY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1313 ILILDPAISGNETQPYPAFTRGVEDDVFIKYpNDGDIVWGKVWPDfpdvvvngsldwdsqvelyraYVAFPDFFRNSTAK 1392
Cdd:cd06602    81 VPILDPGISANESGGYPPYDRGLEMDVFIKN-DDGSPYVGKVWPG---------------------YTVFPDFTNPNTQE 138
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1393 WWKREIEELYNNpqnpersLKFDGMWIDMNEPSSFVNGAVSP-----GCRDASLNHPPYMPHLeSRDRGLSSKTLCMESQ 1467
Cdd:cd06602   139 WWTEEIKDFHDQ-------VPFDGLWIDMNEPSNFCTGSCGNspnapGCPDNKLNNPPYVPNN-LGGGSLSDKTICMDAV 210
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1468 QIlpDGSLvqHYNVHNLYGWSQTRPTYEAVQEV-TGQRGVVITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIGMMEFSL 1546
Cdd:cd06602   211 HY--DGGL--HYDVHNLYGLSEAIATYKALKEIfPGKRPFIISRSTFPGSGKYAGHWLGDNYSTWEDMRYSIPGMLEFNL 286
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1547 FGISYTGADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTIGTRRQDPVSWDVAFVNISRTVLQTRYTLLPYLYTLMHKA 1626
Cdd:cd06602   287 FGIPMVGADICGFNGNTTEELCARWMQLGAFYPFSRNHNDIGAIDQEPYVWGPSVADASRKALLIRYSLLPYLYTLFYRA 366

                  .
gi 221316699 1627 H 1627
Cdd:cd06602   367 H 367
GH31_MGAM-like cd06600
maltase-glucoamylase (MGAM)-like; This family includes the following closely related glycosyl ...
368-719 1.24e-137

maltase-glucoamylase (MGAM)-like; This family includes the following closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), lysosomal acid alpha-glucosidase (GAA), neutral alpha-glucosidase C (GANC), the alpha subunit of neutral alpha-glucosidase AB (GANAB), and alpha-glucosidase II. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end and a C-terminal luminal domain. Both of the tandem GH31 domains of MGAM and SI are included in this family. The domain architecture of GAA includes an N-terminal TFF (trefoil factor family) domain in addition to the GH31 catalytic domain. Deficient GAA expression causes Pompe disease, an autosomal recessive genetic disorder also known as glycogen storage disease type II (GSDII). GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. Alpha-glucosidase II is a GH31 enzyme, found in bacteria and plants, which has exo-alpha-1,4-glucosidase and oligo-1,6-glucosidase activities. Alpha-glucosidase II has been characterized in Bacillus thermoamyloliquefaciens where it forms a homohexamer. This family also includes the MalA alpha-glucosidase from Sulfolobus solfataricus and the AglA alpha-glucosidase from Picrophilus torridus. MalA is part of the carbohydrate-metabolizing machinery that allows this organism to utilize carbohydrates, such as maltose, as the sole carbon and energy source. The MGAM-like family corresponds to subgroup 1 in the Ernst et al classification of GH31 enzymes.


Pssm-ID: 269886 [Multi-domain]  Cd Length: 256  Bit Score: 427.68  E-value: 1.24e-137
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMDERRDFTYDSVDFKGFPEFVNELHNNGQKL 447
Cdd:cd06600     1 GRPALPPYWAFGYHQSRYSYYDQDKVVEVVDIMQEAGIPYDVMWLDIDYMDSYKDFTWDPVRFPEPKKFVDELHKNGQKL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  448 VIIVDPAIsnnsssskpygpydrgsdmkiwvnssdgvtpligevwpgqtvfpdytnpnCAVWWTKEFELFHNQVEFDGIW 527
Cdd:cd06600    81 VTIVDPGI--------------------------------------------------TREWWAGLISEFLYSQGIDGIW 110
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  528 IDMNEVSNFvdgsvsgcstnnlnnppftprildgylfcktlcmdavqhwgkqYDIHNLYGYSMAVATAEAAKTVfPNKRS 607
Cdd:cd06600   111 IDMNEPSNF-------------------------------------------YKVHNLYGFYEAMATAEGLRTS-HNERP 146
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  608 FILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTPEELCRRWMQLGAFYPFSRNHN 687
Cdd:cd06600   147 FILSRSTFAGSQKYAAHWTGDNTASWDDLKLSIPLVLGLSLSGIPFVGADIGGFAGDTSEELLVRWYQLGAFYPFSRSHK 226
                         330       340       350
                  ....*....|....*....|....*....|..
gi 221316699  688 GQGYKDQDPASFgaDSLLLNSSRHYLNIRYTL 719
Cdd:cd06600   227 ATDTKDQEPVLF--PEYYKESVREILELRYKL 256
GH31_GANC_GANAB_alpha cd06603
neutral alpha-glucosidase C, neutral alpha-glucosidase AB; This subgroup includes the closely ...
368-861 1.78e-120

neutral alpha-glucosidase C, neutral alpha-glucosidase AB; This subgroup includes the closely related glycosyl hydrolase family 31 (GH31) isozymes, neutral alpha-glucosidase C (GANC) and the alpha subunit of heterodimeric neutral alpha-glucosidase AB (GANAB). Initially distinguished on the basis of differences in electrophoretic mobility in starch gel, GANC and GANAB have been shown to have other differences, including those of substrate specificity. GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. The GANC/GANAB family includes the alpha-glucosidase II (ModA) from Dictyostelium discoideum as well as the alpha-glucosidase II (GLS2, or ROT2 - Reversal of TOR2 lethality protein 2) from Saccharomyces cerevisiae.


Pssm-ID: 269889  Cd Length: 467  Bit Score: 388.80  E-value: 1.78e-120
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMDERRDFTYDSVDFKGFPEFVNELHNNGQKL 447
Cdd:cd06603     1 GTPPLPPLFALGYHQCRWNYNDQEDVLEVDANFDEHDIPYDVIWLDIEHTDGKRYFTWDKKKFPDPKKMQEKLASKGRKL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  448 VIIVDPAISNNSSsskpYGPYDRGSDMKIWVNSSDGvTPLIGEVWPGQTVFPDYTNPNCAVWWTKEFELFHNQVEFD--G 525
Cdd:cd06603    81 VTIVDPHIKRDDD----YFVYKEAKEKDYFVKDSDG-KDFEGWCWPGSSSWPDFLNPEVRDWWASLFSYDKYKGSTEnlY 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  526 IWIDMNEVSNFvdgsvsgcstnnlNNPPFT-PRildgylfcktlcmDAVQHWGKQY-DIHNLYGYSMAVATAEA-AKTVF 602
Cdd:cd06603   156 IWNDMNEPSVF-------------NGPEITmPK-------------DAIHYGGVEHrDVHNIYGLYMHMATFEGlLKRSN 209
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  603 PNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTPEELCRRWMQLGAFYPF 682
Cdd:cd06603   210 GKKRPFVLTRSFFAGSQRYGAVWTGDNMATWEHLKISIPMLLSLSIAGIPFVGADVGGFFGNPDEELLVRWYQAGAFYPF 289
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  683 SRNHNGQGYKDQDPASFGADSllLNSSRHYLNIRYTLLPYLYTLFFRAHSRGDTVARPLLHEFYEDNSTWDVHQQFLWGP 762
Cdd:cd06603   290 FRAHAHIDTKRREPWLFGEET--TEIIREAIRLRYRLLPYWYTLFYEASRTGLPIMRPLWYEFPEDESTFDIDDQFMLGD 367
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  763 GLLITPVLDEGAEKVMAYVP-DAVWYDYETGSQVRwRKQKVEMELPGDKIGLHLRGGYIFPTQQ-PNTTTLASRKNPLGL 840
Cdd:cd06603   368 SLLVKPVVEEGATSVTVYLPgGEVWYDYFTGQRVT-GGGTKTVPVPLDSIPVFQRGGSIIPRKErVRRSSKLMRNDPYTL 446
                         490       500
                  ....*....|....*....|.
gi 221316699  841 IIALDENKEAKGELFWDNGET 861
Cdd:cd06603   447 VVALDENGEAEGELYLDDGES 467
GH31_glucosidase_II_MalA cd06604
Alpha-glucosidase II-like; Alpha-glucosidase II (alpha-D-glucoside glucohydrolase) is a ...
368-734 4.79e-120

Alpha-glucosidase II-like; Alpha-glucosidase II (alpha-D-glucoside glucohydrolase) is a glycosyl hydrolase family 31 (GH31) enzyme, found in bacteria and plants, which has exo-alpha-1,4-glucosidase and oligo-1,6-glucosidase activities. Alpha-glucosidase II has been characterized in Bacillus thermoamyloliquefaciens where it forms a homohexamer. This subgroup also includes the MalA alpha-glucosidase from Sulfolobus solfataricus and the AglA alpha-glucosidase from Picrophilus torridus. MalA is part of the carbohydrate-metabolizing machinery that allows this organism to utilize carbohydrates, such as maltose, as the sole carbon and energy source.


Pssm-ID: 269890 [Multi-domain]  Cd Length: 339  Bit Score: 382.24  E-value: 4.79e-120
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMDERRDFTYDSVDFKGFPEFVNELHNNGQKL 447
Cdd:cd06604     1 GRPPLPPKWALGYQQSRWSYYPEEEVREVAKGFRERDIPCDAIYLDIDYMDGYRVFTWDKERFPDPKELIKELHEQGFRL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  448 VIIVDPAISNNssssKPYGPYDRGSDMKIWVNSSDGvTPLIGEVWPGQTVFPDYTNPNCAVWWTKEFELFHNQvEFDGIW 527
Cdd:cd06604    81 VTIVDPGVKVD----PGYEVYEEGLENDYFVKDPDG-ELYVGKVWPGKSVFPDFTNPEVREWWGDLYKELVDL-GVDGIW 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  528 IDMNEVSNFVDGSVSGCSTNNLNNppftpriLDGylfcktlcmDAVQHwgkqYDIHNLYGYSMAVATAEAAKTVFPNKRS 607
Cdd:cd06604   155 NDMNEPAVFNAPGGTTMPLDAVHR-------LDG---------GKITH----EEVHNLYGLLMARATYEGLRRLRPNKRP 214
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  608 FILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTPEELCRRWMQLGAFYPFSRNHN 687
Cdd:cd06604   215 FVLSRAGYAGIQRYAAIWTGDNSSSWEHLRLSIPMLLNLGLSGVPFVGADIGGFAGDPSPELLARWYQLGAFFPFFRNHS 294
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*..
gi 221316699  688 GQGYKDQDPASFGADSllLNSSRHYLNIRYTLLPYLYTLFFRAHSRG 734
Cdd:cd06604   295 AKGTRDQEPWAFGEEV--EEIARKAIELRYRLLPYLYTLFYEAHETG 339
GH31_GANC_GANAB_alpha cd06603
neutral alpha-glucosidase C, neutral alpha-glucosidase AB; This subgroup includes the closely ...
1234-1757 1.56e-104

neutral alpha-glucosidase C, neutral alpha-glucosidase AB; This subgroup includes the closely related glycosyl hydrolase family 31 (GH31) isozymes, neutral alpha-glucosidase C (GANC) and the alpha subunit of heterodimeric neutral alpha-glucosidase AB (GANAB). Initially distinguished on the basis of differences in electrophoretic mobility in starch gel, GANC and GANAB have been shown to have other differences, including those of substrate specificity. GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. The GANC/GANAB family includes the alpha-glucosidase II (ModA) from Dictyostelium discoideum as well as the alpha-glucosidase II (GLS2, or ROT2 - Reversal of TOR2 lethality protein 2) from Saccharomyces cerevisiae.


Pssm-ID: 269889  Cd Length: 467  Bit Score: 343.35  E-value: 1.56e-104
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1234 GRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYSDIDYMERQLDFTLSPKFagFP---ALINRMKADGM 1310
Cdd:cd06603     1 GTPPLPPLFALGYHQCRWNYNDQEDVLEVDANFDEHDIPYDVIWLDIEHTDGKRYFTWDKKK--FPdpkKMQEKLASKGR 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1311 RVILILDPAISGNETqpYPAFTRGVEDDVFIKYPNDGDIVwGKVWPdfpdvvvnGSLDWdsqvelyrayvafPDFFRNST 1390
Cdd:cd06603    79 KLVTIVDPHIKRDDD--YFVYKEAKEKDYFVKDSDGKDFE-GWCWP--------GSSSW-------------PDFLNPEV 134
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1391 AKWWKreieELYNnPQNPERSLKFDGMWIDMNEPSSFvNGAvspgcrDASLnhPPYMPHLESrdrglssktlcmesqqil 1470
Cdd:cd06603   135 RDWWA----SLFS-YDKYKGSTENLYIWNDMNEPSVF-NGP------EITM--PKDAIHYGG------------------ 182
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1471 pdgslVQHYNVHNLYGWSQTRPTYEAV--QEVTGQRGVVITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIGMMEFSLFG 1548
Cdd:cd06603   183 -----VEHRDVHNIYGLYMHMATFEGLlkRSNGKKRPFVLTRSFFAGSQRYGAVWTGDNMATWEHLKISIPMLLSLSIAG 257
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1549 ISYTGADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTIGTRRQDPVSWDVAFVNISRTVLQTRYTLLPYLYTLMHKAHT 1628
Cdd:cd06603   258 IPFVGADVGGFFGNPDEELLVRWYQAGAFYPFFRAHAHIDTKRREPWLFGEETTEIIREAIRLRYRLLPYWYTLFYEASR 337
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1629 EGVTVVRPLLHEFVSDQVTWDIDSQFLLGPAFLVSPVLERNARNVTAYFPR-ARWYDYYTGvDINARGEWKTLPAPLDHI 1707
Cdd:cd06603   338 TGLPIMRPLWYEFPEDESTFDIDDQFMLGDSLLVKPVVEEGATSVTVYLPGgEVWYDYFTG-QRVTGGGTKTVPVPLDSI 416
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|.
gi 221316699 1708 NLHVRGGYILP-WQEPALNTHLSRQKFMGFKIALDDEGTAGGWLFWDDGQS 1757
Cdd:cd06603   417 PVFQRGGSIIPrKERVRRSSKLMRNDPYTLVVALDENGEAEGELYLDDGES 467
GH31_glucosidase_II_MalA cd06604
Alpha-glucosidase II-like; Alpha-glucosidase II (alpha-D-glucoside glucohydrolase) is a ...
1234-1630 1.27e-102

Alpha-glucosidase II-like; Alpha-glucosidase II (alpha-D-glucoside glucohydrolase) is a glycosyl hydrolase family 31 (GH31) enzyme, found in bacteria and plants, which has exo-alpha-1,4-glucosidase and oligo-1,6-glucosidase activities. Alpha-glucosidase II has been characterized in Bacillus thermoamyloliquefaciens where it forms a homohexamer. This subgroup also includes the MalA alpha-glucosidase from Sulfolobus solfataricus and the AglA alpha-glucosidase from Picrophilus torridus. MalA is part of the carbohydrate-metabolizing machinery that allows this organism to utilize carbohydrates, such as maltose, as the sole carbon and energy source.


Pssm-ID: 269890 [Multi-domain]  Cd Length: 339  Bit Score: 332.94  E-value: 1.27e-102
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1234 GRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYSDIDYMERQLDFTLSP-KFAGFPALINRMKADGMRV 1312
Cdd:cd06604     1 GRPPLPPKWALGYQQSRWSYYPEEEVREVAKGFRERDIPCDAIYLDIDYMDGYRVFTWDKeRFPDPKELIKELHEQGFRL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1313 ILILDPAISGNetQPYPAFTRGVEDDVFIKYPnDGDIVWGKVWPDFpdvvvngsldwdsqvelyrayVAFPDFFRNSTAK 1392
Cdd:cd06604    81 VTIVDPGVKVD--PGYEVYEEGLENDYFVKDP-DGELYVGKVWPGK---------------------SVFPDFTNPEVRE 136
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1393 WWKREIEELYNNpqnperslKFDGMWIDMNEPSSFVNgavspgcrdaslNHPPYMPhLESRDRGlssktlcmesqqilpD 1472
Cdd:cd06604   137 WWGDLYKELVDL--------GVDGIWNDMNEPAVFNA------------PGGTTMP-LDAVHRL---------------D 180
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1473 GSLVQHYNVHNLYGWSQTRPTYEAVQEV-TGQRGVVITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIGMMEFSLFGISY 1551
Cdd:cd06604   181 GGKITHEEVHNLYGLLMARATYEGLRRLrPNKRPFVLSRAGYAGIQRYAAIWTGDNSSSWEHLRLSIPMLLNLGLSGVPF 260
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 221316699 1552 TGADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTIGTRRQDPVSWDVAFVNISRTVLQTRYTLLPYLYTLMHKAHTEG 1630
Cdd:cd06604   261 VGADIGGFAGDPSPELLARWYQLGAFFPFFRNHSAKGTRDQEPWAFGEEVEEIARKAIELRYRLLPYLYTLFYEAHETG 339
PLN02763 PLN02763
hydrolase, hydrolyzing O-glycosyl compounds
258-895 2.72e-97

hydrolase, hydrolyzing O-glycosyl compounds


Pssm-ID: 215408 [Multi-domain]  Cd Length: 978  Bit Score: 338.40  E-value: 2.72e-97
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  258 LPS-TNVYGLGEHVHQQYRHDMNWKTWpifNRDTTPNGNGT-NLYGAQTF-FLCLEdaSGLSFGVFLMNSNAMEVVLQ-- 332
Cdd:PLN02763   70 LPSgTSFYGTGEVSGPLERTGKRVYTW---NTDAWGYGQNTtSLYQSHPWvFVVLP--NGEALGVLADTTRRCEIDLRke 144
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  333 ---------PAPAITyrtiggildFYVFlgNTPEQVVQEYLELIGRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAA 403
Cdd:PLN02763  145 siiriiapaSYPVIT---------FGPF--PSPEALLTSLSHAIGTVFMPPKWALGYQQCRWSYESAKRVAEIARTFREK 213
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  404 QLPYDVQHADIDYMDERRDFTYDSVDFKGFPEFVNELHNNGQKLVIIVDPAISNNssssKPYGPYDRGSDMKIWVNSSDG 483
Cdd:PLN02763  214 KIPCDVVWMDIDYMDGFRCFTFDKERFPDPKGLADDLHSIGFKAIWMLDPGIKAE----EGYFVYDSGCENDVWIQTADG 289
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  484 vTPLIGEVWPGQTVFPDYTNPNCAVWWTKEFELFHNqVEFDGIWIDMNEVSNFVDGSVSGCSTN-NLNNPPFTPRILDGY 562
Cdd:PLN02763  290 -KPFVGEVWPGPCVFPDFTNKKTRSWWANLVKDFVS-NGVDGIWNDMNEPAVFKTVTKTMPETNiHRGDEELGGVQNHSH 367
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  563 LfcktlcmdavqhwgkqydiHNLYGYSMAVATAEAAKTVFPNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPG 642
Cdd:PLN02763  368 Y-------------------HNVYGMLMARSTYEGMLLANKNKRPFVLTRAGFIGSQRYAATWTGDNLSNWEHLHMSIPM 428
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  643 VLEFNLFGIPMVGPDICGFALDTPEELCRRWMQLGAFYPFSRNHNGQGYKDQDPASFGADslLLNSSRHYLNIRYTLLPY 722
Cdd:PLN02763  429 VLQLGLSGQPLSGPDIGGFAGDATPKLFGRWMGVGAMFPFARGHSEQGTIDHEPWSFGEE--CEEVCRLALKRRYRLLPH 506
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  723 LYTLFFRAHSRGDTVARPLLHEFYEDNSTWDVHQQFLWGPGLLITPVL-DEGAEKVMAYVPDAVW--YDYETGSQvrwrk 799
Cdd:PLN02763  507 FYTLFYKAHTTGLPVMTPIFFADPKDPSLRKVENSFLLGPLLISASTLpDQGSDNLQHVLPKGIWqrFDFDDSHP----- 581
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  800 qkvemELPgdkiGLHLRGGYIFPTQQP-NTTTLASRKNPLGLIIALDENKEAKGELFWDNGETKDtVANKVYLLCEFSVT 878
Cdd:PLN02763  582 -----DLP----LLYLQGGSIIPLGPPiQHVGEASLSDDLTLLIALDENGKAEGVLYEDDGDGFG-YTKGDYLLTHYEAE 651
                         650       660
                  ....*....|....*....|.
gi 221316699  879 QNRLEVNI----SQSTYKDPN 895
Cdd:PLN02763  652 LVSSEVTVrvasTEGSWKRPK 672
YicI COG1501
Alpha-glucosidase/xylosidase, GH31 family [Carbohydrate transport and metabolism];
259-861 1.83e-96

Alpha-glucosidase/xylosidase, GH31 family [Carbohydrate transport and metabolism];


Pssm-ID: 441110 [Multi-domain]  Cd Length: 609  Bit Score: 325.58  E-value: 1.83e-96
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  259 PSTNVYGLGEH---VHQQYRHDMNWktwpifNRDTTPNGNGTNLYGAQTFFLCLEDasglsFGVFLmNSNAM---EVVLQ 332
Cdd:COG1501    60 LGEQIYGLGERfttLHKRGRIVVNW------NLDHGGHKDNGNTYAPIPFYVSSKG-----YGVFV-NSASYvtfDVGSA 127
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  333 PAPAITYRTIGGILDFYVFLGNTPEQVVQEYLELIGRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHA 412
Cdd:COG1501   128 YSDLVEFTVPGDSLEFYVIEGPSPEDVLEKYTELTGKPPLPPRWAFGYWQSRKSYYDQDQVLAFADEFRDRGFPLDVIHL 207
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  413 DIDYMDE--RRDFTYDSVDFKGFPEFVNELHNNGQKLVIIVDPAISNNSSsskpygPYDRGsdMKIWVNSSDGvTPLIGE 490
Cdd:COG1501   208 DIRWMDKyyWGDFEWDPRRFPDPKAMVKELHDRGVKLVLWINPYVAPDSA------IFAEG--MANFVKIASG-TVFVGK 278
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  491 VWPGQTVFPDYTNPNCAVWWTKEFELFHNQVEFDGIWIDMNEvsnfvdgsvsGCSTNNLNNPPFTPrildgylfcktlcm 570
Cdd:COG1501   279 MWPGTTGLLDFTRPDAREWFWAGLEKELLSIGVDGIKLDMNE----------GWPTDVATFPSNVP-------------- 334
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  571 davqhwgKQYdiHNLYGYSMAVATAEAAKTVFpNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFG 650
Cdd:COG1501   335 -------QQM--RNLYGLLEAKATFEGFRTSR-NNRTFILTRSGFAGGQRYPVIWTGDNTSSWESLEDQLTQGLNLSLSG 404
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  651 IPMVGPDICGFALDTPEELCRRWMQLGAFYPFSRNHngQGYKDQDPASFG--ADSLLlnssRHYLNIRYTLLPYLYTLFF 728
Cdd:COG1501   405 VPFWTPDIGGFFGSPSRELWIRWFQVGAFSPFARIH--GWASSTEPWFFDeeAKQIV----KEYAQLRYRLLPYIYSLFA 478
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  729 RAHSRGDTVARPLLHEFYEDNSTWDVHQQFLWGPGLLITPVLdEGAEKVMAYVPDAVWYDYETGSQVRwRKQKVEMELPG 808
Cdd:COG1501   479 KASTDGTPVIRPLFLEFPDDPTTRFIDDQYMFGEYLLVAPIF-AGTESRLVYLPKGKWYDFWTGELIE-GGQWITVTAPL 556
                         570       580       590       600       610
                  ....*....|....*....|....*....|....*....|....*....|...
gi 221316699  809 DKIGLHLRGGYIFPTQQPNTTTLASRKNPLGLIIALDenKEAKGELFWDNGET 861
Cdd:COG1501   557 DRLPLYVRDGSIIPLGPVSLRPSMQKIDGIELRVYGS--GETAYTLYDDDGET 607
alpha_gluc_MalA NF040948
alpha-glucosidase MalA;
347-824 1.49e-93

alpha-glucosidase MalA;


Pssm-ID: 468879 [Multi-domain]  Cd Length: 626  Bit Score: 317.35  E-value: 1.49e-93
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  347 DFYVFLGNTPEQVVQEYLELIGRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMDERRDFTYD 426
Cdd:NF040948  145 ELYVIEGPTIEEVLETYSELTGKPFLPPKWALGYQISRYSYYPQDAVVEVVDELRKEGFPVSAVYLDIDYMDSYKLFTWD 224
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  427 SVDFKGFPEFVNELHNNGQKLVIIVDPAISNNSSsskpYGPYDRGsdMKIWVNSSDGvTPLIGEVWPGQTVFPDYTNPNC 506
Cdd:NF040948  225 KEKFPDPRKFIEELHSRGVKVITIVDPSVKADQN----YEVFRSG--LGKYCETENG-ELYVGKLWPGNSVFPDFLNEET 297
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  507 AVWWTKEFELFHNQVEFDGIWIDMNEVSNFV-DGSVSGCSTNNLNNPPFTPRILDGYLFcKTLCMDAVQHwgkqYDIHNL 585
Cdd:NF040948  298 REWWAELVEEWVKQYGVDGIWLDMNEPTDFTeDIERAALGPHQLREDRLLYTFPPGAVH-RLDDGKKVKH----EKVRNA 372
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  586 YGYSMAVATAEAAKTVfpNKRS-FILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFA-- 662
Cdd:NF040948  373 YPYFEAMATYEGLKRA--GKDEpFILSRSGYAGIQRYAAIWTGDNTSSWDDLKLQLQLVLGLSISGVPYVGCDIGGFAgr 450
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  663 ---LDTPEELCRRWMQLGAFYPFSRNHNGQGYKDQDPASFGADSllLNSSRHYLNIRYTLLPYLYTLFFRAHSRGDTVAR 739
Cdd:NF040948  451 sfpIDNSPELLVRYYQAALFFPLFRTHKSKDGNDQEPYFLPSKY--KEKVKRVIKLRYKFLPYLYSLAWEAHETGHPIIR 528
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  740 PLLHEFYEDNSTWDVHQQFLWGPGLLITPVLDEGAEKVMAYVPDAVWYDYETGSQV---RWRKQkvEMELPgdkigLHLR 816
Cdd:NF040948  529 PLFYEFQDDEDAYRIEDEYMVGKYLLYAPQIYPKEESRDVYLPRGKWLDFWTGEEYegpSWIES--EAELP-----IYIR 601

                  ....*...
gi 221316699  817 GGYIFPTQ 824
Cdd:NF040948  602 EGSAVPLD 609
YicI COG1501
Alpha-glucosidase/xylosidase, GH31 family [Carbohydrate transport and metabolism];
1127-1758 3.23e-93

Alpha-glucosidase/xylosidase, GH31 family [Carbohydrate transport and metabolism];


Pssm-ID: 441110 [Multi-domain]  Cd Length: 609  Bit Score: 315.95  E-value: 3.23e-93
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1127 PSKYLYGFGE---TEHRSYRRDLEWHTwgmfsrdQPPGYKK--NSYGVHPYYMGLEEDGsahgvLLLNSNAM---DVTFQ 1198
Cdd:COG1501    60 LGEQIYGLGErftTLHKRGRIVVNWNL-------DHGGHKDngNTYAPIPFYVSSKGYG-----VFVNSASYvtfDVGSA 127
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1199 PLPALTYRTTGGVLDFYVFLGPTPELVTQQYTELIGRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYS 1278
Cdd:COG1501   128 YSDLVEFTVPGDSLEFYVIEGPSPEDVLEKYTELTGKPPLPPRWAFGYWQSRKSYYDQDQVLAFADEFRDRGFPLDVIHL 207
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1279 DIDYMERQL--DFTLSP-KFAGFPALINRMKADGMRVILILDPAIsGNETQPYPAFTRGveddvFIKYPNdGDIVWGKVW 1355
Cdd:COG1501   208 DIRWMDKYYwgDFEWDPrRFPDPKAMVKELHDRGVKLVLWINPYV-APDSAIFAEGMAN-----FVKIAS-GTVFVGKMW 280
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1356 PDfpdvvvngsldwdsqvelyraYVAFPDFFRNSTAKWWKREIEELYnnpqnpeRSLKFDGMWIDMNEpssfvngavspG 1435
Cdd:COG1501   281 PG---------------------TTGLLDFTRPDAREWFWAGLEKEL-------LSIGVDGIKLDMNE-----------G 321
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1436 CRDASLNHPPYMPHlesrdrglssktlcmesqqilpdgslvqhyNVHNLYGWSQTRPTYEAVQEVTGQRGVVITRSTFPS 1515
Cdd:COG1501   322 WPTDVATFPSNVPQ------------------------------QMRNLYGLLEAKATFEGFRTSRNNRTFILTRSGFAG 371
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1516 SGRWAGHWLGDNTAAWDQLKKSIIGMMEFSLFGISYTGADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTigTRRQDPV 1595
Cdd:COG1501   372 GQRYPVIWTGDNTSSWESLEDQLTQGLNLSLSGVPFWTPDIGGFFGSPSRELWIRWFQVGAFSPFARIHGW--ASSTEPW 449
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1596 SWDVAFVNISRTVLQTRYTLLPYLYTLMHKAHTEGVTVVRPLLHEFVSDQVTWDIDSQFLLGPAFLVSPVLErNARNVTA 1675
Cdd:COG1501   450 FFDEEAKQIVKEYAQLRYRLLPYIYSLFAKASTDGTPVIRPLFLEFPDDPTTRFIDDQYMFGEYLLVAPIFA-GTESRLV 528
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1676 YFPRARWYDYYTGVDINArGEWKTLPAPLDHINLHVRGGYILPWQePALNtHLSRQKFMGFKIALDDEGTAGGWLFWDDG 1755
Cdd:COG1501   529 YLPKGKWYDFWTGELIEG-GQWITVTAPLDRLPLYVRDGSIIPLG-PVSL-RPSMQKIDGIELRVYGSGETAYTLYDDDG 605

                  ...
gi 221316699 1756 QSI 1758
Cdd:COG1501   606 ETV 608
GH31 cd06589
glycosyl hydrolase family 31 (GH31); GH31 enzymes occur in prokaryotes, eukaryotes, and ...
368-713 3.75e-87

glycosyl hydrolase family 31 (GH31); GH31 enzymes occur in prokaryotes, eukaryotes, and archaea with a wide range of hydrolytic activities, including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite -1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes. The lysosomal acid alpha-glucosidase (GAA) is essential for glycogen degradation and a deficiency or malfunction of this enzyme causes glycogen storage disease II, also known as Pompe disease. In the endoplasmic reticulum, alpha-glucosidase II catalyzes the second step in the N-linked oligosaccharide processing pathway that constitutes part of the quality control system for glycoprotein folding and maturation. The intestinal enzymes sucrase-isomaltase (SI) and maltase-glucoamylase (MGAM) play key roles in the final stage of carbohydrate digestion, making alpha-glucosidase inhibitors useful in the treatment of type 2 diabetes. GH31 alpha-glycosidases are retaining enzymes that cleave their substrates via an acid/base-catalyzed, double-displacement mechanism involving a covalent glycosyl-enzyme intermediate. Two aspartic acid residues have been identified as the catalytic nucleophile and the acid/base, respectively.


Pssm-ID: 269876 [Multi-domain]  Cd Length: 265  Bit Score: 285.40  E-value: 3.75e-87
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMD---ERRDFTYDSVDFKGFPEFVNELHNNG 444
Cdd:cd06589     1 GRPPLLPKWALGFWNSRYGYYSEDEVEELVDRYREEGIPLDGFVLDSDWMDwggNWGGFTWNREKFPDPKGMIDELHDKG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  445 QKLVIIVDPAIsnnsssskpygpydrgsdmkiwvnssdgvtpligevwpgqtvfpdytnpncAVWWTKEFELFHNQVEFD 524
Cdd:cd06589    81 VKLGLIVKPRL---------------------------------------------------RDWWWENIKKLLLEQGVD 109
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  525 GIWIDMNEVSNFVDgsvsgcstnnlnnppftprildgylfcktlcmDAVQHWGKQYDIHNLYGYSMAVATAEAAKTVFPN 604
Cdd:cd06589   110 GWWTDMGEPLPFDD--------------------------------ATFHNGGKAQKIHNAYPLNMAEATYEGQKKTFPN 157
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  605 KRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTP-EELCRRWMQLGAFYPFS 683
Cdd:cd06589   158 KRPFILSRSGYAGAQRYPAIWSGDNTTTWDSLAFQIRAGLSASLSGVGYWGHDIGGFTGGDPdKELYTRWVQFGAFSPIF 237
                         330       340       350
                  ....*....|....*....|....*....|
gi 221316699  684 RNHNGQGYKDQDPasFGADSLLLNSSRHYL 713
Cdd:cd06589   238 RLHGDNSPRDKEP--WVYGEEALAIFRKYL 265
GH31_MGAM-like cd06600
maltase-glucoamylase (MGAM)-like; This family includes the following closely related glycosyl ...
1234-1615 1.50e-80

maltase-glucoamylase (MGAM)-like; This family includes the following closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), lysosomal acid alpha-glucosidase (GAA), neutral alpha-glucosidase C (GANC), the alpha subunit of neutral alpha-glucosidase AB (GANAB), and alpha-glucosidase II. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end and a C-terminal luminal domain. Both of the tandem GH31 domains of MGAM and SI are included in this family. The domain architecture of GAA includes an N-terminal TFF (trefoil factor family) domain in addition to the GH31 catalytic domain. Deficient GAA expression causes Pompe disease, an autosomal recessive genetic disorder also known as glycogen storage disease type II (GSDII). GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. Alpha-glucosidase II is a GH31 enzyme, found in bacteria and plants, which has exo-alpha-1,4-glucosidase and oligo-1,6-glucosidase activities. Alpha-glucosidase II has been characterized in Bacillus thermoamyloliquefaciens where it forms a homohexamer. This family also includes the MalA alpha-glucosidase from Sulfolobus solfataricus and the AglA alpha-glucosidase from Picrophilus torridus. MalA is part of the carbohydrate-metabolizing machinery that allows this organism to utilize carbohydrates, such as maltose, as the sole carbon and energy source. The MGAM-like family corresponds to subgroup 1 in the Ernst et al classification of GH31 enzymes.


Pssm-ID: 269886 [Multi-domain]  Cd Length: 256  Bit Score: 266.28  E-value: 1.50e-80
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1234 GRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYSDIDYMERQLDFTLSPK-FAGFPALINRMKADGMRV 1312
Cdd:cd06600     1 GRPALPPYWAFGYHQSRYSYYDQDKVVEVVDIMQEAGIPYDVMWLDIDYMDSYKDFTWDPVrFPEPKKFVDELHKNGQKL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1313 ILILDPAIsgnetqpypaftrgveddvfikypndgdivwgkvwpdfpdvvvngsldwdsqvelyrayvafpdffrnsTAK 1392
Cdd:cd06600    81 VTIVDPGI---------------------------------------------------------------------TRE 91
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1393 WWKREIEELYNnpqnperSLKFDGMWIDMNEPSSFvngavspgcrdaslnhppymphlesrdrglssktlcmesqqilpd 1472
Cdd:cd06600    92 WWAGLISEFLY-------SQGIDGIWIDMNEPSNF--------------------------------------------- 119
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1473 gslvqhYNVHNLYGWSQTRPTYEAVQEVTGQRGVVITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIGMMEFSLFGISYT 1552
Cdd:cd06600   120 ------YKVHNLYGFYEAMATAEGLRTSHNERPFILSRSTFAGSQKYAAHWTGDNTASWDDLKLSIPLVLGLSLSGIPFV 193
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 221316699 1553 GADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTIGTRRQDPVSWDVAFVNISRTVLQTRYTL 1615
Cdd:cd06600   194 GADIGGFAGDTSEELLVRWYQLGAFYPFSRSHKATDTKDQEPVLFPEYYKESVREILELRYKL 256
PLN02763 PLN02763
hydrolase, hydrolyzing O-glycosyl compounds
1132-1775 3.79e-75

hydrolase, hydrolyzing O-glycosyl compounds


Pssm-ID: 215408 [Multi-domain]  Cd Length: 978  Bit Score: 272.15  E-value: 3.79e-75
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1132 YGFGETE---HRSYRRDLEWHT--WGmfsrdqppgYKKNS---YGVHPYYMGLEEDGSAHGVLLLNSNAMDVTFQ----- 1198
Cdd:PLN02763   77 YGTGEVSgplERTGKRVYTWNTdaWG---------YGQNTtslYQSHPWVFVVLPNGEALGVLADTTRRCEIDLRkesii 147
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1199 ------PLPALTyrttggvldFYVFlgPTPELVTQQYTELIGRPVMVPYWSLGFQLCRYGYQND---SEIASLYDEmvaA 1269
Cdd:PLN02763  148 riiapaSYPVIT---------FGPF--PSPEALLTSLSHAIGTVFMPPKWALGYQQCRWSYESAkrvAEIARTFRE---K 213
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1270 QIPYDVQYSDIDYMERQLDFTLSPKFAGFP-ALINRMKADGMRVILILDPAISGNETqpYPAFTRGVEDDVFIKyPNDGD 1348
Cdd:PLN02763  214 KIPCDVVWMDIDYMDGFRCFTFDKERFPDPkGLADDLHSIGFKAIWMLDPGIKAEEG--YFVYDSGCENDVWIQ-TADGK 290
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1349 IVWGKVWPdfpdvvvngsldwdsqvelyrAYVAFPDFFRNSTAKWWKREIEELYNNpqnperslKFDGMWIDMNEPSSFv 1428
Cdd:PLN02763  291 PFVGEVWP---------------------GPCVFPDFTNKKTRSWWANLVKDFVSN--------GVDGIWNDMNEPAVF- 340
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1429 ngavspgcRDASLNHPPYMPHLESRDRGlssktlcmesqqilpdgSLVQHYNVHNLYGWSQTRPTYEAVQEV-TGQRGVV 1507
Cdd:PLN02763  341 --------KTVTKTMPETNIHRGDEELG-----------------GVQNHSHYHNVYGMLMARSTYEGMLLAnKNKRPFV 395
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1508 ITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIGMMEFSLFGISYTGADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTI 1587
Cdd:PLN02763  396 LTRAGFIGSQRYAATWTGDNLSNWEHLHMSIPMVLQLGLSGQPLSGPDIGGFAGDATPKLFGRWMGVGAMFPFARGHSEQ 475
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1588 GTRRQDPVSWDVAFVNISRTVLQTRYTLLPYLYTLMHKAHTEGVTVVRPLLHEFVSDQVTWDIDSQFLLGPAFL-VSPVL 1666
Cdd:PLN02763  476 GTIDHEPWSFGEECEEVCRLALKRRYRLLPHFYTLFYKAHTTGLPVMTPIFFADPKDPSLRKVENSFLLGPLLIsASTLP 555
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1667 ERNARNVTAYFPRARWYDYYTGVDinargeWKTLPApldhinLHVRGGYILPWQEPALntHLSRQKF---MGFKIALDDE 1743
Cdd:PLN02763  556 DQGSDNLQHVLPKGIWQRFDFDDS------HPDLPL------LYLQGGSIIPLGPPIQ--HVGEASLsddLTLLIALDEN 621
                         650       660       670
                  ....*....|....*....|....*....|..
gi 221316699 1744 GTAGGWLFWDDGQSIDtYGKGLYYLASFSASQ 1775
Cdd:PLN02763  622 GKAEGVLYEDDGDGFG-YTKGDYLLTHYEAEL 652
NtCtMGAM_N pfam16863
N-terminal barrel of NtMGAM and CtMGAM, maltase-glucoamylase; NtCtMGAM_N is a beta-barrel-like ...
150-260 1.84e-47

N-terminal barrel of NtMGAM and CtMGAM, maltase-glucoamylase; NtCtMGAM_N is a beta-barrel-like structure just N-terminal to the catalytic domain of maltase-glucoamylase in eukaryotes. It contributes to the architecture of the substrate-binding site, by donating a loop that comes into close contact with two regions in the catalytic domain thereby creating the site. This family is frequently found at the N-terminus of Glycosyl hydrolase 31, pfam01055.to which it contributes as above.


Pssm-ID: 465286  Cd Length: 113  Bit Score: 165.35  E-value: 1.84e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699   150 GFTARLKNLPSSP-VFGSNVDNVLLTAEYQTSNRFHFKLTDQTNNRFEVPHEHVQ-SFSGNAAASLTYQVEISRQPFSIK 227
Cdd:pfam16863    1 GLTADLTLAGSPCnLYGNDIETLKLTVEYQTDNRLHVKITDPNNKRYEVPEELLPrPSPSSSASDSLYEFEYTNEPFGFK 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 221316699   228 VTRRSNNRVLFDSSIGPLLFADQFLQLSTRLPS 260
Cdd:pfam16863   81 VTRKSTGEVLFDTSGGPLVFEDQFLQLSTRLPS 113
GH31_lyase_GLase cd06601
alpha-1,4-glucan lyase; GLases (alpha-1,4-glucan lyases) are glycosyl hydrolase family 31 ...
368-730 2.62e-47

alpha-1,4-glucan lyase; GLases (alpha-1,4-glucan lyases) are glycosyl hydrolase family 31 (GH31) enzymes that degrade alpha-1,4-glucans and maltooligosaccharides via a nonhydrolytic pathway to yield 1,5-D-anhydrofructose from the nonreducing end. GLases cleave the bond between C1 and O1 of the nonreducing sugar residue of alpha-glucans to generate a monosaccharide product with a double bond between C1 and C2. This family corresponds to subgroup 2 in the Ernst et al classification of GH31 enzymes.


Pssm-ID: 269887 [Multi-domain]  Cd Length: 347  Bit Score: 173.75  E-value: 2.62e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMDERRDFTYDSVDFKGFPEFVNELHNNGQKL 447
Cdd:cd06601     1 GRSRMKPRYVFGYHQGCYGYSSRESLEVVVQSYRDANIPLDGLHIDVDFQDNYRTFTTSKDKFPNPKEMFSNLHAQGFKC 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  448 VIIVDPAISNnsssskpygpydrgsdmkiwvnssdgvtPLIGEVWPGQTV-----FPDYTNPNCAVWWTKEFE-LFHNQV 521
Cdd:cd06601    81 STNITPIITD----------------------------PYIGGVNYGGGLgspgfYPDLGRPEVREWWGQQYKyLFDMGL 132
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  522 EFdgIWIDMnevsnfvdgsvsgcstnnlNNPPFTPRILDGYLFCKTL-------CMDAVQHWGKQ--YDIHNLYGYSMAV 592
Cdd:cd06601   133 EM--VWQDM-------------------TTPAIAPHKINGYGDMKTFplrllvtDDSVKNEHTYKpaATLWNLYAYNLHK 191
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  593 AT-----AEAAKtvfPNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFAL--DT 665
Cdd:cd06601   192 ATyhglnRLNAR---PNRRNFIIGRGGYAGAQRFAGLWTGDNASTWDFLQINIPQVLNLGLSGVPISGSDIGGFASgsDE 268
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 221316699  666 PE------ELCRRWMQLGAFYPFSRNH----NGQGYKDQDPASFGADSLLLNSSRHYLNIRYTLLPYLYTLFFRA 730
Cdd:cd06601   269 NEgkwcdpELLIRWVQAGAFLPWFRNHydryIKKKQQEKLYEPYYYYEPVLPICRKYVELRYRLMQVFYDAMYEN 343
GH31 cd06589
glycosyl hydrolase family 31 (GH31); GH31 enzymes occur in prokaryotes, eukaryotes, and ...
1234-1598 7.62e-42

glycosyl hydrolase family 31 (GH31); GH31 enzymes occur in prokaryotes, eukaryotes, and archaea with a wide range of hydrolytic activities, including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite -1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes. The lysosomal acid alpha-glucosidase (GAA) is essential for glycogen degradation and a deficiency or malfunction of this enzyme causes glycogen storage disease II, also known as Pompe disease. In the endoplasmic reticulum, alpha-glucosidase II catalyzes the second step in the N-linked oligosaccharide processing pathway that constitutes part of the quality control system for glycoprotein folding and maturation. The intestinal enzymes sucrase-isomaltase (SI) and maltase-glucoamylase (MGAM) play key roles in the final stage of carbohydrate digestion, making alpha-glucosidase inhibitors useful in the treatment of type 2 diabetes. GH31 alpha-glycosidases are retaining enzymes that cleave their substrates via an acid/base-catalyzed, double-displacement mechanism involving a covalent glycosyl-enzyme intermediate. Two aspartic acid residues have been identified as the catalytic nucleophile and the acid/base, respectively.


Pssm-ID: 269876 [Multi-domain]  Cd Length: 265  Bit Score: 155.20  E-value: 7.62e-42
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1234 GRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYSDIDYMERQLDFTL----SPKFAGFPALINRMKADG 1309
Cdd:cd06589     1 GRPPLLPKWALGFWNSRYGYYSEDEVEELVDRYREEGIPLDGFVLDSDWMDWGGNWGGftwnREKFPDPKGMIDELHDKG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1310 MRVILILDPAIsgnetqpypaftrgveddvfikypndgdivwgkvwpdfpdvvvngsldwdsqvelyrayvafpdffrns 1389
Cdd:cd06589    81 VKLGLIVKPRL--------------------------------------------------------------------- 91
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1390 tAKWWKREIEELYNnpqnperSLKFDGMWIDMNEPSSFVNGAVSPGcrdaslnhppymphlesrdrglssktlcmesqqi 1469
Cdd:cd06589    92 -RDWWWENIKKLLL-------EQGVDGWWTDMGEPLPFDDATFHNG---------------------------------- 129
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1470 lpdgslVQHYNVHNLYGWSQTRPTYEAVQEVTG-QRGVVITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIGMMEFSLFG 1548
Cdd:cd06589   130 ------GKAQKIHNAYPLNMAEATYEGQKKTFPnKRPFILSRSGYAGAQRYPAIWSGDNTTTWDSLAFQIRAGLSASLSG 203
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|.
gi 221316699 1549 ISYTGADICGFFQ-DAEYEMCVRWMQLGAFYPFSRNHNtiGTRRQDPVSWD 1598
Cdd:cd06589   204 VGYWGHDIGGFTGgDPDKELYTRWVQFGAFSPIFRLHG--DNSPRDKEPWV 252
GH31_transferase_CtsZ cd06598
CtsZ (cyclic tetrasaccharide-synthesizing enzyme Z)-like; CtsZ is a bacterial ...
368-729 3.45e-40

CtsZ (cyclic tetrasaccharide-synthesizing enzyme Z)-like; CtsZ is a bacterial 6-alpha-glucosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsY. CtsZ and CtsY both have a glycosyl hydrolase family 31 (GH31) catalytic domain; CtsY belongs to a different subfamily. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.


Pssm-ID: 269884  Cd Length: 332  Bit Score: 152.84  E-value: 3.45e-40
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLP-----YDVQ-HADIDYMDERR--DFTYDSVDFKGFPEFVNE 439
Cdd:cd06598     1 GRPPLPPKWAFGLWQSEFGYDNWAEVDELVDTLRQKDFPldgvvLDLYwFGGIIASPDGPmgDLDWDRKAFPDPAKMIAD 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  440 LHNNGQKLVIIVDPAISNNSSSskpygpYDRGSDMKIWVNSSDG-VTPLIGEVWPGQTVFPDYTNPNCAVWWtKEFELFH 518
Cdd:cd06598    81 LKQQGVGTILIEEPYVLKNSDE------YDELVKKGLLAKDKAGkPEPTLFNFWFGEGGMIDWSDPEARAWW-HDRYKDL 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  519 NQVEFDGIWIDMNEvsnfvdgsvsgcstnnlnnPPFTPRildgylfcktlcmDAVQHWGKQYDIHNLYGYSMAVATAEAA 598
Cdd:cd06598   154 IDMGVAGWWTDLGE-------------------PEMHPP-------------DMVHADGDAADVHNIYNLLWAKSIYDGY 201
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  599 KTVFPNKRSFILTRSTFAGSGKF-AAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTP--EELCRRWMQ 675
Cdd:cd06598   202 QRNFPEQRPFIMSRSGTAGSQRYgVIPWSGDIGRTWGGLASQINLQLHMSLSGIDYYGSDIGGFARGETldPELYTRWFQ 281
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....
gi 221316699  676 LGAFYPFSRNHnGQGYKDQDPASFGADSllLNSSRHYLNIRYTLLPYLYTLFFR 729
Cdd:cd06598   282 YGAFDPPVRPH-GQNLCNPETAPDREGT--KAINRENIKLRYQLLPYYYSLAYR 332
GH31_xylosidase_YicI cd06593
alpha-xylosidase YicI-like; YicI alpha-xylosidase is a glycosyl hydrolase family 31 (GH31) ...
368-719 1.96e-39

alpha-xylosidase YicI-like; YicI alpha-xylosidase is a glycosyl hydrolase family 31 (GH31) enzyme that catalyzes the release of an alpha-xylosyl residue from the non-reducing end of alpha-xyloside substrates such as alpha-xylosyl fluoride and isoprimeverose. YicI forms a homohexamer (a trimer of dimers). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The YicI family corresponds to subgroup 4 in the Ernst et al classification of GH31 enzymes.


Pssm-ID: 269879 [Multi-domain]  Cd Length: 308  Bit Score: 149.64  E-value: 1.96e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMDERR--DFTYDSVDFKGFPEFVNELHNNGQ 445
Cdd:cd06593     1 GRPPLPPAWSFGLWLSRSFYYSEEEVLEVADGMRERGIPCDVIHLDCFWMKEDWwcDFEWDEERFPDPEGMIARLKEKGF 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  446 KLVIIVDPAISNNSSsskpygPYDRGSDMKIWVNSSDGVTPLIGEVWPGQTVFPDYTNPNCAVWWT-KEFELFHNQVefD 524
Cdd:cd06593    81 KVCLWINPYISQDSP------LFKEAAEKGYLVKNPDGSPWHQWDGWQPGMGIIDFTNPEAVAWYKeKLKRLLDMGV--D 152
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  525 GIWIDMNEVsnfvdgsvsgcstnnlnnppftprildgylfcktLCMDAVQHWGKQYD-IHNLYGYSMAVATAEAAKTVFP 603
Cdd:cd06593   153 VIKTDFGER----------------------------------IPEDAVYYDGSDGRkMHNLYPLLYNKAVYEATKEVKG 198
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  604 nKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTPEELCRRWMQLGAFYPFS 683
Cdd:cd06593   199 -EEAVLWARSAWAGSQRYPVHWGGDSESTFEGMAASLRGGLSLGLSGFGFWSHDIGGFEGTPSPELYKRWTQFGLLSSHS 277
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 221316699  684 RNHnGQGYKdqDPASFGADSllLNSSRHYLNIRYTL 719
Cdd:cd06593   278 RLH-GSTPR--EPWEYGEEA--LDVVRKFAKLRYRL 308
NtCtMGAM_N pfam16863
N-terminal barrel of NtMGAM and CtMGAM, maltase-glucoamylase; NtCtMGAM_N is a beta-barrel-like ...
1014-1128 8.33e-39

N-terminal barrel of NtMGAM and CtMGAM, maltase-glucoamylase; NtCtMGAM_N is a beta-barrel-like structure just N-terminal to the catalytic domain of maltase-glucoamylase in eukaryotes. It contributes to the architecture of the substrate-binding site, by donating a loop that comes into close contact with two regions in the catalytic domain thereby creating the site. This family is frequently found at the N-terminus of Glycosyl hydrolase 31, pfam01055.to which it contributes as above.


Pssm-ID: 465286  Cd Length: 113  Bit Score: 140.69  E-value: 8.33e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  1014 GATADISLKSSvYANAFPsTPVNPLRLDVTYHKNEMLQFKIYDPNKNRYEVP-VPLNIPSmPSSTPEGQLYDVLIKKNPF 1092
Cdd:pfam16863    1 GLTADLTLAGS-PCNLYG-NDIETLKLTVEYQTDNRLHVKITDPNNKRYEVPeELLPRPS-PSSSASDSLYEFEYTNEPF 77
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 221316699  1093 GIEIRRKSTGTIIWDSQLLGFTFSDMFIRISTRLPS 1128
Cdd:pfam16863   78 GFKVTRKSTGEVLFDTSGGPLVFEDQFLQLSTRLPS 113
GH31_glycosidase_Aec37 cd06599
E.coli Aec37-like; Glycosyl hydrolase family 31 (GH31) domain of a bacterial protein family ...
368-683 1.49e-37

E.coli Aec37-like; Glycosyl hydrolase family 31 (GH31) domain of a bacterial protein family represented by Escherichia coli protein Aec37. The gene encoding Aec37 (aec-37) is located within a genomic island (AGI-3) isolated from the extraintestinal avian pathogenic Escherichia coli strain BEN2908. The function of Aec37 and its orthologs is unknown; however, deletion of a region of the genome that includes aec-37 affects the assimilation of seven carbohydrates, decreases growth rate of the strain in minimal medium containing galacturonate or trehalose, and attenuates the virulence of E. coli BEN2908 in chickens. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.


Pssm-ID: 269885 [Multi-domain]  Cd Length: 319  Bit Score: 144.66  E-value: 1.49e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDN----MREVVERNRAAQLPYDVQHADIDY----MDERRDFTYDSVDFKGFPEFVNE 439
Cdd:cd06599     1 GRPALPPRWSLGYLGSTMYYTEAPDaqeqILDFIDTCREHDIPCDGFHLSSGYtsieDGKRYVFNWNKDKFPDPKAFFRK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  440 LHNNGQKLVIIVDPAISNNSssskPYgpYDRGSDMKIWVNSSDGVTPLIGEVWPGQTVFPDYTNPNCAVWWTKEfeLFHN 519
Cdd:cd06599    81 FHERGIRLVANIKPGLLTDH----PH--YDELAEKGAFIKDDDGGEPAVGRFWGGGGSYLDFTNPEGREWWKEG--LKEQ 152
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  520 QVEF--DGIWIDMNEVSnfvdgsvsgcstnnlnnppftprILDGYLFCKTLCMDAVQHWGKQydihnLYGYSMAVATAEA 597
Cdd:cd06599   153 LLDYgiDSVWNDNNEYE-----------------------IWDDDAACCGFGKGGPISELRP-----IQPLLMARASREA 204
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  598 AKTVFPNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTPE-ELCRRWMQL 676
Cdd:cd06599   205 QLEHAPNKRPFVISRSGCAGIQRYAQTWSGDNRTSWKTLKYNIAMGLGMSLSGVANYGHDIGGFAGPAPEpELFVRWVQN 284

                  ....*...
gi 221316699  677 GAFYP-FS 683
Cdd:cd06599   285 GIFQPrFS 292
PRK10658 PRK10658
putative alpha-glucosidase; Provisional
343-807 1.84e-32

putative alpha-glucosidase; Provisional


Pssm-ID: 236731 [Multi-domain]  Cd Length: 665  Bit Score: 136.18  E-value: 1.84e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  343 GGILDFYVFLGNTPEQVVQEYLELIGRPALPSYWALGFHLS-----RYEYGT----LDNMREvveRNraaqLPYDVQHAD 413
Cdd:PRK10658  233 GEYLEYFVIDGPTPKEVLDRYTALTGRPALPPAWSFGLWLTtsfttNYDEATvnsfIDGMAE---RD----LPLHVFHFD 305
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  414 IDYMDERR--DFTYDSVDFKGfPE-FVNELHNNGQKLVIIVDPAISNNSSSskpygpYDRGSDMKIWVNSSDGvtpligE 490
Cdd:PRK10658  306 CFWMKEFQwcDFEWDPRTFPD-PEgMLKRLKAKGLKICVWINPYIAQKSPL------FKEGKEKGYLLKRPDG------S 372
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  491 VW------PGQTVFpDYTNPNCAVWWTKEFELFhnqvefdgiwIDMNevsnfVDgsvsgC-STNnlnnppFTPRIldgyl 563
Cdd:PRK10658  373 VWqwdkwqPGMAIV-DFTNPDACKWYADKLKGL----------LDMG-----VD-----CfKTD------FGERI----- 420
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  564 fcktlCMDAVQHWGKQ-YDIHNLYGYSMAVATAEAAKTVFPNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPG 642
Cdd:PRK10658  421 -----PTDVVWFDGSDpQKMHNYYTYLYNKTVFDVLKETRGEGEAVLFARSATVGGQQFPVHWGGDCYSNYESMAESLRG 495
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  643 VLEFNLFGIPMVGPDICGFALDTPEELCRRWMQLGAFYPFSRNHNGQGYKDqdPASFGADSLllNSSRHYLNIRYTLLPY 722
Cdd:PRK10658  496 GLSLGLSGFGFWSHDIGGFENTATADVYKRWCAFGLLSSHSRLHGSKSYRV--PWAYDEEAV--DVVRFFTKLKCRLMPY 571
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  723 LYTLFFRAHSRGDTVARPLLHEFYEDNSTWDVHQQFLWGPGLLITPVLDEGAEkVMAYVPDAVWYDYETGSQV---RWRK 799
Cdd:PRK10658  572 LYREAAEAHERGTPMMRAMVLEFPDDPACDYLDRQYMLGDSLLVAPVFSEAGD-VEYYLPEGRWTHLLTGEEVeggRWHK 650

                  ....*....
gi 221316699  800 QKV-EMELP 807
Cdd:PRK10658  651 EQHdFLSLP 659
GH31_transferase_CtsZ cd06598
CtsZ (cyclic tetrasaccharide-synthesizing enzyme Z)-like; CtsZ is a bacterial ...
1234-1625 3.35e-32

CtsZ (cyclic tetrasaccharide-synthesizing enzyme Z)-like; CtsZ is a bacterial 6-alpha-glucosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsY. CtsZ and CtsY both have a glycosyl hydrolase family 31 (GH31) catalytic domain; CtsY belongs to a different subfamily. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.


Pssm-ID: 269884  Cd Length: 332  Bit Score: 129.34  E-value: 3.35e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1234 GRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYD-----------VQYSDIDYMERqLDFTLspkfAGFP--- 1299
Cdd:cd06598     1 GRPPLPPKWAFGLWQSEFGYDNWAEVDELVDTLRQKDFPLDgvvldlywfggIIASPDGPMGD-LDWDR----KAFPdpa 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1300 ALINRMKADGMRVILILDPAISGNEtqpyPAFTRGVEDDVFIKY------PNDGDIVWGKvwpdfpdvvvNGSLDWdsqv 1373
Cdd:cd06598    76 KMIADLKQQGVGTILIEEPYVLKNS----DEYDELVKKGLLAKDkagkpePTLFNFWFGE----------GGMIDW---- 137
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1374 elyrayvafpdfFRNSTAKWWKREIEELYNnpqnpersLKFDGMWIDMNEPSsfvngavspgcrdaslNHPPYMPHLesr 1453
Cdd:cd06598   138 ------------SDPEARAWWHDRYKDLID--------MGVAGWWTDLGEPE----------------MHPPDMVHA--- 178
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1454 drglssktlcmesqqilpDGSlvqHYNVHNLYG--WSQTrpTYEA-VQEVTGQRGVVITRSTFPSSGRW-AGHWLGDNTA 1529
Cdd:cd06598   179 ------------------DGD---AADVHNIYNllWAKS--IYDGyQRNFPEQRPFIMSRSGTAGSQRYgVIPWSGDIGR 235
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1530 AWDQLKKSIIGMMEFSLFGISYTGADICGFFQDAEY--EMCVRWMQLGAFYPFSRNHnTIGTRRQDPVSWDVAFVNISRT 1607
Cdd:cd06598   236 TWGGLASQINLQLHMSLSGIDYYGSDIGGFARGETLdpELYTRWFQYGAFDPPVRPH-GQNLCNPETAPDREGTKAINRE 314
                         410
                  ....*....|....*...
gi 221316699 1608 VLQTRYTLLPYLYTLMHK 1625
Cdd:cd06598   315 NIKLRYQLLPYYYSLAYR 332
GH31_NET37 cd06592
glucosidase NET37; NET37 (also known as KIAA1161) is a human lamina-associated nuclear ...
422-786 1.72e-31

glucosidase NET37; NET37 (also known as KIAA1161) is a human lamina-associated nuclear envelope transmembrane protein. A member of the glycosyl hydrolase family 31 (GH31) , it has been shown to be required for myogenic differentiation of C2C12 cells. Related proteins are found in eukaryotes and prokaryotes. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.


Pssm-ID: 269878 [Multi-domain]  Cd Length: 364  Bit Score: 128.11  E-value: 1.72e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  422 DFTYDSVDFKGFPEFVNELHNNGQKLVIIVDPAISNNSSSskpygpYDRGSDMKIWVNSSDGVTPLIGEVWPGQTVFPDY 501
Cdd:cd06592    49 DFEFDPEKFPDPKGMIDKLHEMGFRVTLWVHPFINPDSPN------FRELRDKGYLVKEDSGGPPLIVKWWNGYGAVLDF 122
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  502 TNPNCAVWWTKEFELFHNQVEFDGIWIDMNEVSNFvdgsvsgcstnnLNNPPFTPRILDGYLFCKTlcmdavqhWGKQYD 581
Cdd:cd06592   123 TNPEARDWFKERLRELQEDYGIDGFKFDAGEASYL------------PADPATFPSGLNPNEYTTL--------YAELAA 182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  582 IhnlYGYSMAVATAEAaktvfpNKRSFILTRSTFAGSgkfaaHWlgdntATWDDLRWSIPGVLEFNLFGIPMVGPD-ICG 660
Cdd:cd06592   183 E---FGLLNEVRSGWK------SQGLPLFVRMSDKDS-----HW-----GYWNGLRSLIPTALTQGLLGYPFVLPDmIGG 243
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  661 FALD---TPEELCRRWMQLGAFYP---FSrnHNGQGYKDQDpasfgadslLLNSSRHYLNIRYTLLPYLYTLFFRAHSRG 734
Cdd:cd06592   244 NAYGnfpPDKELYIRWLQLSAFMPamqFS--VAPWRNYDEE---------VVDIARKLAKLREKLLPYIYELAAEAVDTG 312
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|..
gi 221316699  735 DTVARPLLHEFYEDNSTWDVHQQFLWGPGLLITPVLDEGAEKVMAYVPDAVW 786
Cdd:cd06592   313 EPIIRPLWWIAPEDEEALTIDDQFLLGDDILVAPVLEKGARSRDVYLPKGRW 364
PRK10658 PRK10658
putative alpha-glucosidase; Provisional
1212-1699 2.02e-30

putative alpha-glucosidase; Provisional


Pssm-ID: 236731 [Multi-domain]  Cd Length: 665  Bit Score: 130.02  E-value: 2.02e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1212 LDFYVFLGPTPELVTQQYTELIGRPVMVPYWSLGFQLCRYGYQNDSE--IASLYDEMVAAQIPYDVQYSDIDYMeRQL-- 1287
Cdd:PRK10658  236 LEYFVIDGPTPKEVLDRYTALTGRPALPPAWSFGLWLTTSFTTNYDEatVNSFIDGMAERDLPLHVFHFDCFWM-KEFqw 314
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1288 -DFTLSPkfAGFP---ALINRMKADGMRVILILDPAISgnetQPYPAFTRGVEDDVFIKYPNdgdivwGKVWpdfpdvvv 1363
Cdd:PRK10658  315 cDFEWDP--RTFPdpeGMLKRLKAKGLKICVWINPYIA----QKSPLFKEGKEKGYLLKRPD------GSVW-------- 374
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1364 ngslDWDsqveLYRAYVAFPDFFRNSTAKWWKREIEELynnpqnperslkfdgmwIDMnepssfvngavspgcrdaslnh 1443
Cdd:PRK10658  375 ----QWD----KWQPGMAIVDFTNPDACKWYADKLKGL-----------------LDM---------------------- 407
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1444 ppymphlesrdrGLSS-KTLCMESqqiLP------DGSLVQhyNVHNLYGWSQTRPTYEAVQEVTGQR-GVVITRSTFPS 1515
Cdd:PRK10658  408 ------------GVDCfKTDFGER---IPtdvvwfDGSDPQ--KMHNYYTYLYNKTVFDVLKETRGEGeAVLFARSATVG 470
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1516 SGRWAGHWLGDNTAAWDQLKKSIIGMMEFSLFGISYTGADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTIGTRrqdpV 1595
Cdd:PRK10658  471 GQQFPVHWGGDCYSNYESMAESLRGGLSLGLSGFGFWSHDIGGFENTATADVYKRWCAFGLLSSHSRLHGSKSYR----V 546
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1596 SW--DVAFVNISRTVLQTRYTLLPYLYTLMHKAHTEGVTVVRPLLHEFVSDQVTWDIDSQFLLGPAFLVSPVLERNARnV 1673
Cdd:PRK10658  547 PWayDEEAVDVVRFFTKLKCRLMPYLYREAAEAHERGTPMMRAMVLEFPDDPACDYLDRQYMLGDSLLVAPVFSEAGD-V 625
                         490       500
                  ....*....|....*....|....*.
gi 221316699 1674 TAYFPRARWYDYYTGVDINArGEWKT 1699
Cdd:PRK10658  626 EYYLPEGRWTHLLTGEEVEG-GRWHK 650
GH31_CPE1046 cd06596
Clostridium CPE1046-like; CPE1046 is an uncharacterized Clostridium perfringens protein with a ...
586-792 5.00e-30

Clostridium CPE1046-like; CPE1046 is an uncharacterized Clostridium perfringens protein with a glycosyl hydrolase family 31 (GH31) domain. The domain architecture of CPE1046 and its orthologs includes a C-terminal fibronectin type 3 (FN3) domain and a coagulation factor 5/8 type C domain in addition to the GH31 domain. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.


Pssm-ID: 269882  Cd Length: 334  Bit Score: 123.22  E-value: 5.00e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  586 YGYSMAVATAEAAKTVFPNK---RSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFA 662
Cdd:cd06596   122 AGYSFALNGVEDAADGIENNsnaRPFIWTVDGWAGTQRYAVIWTGDQSGSWEYIRFHIPTYIGSGLSGQAYATSDVDGIF 201
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  663 LDTPEELCRRwMQLGAFYPFSRNHNGQGYKDQDPASFGADSLLLNssRHYLNIRYTLLPYLYTLFFRAHSRGDTVARPLL 742
Cdd:cd06596   202 GGSPETYTRD-LQWKAFTPVLMNMSGWAANDKQPWVFGEPYTSIN--RKYLKLKMRLMPYIYTYAREASVTGLPMVRAMF 278
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 221316699  743 HEFYEDNSTW--DVHQQFLWGPGLLITPVLDEGAEKVMA----YVPDAVWYDYETG 792
Cdd:cd06596   279 LEYPNDPTAYgtATQYQFMWGPDFLVAPVYQNTAAGNDVrngiYLPAGTWIDYWTG 334
GH31_xylosidase_XylS cd06591
xylosidase XylS-like; XylS is a glycosyl hydrolase family 31 (GH31) alpha-xylosidase found in ...
368-686 4.73e-29

xylosidase XylS-like; XylS is a glycosyl hydrolase family 31 (GH31) alpha-xylosidase found in prokaryotes, eukaryotes, and archaea, that catalyzes the release of alpha-xylose from the non-reducing terminal side of the alpha-xyloside substrate. XylS has been characterized in Sulfolobus solfataricus where it hydrolyzes isoprimeverose, the p-nitrophenyl-beta derivative of isoprimeverose, and xyloglucan oligosaccharides, and has transxylosidic activity. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The XylS family corresponds to subgroup 3 in the Ernst et al classification of GH31 enzymes.


Pssm-ID: 269877 [Multi-domain]  Cd Length: 322  Bit Score: 119.97  E-value: 4.73e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVQHADIDYMDERR--DFTYDSVDFKGFPEFVNELHNNGQ 445
Cdd:cd06591     1 GKAPMLPKWALGFWQSKERYKTQEELLEVAREYRERGIPLDVIVQDWFYWTEQGwgDMKFDPERFPDPKGMVDELHKMNV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  446 KLVIIVDPAISNNSSSskpygpYDRGSDMKIWVNSSDGVTPLigevwPGQTVFPDYTNPNCAVWWTKEFELFHNQVEFDG 525
Cdd:cd06591    81 KLMISVWPTFGPGSEN------YKELDEKGLLLRTNRGNGGF-----GGGTAFYDATNPEAREIYWKQLKDNYFDKGIDA 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  526 IWIDMNEVSNFVDGsvsgcstnnlnnppftprilDGYLFCKTlcmdavqHWGKQYDIHNLYGYSMAVATAEAAKTVFPNK 605
Cdd:cd06591   150 WWLDATEPELDPYD--------------------FDNYDGRT-------ALGPGAEVGNAYPLMHAKGIYEGQRATGPDK 202
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  606 RSFILTRSTFAGSGKF-AAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGF-------ALDTPE--ELCRRWMQ 675
Cdd:cd06591   203 RVVILTRSAFAGQQRYgAAVWSGDISSSWETLRRQIPAGLNFGASGIPYWTTDIGGFfggdpepGEDDPAyrELYVRWFQ 282
                         330
                  ....*....|.
gi 221316699  676 LGAFYPFSRNH 686
Cdd:cd06591   283 FGAFCPIFRSH 293
GH31_NET37 cd06592
glucosidase NET37; NET37 (also known as KIAA1161) is a human lamina-associated nuclear ...
1238-1682 4.22e-28

glucosidase NET37; NET37 (also known as KIAA1161) is a human lamina-associated nuclear envelope transmembrane protein. A member of the glycosyl hydrolase family 31 (GH31) , it has been shown to be required for myogenic differentiation of C2C12 cells. Related proteins are found in eukaryotes and prokaryotes. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.


Pssm-ID: 269878 [Multi-domain]  Cd Length: 364  Bit Score: 118.09  E-value: 4.22e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1238 MVPYWSLGFQLcrYGYQNDSEIASLYDEMVAAQIPYDVQYSDIDYMERQLDFTLSP-KFAGFPALINRMKADGMRVILIL 1316
Cdd:cd06592     1 RPPIWSTWAEY--KYNINQEKVLEYAEEIRANGFPPSVIEIDDGWQTYYGDFEFDPeKFPDPKGMIDKLHEMGFRVTLWV 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1317 DPAISGNEtqpyPAFTRGVEDDVFIKYPNDGDIVWGKVWpdfpdvvvNGsldwdsqvelyraYVAFPDFFRNSTAKWWKR 1396
Cdd:cd06592    79 HPFINPDS----PNFRELRDKGYLVKEDSGGPPLIVKWW--------NG-------------YGAVLDFTNPEARDWFKE 133
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1397 EIEELynnpqnpERSLKFDGMWIDMNEPSsfvngavspgcrdaslnhppYMPHLESRDRGLSSktlcmesqqilPDgslv 1476
Cdd:cd06592   134 RLREL-------QEDYGIDGFKFDAGEAS--------------------YLPADPATFPSGLN-----------PN---- 171
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1477 qHYnvHNLYGwsQTRPTYEAVQEVT----GQRGVVITRSTFPSSgRWaGHWLGdntaawdqLKKSIIGMMEFSLFGISYT 1552
Cdd:cd06592   172 -EY--TTLYA--ELAAEFGLLNEVRsgwkSQGLPLFVRMSDKDS-HW-GYWNG--------LRSLIPTALTQGLLGYPFV 236
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1553 GADICG----FFQDAEYEMCVRWMQLGAFYP---FSrnhntigtrrqdPVSWDVAF---VNISRTVLQTRYTLLPYLYTL 1622
Cdd:cd06592   237 LPDMIGgnayGNFPPDKELYIRWLQLSAFMPamqFS------------VAPWRNYDeevVDIARKLAKLREKLLPYIYEL 304
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1623 MHKAHTEGVTVVRPLLHEFVSDQVTWDIDSQFLLGPAFLVSPVLERNARNVTAYFPRARW 1682
Cdd:cd06592   305 AAEAVDTGEPIIRPLWWIAPEDEEALTIDDQFLLGDDILVAPVLEKGARSRDVYLPKGRW 364
GH31_lyase_GLase cd06601
alpha-1,4-glucan lyase; GLases (alpha-1,4-glucan lyases) are glycosyl hydrolase family 31 ...
1234-1630 2.11e-27

alpha-1,4-glucan lyase; GLases (alpha-1,4-glucan lyases) are glycosyl hydrolase family 31 (GH31) enzymes that degrade alpha-1,4-glucans and maltooligosaccharides via a nonhydrolytic pathway to yield 1,5-D-anhydrofructose from the nonreducing end. GLases cleave the bond between C1 and O1 of the nonreducing sugar residue of alpha-glucans to generate a monosaccharide product with a double bond between C1 and C2. This family corresponds to subgroup 2 in the Ernst et al classification of GH31 enzymes.


Pssm-ID: 269887 [Multi-domain]  Cd Length: 347  Bit Score: 115.59  E-value: 2.11e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1234 GRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYSDIDYMERQLDFTLSP-KFAGFPALINRMKADGMRV 1312
Cdd:cd06601     1 GRSRMKPRYVFGYHQGCYGYSSRESLEVVVQSYRDANIPLDGLHIDVDFQDNYRTFTTSKdKFPNPKEMFSNLHAQGFKC 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1313 ilildpaiSGNETQpypaftrgveddvFIKYPNDGDIVWGkvwpdfpdvvvnGSLDWDSQvelyrayvaFPDFFRNSTAK 1392
Cdd:cd06601    81 --------STNITP-------------IITDPYIGGVNYG------------GGLGSPGF---------YPDLGRPEVRE 118
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1393 WWKREIEELYnnpqnperSLKFDGMWIDMNEPssfvngAVSPGCRDASlnhpPYMPHLESRdrglssktLCMESQQILPD 1472
Cdd:cd06601   119 WWGQQYKYLF--------DMGLEMVWQDMTTP------AIAPHKINGY----GDMKTFPLR--------LLVTDDSVKNE 172
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1473 GSLVQHYNVHNLYGWSQTRPTYEAVQEVTG---QRGVVITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIGMMEFSLFGI 1549
Cdd:cd06601   173 HTYKPAATLWNLYAYNLHKATYHGLNRLNArpnRRNFIIGRGGYAGAQRFAGLWTGDNASTWDFLQINIPQVLNLGLSGV 252
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1550 SYTGADICGFFQDAE--------YEMCVRWMQLGAFYPFSRNH----NTIGTRRQDPVSWDV--AFVNISRTVLQTRYTL 1615
Cdd:cd06601   253 PISGSDIGGFASGSDenegkwcdPELLIRWVQAGAFLPWFRNHydryIKKKQQEKLYEPYYYyePVLPICRKYVELRYRL 332
                         410
                  ....*....|....*
gi 221316699 1616 LPYLYTLMHKAHTEG 1630
Cdd:cd06601   333 MQVFYDAMYENTQNG 347
PRK10426 PRK10426
alpha-glucosidase; Provisional
1482-1704 3.12e-27

alpha-glucosidase; Provisional


Pssm-ID: 236691 [Multi-domain]  Cd Length: 635  Bit Score: 119.71  E-value: 3.12e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1482 HNLYG--WSQTrpTYEAVQEvTGQRG--VVITRSTFPSSGRWAG-HWLGDNTAAW---DQLKKSIIGMMEFSLFGISYTG 1553
Cdd:PRK10426  381 HNAWPalWAKC--NYEALEE-TGKLGeiLFFMRAGYTGSQKYSTlFWAGDQNVDWsldDGLASVVPAALSLGMSGHGLHH 457
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1554 ADICGFFQDAEY----EMCVRWMQLGAFYPFSRNHNtiGTRRQDPVSWD------VAFVNISRTvlqtrYTLL-PYLYTL 1622
Cdd:PRK10426  458 SDIGGYTTLFGMkrtkELLLRWCEFSAFTPVMRTHE--GNRPGDNWQFDsdaetiAHFARMTRV-----FTTLkPYLKEL 530
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1623 MHKAHTEGVTVVRPLLHEFVSDQVTWDIDSQFLLGPAFLVSPVLERNARNVTAYFPRARWYDYYTGVDInaRGEWKTLPA 1702
Cdd:PRK10426  531 VAEAAKTGLPVMRPLFLHYEDDAATYTLKYQYLLGRDLLVAPVHEEGRTDWTVYLPEDKWVHLWTGEAF--AGGEITVEA 608

                  ..
gi 221316699 1703 PL 1704
Cdd:PRK10426  609 PI 610
GH31_transferase_CtsY cd06597
CtsY (cyclic tetrasaccharide-synthesizing enzyme Y)-like; CtsY is a bacterial ...
368-717 5.38e-27

CtsY (cyclic tetrasaccharide-synthesizing enzyme Y)-like; CtsY is a bacterial 3-alpha-isomaltosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsZ. CtsY and CtsZ both have a glycosyl hydrolase family 31 (GH31) catalytic domain; CtsZ belongs to a different subfamily. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.


Pssm-ID: 269883 [Multi-domain]  Cd Length: 326  Bit Score: 113.95  E-value: 5.38e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDVqhADID-YMDERRDFTYDSVDFKgFPEF---VNELHNN 443
Cdd:cd06597     1 GRAALPPKWAFGHWVSANEWNSQAEVLELVEEYLAYDIPVGA--VVIEaWSDEATFYIFNDATGK-WPDPkgmIDSLHEQ 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  444 GQKLVIIVDPAISNNSSSSKPYGP-YDRGSDMKIWVNSSDGVTPLIGEVWPGQTVFPDYTNPNCAVWWtkefelfHNQVE 522
Cdd:cd06597    78 GIKVILWQTPVVKTDGTDHAQKSNdYAEAIAKGYYVKNGDGTPYIPEGWWFGGGSLIDFTNPEAVAWW-------HDQRD 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  523 --FDGIWIDmnevsNF-VDGSvsgcstnnlnnppftprilDGYLFcktlcMDAVQHWGKQYDI-HNLYGYSMAVATAEAA 598
Cdd:cd06597   151 ylLDELGID-----GFkTDGG-------------------EPYWG-----EDLIFSDGKKGREmRNEYPNLYYKAYFDYI 201
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  599 KTVFPNKRSFilTRSTFAGSGKFAAHWLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTPE-ELCRRWMQLG 677
Cdd:cd06597   202 REIGNDGVLF--SRAGDSGAQRYPIGWVGDQDSTFEGLQSALKAGLSAAWSGYPFWGWDIGGFSGPLPTaELYLRWTQLA 279
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*.
gi 221316699  678 AFYPFSRNH---NGQGYKDQDPASFGA---DSLLLNSSRHYLNIRY 717
Cdd:cd06597   280 AFSPIMQNHsekNHRPWSEERRWNVAErtgDPEVLDIYRKYVKLRM 325
GH31_CPE1046 cd06596
Clostridium CPE1046-like; CPE1046 is an uncharacterized Clostridium perfringens protein with a ...
1494-1688 5.41e-27

Clostridium CPE1046-like; CPE1046 is an uncharacterized Clostridium perfringens protein with a glycosyl hydrolase family 31 (GH31) domain. The domain architecture of CPE1046 and its orthologs includes a C-terminal fibronectin type 3 (FN3) domain and a coagulation factor 5/8 type C domain in addition to the GH31 domain. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.


Pssm-ID: 269882  Cd Length: 334  Bit Score: 113.98  E-value: 5.41e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1494 YEAVQEVTGQRGVVITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIGMMEFSLFGISYTGADICGFFQDAEyEMCVRWMQ 1573
Cdd:cd06596   135 ADGIENNSNARPFIWTVDGWAGTQRYAVIWTGDQSGSWEYIRFHIPTYIGSGLSGQAYATSDVDGIFGGSP-ETYTRDLQ 213
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1574 LGAFYPFSRNHNTIGTRRQDPVSWDVAFVNISRTVLQTRYTLLPYLYTLMHKAHTEGVTVVRPLLHEFVSDQVTWDIDS- 1652
Cdd:cd06596   214 WKAFTPVLMNMSGWAANDKQPWVFGEPYTSINRKYLKLKMRLMPYIYTYAREASVTGLPMVRAMFLEYPNDPTAYGTATq 293
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 221316699 1653 -QFLLGPAFLVSPVLERNARNVTA----YFPRARWYDYYTG 1688
Cdd:cd06596   294 yQFMWGPDFLVAPVYQNTAAGNDVrngiYLPAGTWIDYWTG 334
GH31_xylosidase_YicI cd06593
alpha-xylosidase YicI-like; YicI alpha-xylosidase is a glycosyl hydrolase family 31 (GH31) ...
1234-1615 7.03e-27

alpha-xylosidase YicI-like; YicI alpha-xylosidase is a glycosyl hydrolase family 31 (GH31) enzyme that catalyzes the release of an alpha-xylosyl residue from the non-reducing end of alpha-xyloside substrates such as alpha-xylosyl fluoride and isoprimeverose. YicI forms a homohexamer (a trimer of dimers). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The YicI family corresponds to subgroup 4 in the Ernst et al classification of GH31 enzymes.


Pssm-ID: 269879 [Multi-domain]  Cd Length: 308  Bit Score: 113.05  E-value: 7.03e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1234 GRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYSDIDYME--RQLDFTLSPK-FAGFPALINRMKADGM 1310
Cdd:cd06593     1 GRPPLPPAWSFGLWLSRSFYYSEEEVLEVADGMRERGIPCDVIHLDCFWMKedWWCDFEWDEErFPDPEGMIARLKEKGF 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1311 RVILILDPAISgnetQPYPAFTRGVEDDVFIKYPNdgdivwGKVWpdfpdvvvngsldwdSQVELYRAYVAFPDFFRNST 1390
Cdd:cd06593    81 KVCLWINPYIS----QDSPLFKEAAEKGYLVKNPD------GSPW---------------HQWDGWQPGMGIIDFTNPEA 135
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1391 AKWWKREIEELYNnpqnpersLKFDGMWIDMNEpssfvngavspgcrdaslnhppYMPHlesrdrglssktlcmesQQIL 1470
Cdd:cd06593   136 VAWYKEKLKRLLD--------MGVDVIKTDFGE----------------------RIPE-----------------DAVY 168
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1471 PDGSLVQhyNVHNLYGWSQTRPTYEAVQEVTGQRGVVITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIGMMEFSLFGIS 1550
Cdd:cd06593   169 YDGSDGR--KMHNLYPLLYNKAVYEATKEVKGEEAVLWARSAWAGSQRYPVHWGGDSESTFEGMAASLRGGLSLGLSGFG 246
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 221316699 1551 YTGADICGFFQDAEYEMCVRWMQLGAFYPFSRNHntiGTRRQDPVSWDVAFVNISRTVLQTRYTL 1615
Cdd:cd06593   247 FWSHDIGGFEGTPSPELYKRWTQFGLLSSHSRLH---GSTPREPWEYGEEALDVVRKFAKLRYRL 308
PRK10426 PRK10426
alpha-glucosidase; Provisional
259-786 7.45e-26

alpha-glucosidase; Provisional


Pssm-ID: 236691 [Multi-domain]  Cd Length: 635  Bit Score: 115.48  E-value: 7.45e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  259 PSTNVYGLGEhvhqQYRH-DMNWKTWPIF------NRDTTPN------------GNGTNLYGAQTFFLcledaSGLSFGV 319
Cdd:PRK10426   80 PDEHIYGCGE----QFSYfDLRGKPFPLWtseqgvGRNKQTYvtwqadckenagGDYYWTYFPQPTFV-----SSQKYYC 150
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  320 FLMNSNAMEVVLQPAPAITYRTIGGILDFYVFLGNTPEQVVQEYLELIGR-PALPSyWALGFHLSRYEYGTlDNMREVVE 398
Cdd:PRK10426  151 HVDNSAYMNFDFSAPEYHELELWEDKATLRFECADTYISLLEKLTALFGRqPELPD-WAYDGVTLGIQGGT-EVVQKKLD 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  399 RNRAAQLPYD---VQhadiDYMDERR---------DFTYDSVDFKGFPEFVNELHNNGQKLVIIVDPAISNNsssskpyG 466
Cdd:PRK10426  229 TMRNAGVKVNgiwAQ----DWSGIRMtsfgkrlmwNWKWDSERYPQLDSRIKQLNEEGIQFLGYINPYLASD-------G 297
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  467 P-YDRGSDMKIWVNSSDGVTPLI--GEVWPGqtvFPDYTNPNcAVWWTKEFeLFHNQVEFdgiwidmnevsnfvdgsvsG 543
Cdd:PRK10426  298 DlCEEAAEKGYLAKDADGGDYLVefGEFYAG---VVDLTNPE-AYEWFKEV-IKKNMIGL-------------------G 353
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  544 CStnnlnnppftprildGYL--FCKTLCMDAVQHWGKQYDI-HNLYGYSMAVATAEAAKTVFPNKRSFILTRSTFAGSGK 620
Cdd:PRK10426  354 CS---------------GWMadFGEYLPTDAYLHNGVSAEImHNAWPALWAKCNYEALEETGKLGEILFFMRAGYTGSQK 418
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  621 FA-AHWLGDNTATW---DDLRWSIPGVLEFNLFGIPMVGPDICG----FALDTPEELCRRWMQLGAFYPFSRNHNGQgYK 692
Cdd:PRK10426  419 YStLFWAGDQNVDWsldDGLASVVPAALSLGMSGHGLHHSDIGGyttlFGMKRTKELLLRWCEFSAFTPVMRTHEGN-RP 497
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  693 DQDPASFGADSLLLNSSRhYLNIRYTLLPYLYTLFFRAHSRGDTVARPLLHEFYEDNSTWDVHQQFLWGPGLLITPVLDE 772
Cdd:PRK10426  498 GDNWQFDSDAETIAHFAR-MTRVFTTLKPYLKELVAEAAKTGLPVMRPLFLHYEDDAATYTLKYQYLLGRDLLVAPVHEE 576
                         570
                  ....*....|....
gi 221316699  773 GAEKVMAYVPDAVW 786
Cdd:PRK10426  577 GRTDWTVYLPEDKW 590
GH31_N cd14752
N-terminal domain of glycosyl hydrolase family 31 (GH31); This family is found N-terminal to ...
252-368 1.70e-25

N-terminal domain of glycosyl hydrolase family 31 (GH31); This family is found N-terminal to the glycosyl-hydrolase domain of Glycoside hydrolase family 31 (GH31). GH31 includes the glycoside hydrolases alpha-glucosidase (EC 3.2.1.20), alpha-1,3-glucosidase (EC 3.2.1.84), alpha-xylosidase (EC 3.2.1.177), sucrase-isomaltase (EC 3.2.1.48 and EC 3.2.1.10), as well as alpha-glucan lyase (EC 4.2.2.13). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite-1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes. The lysosomal acid alpha-glucosidase (GAA) is essential for glycogen degradation and a deficiency or malfunction of this enzyme causes glycogen storage disease II, also known as Pompe disease. In the endoplasmic reticulum, alpha-glucosidase II catalyzes the second step in the N-linked oligosaccharide processing pathway that constitutes part of the quality control system for glycoprotein folding and maturation. The intestinal enzymes sucrase-isomaltase (SI) and maltase-glucoamylase (MGAM) play key roles in the final stage of carbohydrate digestion, making alpha-glucosidase inhibitors useful in the treatment of type 2 diabetes. GH31 alpha-glycosidases are retaining enzymes that cleave their substrates via an acid/base-catalyzed, double-displacement mechanism involving a covalent glycosyl-enzyme intermediate. Two aspartic acid residues of the catalytic domain have been identified as the catalytic nucleophile and the acid/base, respectively. A loop of the N-terminal beta-sandwich domain is part of the active site pocket.


Pssm-ID: 270212 [Multi-domain]  Cd Length: 122  Bit Score: 103.03  E-value: 1.70e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  252 LQLSTRLP-STNVYGLGEHVHqqyRHDMNWKTWPIFNRDT-TPNGNGTNLYGAQTFFLCLEdasglSFGVFLMNSNAMEV 329
Cdd:cd14752    10 LRLSFKLPpDEHFYGLGERFG---GLNKRGKRYRLWNTDQgGYRGSTDPLYGSIPFYLSSK-----GYGVFLDNPSRTEF 81
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 221316699  330 VLQPA--PAITYRTIGGILDFYVFLGNTPEQVVQEYLELIG 368
Cdd:cd14752    82 DFGSEdsDELTFSSEGGDLDYYFFAGPTPKEVIEQYTELTG 122
GH31_N cd14752
N-terminal domain of glycosyl hydrolase family 31 (GH31); This family is found N-terminal to ...
1113-1234 6.03e-24

N-terminal domain of glycosyl hydrolase family 31 (GH31); This family is found N-terminal to the glycosyl-hydrolase domain of Glycoside hydrolase family 31 (GH31). GH31 includes the glycoside hydrolases alpha-glucosidase (EC 3.2.1.20), alpha-1,3-glucosidase (EC 3.2.1.84), alpha-xylosidase (EC 3.2.1.177), sucrase-isomaltase (EC 3.2.1.48 and EC 3.2.1.10), as well as alpha-glucan lyase (EC 4.2.2.13). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite-1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes. The lysosomal acid alpha-glucosidase (GAA) is essential for glycogen degradation and a deficiency or malfunction of this enzyme causes glycogen storage disease II, also known as Pompe disease. In the endoplasmic reticulum, alpha-glucosidase II catalyzes the second step in the N-linked oligosaccharide processing pathway that constitutes part of the quality control system for glycoprotein folding and maturation. The intestinal enzymes sucrase-isomaltase (SI) and maltase-glucoamylase (MGAM) play key roles in the final stage of carbohydrate digestion, making alpha-glucosidase inhibitors useful in the treatment of type 2 diabetes. GH31 alpha-glycosidases are retaining enzymes that cleave their substrates via an acid/base-catalyzed, double-displacement mechanism involving a covalent glycosyl-enzyme intermediate. Two aspartic acid residues of the catalytic domain have been identified as the catalytic nucleophile and the acid/base, respectively. A loop of the N-terminal beta-sandwich domain is part of the active site pocket.


Pssm-ID: 270212 [Multi-domain]  Cd Length: 122  Bit Score: 98.80  E-value: 6.03e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1113 FTFSDmfIRISTRLP-SKYLYGFGEtehRSYRRDLEWHTWGMFSRDQPPGY--KKNSYGVHPYYMGLEedgsAHGVLLLN 1189
Cdd:cd14752     5 VRITP--LRLSFKLPpDEHFYGLGE---RFGGLNKRGKRYRLWNTDQGGYRgsTDPLYGSIPFYLSSK----GYGVFLDN 75
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 221316699 1190 SNAMDVTFQP--LPALTYRTTGGVLDFYVFLGPTPELVTQQYTELIG 1234
Cdd:cd14752    76 PSRTEFDFGSedSDELTFSSEGGDLDYYFFAGPTPKEVIEQYTELTG 122
GH31_glycosidase_Aec37 cd06599
E.coli Aec37-like; Glycosyl hydrolase family 31 (GH31) domain of a bacterial protein family ...
1234-1581 1.49e-22

E.coli Aec37-like; Glycosyl hydrolase family 31 (GH31) domain of a bacterial protein family represented by Escherichia coli protein Aec37. The gene encoding Aec37 (aec-37) is located within a genomic island (AGI-3) isolated from the extraintestinal avian pathogenic Escherichia coli strain BEN2908. The function of Aec37 and its orthologs is unknown; however, deletion of a region of the genome that includes aec-37 affects the assimilation of seven carbohydrates, decreases growth rate of the strain in minimal medium containing galacturonate or trehalose, and attenuates the virulence of E. coli BEN2908 in chickens. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.


Pssm-ID: 269885 [Multi-domain]  Cd Length: 319  Bit Score: 100.75  E-value: 1.49e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1234 GRPVMVPYWSLGFQLCRYGY----QNDSEIASLYDEMVAAQIPYDV-----QYSDIDYMERQLdFTL-SPKFAGFPALIN 1303
Cdd:cd06599     1 GRPALPPRWSLGYLGSTMYYteapDAQEQILDFIDTCREHDIPCDGfhlssGYTSIEDGKRYV-FNWnKDKFPDPKAFFR 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1304 RMKADGMRVILILDPAIsgneTQPYPAFTRGVEDDVFIKYPNDGDIVWGKVWPDfpdvvvNGSldwdsqvelyrayvaFP 1383
Cdd:cd06599    80 KFHERGIRLVANIKPGL----LTDHPHYDELAEKGAFIKDDDGGEPAVGRFWGG------GGS---------------YL 134
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1384 DFFRNSTAKWWKREIEELYnnpqnpersLKF--DGMWIDMNEPSSFVNGAVSPGCRdaslnhpPYMPHLESRdrglSSKT 1461
Cdd:cd06599   135 DFTNPEGREWWKEGLKEQL---------LDYgiDSVWNDNNEYEIWDDDAACCGFG-------KGGPISELR----PIQP 194
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1462 LCMesqqilpdgslvqhynvhnlygwsqTRPTYEAVQEV-TGQRGVVITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIG 1540
Cdd:cd06599   195 LLM-------------------------ARASREAQLEHaPNKRPFVISRSGCAGIQRYAQTWSGDNRTSWKTLKYNIAM 249
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 221316699 1541 MMEFSLFGISYTGADICGFFQDA-EYEMCVRWMQLGAFYP-FS 1581
Cdd:cd06599   250 GLGMSLSGVANYGHDIGGFAGPApEPELFVRWVQNGIFQPrFS 292
GH31_xylosidase_XylS cd06591
xylosidase XylS-like; XylS is a glycosyl hydrolase family 31 (GH31) alpha-xylosidase found in ...
1234-1590 2.64e-21

xylosidase XylS-like; XylS is a glycosyl hydrolase family 31 (GH31) alpha-xylosidase found in prokaryotes, eukaryotes, and archaea, that catalyzes the release of alpha-xylose from the non-reducing terminal side of the alpha-xyloside substrate. XylS has been characterized in Sulfolobus solfataricus where it hydrolyzes isoprimeverose, the p-nitrophenyl-beta derivative of isoprimeverose, and xyloglucan oligosaccharides, and has transxylosidic activity. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The XylS family corresponds to subgroup 3 in the Ernst et al classification of GH31 enzymes.


Pssm-ID: 269877 [Multi-domain]  Cd Length: 322  Bit Score: 96.86  E-value: 2.64e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1234 GRPVMVPYWSLGFQLCRYGYQNDSEIASLYDEMVAAQIPYDVQYSDIDYMERQL--DFTLSPKFagFP---ALINRMKAD 1308
Cdd:cd06591     1 GKAPMLPKWALGFWQSKERYKTQEELLEVAREYRERGIPLDVIVQDWFYWTEQGwgDMKFDPER--FPdpkGMVDELHKM 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1309 GMRVILILDPAIsGNETQPYPAFtrgVEDDVFIKypndgdivwGKVWPDFPDvvvngsldwdsqvelyrAYVAFPDFFRN 1388
Cdd:cd06591    79 NVKLMISVWPTF-GPGSENYKEL---DEKGLLLR---------TNRGNGGFG-----------------GGTAFYDATNP 128
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1389 STAKWWKREIEELYnnpqnpeRSLKFDGMWIDMNEPSsfvngavspgcrdaslnhppYMPHLESRDRGLSSktlcmesqq 1468
Cdd:cd06591   129 EAREIYWKQLKDNY-------FDKGIDAWWLDATEPE--------------------LDPYDFDNYDGRTA--------- 172
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1469 ilpDGSLVQhynVHNLYGWSQTRPTYEAVQEVTGQRGVVI-TRSTFPSSGRW-AGHWLGDNTAAWDQLKKSIIGMMEFSL 1546
Cdd:cd06591   173 ---LGPGAE---VGNAYPLMHAKGIYEGQRATGPDKRVVIlTRSAFAGQQRYgAAVWSGDISSSWETLRRQIPAGLNFGA 246
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 221316699 1547 FGISYTGADICGFF--------QDAEY-EMCVRWMQLGAFYPFSRNHntiGTR 1590
Cdd:cd06591   247 SGIPYWTTDIGGFFggdpepgeDDPAYrELYVRWFQFGAFCPIFRSH---GTR 296
Trefoil pfam00088
Trefoil (P-type) domain;
90-133 3.24e-15

Trefoil (P-type) domain;


Pssm-ID: 459666  Cd Length: 43  Bit Score: 71.20  E-value: 3.24e-15
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 221316699    90 CPVVNELERINCIPdQPPTKATCDQRGCCWNPQGAVSVPWCYYS 133
Cdd:pfam00088    1 CSSVPPSDRFDCGY-PGITQEECEARGCCWDPSVDPGVPWCFYP 43
PD smart00018
P or trefoil or TFF domain; Proposed role in renewal and pathology of mucous epithelia.
88-136 2.12e-14

P or trefoil or TFF domain; Proposed role in renewal and pathology of mucous epithelia.


Pssm-ID: 197472  Cd Length: 46  Bit Score: 68.95  E-value: 2.12e-14
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 221316699     88 AECpVVNELERINCIPDqPPTKATCDQRGCCWNPQGaVSVPWCYYSKNH 136
Cdd:smart00018    1 AQC-SVPPSERINCGPP-GITEAECEARGCCFDSSI-SGVPWCFYPNTV 46
GH31_u1 cd06595
glycosyl hydrolase family 31 (GH31); uncharacterized subgroup; This family represents an ...
368-723 8.30e-14

glycosyl hydrolase family 31 (GH31); uncharacterized subgroup; This family represents an uncharacterized GH31 enzyme subgroup found in bacteria and eukaryotes. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.


Pssm-ID: 269881 [Multi-domain]  Cd Length: 304  Bit Score: 74.16  E-value: 8.30e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  368 GRPALPSYWALGFHLSRYEYGTLDNMREVVERNRAAQLPYDV------QHA-DIDYMDERRDFTYDSVDFKGFPEFVNEL 440
Cdd:cd06595     2 GKPPLIPRYALGNWWSRYWAYSDDDILDLVDNFKRNEIPLSVlvldmdWHItDKKYKNGWTGYTWNKELFPDPKGFLDWL 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  441 HNNGQKLVIIVDPAISnnsssskPYGPYDRGSDMKiwvnSSDGVTPLIGEVWPgqtvFpDYTNPN-CAVWwtkeFELFHN 519
Cdd:cd06595    82 HERGLRVGLNLHPAEG-------IRPHEEAYAEFA----KYLGIDPAKIIPIP----F-DVTDPKfLDAY----FKLLIH 141
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  520 QVEFDGI---WIDMNEvsnfvDGSVSGCSTNNLNnppftprildgylfcktlcmdavqhWGKQYdiHNLYGYSmavatae 596
Cdd:cd06595   142 PLEKQGVdfwWLDWQQ-----GKDSPLAGLDPLW-------------------------WLNHY--HYLDSGR------- 182
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  597 aaktvFPNKRSFILTRSTFAGSGKFAAHWLGDNTATWDDLRWsIPgvlEFNL----FGIPMVGPDICGFALDTPE-ELCR 671
Cdd:cd06595   183 -----NGKRRPLILSRWGGLGSHRYPIGFSGDTEVSWETLAF-QP---YFTAtaanVGYSWWSHDIGGHKGGIEDpELYL 253
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 221316699  672 RWMQLGAFYPFSRNHNGQG-YKDQDPASFGADSllLNSSRHYLNIRYTLLPYL 723
Cdd:cd06595   254 RWVQFGVFSPILRLHSDKGpYYKREPWLWDAKT--FEIAKDYLRLRHRLIPYL 304
Trefoil cd00111
P or trefoil or TFF domain; Trefoil factor family domain peptides are mucin-associated ...
88-134 1.29e-13

P or trefoil or TFF domain; Trefoil factor family domain peptides are mucin-associated molecules, largely found in epithelia of gastrointestinal tissues. Function is not known but it was originally identified from mucosal tissues, where it may have a regulatory or structural role and has also been implicated as a growth fractor in other tissues.The domain is found in 1 to 6 copies where it occurs.


Pssm-ID: 238059  Cd Length: 44  Bit Score: 66.60  E-value: 1.29e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 221316699   88 AECPVvNELERINCIPdQPPTKATCDQRGCCWNPQGaVSVPWCYYSK 134
Cdd:cd00111     1 EWCSV-PPSERIDCGP-PGITQEECEARGCCFDPSI-SGVPWCFYPK 44
GH31_transferase_CtsY cd06597
CtsY (cyclic tetrasaccharide-synthesizing enzyme Y)-like; CtsY is a bacterial ...
1482-1616 4.44e-13

CtsY (cyclic tetrasaccharide-synthesizing enzyme Y)-like; CtsY is a bacterial 3-alpha-isomaltosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsZ. CtsY and CtsZ both have a glycosyl hydrolase family 31 (GH31) catalytic domain; CtsZ belongs to a different subfamily. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.


Pssm-ID: 269883 [Multi-domain]  Cd Length: 326  Bit Score: 72.35  E-value: 4.44e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1482 HNLYGWSQTRPTYEAVQEVTGQrGVVITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIGMMEFSLFGISYTGADICGFFQ 1561
Cdd:cd06597   186 RNEYPNLYYKAYFDYIREIGND-GVLFSRAGDSGAQRYPIGWVGDQDSTFEGLQSALKAGLSAAWSGYPFWGWDIGGFSG 264
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 221316699 1562 DA-EYEMCVRWMQLGAFYPFSRNHNT-IGTRRQDPVSWDVAFVNISRTVLQT--RYTLL 1616
Cdd:cd06597   265 PLpTAELYLRWTQLAAFSPIMQNHSEkNHRPWSEERRWNVAERTGDPEVLDIyrKYVKL 323
PD smart00018
P or trefoil or TFF domain; Proposed role in renewal and pathology of mucous epithelia.
957-1000 2.12e-11

P or trefoil or TFF domain; Proposed role in renewal and pathology of mucous epithelia.


Pssm-ID: 197472  Cd Length: 46  Bit Score: 60.48  E-value: 2.12e-11
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....
gi 221316699    957 KIRDEEKIDCYPDenGASAENCTARGCIWEaSNSSGVPFCYFVN 1000
Cdd:smart00018    4 SVPPSERINCGPP--GITEAECEARGCCFD-SSISGVPWCFYPN 44
Trefoil cd00111
P or trefoil or TFF domain; Trefoil factor family domain peptides are mucin-associated ...
958-1000 5.68e-11

P or trefoil or TFF domain; Trefoil factor family domain peptides are mucin-associated molecules, largely found in epithelia of gastrointestinal tissues. Function is not known but it was originally identified from mucosal tissues, where it may have a regulatory or structural role and has also been implicated as a growth fractor in other tissues.The domain is found in 1 to 6 copies where it occurs.


Pssm-ID: 238059  Cd Length: 44  Bit Score: 58.89  E-value: 5.68e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 221316699  958 IRDEEKIDCYPdeNGASAENCTARGCIWEaSNSSGVPFCYFVN 1000
Cdd:cd00111     5 VPPSERIDCGP--PGITQEECEARGCCFD-PSISGVPWCFYPK 44
GH31_glucosidase_YihQ cd06594
alpha-glucosidase YihQ-like; YihQ is a bacterial alpha-glucosidase with a conserved glycosyl ...
425-688 1.31e-09

alpha-glucosidase YihQ-like; YihQ is a bacterial alpha-glucosidase with a conserved glycosyl hydrolase family 31 (GH31) domain that catalyzes the release of an alpha-glucosyl residue from the non-reducing end of alpha-glucoside substrates such as alpha-glucosyl fluoride. Orthologs of YihQ that have not yet been functionally characterized are present in plants and fungi. YihQ has sequence similarity to other GH31 enzymes such as CtsZ, a 6-alpha-glucosyltransferase from Bacillus globisporus, and YicI, an alpha-xylosidase from Echerichia coli. These latter two belong to different GH31 subfamilies than YihQ. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation.


Pssm-ID: 269880 [Multi-domain]  Cd Length: 325  Bit Score: 61.83  E-value: 1.31e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  425 YDSVDFKGFPEFVNELHNNGQKLVIIVDPAISNNSSSskpyGPYDRGSDMKIWVNSSDGvTPLIGEVWPGQTVFPDYTNP 504
Cdd:cd06594    65 WDEELYPGWDELVKELKEQGIRVLGYINPFLANVGPL----YSYKEAEEKGYLVKNKTG-EPYLVDFGEFDAGLVDLTNP 139
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  505 NCAVWwtkefelfhnqveFDGIWIDMNEVSNfvdgsvsgcstnnlnnppftpriLDGYL--FCKTLCMDAVQHWGKQ-YD 581
Cdd:cd06594   140 EARRW-------------FKEVIKENMIDFG-----------------------LSGWMadFGEYLPFDAVLHSGEDaAL 183
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699  582 IHNLYGYSMAVATAEAAKTVFPNKRSFILTRSTFAGSGKFAA-HWLGDNTATW---DDLRWSIPGVLEFNLFGIPMVGPD 657
Cdd:cd06594   184 YHNRYPELWARLNREAVEEAGKEGEIVFFMRSGYTGSPRYSTlFWAGDQNVDWsrdDGLKSVIPGALSSGLSGFSLTHSD 263
                         250       260       270
                  ....*....|....*....|....*....|....*....
gi 221316699  658 ICGF-ALDTP-------EELCRRWMQLGAFYPFSRNHNG 688
Cdd:cd06594   264 IGGYtTLFNPlvgykrsKELLMRWAEMAAFTPVMRTHEG 302
GH31_u1 cd06595
glycosyl hydrolase family 31 (GH31); uncharacterized subgroup; This family represents an ...
1504-1619 2.25e-08

glycosyl hydrolase family 31 (GH31); uncharacterized subgroup; This family represents an uncharacterized GH31 enzyme subgroup found in bacteria and eukaryotes. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.


Pssm-ID: 269881 [Multi-domain]  Cd Length: 304  Bit Score: 57.60  E-value: 2.25e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1504 RGVVITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIigmmEF----SLFGISYTGADICGFFQDAE-YEMCVRWMQLGAFY 1578
Cdd:cd06595   187 RPLILSRWGGLGSHRYPIGFSGDTEVSWETLAFQP----YFtataANVGYSWWSHDIGGHKGGIEdPELYLRWVQFGVFS 262
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 221316699 1579 PFSRNHNTIGTR-RQDPVSWDVAFVNISRTVLQTRYTLLPYL 1619
Cdd:cd06595   263 PILRLHSDKGPYyKREPWLWDAKTFEIAKDYLRLRHRLIPYL 304
GH31_glucosidase_YihQ cd06594
alpha-glucosidase YihQ-like; YihQ is a bacterial alpha-glucosidase with a conserved glycosyl ...
1481-1584 3.47e-08

alpha-glucosidase YihQ-like; YihQ is a bacterial alpha-glucosidase with a conserved glycosyl hydrolase family 31 (GH31) domain that catalyzes the release of an alpha-glucosyl residue from the non-reducing end of alpha-glucoside substrates such as alpha-glucosyl fluoride. Orthologs of YihQ that have not yet been functionally characterized are present in plants and fungi. YihQ has sequence similarity to other GH31 enzymes such as CtsZ, a 6-alpha-glucosyltransferase from Bacillus globisporus, and YicI, an alpha-xylosidase from Echerichia coli. These latter two belong to different GH31 subfamilies than YihQ. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation.


Pssm-ID: 269880 [Multi-domain]  Cd Length: 325  Bit Score: 57.21  E-value: 3.47e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 221316699 1481 VHNLYG--WSQTrpTYEAVQEVTGQRGVVI-TRSTFPSSGRWAG-HWLGDNTAAW---DQLKKSIIGMMEFSLFGISYTG 1553
Cdd:cd06594   184 YHNRYPelWARL--NREAVEEAGKEGEIVFfMRSGYTGSPRYSTlFWAGDQNVDWsrdDGLKSVIPGALSSGLSGFSLTH 261
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 221316699 1554 ADICG----FFQDAEY----EMCVRWMQLGAFYPFSRNH 1584
Cdd:cd06594   262 SDIGGyttlFNPLVGYkrskELLMRWAEMAAFTPVMRTH 300
Trefoil pfam00088
Trefoil (P-type) domain;
958-998 3.64e-08

Trefoil (P-type) domain;


Pssm-ID: 459666  Cd Length: 43  Bit Score: 51.17  E-value: 3.64e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 221316699   958 IRDEEKIDC-YPdenGASAENCTARGCIWEASNSSGVPFCYF 998
Cdd:pfam00088    4 VPPSDRFDCgYP---GITQEECEARGCCWDPSVDPGVPWCFY 42
Gal_mutarotas_2 pfam13802
Galactose mutarotase-like; This family is found N-terminal to glycosyl-hydrolase domains, and ...
262-329 2.43e-03

Galactose mutarotase-like; This family is found N-terminal to glycosyl-hydrolase domains, and appears to be similar to the galactose mutarotase superfamily.


Pssm-ID: 463987 [Multi-domain]  Cd Length: 67  Bit Score: 38.22  E-value: 2.43e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 221316699   262 NVYGLGEHVhqqyrHDMNWKTWP--IFNRDT-TPNGNGTNLYGAQTFFLCLEDasGLSFGVFLMNSNAMEV 329
Cdd:pfam13802    3 HVYGLGERA-----GPLNKRGTRyrLWNTDAfGYELDTDPLYKSIPFYISHNG--GRGYGVFWDNPAETWF 66
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH