U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

MUC21 mucin 21, cell surface associated [ Homo sapiens (human) ]

Gene ID: 394263, updated on 5-Mar-2024

Summary

Official Symbol
MUC21provided by HGNC
Official Full Name
mucin 21, cell surface associatedprovided by HGNC
Primary source
HGNC:HGNC:21661
See related
Ensembl:ENSG00000204544 MIM:616991; AllianceGenome:HGNC:21661
Gene type
protein coding
RefSeq status
REVIEWED
Organism
Homo sapiens
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
Also known as
MUC-21; KMQK697; C6orf205
Summary
This gene encodes a large membrane-bound glycoprotein which is a member of the mucin family. Mucins are O-glycosylated proteins that play an essential role in forming protective mucous barriers on epithelial surfaces. These proteins also play a role in intracellular signaling. The encoded protein contains an N-terminal signal sequence, an extracellular mucin domain, a stem domain, a transmembrane domain, and a C-terminal cytoplasmic tail domain. The mucin domain contains O-glycosylation sites and is polymorphic with isoforms containing a variable number of nonidentical proline-, threonine-, and serine-rich tandem repeats of 15 amino acids each. The aberrent expression of this gene is associated with lung adenocarcinoma. [provided by RefSeq, May 2017]
Expression
Restricted expression toward esophagus (RPKM 485.6) See more
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See MUC21 in Genome Data Viewer
Location:
6p21.33
Exon count:
3
Annotation release Status Assembly Chr Location
RS_2023_10 current GRCh38.p14 (GCF_000001405.40) 6 NC_000006.12 (30983718..30989903)
RS_2023_10 current T2T-CHM13v2.0 (GCF_009914755.1) 6 NC_060930.1 (30848372..30854562)
105.20220307 previous assembly GRCh37.p13 (GCF_000001405.25) 6 NC_000006.11 (30951495..30957680)

Chromosome 6 - NC_000006.12Genomic Context describing neighboring genes Neighboring gene mucin like 3 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr6:30923489-30924169 Neighboring gene HLA complex group 21 Neighboring gene N-ethylmaleimide-sensitive factor attachment protein, gamma pseudogene 2 Neighboring gene CDK7 strongly-dependent group 2 enhancer GRCh37_chr6:30954612-30955811 Neighboring gene NANOG-H3K27ac hESC enhancer GRCh37_chr6:30960303-30961066 Neighboring gene mucin 22 Neighboring gene H3K27ac hESC enhancer GRCh37_chr6:30997211-30997711 Neighboring gene Sharpr-MPRA regulatory region 1306 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr6:31021318-31021818 Neighboring gene HLA complex group 22 (non-protein coding)

Genomic regions, transcripts, and products

Expression

  • Project title: HPA RNA-seq normal tissues
  • Description: RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes
  • BioProject: PRJEB4337
  • Publication: PMID 24309898
  • Analysis date: Wed Apr 4 07:08:55 2018

Bibliography

GeneRIFs: Gene References Into Functions

What's a GeneRIF?

Phenotypes

EBI GWAS Catalog

Description
A genome-wide association study identifies two new risk loci for Graves' disease.
EBI GWAS Catalog
Genome-wide association study of pre-eclampsia detects novel maternal single nucleotide polymorphisms and copy-number variants in subsets of the Hyperglycemia and Adverse Pregnancy Outcome (HAPO) study cohort.
EBI GWAS Catalog
Genome-wide association study of Stevens-Johnson Syndrome and Toxic Epidermal Necrolysis in Europe.
EBI GWAS Catalog
Genome-wide joint meta-analysis of SNP and SNP-by-smoking interaction identifies novel loci for pulmonary function.
EBI GWAS Catalog
GWAS of blood cell traits identifies novel associated loci and epistatic interactions in Caucasian and African-American children.
EBI GWAS Catalog
Multiple loci are associated with white blood cell phenotypes.
EBI GWAS Catalog

General gene information

Markers

Clone Names

  • MGC125609

Gene Ontology Provided by GOA

Process Evidence Code Pubs
involved_in negative regulation of cell-cell adhesion IBA
Inferred from Biological aspect of Ancestor
more info
 
Component Evidence Code Pubs
located_in Golgi lumen TAS
Traceable Author Statement
more info
 
is_active_in plasma membrane IBA
Inferred from Biological aspect of Ancestor
more info
 
located_in plasma membrane TAS
Traceable Author Statement
more info
 

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

Genomic

  1. NG_054937.1 RefSeqGene

    Range
    5011..11196
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. NM_001010909.5NP_001010909.2  mucin-21 isoform 1 precursor

    See identical proteins and their annotated locations for NP_001010909.2

    Status: REVIEWED

    Description
    Transcript Variant: This variant (1) represents the shortest transcript and encodes isoform (1).
    Source sequence(s)
    AK298642, AL669830, BC105735, BC105737, DA763771, DC373882
    Consensus CDS
    CCDS34388.1
    UniProtKB/Swiss-Prot
    B0UZT7, B4DQ55, C9JMK2, D9N007, Q0VGF1, Q3B7T2, Q5SS94, Q5SSG8, Q6UXC5
    Related
    ENSP00000365473.3, ENST00000376296.3
    Conserved Domains (2) summary
    pfam05647
    Location:3297
    Epiglycanin_TR; Tandem-repeating region of mucin, epiglycanin-like
    pfam14654
    Location:464562
    Epiglycanin_C; Mucin, catalytic, TM and cytoplasmic tail region
  2. NM_001322370.2NP_001309299.1  mucin-21 isoform 2 precursor

    Status: REVIEWED

    Description
    Transcript Variant: This variant (3) represents an alternate allele located on the reference alternate locus, ALT_REF_LOCI_4.The encoded isoform (2) is longer than isoform (1).
    Source sequence(s)
    AL669830, AY358415
    UniProtKB/TrEMBL
    A0A0G2JKD1
    Conserved Domains (2) summary
    pfam05647
    Location:122177
    Epiglycanin_TR; Tandem-repeating region of mucin, epiglycanin-like
    pfam14654
    Location:494592
    Epiglycanin_C; Mucin, catalytic, TM and cytoplasmic tail region
  3. NM_001322371.2NP_001309300.2  mucin-21 isoform 3 precursor

    Status: REVIEWED

    Description
    Transcript Variant: This variant (4) represents an alternate allele located on the reference alternate locus, ALT_REF_LOCI_6. The encoded isoform (3) is longer than isoform (1).
    Source sequence(s)
    AK299471, AL669830
    UniProtKB/TrEMBL
    B4DRX4
    Conserved Domains (2) summary
    pfam05647
    Location:122177
    Epiglycanin_TR; Tandem-repeating region of mucin, epiglycanin-like
    pfam14654
    Location:524622
    Epiglycanin_C; Mucin, catalytic, TM and cytoplasmic tail region

RNA

  1. NR_130720.3 RNA Sequence

    Status: REVIEWED

    Description
    Transcript Variant: This variant (2) uses an alternate splice site in the 5' terminal exon compared to variant 1. This variant is represented as non-coding because the predicted protein does not meet RefSeq quality criteria.
    Source sequence(s)
    AK298642, AL669830, BC105735, BC105736, DA763771, DC373882

RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2023_10

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCh38.p14 Primary Assembly

Genomic

  1. NC_000006.12 Reference GRCh38.p14 Primary Assembly

    Range
    30983718..30989903
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

Reference GRCh38.p14 ALT_REF_LOCI_1

Genomic

  1. NT_167244.2 Reference GRCh38.p14 ALT_REF_LOCI_1

    Range
    2313208..2319398
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

Reference GRCh38.p14 ALT_REF_LOCI_2

Genomic

  1. NT_113891.3 Reference GRCh38.p14 ALT_REF_LOCI_2

    Range
    2463279..2469484
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

Reference GRCh38.p14 ALT_REF_LOCI_3

Genomic

  1. NT_167245.2 Reference GRCh38.p14 ALT_REF_LOCI_3

    Range
    2240011..2246201
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

Reference GRCh38.p14 ALT_REF_LOCI_4

Genomic

  1. NT_167246.2 Reference GRCh38.p14 ALT_REF_LOCI_4

    Range
    2293951..2300241
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

Reference GRCh38.p14 ALT_REF_LOCI_5

Genomic

  1. NT_167247.2 Reference GRCh38.p14 ALT_REF_LOCI_5

    Range
    2327846..2334046
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

Reference GRCh38.p14 ALT_REF_LOCI_6

Genomic

  1. NT_167248.2 Reference GRCh38.p14 ALT_REF_LOCI_6

    Range
    2238874..2245269
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

Alternate T2T-CHM13v2.0

Genomic

  1. NC_060930.1 Alternate T2T-CHM13v2.0

    Range
    30848372..30854562
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)