Analysis of the gene-dense major histocompatibility complex class III region and its comparison to mouse

Genome Res. 2003 Dec;13(12):2621-36. doi: 10.1101/gr.1736803.

Abstract

In mammals, the Major Histocompatibility Complex class I and II gene clusters are separated by an approximately 700-kb stretch of sequence called the MHC class III region, which has been associated with susceptibility to numerous diseases. To facilitate understanding of this medically important and architecturally interesting portion of the genome, we have sequenced and analyzed both the human and mouse class III regions. The cross-species comparison has facilitated the identification of 60 genes in human and 61 in mouse, including a potential RNA gene for which the introns are more conserved across species than the exons. Delineation of global organization, gene structure, alternative splice forms, protein similarities, and potential cis-regulatory elements leads to several conclusions: (1) The human MHC class III region is the most gene-dense region of the human genome: >14% of the sequence is coding, approximately 72% of the region is transcribed, and there is an average of 8.5 genes per 100 kb. (2) Gene sizes, number of exons, and intergenic distances are for the most part similar in both species, implying that interspersed repeats have had little impact in disrupting the tight organization of this densely packed set of genes. (3) The region contains a heterogeneous mixture of genes, only a few of which have a clearly defined and proven function. Although many of the genes are of ancient origin, some appear to exist only in mammals and fish, implying they might be specific to vertebrates. (4) Conserved noncoding sequences are found primarily in or near the 5'-UTR or the first intron of genes, and seldom in the intergenic regions. Many of these conserved blocks are likely to be cis-regulatory elements.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Alternative Splicing / genetics
  • Animals
  • Conserved Sequence
  • Evolution, Molecular
  • Genes*
  • Humans
  • Major Histocompatibility Complex / genetics*
  • Mice
  • Molecular Sequence Data
  • Polymorphism, Genetic / genetics
  • Protein Biosynthesis
  • Proteins / genetics
  • RNA, Untranslated / genetics
  • Sequence Analysis, DNA / methods

Substances

  • Proteins
  • RNA, Untranslated

Associated data

  • GENBANK/AC007080
  • GENBANK/AF019413
  • GENBANK/AF030001
  • GENBANK/AF049850
  • GENBANK/AF109719
  • GENBANK/AF109905
  • GENBANK/AF109906
  • GENBANK/AF129756
  • GENBANK/AF134726
  • GENBANK/U89335
  • GENBANK/U89336
  • GENBANK/U89337