Alternative splice variants encoding unstable protein domains exist in the human brain

J Mol Biol. 2004 Nov 5;343(5):1207-20. doi: 10.1016/j.jmb.2004.09.028.

Abstract

Alternative splicing has been recognized as a major mechanism by which protein diversity is increased without significantly increasing genome size in animals and has crucial medical implications, as many alternative splice variants are known to cause diseases. Despite the importance of knowing what structural changes alternative splicing introduces to the encoded proteins for the consideration of its significance, the problem has not been adequately explored. Therefore, we systematically examined the structures of the proteins encoded by the alternative splice variants in the HUGE protein database derived from long (>4 kb) human brain cDNAs. Limiting our analyses to reliable alternative splice junctions, we found alternative splice junctions to have a slight tendency to avoid the interior of SCOP domains and a strong statistically significant tendency to coincide with SCOP domain boundaries. These findings reflect the occurrence of some alternative splicing events that utilize protein structural units as a cassette. However, 50 cases were identified in which SCOP domains are disrupted in the middle by alternative splicing. In six of the cases, insertions are introduced at the molecular surface, presumably affecting protein functions, while in 11 of the cases alternatively spliced variants were found to encode pairs of stable and unstable proteins. The mRNAs encoding such unstable proteins are much less abundant than those encoding stable proteins and tend not to have corresponding mRNAs in non-primate species. We propose that most unstable proteins encoded by alternative splice variants lack normal functions and are an evolutionary dead-end.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing*
  • Amino Acid Sequence
  • Brain / metabolism*
  • Humans
  • Nerve Tissue Proteins / genetics*
  • Nerve Tissue Proteins / metabolism
  • Protein Isoforms / genetics
  • Protein Isoforms / metabolism
  • Protein Structure, Tertiary
  • RNA / genetics
  • RNA / metabolism
  • RNA Splice Sites
  • Sequence Alignment
  • Sequence Analysis, Protein
  • Sequence Homology, Amino Acid

Substances

  • Nerve Tissue Proteins
  • Protein Isoforms
  • RNA Splice Sites
  • RNA