Detection, validation, and downstream analysis of allelic variation in gene expression

Genetics. 2010 Jan;184(1):119-28. doi: 10.1534/genetics.109.107474. Epub 2009 Nov 2.

Abstract

Common sequence variants within a gene often generate important differences in expression of corresponding mRNAs. This high level of local (allelic) control-or cis modulation-rivals that produced by gene targeting, but expression is titrated finely over a range of levels. We are interested in exploiting this allelic variation to study gene function and downstream consequences of differences in expression dosage. We have used several bioinformatics and molecular approaches to estimate error rates in the discovery of cis modulation and to analyze some of the biological and technical confounds that contribute to the variation in gene expression profiling. Our analysis of SNPs and alternative transcripts, combined with eQTL maps and selective gene resequencing, revealed that between 17 and 25% of apparent cis modulation is caused by SNPs that overlap probes rather than by genuine quantitative differences in mRNA levels. This estimate climbs to 40-50% when qualitative differences between isoform variants are included. We have developed an analytical approach to filter differences in expression and improve the yield of genuine cis-modulated transcripts to approximately 80%. This improvement is important because the resulting variation can be successfully used to study downstream consequences of altered expression on higher-order phenotypes. Using a systems genetics approach we show that two validated cis-modulated genes, Stk25 and Rasd2, are likely to control expression of downstream targets and affect disease susceptibility.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • 3' Untranslated Regions / genetics
  • Alleles*
  • Alternative Splicing
  • Animals
  • Computational Biology*
  • Databases, Genetic
  • Gene Expression / genetics*
  • Gene Expression Profiling*
  • Humans
  • Mice
  • Oligonucleotide Array Sequence Analysis
  • Polymerase Chain Reaction
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci / genetics
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Reproducibility of Results
  • Sequence Analysis, DNA
  • Transcription, Genetic

Substances

  • 3' Untranslated Regions
  • RNA, Messenger