Non-EST-based prediction of novel alternatively spliced cassette exons with cell signaling function in Caenorhabditis elegans and human

Nucleic Acids Res. 2007;35(10):3192-202. doi: 10.1093/nar/gkm187. Epub 2007 Apr 22.

Abstract

To better understand the complex role that alternative splicing plays in intracellular signaling, it is important to catalog the numerous splice variants involved in signal transduction. Therefore, we developed PASE (Prediction of Alternative Signaling Exons), a computational tool to identify novel alternative cassette exons that code for kinase phosphorylation or signaling protein-binding sites. We first applied PASE to the Caenorhabditis elegans genome. In this organism, our algorithm had an overall specificity of > or =76.4%, including 33 novel cassette exons that we experimentally verified. We then used PASE to analyze the human genome and made 804 predictions, of which 308 were found as alternative exons in the transcript database. We experimentally tested 384 of the remaining unobserved predictions and discovered 26 novel human exons for a total specificity of > or =41.5% in human. By using a test set of known alternatively spliced signaling exons, we determined that the sensitivity of PASE is approximately 70%. GO term analysis revealed that our exon predictions were found in the introns of known signal transduction genes more often than expected by chance, indicating PASE enriches for splice variants that function in signaling pathways. Overall, PASE was able to uncover 59 novel alternative cassette exons in C. elegans and humans through a genome-wide ab initio prediction method that enriches for exons involved in signaling.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms
  • Alternative Splicing*
  • Amino Acid Motifs
  • Animals
  • Caenorhabditis elegans / genetics
  • Caenorhabditis elegans Proteins / chemistry
  • Caenorhabditis elegans Proteins / genetics*
  • Caenorhabditis elegans Proteins / metabolism
  • Computational Biology / methods*
  • Estrogen Receptor alpha / genetics
  • Estrogen Receptor alpha / metabolism
  • Exons*
  • Expressed Sequence Tags / chemistry
  • Genomics / methods*
  • Humans
  • Intracellular Signaling Peptides and Proteins / chemistry
  • Intracellular Signaling Peptides and Proteins / genetics*
  • Intracellular Signaling Peptides and Proteins / metabolism
  • Low Density Lipoprotein Receptor-Related Protein-1 / genetics
  • Low Density Lipoprotein Receptor-Related Protein-1 / metabolism
  • Protein Kinases / chemistry
  • Protein Kinases / genetics
  • Protein Kinases / metabolism
  • RNA Splice Sites
  • Signal Transduction

Substances

  • Caenorhabditis elegans Proteins
  • Estrogen Receptor alpha
  • Intracellular Signaling Peptides and Proteins
  • Low Density Lipoprotein Receptor-Related Protein-1
  • RNA Splice Sites
  • Protein Kinases