Sequence-based discovery of the human and rodent peroxisomal proteome

Appl Bioinformatics. 2005;4(2):93-104. doi: 10.2165/00822942-200504020-00003.

Abstract

Background: Peroxisomes are metabolic organelles present in virtually all eukaryotic cells. They contain enzymes involved in hydrogen peroxide-based respiration and lipid metabolism. At present, only a small number of peroxisomal enzymes that are associated with oxidative stress response and metabolic disorders have been characterised biochemically. Therefore, we devised a sequence-based, multistep knowledge discovery strategy to identify potential novel peroxisomal protein candidates in small rodent model organisms and human.

Methods: Screening of 130,629 putative translations of GenBank rodent and primate mRNA sequences was limited to the classical type-1 peroxisomal targeting signal [SA]-K-L. This motif is over-represented among peroxisomal proteins and has a high targeting efficiency. Subsequent steps of identifying co-occurring motifs, secondary structure properties, orthologues and variants, in combination with literature searching and visual inspection by domain experts, aimed at reduction of both false positive and negative validation targets.

Results: Our method yielded 117 known peroxisome-targeted proteins and 29 novel candidate proteins. Of special interest were the mouse C530046K17Rik and 1300019N10Rik protein sequences that contain domains associated with enzymatic functions. C530046K17Rik showed no similarity to any known sequence of the animal kingdom, but weak similarity to the possible Leishmania quinone oxidoreductase and a putative cyanobacterium nicotinamide adenine dinucleotide phosphate (NADP)-dependent oxidoreductase. 1300019N10Rik contains two protease-related domains, glutamyl endopeptidase I and trypsin-like serine and cysteine proteases, which may have unique specificities to achieve efficient breakdown of proteins in the peroxisomes.

Conclusion: One mouse C57BL/6J strain-specific isocitrate dehydrogenase 1 isoform might be suitable to investigate potential phenotypes associated with the deficit of the intraperoxisomal reduced form of NADP (NADPH) and 2-oxoglutarate. Our biological knowledge discovery strategy enabled not only the identification of peroxisomal enzymes already described in the literature, but also the prediction of several novel proteins with possible roles in peroxisomal biochemistry and metabolism that are currently under experimental validation.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Conserved Sequence
  • Enzymes / chemistry*
  • Enzymes / metabolism*
  • Humans
  • Mice
  • Molecular Sequence Data
  • Peroxisomes / chemistry*
  • Peroxisomes / metabolism*
  • Proteome / analysis
  • Proteome / chemistry*
  • Proteome / metabolism*
  • Rats
  • Sequence Alignment / methods
  • Sequence Analysis, Protein / methods*
  • Sequence Homology, Amino Acid
  • Species Specificity

Substances

  • Enzymes
  • Proteome