Emergence of young human genes after a burst of retroposition in primates

PLoS Biol. 2005 Nov;3(11):e357. doi: 10.1371/journal.pbio.0030357. Epub 2005 Oct 11.

Abstract

The origin of new genes through gene duplication is fundamental to the evolution of lineage- or species-specific phenotypic traits. In this report, we estimate the number of functional retrogenes on the lineage leading to humans generated by the high rate of retroposition (retroduplication) in primates. Extensive comparative sequencing and expression studies coupled with evolutionary analyses and simulations suggest that a significant proportion of recent retrocopies represent bona fide human genes. We estimate that at least one new retrogene per million years emerged on the human lineage during the past approximately 63 million years of primate evolution. Detailed analysis of a subset of the data shows that the majority of retrogenes are specifically expressed in testis, whereas their parental genes show broad expression patterns. Consistently, most retrogenes evolved functional roles in spermatogenesis. Proteins encoded by X chromosome-derived retrogenes were strongly preserved by purifying selection following the duplication event, supporting the view that they may act as functional autosomal substitutes during X-inactivation of late spermatogenesis genes. Also, some retrogenes acquired a new or more adapted function driven by positive selection. We conclude that retroduplication significantly contributed to the formation of recent human genes and that most new retrogenes were progressively recruited during primate evolution by natural and/or sexual selection to enhance male germline function.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Biological Evolution*
  • Cell Lineage
  • Computer Simulation
  • Evolution, Molecular
  • Genome
  • Genome, Human
  • Humans
  • Kinetics
  • Likelihood Functions
  • Male
  • Molecular Sequence Data
  • Open Reading Frames
  • Peptides
  • Phenotype
  • Phylogeny
  • Polymerase Chain Reaction
  • Primates
  • Retroelements / genetics*
  • Reverse Transcriptase Polymerase Chain Reaction
  • Sequence Analysis, DNA
  • Sex Factors
  • Spermatogenesis
  • Testis / metabolism
  • Time Factors
  • Tissue Distribution

Substances

  • Peptides
  • Retroelements

Associated data

  • GENBANK/DQ120612
  • GENBANK/DQ120613
  • GENBANK/DQ120614
  • GENBANK/DQ120615
  • GENBANK/DQ120616
  • GENBANK/DQ120617
  • GENBANK/DQ120618
  • GENBANK/DQ120619
  • GENBANK/DQ120620