Homologs of Drosophila P transposons were mobile in zebrafish but have been domesticated in a common ancestor of chicken and human

Mol Biol Evol. 2005 Apr;22(4):833-44. doi: 10.1093/molbev/msi068. Epub 2004 Dec 22.

Abstract

A substantial fraction of vertebrate and invertebrate genomes is composed of mobile elements and their derivatives. One of the most intensively studied transposon families, the P elements of Drosophila, was thought to exist exclusively in the genomes of dipteran insects. Based on the data provided by the human genome project, in 2001 our group has identified a P element-homologous sequence in the human genome. This P element-homologous human gene, named Phsa, is 19,533 nucleotides long, comprises six exons and five introns, and encodes a protein of still unknown function with a length of 903 amino acid residues. The N-terminal THAP domain of the putative Phsa protein shows similarities to the site-specific DNA-binding domain of the Drosophila P element transposase. In the present study, FISH analysis and the screening of a human lambda genomic library revealed a single copy of Phsa located on the long arm of chromosome 4, upstream of a gene coding for the hypothetical protein DKFZp686L1814. The same gene arrangement was found for the homologous gene Pgga in the genome of chicken, thus, displaying Pgga at orthologous position on the long arm of chromosome 4. The single-copy gene status and the absence of terminal inverted repeats and target-site duplications indicate that Phsa and Pgga constitute domesticated stationary sequences. In contrast, a considerable number of P-homologous sequences with terminal inverted repeats and intact target-site duplications could be identified in zebrafish, strongly indicating that Pdre elements were mobile within the zebrafish genome. Pdre elements are the first P-like transposons identified in a vertebrate species. With respect to Phsa, gene expression studies showed that Phsa is expressed in a broad range of human tissues, suggesting that the putative Phsa protein plays a not yet understood but essential role in a specific metabolic pathway. We demonstrate that P-homologous DNA sequences occur in the genomes of 21 analyzed vertebrates but only as rudiments in the rodents. Finally, the evolutionary history of P element-homologous vertebrate sequences is discussed in the context of the "molecular domestication" hypothesis versus the "source gene hypothesis."

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Chickens / genetics*
  • DNA Primers
  • DNA Transposable Elements*
  • Genome
  • Humans
  • In Situ Hybridization, Fluorescence
  • Molecular Sequence Data
  • Phylogeny
  • Polymerase Chain Reaction
  • Sequence Homology, Amino Acid
  • Zebrafish / genetics*

Substances

  • DNA Primers
  • DNA Transposable Elements

Associated data

  • GENBANK/AJ717666
  • GENBANK/AJ717667
  • GENBANK/AJ717668
  • GENBANK/AJ717669
  • GENBANK/AJ717670
  • GENBANK/AJ717671
  • GENBANK/AJ717672
  • GENBANK/AJ717673
  • GENBANK/AJ717674
  • GENBANK/AJ717675
  • GENBANK/AJ717676
  • GENBANK/AJ717677
  • GENBANK/AJ717678
  • GENBANK/AJ717679
  • GENBANK/AJ717680
  • GENBANK/AJ717681
  • GENBANK/AJ717682
  • GENBANK/AJ717683
  • GENBANK/AJ717684
  • GENBANK/AJ717685