Diversification and molecular evolution of ATOH8, a gene encoding a bHLH transcription factor

PLoS One. 2011;6(8):e23005. doi: 10.1371/journal.pone.0023005. Epub 2011 Aug 4.

Abstract

ATOH8 is a bHLH domain transcription factor implicated in the development of the nervous system, kidney, pancreas, retina and muscle. In the present study, we collected sequence of ATOH8 orthologues from 18 vertebrate species and 24 invertebrate species. The reconstruction of ATOH8 phylogeny and sequence analysis showed that this gene underwent notable divergences during evolution. For those vertebrate species investigated, we analyzed the gene structure and regulatory elements of ATOH8. We found that the bHLH domain of vertebrate ATOH8 was highly conserved. Mammals retained some specific amino acids in contrast to the non-mammalian orthologues. Mammals also developed another potential isoform, verified by a human expressed sequence tag (EST). Comparative genomic analyses of the regulatory elements revealed a replacement of the ancestral TATA box by CpG-islands in the eutherian mammals and an evolutionary tendency for TATA box reduction in vertebrates in general. We furthermore identified the region of the effective promoter of human ATOH8 which could drive the expression of EGFP reporter in the chicken embryo. In the opossum, both the coding region and regulatory elements of ATOH8 have some special features, such as the unique extended C-terminus encoded by the third exon and absence of both CpG islands and TATA elements in the regulatory region. Our gene mapping data showed that in human, ATOH8 was hosted in one chromosome which is a fusion product of two orthologous chromosomes in non-human primates. This unique chromosomal environment of human ATOH8 probably subjects its expression to the regulation at chromosomal level. We deduce that the great interspecific differences found in both ATOH8 gene sequence and its regulatory elements might be significant for the fine regulation of its spatiotemporal expression and roles of ATOH8, thus orchestrating its function in different tissues and organisms.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Basic Helix-Loop-Helix Transcription Factors / classification
  • Basic Helix-Loop-Helix Transcription Factors / genetics*
  • Basic Helix-Loop-Helix Transcription Factors / metabolism
  • Bayes Theorem
  • Cats
  • Cattle
  • Chick Embryo
  • Chromosome Mapping
  • Chromosomes, Human, Pair 2 / genetics
  • Evolution, Molecular*
  • Gene Expression Regulation
  • Genetic Variation*
  • Green Fluorescent Proteins / genetics
  • Green Fluorescent Proteins / metabolism
  • Humans
  • In Situ Hybridization, Fluorescence
  • Invertebrates / genetics
  • Mice
  • Molecular Sequence Data
  • Phylogeny
  • Primates
  • Rats
  • Regulatory Sequences, Nucleic Acid / genetics*
  • Sequence Homology, Amino Acid
  • Species Specificity
  • Vertebrates / genetics

Substances

  • Basic Helix-Loop-Helix Transcription Factors
  • Green Fluorescent Proteins

Associated data

  • GENBANK/FN868883
  • GENBANK/FN868884
  • GENBANK/FN868885
  • GENBANK/FN868886
  • GENBANK/FN868887
  • GENBANK/FN868888
  • GENBANK/FN868889
  • GENBANK/FN868890
  • GENBANK/FN868891