Reconstruction and analysis of human Alu genes

J Mol Evol. 1991 Feb;32(2):105-21. doi: 10.1007/BF02515383.

Abstract

The existing classification of human Alu sequences is revised and expanded using a novel methodology and a larger set of sequence data. Our study confirms that there are two major Alu subfamilies, Alu-J and Alu-S. The Alu-S subfamily consists of at least five distinct subfamilies referred to as Alu-Sx, Alu-Sq, Alu-Sp, Alu-Sc, and Alu-Sb. The Alu-Sp and Alu-Sq subfamilies have been revealed by this study. Alu subfamilies differ from one another in a number of positions called diagnostic. In this paper the diagnostic positions are defined in quantitative terms and are used to evaluate statistical significance of the observed subfamilies. Each Alu subfamily most likely represents pseudogenes retroposed from evolving functional source Alu genes. Evidence presented in this paper indicates that Alu-Sp and Alu-Sc pseudogenes were retroposed from different source genes, during overlapping periods of time, and at different rates. Our analysis also indicates that the previously identified Alu-type transcript BC200 comes from an active Alu gene that might have existed even before the origin of dimeric Alu sequences. The source genes for Alu pseudogene families are reconstructed. It is assumed that diagnostic differences between reconstructed source genes reflect mutations that have occurred in true source Alu genes under natural selection. Some of these mutations are compensatory and are used to reconstruct a common secondary structure of Alu RNAs transcribed from the source genes. The biological function of Alu RNA is discussed in the context of its homology to the elongation-arresting domain of 7SL RNA.(ABSTRACT TRUNCATED AT 250 WORDS)

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Base Composition
  • Base Sequence
  • Biological Evolution
  • Consensus Sequence
  • DNA / genetics*
  • DNA / physiology
  • Deoxyribonucleases, Type II Site-Specific*
  • Gene Rearrangement
  • Humans
  • Molecular Sequence Data
  • Multigene Family*
  • Nucleic Acid Conformation
  • Pseudogenes
  • RNA / genetics
  • Sequence Homology, Nucleic Acid
  • Virus Diseases / genetics

Substances

  • RNA
  • DNA
  • endodeoxyribonuclease AluI
  • Deoxyribonucleases, Type II Site-Specific