Comparative sequence analyses reveal rapid and divergent evolutionary changes of the WFDC locus in the primate lineage

Genome Res. 2007 Mar;17(3):276-86. doi: 10.1101/gr.6004607. Epub 2007 Jan 31.

Abstract

The initial comparison of the human and chimpanzee genome sequences revealed 16 genomic regions with an unusually high density of rapidly evolving genes. One such region is the whey acidic protein (WAP) four-disulfide core domain locus (or WFDC locus), which contains 14 WFDC genes organized in two subloci on human chromosome 20q13. WAP protease inhibitors have roles in innate immunity and/or the regulation of a group of endogenous proteolytic enzymes called kallikreins. In human, the centromeric WFDC sublocus also contains the rapidly evolving seminal genes, semenogelin 1 and 2 (SEMG1 and SEMG2). The rate of SEMG2 evolution in primates has been proposed to correlate with female promiscuity and semen coagulation, perhaps related to post-copulatory sperm competition. We mapped and sequenced the centromeric WFDC sublocus in 12 primate species that collectively represent four different mating systems. Our analyses reveal a 130-kb region with a notably complex evolutionary history that has included nested duplications, deletions, and significant interspecies divergence of both coding and noncoding sequences; together, this has led to striking differences of this region among primates and between primates and rodents. Further, this region contains six closely linked genes (WFDC12, PI3, SEMG1, SEMG2, SLPI, and MATN4) that show strong patterns of adaptive selection, although an unambiguous correlation between gene mutation rates and mating systems could not be established.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Intramural

MeSH terms

  • Animals
  • Base Sequence
  • Chromosome Mapping
  • Chromosomes, Human, Pair 20 / genetics*
  • Elafin
  • Evolution, Molecular*
  • Female
  • Genetic Variation*
  • Humans
  • Male
  • Milk Proteins / genetics
  • Molecular Sequence Data
  • Multigene Family / genetics*
  • Primates / genetics*
  • Protein Precursors / genetics
  • Selection, Genetic*
  • Semen / metabolism
  • Seminal Vesicle Secretory Proteins / genetics
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Sexual Behavior, Animal / physiology

Substances

  • Elafin
  • Milk Proteins
  • PI3 protein, human
  • Protein Precursors
  • Seminal Vesicle Secretory Proteins
  • seminal vesicle-specific antigen
  • whey acidic proteins