Differing evolutionary histories of WFDC8 (short-term balancing) in Europeans and SPINT4 (incomplete selective sweep) in Africans

Mol Biol Evol. 2011 Oct;28(10):2811-22. doi: 10.1093/molbev/msr106. Epub 2011 May 2.

Abstract

The whey acidic protein four-disulfide core (WFDC) gene cluster on human chromosome 20q13, harbors 15 small serine protease inhibitor genes with roles in innate immunity, reproduction, and regulation of endogenous proteases kallikreins. The WFDC cluster has emerged as a prime example of rapid diversification and adaptive evolution in primates. This study sought a better understanding of the evolutionary history of WFDC genes in humans and focused on exploring the adaptive selection signatures found in populations of European (Utah residents with ancestry from northern and western Europe [CEU]) and African (Yoruba from Ibadan, in Nigeria [YRI]) ancestry in a genome-wide scan for putative targets of recent adaptive selection. Our approach included resequencing coding and noncoding regions of WFDC6, EPPIN, and WFDC8 in 20 CEU and of SPINT4 in 20 YRI individuals. We generated 302 kb and 60 kb of high-quality sequence data from CEU and of YRI populations, respectively, enabling the identification of 72 single nucleotide polymorphisms. Using classic neutrality tests, empirical and haplotype-based analysis, we pinpointed WFDC8 and SPINT4 as the likely targets of short-term balancing selection in the CEU population, and recent positive selection (incomplete selective sweep) in the YRI population. Putative candidate variants targeted by selection include 44A (rs7273669A) for WFDC8, which may downregulate gene expression by abolishing the binding site of two transcription factors; and a haplotype configuration [Ser73+98A] (rs6017667A-rs6032474A) for SPINT4, which may simultaneously affect protein function and gene regulation. We propose that the evolution of WFDC8 and SPINT4 has been shaped by complex selective scenarios due to the interdependence of variant fitness and ecological variables.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adaptation, Biological
  • Black People / genetics*
  • Evolution, Molecular*
  • Humans
  • Immunity, Innate
  • Milk Proteins / genetics*
  • Multigene Family*
  • Polymorphism, Single Nucleotide
  • Selection, Genetic
  • Serine Proteinase Inhibitors / genetics*
  • White People / genetics*

Substances

  • Milk Proteins
  • Serine Proteinase Inhibitors
  • whey acidic proteins