Comparative genomic analysis of three strains of Ehrlichia ruminantium reveals an active process of genome size plasticity

J Bacteriol. 2006 Apr;188(7):2533-42. doi: 10.1128/JB.188.7.2533-2542.2006.

Abstract

Ehrlichia ruminantium is the causative agent of heartwater, a major tick-borne disease of livestock in Africa that has been introduced in the Caribbean and is threatening to emerge and spread on the American mainland. We sequenced the complete genomes of two strains of E. ruminantium of differing phenotypes, strains Gardel (Erga; 1,499,920 bp), from the island of Guadeloupe, and Welgevonden (Erwe; 1,512,977 bp), originating in South Africa and maintained in Guadeloupe in a different cell environment. Comparative genomic analysis of these two strains was performed with the recently published parent strain of Erwe (Erwo) and other Rickettsiales (Anaplasma, Wolbachia, and Rickettsia spp.). Gene order is highly conserved between the E. ruminantium strains and with A. marginale. In contrast, there is very little conservation of gene order with members of the Rickettsiaceae. However, gene order may be locally conserved, as illustrated by the tuf operons. Eighteen truncated protein-encoding sequences (CDSs) differentiate Erga from Erwe/Erwo, whereas four other truncated CDSs differentiate Erwe from Erwo. Moreover, E. ruminantium displays the lowest coding ratio observed among bacteria due to unusually long intergenic regions. This is related to an active process of genome expansion/contraction targeted at tandem repeats in noncoding regions and based on the addition or removal of ca. 150-bp tandem units. This process seems to be specific to E. ruminantium and is not observed in the other Rickettsiales.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Conserved Sequence
  • Ehrlichia ruminantium / classification*
  • Ehrlichia ruminantium / genetics*
  • Evolution, Molecular*
  • Gene Order
  • Genetic Variation / genetics*
  • Genome, Bacterial*
  • Molecular Sequence Data
  • Mutagenesis / genetics*
  • Phenotype
  • Species Specificity
  • Tandem Repeat Sequences / genetics

Associated data

  • GENBANK/CR925677
  • GENBANK/CR925678