Tropheryma whipplei Twist: a human pathogenic Actinobacteria with a reduced genome

Genome Res. 2003 Aug;13(8):1800-9. doi: 10.1101/gr.1474603.

Abstract

The human pathogen Tropheryma whipplei is the only known reduced genome species (<1 Mb) within the Actinobacteria [high G+C Gram-positive bacteria]. We present the sequence of the 927303-bp circular genome of T. whipplei Twist strain, encoding 808 predicted protein-coding genes. Specific genome features include deficiencies in amino acid metabolisms, the lack of clear thioredoxin and thioredoxin reductase homologs, and a mutation in DNA gyrase predicting a resistance to quinolone antibiotics. Moreover, the alignment of the two available T. whipplei genome sequences (Twist vs. TW08/27) revealed a large chromosomal inversion the extremities of which are located within two paralogous genes. These genes belong to a large cell-surface protein family defined by the presence of a common repeat highly conserved at the nucleotide level. The repeats appear to trigger frequent genome rearrangements in T. whipplei, potentially resulting in the expression of different subsets of cell surface proteins. This might represent a new mechanism for evading host defenses. The T. whipplei genome sequence was also compared to other reduced bacterial genomes to examine the generality of previously detected features. The analysis of the genome sequence of this previously largely unknown human pathogen is now guiding the development of molecular diagnostic tools and more convenient culture conditions.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Actinomycetales / genetics*
  • Actinomycetales / pathogenicity*
  • Actinomycetales Infections / genetics
  • Amino Acids / metabolism
  • Base Composition
  • DNA, Bacterial / analysis
  • Energy Metabolism / genetics
  • GC Rich Sequence / genetics
  • Gene Transfer, Horizontal / genetics
  • Genes, Bacterial / genetics
  • Genes, Bacterial / physiology
  • Genome, Bacterial*
  • Humans
  • Molecular Sequence Data
  • Multigene Family / genetics
  • Predictive Value of Tests
  • Pseudogenes / genetics
  • Repetitive Sequences, Nucleic Acid / genetics
  • Sequence Analysis, DNA

Substances

  • Amino Acids
  • DNA, Bacterial

Associated data

  • GENBANK/AE014184