Genomic DNA sequence for human C-reactive protein

J Biol Chem. 1985 Oct 25;260(24):13377-83.

Abstract

The gene for the prototype acute phase reactant, C-reactive protein, has been isolated from two lambda phage libraries containing inserted human DNA fragments using synthetic oligonucleotide probes. Nucleotide sequence analysis indicates that after coding for a signal peptide of 18 amino acids and the first two amino acids of the mature protein, there is an intron of 278 base pairs followed by the nucleotide sequence for the remaining 204 amino acids. The intron is unusual in that it contains on the positive strand a poly(A) stretch 16 nucleotides long and a poly(GT) region 30 nucleotides long which could adopt the Z-form of DNA. The nucleotide sequence reported here confirms the amino acid sequence of mature C-reactive protein as originally reported except that it codes for an additional 19 amino acids beginning at position 62. Thus DNA sequence analysis predicts that the mature protein consists of 206 amino acids rather than 187 as originally reported. The mRNA cap site is located 104 nucleotides from the start of the signal peptide and there is a 3' noncoding region 1.2 kilobase pairs in length. The gene has a typical promoter containing the sequences TATAAAT and CAAT 29 and 81 base pairs upstream, respectively, of the cap site.

MeSH terms

  • Amino Acid Sequence
  • Bacteriophage lambda / genetics
  • Base Sequence
  • C-Reactive Protein / genetics*
  • DNA / genetics*
  • DNA Restriction Enzymes
  • DNA, Recombinant
  • Humans
  • Nucleic Acid Hybridization
  • Oligodeoxyribonucleotides / genetics
  • RNA Caps / genetics
  • RNA, Messenger / genetics

Substances

  • DNA, Recombinant
  • Oligodeoxyribonucleotides
  • RNA Caps
  • RNA, Messenger
  • C-Reactive Protein
  • DNA
  • DNA Restriction Enzymes

Associated data

  • GENBANK/M11725