Identification of high-molecular-weight proteins with multiple EGF-like motifs by motif-trap screening

Genomics. 1998 Jul 1;51(1):27-34. doi: 10.1006/geno.1998.5341.

Abstract

To identify large proteins with an EGF-like-motif in a systematic manner, we developed a computer-assisted method called motif-trap screening. The method exploits 5'-end single-pass sequence data obtained from a pool of cDNAs whose sizes exceed 5 kb. Using this screening procedure, we were able to identify five known and nine new genes for proteins with multiple EGF-like-motifs from 8000 redundant human brain cDNA clones. These new genes were found to encode a novel mammalian homologue of Drosophila fat protein, two seven-transmembrane proteins containing multiple cadherin and EGF-like motifs, two mammalian homologues of Drosophila slit protein, an unidentified LDL receptor-like protein, and three totally uncharacterized proteins. The organization of the domains in the proteins, together with their expression profiles and fine chromosomal locations, has indicated their biological significance, demonstrating that motif-trap screening is a powerful tool for the discovery of new genes that have been difficult to identify by conventional methods.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Chromosome Mapping
  • Cloning, Molecular / methods*
  • DNA, Complementary / genetics
  • Epidermal Growth Factor / genetics*
  • Humans
  • Molecular Sequence Data
  • Molecular Weight
  • Protein Conformation
  • Proteins / genetics*
  • Rats
  • Sequence Analysis, DNA / methods*
  • Sequence Homology, Amino Acid*

Substances

  • DNA, Complementary
  • Proteins
  • Epidermal Growth Factor

Associated data

  • GENBANK/AB011527
  • GENBANK/AB011528
  • GENBANK/AB011530
  • GENBANK/AB011531
  • GENBANK/AB011532
  • GENBANK/AB011535
  • GENBANK/AB011536
  • GENBANK/AB011537
  • GENBANK/AB011538
  • GENBANK/AB011539
  • GENBANK/AB011540
  • GENBANK/AB011541
  • GENBANK/AB011542