Identification of Rat Genes by TWINSCAN Gene Prediction, RT–PCR, and Direct Sequencing

  1. Jia Qian Wu1,
  2. David Shteynberg2,
  3. Manimozhiyan Arumugam2,
  4. Richard A. Gibbs1, and
  5. Michael R. Brent2,3
  1. 1 Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
  2. 2 Laboratory for Computational Genomics, Washington University, St. Louis, Missouri 63130, USA

Abstract

The publication of a draft sequence of a third mammalian genome—that of the rat—suggests a need to rethink genome annotation. New mammalian sequences will not receive the kind of labor-intensive annotation efforts that are currently being devoted to human. In this paper, we demonstrate an alternative approach: reverse transcription-polymerase chain reaction (RT–PCR) and direct sequencing based on dual-genome de novo predictions from TWINSCAN. We tested 444 TWINSCAN-predicted rat genes that showed significant homology to known human genes implicated in disease but that were partially or completely missed by methods based on protein-to-genome mapping. Using primers in exons flanking a single predicted intron, we were able to verify the existence of 59% of these predicted genes. We then attempted to amplify the complete predicted open reading frames of 136 genes that were verified in the single-intron experiment. Spliced sequences were amplified in 46 cases (34%). We conclude that this procedure for elucidating gene structures with native cDNA sequences is cost-effective and will become even more so as it is further optimized.

Footnotes

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1959604.

  • 3 Corresponding author. E-MAIL brent{at}cse.wustl.edu; FAX (314) 935-7302.

    • Accepted November 17, 2003.
    • Received September 11, 2003.
| Table of Contents

Preprint Server