Sequence Evaluation of Four Pooled-Tissue Normalized Bovine cDNA Libraries and Construction of a Gene Index for Cattle

  1. Timothy P.L. Smith1,3,
  2. William M. Grosse1,
  3. Brad A. Freking1,
  4. Andrew J. Roberts1,
  5. Roger T. Stone1,
  6. Eduardo Casas1,
  7. James E. Wray1,
  8. Joseph White2,
  9. Jennifer Cho2,
  10. Scott C. Fahrenkrug1,
  11. Gary L. Bennett1,
  12. Michael P. Heaton1,
  13. William W. Laegreid1,
  14. Gary A. Rohrer1,
  15. Carol G. Chitko-McKown1,
  16. Geo Pertea2,
  17. Ingeborg Holt2,
  18. Svetlana Karamycheva2,
  19. Feng Liang2,
  20. John Quackenbush2, and
  21. John W. Keele1
  1. 1United States Department of Agriculture, Agricultural Research Service, United States Meat Animal Research Center, Clay Center, Nebraska 68933, USA; 2The Institute for Genomic Research, Rockville, Maryland 20850, USA

Abstract

An essential component of functional genomics studies is the sequence of DNA expressed in tissues of interest. To provide a resource of bovine-specific expressed sequence data and facilitate this powerful approach in cattle research, four normalized cDNA libraries were produced and arrayed for high-throughput sequencing. The libraries were made with RNA pooled from multiple tissues to increase efficiency of normalization and maximize the number of independent genes for which sequence data were obtained. Target tissues included those with highest likelihood to have impact on production parameters of animal health, growth, reproductive efficiency, and carcass merit. Success of normalization and inter- and intralibrary redundancy were assessed by collecting 6000–23,000 sequences from each of the libraries (68,520 total sequences deposited in GenBank). Sequence comparison and assembly of these sequences was performed in combination with 56,500 other bovine EST sequences present in the GenBank dbEST database to construct a cattle Gene Index (available from The Institute for Genomic Research at http://www.tigr.org/tdb/tgi.shtml). The 124,381 bovine ESTs present in GenBank at the time of the analysis form 16,740 assemblies that are listed and annotated on the Web site. Analysis of individual library sequence data indicates that the pooled-tissue approach was highly effective in preparing libraries for efficient deep sequencing.

Footnotes

  • 3 Corresponding author.

  • E-MAIL smith{at}emailmarc.usda.gov; FAX 402-762-4390.

  • Article and publication are at www.genome.org/cgi/doi/10.1101/gr.170101.

    • Received November 10, 2000.
    • Accepted January 24, 2001.
| Table of Contents

Preprint Server