Computer-Based Methods for the Mouse Full-Length cDNA Encyclopedia: Real-Time Sequence Clustering for Construction of a Nonredundant cDNA Library

  1. Hideaki Konno1,3,
  2. Yoshifumi Fukunishi2,3,
  3. Kazuhiro Shibata2,
  4. Masayoshi Itoh2,
  5. Piero Carninci2,
  6. Yuichi Sugahara2, and
  7. Yoshihide Hayashizaki1,2,3
  1. 1Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, Yokohama 230–0045, Japan; 2Genome Science Laboratory, RIKEN Tsukuba Institute, Tsukuba 305–0074, Japan; 3Core Research for Evolutional Science and Technology, of Japan Science and Technology Corporation, Tsukuba 305–0074, Japan

Abstract

We developed computer-based methods for constructing a nonredundant mouse full-length cDNA library. Our cDNA library construction process comprises assessment of library quality, sequencing the 3′ ends of inserts and clustering, and completing a re-array to generate a nonredundant library from a redundant one. After the cDNA libraries are generated, we sequence the 5′ ends of the inserts to check the quality of the library; then we determine the sequencing priority of each library. Selected libraries undergo large-scale sequencing of the 3′ ends of the inserts and clustering of the tag sequences. After clustering, the nonredundant library is constructed from the original libraries, which have redundant clones. All libraries, plates, clones, sequences, and clusters are uniquely identified, and all information is saved in the database according to this identifier. At press time, our system has been in place for the past two years; we have clustered 939,725 3′ end sequences into 127,385 groups from 227 cDNA libraries/sublibraries (seehttp://genome.gse.riken.go.jp/).

[The sequence data described in this paper have been submitted to the DDBJ data library under accession nos. AV00011–AV175734, AV204013AV382295, andBB561685BB609425.]

Footnotes

  • 3 Corresponding author.

  • E-MAIL fukunisi{at}rtc.riken.go.jp; FAX: 81-(0)298-36-9098.

  • Article published online before print: Genome Res.,10.1101/gr.145701.

  • Article and publication are at www.genome.org/cgi/doi/10.1101/gr.145701.

    • Received October 5, 2000.
    • Accepted November 21, 2000.
| Table of Contents

Preprint Server