Comparing low coverage random shotgun sequence data from Brassica oleracea and Oryza sativa genome sequence for their ability to add to the annotation of Arabidopsis thaliana

  1. Manpreet S. Katari1,2,
  2. Vivekanand Balija1,
  3. Richard K. Wilson3,
  4. Robert A. Martienssen1, and
  5. W. Richard McCombie1,4
  1. 1 Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
  2. 2 Graduate Program in Genetics, State University of New York at Stony Brook, Stony Brook, New York 11794, USA
  3. 3 The Genome Sequencing Center, Washington University School of Medicine, St. Louis, Missouri 63108, USA

Abstract

Since the completion of the Arabidopsis thaliana genome sequence, there is an ongoing effort to annotate the genome as accurately as possible. Comparing genome sequences of related species complements the current annotation strategies by identifying genes and improving gene structure. A total of 595,321 Brassica oleracea shotgun reads were sequenced by TIGR (The Institute for Genome Research) and the collaboration of Washington University and Cold Spring Harbor. Vicogenta (a genome viewer based on GMOD and GBrowse) was created to view the current annotation and sequence alignments for Arabidopsis. Brassica reads were compared with the Arabidopsis genome and proteome databases using BLAST. Hypothetical genes and conserved unannotated regions on the short arm of chromosome 4 from Arabidopsis were experimentally verified using RT–PCR. We were able to improve the Arabidopsis annotation by identifying 25 genes that were missed, and confirming expression of 43 hypothetical genes in Arabidopsis. We were also able to detect conservation in genes whose transcription is normally suppressed due to methylation. We also examined how useful the O. sativa genome and ESTs from other species are, compared with Brassica, in improving the Arabidopsis annotation.

Footnotes

  • [Supplemental material is available online at www.genome.org. Vicogenta is available at http://mccombielab.cshl.org/katari/vicogenta.]

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.3239105.

  • 4 Corresponding author. E-mail mccombie{at}cshl.edu; fax (516) 422-4109.

    • Accepted February 3, 2005.
    • Received September 8, 2005.
| Table of Contents

Preprint Server