Comparison of CAGE and RNA-seq transcriptome profiling using clonally amplified and single-molecule next-generation sequencing

  1. The FANTOM Consortium
  1. 1RIKEN Preventive Medicine and Diagnosis Innovation Program, Saitama 351-0198, Japan;
  2. 2RIKEN Omics Science Center, RIKEN Yokohama Institute, Yokohama, Kanagawa, 230-0045, Japan;4
  3. 3RIKEN Center for Life Science Technologies (CLST), Division of Genomic Technologies (DGT), Kanagawa, 230-0045, Japan

    Abstract

    CAGE (cap analysis gene expression) and RNA-seq are two major technologies used to identify transcript abundances as well as structures. They measure expression by sequencing from either the 5′ end of capped molecules (CAGE) or tags randomly distributed along the length of a transcript (RNA-seq). Library protocols for clonally amplified (Illumina, SOLiD, 454 Life Sciences [Roche], Ion Torrent), second-generation sequencing platforms typically employ PCR preamplification prior to clonal amplification, while third-generation, single-molecule sequencers can sequence unamplified libraries. Although these transcriptome profiling platforms have been demonstrated to be individually reproducible, no systematic comparison has been carried out between them. Here we compare CAGE, using both second- and third-generation sequencers, and RNA-seq, using a second-generation sequencer based on a panel of RNA mixtures from two human cell lines to examine power in the discrimination of biological states, detection of differentially expressed genes, linearity of measurements, and quantification reproducibility. We found that the quantified levels of gene expression are largely comparable across platforms and conclude that CAGE and RNA-seq are complementary technologies that can be used to improve incomplete gene models. We also found systematic bias in the second- and third-generation platforms, which is likely due to steps such as linker ligation, cleavage by restriction enzymes, and PCR amplification. This study provides a perspective on the performance of these platforms, which will be a baseline in the design of further experiments to tackle complex transcriptomes uncovered in a wide range of cell types.

    Footnotes

    • 4 RIKEN Omics Science Center ceased to exist as of April 1, 2013, due to RIKEN reorganization.

    • 5 Corresponding authors

      E-mail kawaji{at}gsc.riken.jp

      E-mail carninci{at}riken.jp

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.156232.113.

      Freely available online through the Genome Research Open Access option.

    • Received February 15, 2013.
    • Accepted December 10, 2013.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution 3.0 Unported), as described at http://creativecommons.org/licenses/by/3.0.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server