Analysis of variable retroduplications in human populations suggests coupling of retrotransposition to cell division

  1. Mark Gerstein1,2,3,4,9
  1. 1Program in Computational Biology and Bioinformatics,
  2. 2Department of Molecular Biophysics and Biochemistry,
  3. 3Program in Neurodevelopment and Regeneration,
  4. 4Department of Computer Science, Yale University, New Haven, Connecticut 06520, USA;
  5. 5Department of Pathology, Brigham & Women's Hospital, Boston, Massachusetts 02115, USA;
  6. 6Harvard Medical School, Boston, Massachusetts 02115, USA
    • Present addresses: 7Department of Biological Sciences, University at Buffalo, State University of New York, Buffalo, New York 14260, USA;

    • 8 The Jackson Laboratory Institute for Genomic Medicine, Farmington, Connecticut 06030, USA.

    Abstract

    In primates and other animals, reverse transcription of mRNA followed by genomic integration creates retroduplications. Expressed retroduplications are either “retrogenes” coding for functioning proteins, or expressed “processed pseudogenes,” which can function as noncoding RNAs. To date, little is known about the variation in retroduplications in terms of their presence or absence across individuals in the human population. We have developed new methodologies that allow us to identify “novel” retroduplications (i.e., those not present in the reference genome), to find their insertion points, and to genotype them. Using these methods, we catalogued and analyzed 174 retroduplication variants in almost one thousand humans, which were sequenced as part of Phase 1 of The 1000 Genomes Project Consortium. The accuracy of our data set was corroborated by (1) multiple lines of sequencing evidence for retroduplication (e.g., depth of coverage in exons vs. introns), (2) experimental validation, and (3) the fact that we can reconstruct a correct phylogenetic tree of human subpopulations based solely on retroduplications. We also show that parent genes of retroduplication variants tend to be expressed at the M-to-G1 transition in the cell cycle and that M-to-G1 expressed genes have more copies of fixed retroduplications than genes expressed at other times. These findings suggest that cell division is coupled to retrotransposition and, perhaps, is even a requirement for it.

    Footnotes

    • 9 Corresponding authors

      E-mail mark.gerstein{at}yale.edu

      E-mail charles.lee{at}jax.org

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.154625.113.

      Freely available online through the Genome Research Open Access option.

    • Received January 7, 2013.
    • Accepted September 4, 2013.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported), as described at http://creativecommons.org/licenses/by-nc/3.0/.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server