Integration target site selection by a resurrected human endogenous retrovirus

  1. Troy Brady1,
  2. Young Nam Lee2,
  3. Keshet Ronen1,
  4. Nirav Malani1,
  5. Charles C. Berry3,
  6. Paul D. Bieniasz2,4 and
  7. Frederic D. Bushman1,5
  1. 1Department of Microbiology, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA;
  2. 2Aaron Diamond AIDS Research Center, Rockefeller University, New York, New York 10065, USA;
  3. 3Department of Family/Preventive Medicine, University of California, San Diego School of Medicine, San Diego, California 92093, USA;
  4. 4Howard Hughes Medical Institute and Laboratory of Retrovirology, Rockefeller University, New York, New York 10065, USA

    Abstract

    At least 8% of the human genome was formed by integration of retroviral DNA sequences. Here we analyze the forces directing the accumulation of human endogenous retroviruses (HERVs) by comparing de novo HERV integration targeting with the distribution of fixed HERV elements in the human genome. All known genomic HERVs are inactive due to mutation, but we were able to study integration targeting using a reconstituted consensus HERV-K (designated HERV-KCon). We found that HERV-KCon integrated preferentially in transcription units, in gene-rich regions, and near features associated with active transcription units and associated regulatory regions. In contrast, genomic HERV-K proviruses are found preferentially outside transcription units. The minority of genomic HERVKs present inside transcription units are in opposite transcriptional orientation relative to the host gene, the orientation predicted to be minimally disruptive to host mRNA synthesis, but de novo HERV-KCon integration within transcription units showed no orientation bias. We also found that the youngest HERV-K elements in the human genome showed a distribution intermediate between de novo HERV-KCon integration sites and older fixed HERV-Ks. These findings indicate that accumulation of HERVs in the human germline is a two-step process: integration targeting biases direct initial accumulation, then purifying selection leads to loss of proviruses disrupting gene function.

    Keywords:

    Keywords

    Footnotes

    | Table of Contents

    Life Science Alliance