A sequence-level map of chromosomal breakpoints in the MCF-7 breast cancer cell line yields insights into the evolution of a cancer genome

  1. Oliver A. Hampton1,3,5,
  2. Petra Den Hollander4,5,
  3. Christopher A. Miller1,3,
  4. David A. Delgado4,5,
  5. Jian Li1,3,
  6. Cristian Coarfa1,2,
  7. Ronald A. Harris1,2,
  8. Stephen Richards2,
  9. Steven E. Scherer2,
  10. Donna M. Muzny2,
  11. Richard A. Gibbs2,3,
  12. Adrian V. Lee4,5,6 and
  13. Aleksandar Milosavljevic1,2,3,5,7
  1. 1 Bioinformatics Research Laboratory, Baylor College of Medicine, Houston, Texas 77030, USA;
  2. 2 Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA;
  3. 3 Program in Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, Houston, Texas 77030, USA;
  4. 4 Breast Center, Baylor College of Medicine, Houston, Texas 77030, USA;
  5. 5 Dan L. Duncan Cancer Center, Baylor College of Medicine, Houston, Texas 77030, USA;
  6. 6 Department of Medicine, Baylor College of Medicine, Houston, Texas 77030, USA

    Abstract

    By applying a method that combines end-sequence profiling and massively parallel sequencing, we obtained a sequence-level map of chromosomal aberrations in the genome of the MCF-7 breast cancer cell line. A total of 157 distinct somatic breakpoints of two distinct types, dispersed and clustered, were identified. A total of 89 breakpoints are evenly dispersed across the genome. A majority of dispersed breakpoints are in regions of low copy repeats (LCRs), indicating a possible role for LCRs in chromosome breakage. The remaining 68 breakpoints form four distinct clusters of closely spaced breakpoints that coincide with the four highly amplified regions in MCF-7 detected by array CGH located in the 1p13.1-p21.1, 3p14.1-p14.2, 17q22-q24.3, and 20q12-q13.33 chromosomal cytobands. The clustered breakpoints are not significantly associated with LCRs. Sequences flanking most (95%) breakpoint junctions are consistent with double-stranded DNA break repair by nonhomologous end-joining or template switching. A total of 79 known or predicted genes are involved in rearrangement events, including 10 fusions of coding exons from different genes and 77 other rearrangements. Four fusions result in novel expressed chimeric mRNA transcripts. One of the four expressed fusion products (RAD51C-ATXN7) and one gene truncation (BRIP1 or BACH1) involve genes coding for members of protein complexes responsible for homology-driven repair of double-stranded DNA breaks. Another one of the four expressed fusion products (ARFGEF2-SULF2) involves SULF2, a regulator of cell growth and angiogenesis. We show that knock-down of SULF2 in cell lines causes tumorigenic phenotypes, including increased proliferation, enhanced survival, and increased anchorage-independent growth.

    Footnotes

    • 7 Corresponding author.

      E-mail amilosav{at}bcm.edu; fax (713) 798-4373.

    • [Supplemental material is available online at www.genome.org and through the Breast Cancer project page at www.genboree.org. All MCF-7 BAC clones are available from Amplicon Express under name HTA and plate/row/column names as indicated. The sequence data from this study have been submitted to the NCBI Trace and Short Read Archives (http://www.ncbi.nlm.nih.gov) under accession nos. 2172834909–2172901416 and 2172904832–2172911164, and SRR006762–SRR006767, respectively].

    • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.080259.108.

      • Received April 29, 2008.
      • Accepted November 19, 2008.
    • Freely available online through the Genome Research open access option.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server