Gene copy number variation spanning 60 million years of human and primate evolution

  1. Laura Dumas1,
  2. Young H. Kim2,
  3. Anis Karimpour-Fard3,
  4. Michael Cox1,4,5,
  5. Janet Hopkins1,4,5,
  6. Jonathan R. Pollack2, and
  7. James M. Sikela1,4,5,6
  1. 1 Human Medical Genetics Program, University of Colorado at Denver and Health Sciences Center, Aurora, Colorado 80045, USA;
  2. 2 Department of Pathology, Stanford University, Stanford, California 94305, USA;
  3. 3 Department of Preventative Medicine and Biometrics, University of Colorado at Denver and Health Sciences Center, Aurora, Colorado 80045, USA;
  4. 4 Neuroscience Program, University of Colorado at Denver and Health Sciences Center, Aurora, Colorado 80045, USA;
  5. 5 Department of Pharmacology, University of Colorado at Denver and Health Sciences Center, Aurora, Colorado 80045, USA

Abstract

Given the evolutionary importance of gene duplication to the emergence of species-specific traits, we have extended the application of cDNA array-based comparative genomic hybridization (aCGH) to survey gene duplications and losses genome-wide across 10 primate species, including human. Using human cDNA arrays that contained 41,126 cDNAs, corresponding to 24,473 unique human genes, we identified 4159 genes that likely represent most of the major lineage-specific gene copy number gains and losses that have occurred in these species over the past 60 million years. We analyzed 1,233,780 gene-to-gene data points and found that gene gains typically outnumbered losses (ratio of gains/losses = 2.34) and these frequently cluster in complex and dynamic genomic regions that are likely to serve as gene nurseries. Almost one-third of all human genes (6696) exhibit an aCGH- predicted change in copy number in one or more of these species, and within-species gene amplification is also evident. Many of the genes identified here are likely to be important to lineage-specific traits including, for example, human-specific duplications of the AQP7 gene, which represent intriguing candidates to underlie the key physiological adaptations in thermoregulation and energy utilization that permitted human endurance running.

Footnotes

  • 6 Corresponding author.

    6 E-mail james.sikela{at}uchsc.edu; fax (303) 724-3663.

  • [Supplemental material is available online at www.genome.org. Full array-based comparative genomic hybridization data for all primates surveyed has been deposited in the Stanford Microarray Database (SMD) at http://www.stanford.edu/microarray.]

  • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.6557307

    • Received April 5, 2007.
    • Accepted June 14, 2007.
| Table of Contents

Preprint Server