Coelacanth genome sequence reveals the evolutionary history of vertebrate genes

  1. James P. Noonan1,4,
  2. Jane Grimwood2,
  3. Joshua Danke3,
  4. Jeremy Schmutz2,
  5. Mark Dickson2,
  6. Chris T. Amemiya3, and
  7. Richard M. Myers1,2,5
  1. 1 Department of Genetics, Stanford University School of Medicine, Stanford, California 94305-5120, USA
  2. 2 Stanford Human Genome Center, Stanford University School of Medicine, Palo Alto, California 94304, USA
  3. 3 Benaroya Research Institute at Virginia Mason, Seattle, Washington 98101, USA

Abstract

The coelacanth is one of the nearest living relatives of tetrapods. However, a teleost species such as zebrafish or Fugu is typically used as the outgroup in current tetrapod comparative sequence analyses. Such studies are complicated by the fact that teleost genomes have undergone a whole-genome duplication event, as well as individual gene-duplication events. Here, we demonstrate the value of coelacanth genome sequence by complete sequencing and analysis of the protocadherin gene cluster of the Indonesian coelacanth, Latimeria menadoensis. We found that coelacanth has 49 protocadherin cluster genes organized in the same three ordered subclusters, α, β, and γ, as the 54 protocadherin cluster genes in human. In contrast, whole-genome and tandem duplications have generated two zebrafish protocadherin clusters comprised of at least 97 genes. Additionally, zebrafish protocadherins are far more prone to homogenizing gene conversion events than coelacanth protocadherins, suggesting that recombination- and duplication-driven plasticity may be a feature of teleost genomes. Our results indicate that coelacanth provides the ideal outgroup sequence against which tetrapod genomes can be measured. We therefore present L. menadoensis as a candidate for whole-genome sequencing.

Footnotes

  • [Supplemental material is available online at www.genome.org and http://www-shgc.stanford.edu/myerslab/. The BAC sequence data from this study have been submitted to GenBank under accession nos. AC150283, AC150284, and AC150308-AC150310.]

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.2972804. Article published online before print in November 2004.

  • 4 Present address: Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.

  • 5 Corresponding author. E-mail myers{at}shgc.stanford.edu; fax (650) 725-9689.

    • Accepted September 17, 2004.
    • Received July 13, 2004.
| Table of Contents

Preprint Server