A genome-wide study of dual coding regions in human alternatively spliced genes

  1. Han Liang1 and
  2. Laura F. Landweber2,3
  1. 1 Department of Chemistry, Princeton University, Princeton, New Jersey 08544, USA
  2. 2 Departments of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey 08544, USA

Abstract

Alternative splicing is a major mechanism for gene product regulation in many multicellular organisms. By using different exon combinations, some coding regions can encode amino acids in multiple reading frames in different transcripts. Here we performed a systematic search through a set of high-quality human transcripts and show that ∼7% of alternatively spliced genes contain dual (multiple) coding regions. By using a conservative criterion, we found that in these regions most secondary reading frames evolved recently in mammals, and a significant proportion of them may be specific to primates. Based on the presence of in-frame stop codons in orthologous sequences in other animals, we further classified ancestral and derived reading frames in these regions. Our results indicated that ancestral reading frames are usually under stronger selection than are derived reading frames. Ancestral reading frames mainly influence the coding properties of these dual coding regions. Compared with coding regions of the whole genome, ancestral reading frames largely maintain similar nucleotide composition at each codon position and amino acid usage, while derived reading frames are significantly different. Our results also indicated that prior to acquisition of a new reading frame, the suppression of in-frame stop codons in the ancestral state is mainly achieved by one-step transition substitutions at the first or second codon position. Finally, the selective forces imposed on these dual coding regions will also be discussed.

Footnotes

  • [Supplemental material is available online at www.genome.org.]

  • Article published online ahead of print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.4246506.

  • 3 Corresponding author. E-mail lfl{at}princeton.edu; fax (609) 258-7892.

    • Accepted October 23, 2005.
    • Received June 7, 2005.
| Table of Contents

Preprint Server