Distribution and Characterization of Regulatory Elements in the Human Genome

  1. Jacek Majewski1 and
  2. Jurg Ott
  1. Rockefeller University, New York, New York 10021, USA

Abstract

The regulation of transcription and subsequent gene splicing are crucial to correct gene expression. Although a number of regulatory sequences involved in both processes are known, it is not clear how general their functions are in the genomic context, nor how the regulatory regions are distributed throughout the genome. Here we study the distribution of known mutagenic elements within human introns and exons to deduce the properties of regions essential for splicing and transcription. We show that intronic splicing regulators are generally found close to the splice sites, but may be found as far as 200 nucleotides away from the splice junctions. Similarly, sequences important for splicing may be located as far as 125 nucleotides away from the junctions, within exons. We characterize several types of simple repetitive sequences and low-complexity regions that are overrepresented close to both intron ends and are likely to play important roles in the splicing process. We show that the first introns within most genes play a particularly important regulatory role that is most likely, however, to be involved in transcription control. We also study the distribution of two known regulatory motifs, the GGG trinucleotide and the CpG dinucleotide, and deduce their respective importance to splicing and transcription regulation.

Footnotes

  • 1 Corresponding author.

  • E-MAIL majewski{at}complex.rockefeller.edu; FAX (212) 327-7996.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.606402.

    • Received July 10, 2002.
    • Accepted October 10, 2002.
| Table of Contents

Preprint Server