A systematic model to predict transcriptional regulatory mechanisms based on overrepresentation of transcription factor binding profiles

  1. Li-Wei Chang1,2,
  2. Rakesh Nagarajan3,
  3. Jeffrey A. Magee3,
  4. Jeffrey Milbrandt3, and
  5. Gary D. Stormo1,4
  1. 1 Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
  2. 2 Department of Biomedical Engineering, Washington University, St. Louis, Missouri 63130, USA
  3. 3 Department of Pathology and Immunology, Division of Laboratory Medicine, Washington University School of Medicine, St. Louis, Missouri 63110, USA

Abstract

An important aspect of understanding a biological pathway is to delineate the transcriptional regulatory mechanisms of the genes involved. Two important tasks are often encountered when studying transcription regulation, i.e., (1) the identification of common transcriptional regulators of a set of coexpressed genes; (2) the identification of genes that are regulated by one or several transcription factors. In this study, a systematic and statistical approach was taken to accomplish these tasks by establishing an integrated model considering all of the promoters and characterized transcription factors (TFs) in the genome. A promoter analysis pipeline (PAP) was developed to implement this approach. PAP was tested using coregulated gene clusters collected from the literature. In most test cases, PAP identified the transcription regulators of the input genes accurately. When compared with chromatin immunoprecipitation experiment data, PAP's predictions are consistent with the experimental observations. When PAP was used to analyze one published expression-profiling data set and two novel coregulated gene sets, PAP was able to generate biologically meaningful hypotheses. Therefore, by taking a systematic approach of considering all promoters and characterized TFs in our model, we were able to make more reliable predictions about the regulation of gene expression in mammalian organisms.

Footnotes

  • [Supplemental material is available online at www.genome.org.]

  • Article published online ahead of print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.4303406.

  • 4 Corresponding author. E-mail stormo{at}genetics.wustl.edu; fax (314) 362-7855.

    • Accepted December 2, 2005.
    • Received June 16, 2005.
| Table of Contents

Preprint Server