Comparing Bacterial Genomes Through Conservation Profiles

  1. Maria J. Martı́n1,
  2. Javier Herrero2,
  3. Alvaro Mateos2, and
  4. Joaquin Dopazo2,3
  1. 1EMBL Outstation—The European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK; 2Bioinformatics Unit, Centro Nacional de Investigaciones Oncológicas (CNIO), Melchor Fernández Almagro 3, 28029 Madrid, Spain

Abstract

We constructed two-dimensional representations of profiles of gene conservation across different genomes using the genome ofEscherichia coli as a model. These profiles permit both the visualization at the genome level of different traits in the organism studied and, at the same time, reveal features related to the genomes analyzed (such as defective genomes or genomes that lack a particular system). Conserved genes are not uniformly distributed along theE. coli genome but tend to cluster together. The study of gene distribution patterns across genomes is important for the understanding of how sets of genes seem to be dependent on each other, probably having some functional link. This provides additional evidence that can be used for the elucidation of the function of unannotated genes. Clustering these patterns produces families of genes which can be arranged in a hierarchy of closeness. In this way, functions can be defined at different levels of generality depending on the level of the hierarchy that is studied. The combined study of conservation and phenotypic traits opens up the possibility of defining phenotype/genotype associations, and ultimately inferring the gene or genes responsible for a particular trait.

Footnotes

  • 3 Corresponding author.

  • E-MAIL jdopazo{at}cnio.es; FAX +34 912246972.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.678303. Article published online before print in April 2003.

    • Received August 2, 2002.
    • Accepted February 4, 2003.
| Table of Contents

Preprint Server