A formal perturbation equation between genotype and phenotype determines the Evolutionary Action of protein-coding variations on fitness

  1. Olivier Lichtarge1,2,3,4
  1. 1Department of Molecular and Human Genetics,
  2. 2Department of Biochemistry & Molecular Biology,
  3. 3Department of Pharmacology, Baylor College of Medicine, Houston, Texas 77030, USA;
  4. 4Computational and Integrative Biomedical Research Center, Baylor College of Medicine, Houston, Texas 77030, USA
  1. Corresponding author: lichtarge{at}bcm.edu

Abstract

The relationship between genotype mutations and phenotype variations determines health in the short term and evolution over the long term, and it hinges on the action of mutations on fitness. A fundamental difficulty in determining this action, however, is that it depends on the unique context of each mutation, which is complex and often cryptic. As a result, the effect of most genome variations on molecular function and overall fitness remains unknown and stands apart from population genetics theories linking fitness effect to polymorphism frequency. Here, we hypothesize that evolution is a continuous and differentiable physical process coupling genotype to phenotype. This leads to a formal equation for the action of coding mutations on fitness that can be interpreted as a product of the evolutionary importance of the mutated site with the difference in amino acid similarity. Approximations for these terms are readily computable from phylogenetic sequence analysis, and we show mutational, clinical, and population genetic evidence that this action equation predicts the effect of point mutations in vivo and in vitro in diverse proteins, correlates disease-causing gene mutations with morbidity, and determines the frequency of human coding polymorphisms, respectively. Thus, elementary calculus and phylogenetics can be integrated into a perturbation analysis of the evolutionary relationship between genotype and phenotype that quantitatively links point mutations to function and fitness and that opens a new analytic framework for equations of biology. In practice, this work explicitly bridges molecular evolution with population genetics with applications from protein redesign to the clinical assessment of human genetic variations.

Footnotes

  • [Supplemental material is available for this article.]

  • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.176214.114.

    Freely available online through the Genome Research Open Access option.

  • Received March 28, 2014.
  • Accepted September 11, 2014.

This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

| Table of Contents
OPEN ACCESS ARTICLE

Preprint Server