Trimming, Weighting, and Grouping SNPs in Human Case-Control Association Studies

  1. Josephine Hoh,
  2. Anja Wille, and
  3. Jurg Ott1
  1. Laboratory of Statistical Genetics, Rockefeller University, New York, New York 10021, USA

Abstract

The search for genes underlying complex traits has been difficult and often disappointing. The main reason for these difficulties is that several genes, each with rather small effect, might be interacting to produce the trait. Therefore, we must search the whole genome for a good chance to find these genes. Doing this with tens of thousands of SNP markers, however, greatly increases the overall probability of false-positive results, and current methods limiting such error probabilities to acceptable levels tend to reduce the power of detecting weak genes. Investigating large numbers of SNPs inevitably introduces errors (e.g., in genotyping), which will distort analysis results. Here we propose a simple strategy that circumvents many of these problems. We develop a set-association method to blend relevant sources of information such as allelic association and Hardy-Weinberg disequilibrium. Information is combined over multiple markers and genes in the genome, quality control is improved by trimming, and an appropriate testing strategy limits the overall false-positive rate. In contrast to other available methods, our method to detect association to sets of SNP markers in different genes in a real data application has shown remarkable success.

Footnotes

  • 1 Corresponding author.

  • E-MAIL ott{at}linkage.rockefeller.edu; FAX (212) 327-7996.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.204001.

    • Received July 6, 2001.
    • Accepted October 10, 2001.
| Table of Contents

Preprint Server