Assembly and analysis of 100 full MHC haplotypes from the Danish population

  1. Mikkel H. Schierup1,4
  1. 1Bioinformatics Research Centre, Aarhus University, 8000 Aarhus C., Denmark;
  2. 2Department of Clinical Medicine, Aarhus University, 8200 Aarhus N., Denmark;
  3. 3Department of Molecular Medicine, Aarhus University Hospital, Skejby, 8200 Aarhus N., Denmark;
  4. 4Department of Bioscience, Aarhus University, 8000 Aarhus C., Denmark
  5. 6Bioinformatics Centre, Department of Biology, University of Copenhagen, 2200 Copenhagen N, Denmark
  6. 7Bioinformatics Research Centre, Aarhus University, 8000 Aarhus C, Denmark
  7. 8iSEQ, Centre for Integrative Sequencing, Aarhus University, 8000 Aarhus C, Denmark
  8. 9DTU Bioinformatics, Department of Bio and Health Informatics, Technical University of Denmark, Kemitorvet, 2800 Kongens Lyngby, Denmark
  9. 10BGI-Europe, Ole Maaløes Vej 3, 2200 Copenhagen N, Denmark
  10. 11Department of Clinical Medicine, Aarhus University, 8000 Aarhus C, Denmark
  11. 12Novo Nordisk Foundation Center for Basic Metabolic Research, Section of Metabolic Genetics, University of Copenhagen, 2100 Copenhagen Ø, Denmark
  12. 13Department of Biomedicine, Aarhus University, 8000 Aarhus C, Denmark
  13. 14The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8000 Aarhus, Denmark
  14. 15BGI-Shenzhen, Shenzhen 518083, China
  15. 16School of Bioscience and Biotechnology, South China University of Technology, Guangzhou 510006, China
  16. 17Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, 2100 Copenhagen Ø, Denmark
  17. 18Department of Psychology, University of Oslo, 0317 Oslo, Norway
  18. 19NORMENT, KG Jebsen Centre for Psychosis Research, Department of Clinical Science, University of Bergen, Bergen, 5021, Norway
  19. 20Dr. E. Martens Research Group of Biological Psychiatry, Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, 5021, Norway
  20. 21Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, 17177, Sweden
  21. 22Department of Genetics, University of North Carolina, Chapel Hill, NC 27599-7264, USA
  22. 23Department of Clinical Epidemiology (formerly Institute of Preventive Medicine), Bispebjerg and Frederiksberg Hospital, The Capital Region, Copenhagen, 2000 Frederiksberg, Denmark
  23. 24Department of Public Health, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen, Denmark
  24. 25Department of Cellular and Molecular Medicine, University of Copenhagen, 2200 Copenhagen N, Denmark
  25. 26Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark
  26. 27Department of Bioscience, Aarhus University, 8000 Aarhus C, Denmark
  1. Corresponding authors: jmj{at}birc.au.dk, mheide{at}birc.au.dk
  2. Abstract

    Genes in the major histocompatibility complex (MHC, also known as HLA) play a critical role in the immune response and variation within the extended 4-Mb region shows association with major risks of many diseases. Yet, deciphering the underlying causes of these associations is difficult because the MHC is the most polymorphic region of the genome with a complex linkage disequilibrium structure. Here, we reconstruct full MHC haplotypes from de novo assembled trios without relying on a reference genome and perform evolutionary analyses. We report 100 full MHC haplotypes and call a large set of structural variants in the regions for future use in imputation with GWAS data. We also present the first complete analysis of the recombination landscape in the entire region and show how balancing selection at classical genes have linked effects on the frequency of variants throughout the region.

    Footnotes

    • 5 A full list of Consortium members and their affiliations is available at the end of the text.

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.218891.116.

    • Freely available online through the Genome Research Open Access option.

    • Received November 25, 2016.
    • Accepted July 21, 2017.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server