The UK Biobank resource with deep phenotyping and genomic data

Clare Bycroft; Colin Freeman; Desislava Petkova; Gavin Band; Lloyd T Elliott; Kevin Sharp; Allan Motyer; Damjan Vukcevic; Olivier Delaneau; Jared O'Connell; Adrian Cortes; Samantha Welsh; Alan Young; Mark Effingham; Gil McVean; Stephen Leslie; Naomi Allen; Peter Donnelly; Jonathan Marchini

doi:10.1038/s41586-018-0579-z

The UK Biobank resource with deep phenotyping and genomic data

Nature. 2018 Oct;562(7726):203-209. doi: 10.1038/s41586-018-0579-z. Epub 2018 Oct 10.

Authors

Clare Bycroft¹, Colin Freeman¹, Desislava Petkova^{1

2}, Gavin Band¹, Lloyd T Elliott³, Kevin Sharp³, Allan Motyer⁴, Damjan Vukcevic^{4

5}, Olivier Delaneau^{6

7

8}, Jared O'Connell⁹, Adrian Cortes^{1

10}, Samantha Welsh¹¹, Alan Young¹², Mark Effingham¹¹, Gil McVean^{1

12}, Stephen Leslie^{4

5}, Naomi Allen¹², Peter Donnelly^{1

3}, Jonathan Marchini^{13

14}

Affiliations

¹ Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK.
² Procter & Gamble, Brussels, Belgium.
³ Department of Statistics, University of Oxford, Oxford, UK.
⁴ Melbourne Integrative Genomics and the Schools of Mathematics and Statistics, and BioSciences, The University of Melbourne, Parkville, Victoria, Australia.
⁵ Murdoch Children's Research Institute, Parkville, Victoria, Australia.
⁶ Department of Genetic Medicine and Development, University of Geneva, Geneva, Switzerland.
⁷ Swiss Institute of Bioinformatics, University of Geneva, Geneva, Switzerland.
⁸ Institute of Genetics and Genomics in Geneva, University of Geneva, Geneva, Switzerland.
⁹ Illumina Ltd, Chesterford Research Park, Little Chesterford, Essex, UK.
¹⁰ Nuffield Department of Clinical Neurosciences, Division of Clinical Neurology, John Radcliffe Hospital, University of Oxford, Oxford, UK.
¹¹ UK Biobank, Adswood, Stockport, Cheshire, UK.
¹² Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK.
¹³ Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK. marchini@stats.ox.ac.uk.
¹⁴ Department of Statistics, University of Oxford, Oxford, UK. marchini@stats.ox.ac.uk.

Abstract

The UK Biobank project is a prospective cohort study with deep genetic and phenotypic data collected on approximately 500,000 individuals from across the United Kingdom, aged between 40 and 69 at recruitment. The open resource is unique in its size and scope. A rich variety of phenotypic and health-related information is available on each participant, including biological measurements, lifestyle indicators, biomarkers in blood and urine, and imaging of the body and brain. Follow-up information is provided by linking health and medical records. Genome-wide genotype data have been collected on all participants, providing many opportunities for the discovery of new genetic associations and the genetic bases of complex traits. Here we describe the centralized analysis of the genetic data, including genotype quality, properties of population structure and relatedness of the genetic data, and efficient phasing and genotype imputation that increases the number of testable variants to around 96 million. Classical allelic variation at 11 human leukocyte antigen genes was imputed, resulting in the recovery of signals with known associations between human leukocyte antigen alleles and many diseases.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Adult
Aged
Alleles
Biomarkers / blood
Biomarkers / urine
Body Height / genetics
Brain / diagnostic imaging
Cohort Studies
Databases, Factual*
Databases, Genetic
Electronic Health Records
Family
Female
Genome-Wide Association Study
Genomics*
Haplotypes / genetics
Humans
Life Style
Major Histocompatibility Complex / genetics
Male
Middle Aged
Phenotype*
Quality Control
Racial Groups / genetics
United Kingdom

Substances

Biomarkers

Abstract

Publication types

MeSH terms

Substances

Grants and funding