The UK10K project identifies rare variants in health and disease.
UK10K Consortium None., Walter K., Min JL., Huang J., Crooks L., Memari Y., McCarthy S., Perry JRB., Xu C., Futema M., Lawson D., Iotchkova V., Schiffels S., Hendricks AE., Danecek P., Li R., Floyd J., Wain LV., Barroso I., Humphries SE., Hurles ME., Zeggini E., Barrett JC., Plagnol V., Richards JB., Greenwood CMT., Timpson NJ., Durbin R., Soranzo N.
The contribution of rare and low-frequency variants to human traits is largely unexplored. Here we describe insights from sequencing whole genomes (low read depth, 7×) or exomes (high read depth, 80×) of nearly 10,000 individuals from population-based and disease collections. In extensively phenotyped cohorts we characterize over 24 million novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with levels of triglycerides (APOB), adiponectin (ADIPOQ) and low-density lipoprotein cholesterol (LDLR and RGAG1) from single-marker and rare variant aggregation tests. We describe population structure and functional annotation of rare and low-frequency variants, use the data to estimate the benefits of sequencing for association studies, and summarize lessons from disease-specific collections. Finally, we make available an extensive resource, including individual-level genetic and phenotypic data and web-based tools to facilitate the exploration of association results.