Repository logo

Genetic associations of protein-coding variants in human disease.

Published version

Change log


Kurki, Mitja I 
Foley, Christopher N 
Mechakra, Asma 
Chen, Chia-Yen 


Genome-wide association studies (GWAS) have identified thousands of genetic variants linked to the risk of human disease. However, GWAS have so far remained largely underpowered in relation to identifying associations in the rare and low-frequency allelic spectrum and have lacked the resolution to trace causal mechanisms to underlying genes1. Here we combined whole-exome sequencing in 392,814 UK Biobank participants with imputed genotypes from 260,405 FinnGen participants (653,219 total individuals) to conduct association meta-analyses for 744 disease endpoints across the protein-coding allelic frequency spectrum, bridging the gap between common and rare variant studies. We identified 975 associations, with more than one-third being previously unreported. We demonstrate population-level relevance for mutations previously ascribed to causing single-gene disorders, map GWAS associations to likely causal genes, explain disease mechanisms, and systematically relate disease associations to levels of 117 biomarkers and clinical-stage drug targets. Combining sequencing and genotyping in two population biobanks enabled us to benefit from increased power to detect and explain disease associations, validate findings through replication and propose medical actionability for rare genetic variants. Our study provides a compendium of protein-coding variant associations for future insights into disease biology and drug discovery.



Gene Frequency, Genetic Predisposition to Disease, Genome-Wide Association Study, Genotype, Humans, Polymorphism, Single Nucleotide, Proteins, Exome Sequencing

Journal Title


Conference Name

Journal ISSN


Volume Title



Springer Science and Business Media LLC