Accurate and robust genomic prediction of celiac disease using statistical learning.
View / Open Files
Authors
Abraham, Gad
Tye-Din, Jason A
Bhalala, Oneil G
Kowalczyk, Adam
Zobel, Justin
Publication Date
2014-02Journal Title
PLoS Genet
ISSN
1553-7390
Publisher
Public Library of Science (PLoS)
Volume
10
Issue
2
Pages
e1004137
Language
eng
Type
Article
Physical Medium
Electronic-eCollection
Metadata
Show full item recordCitation
Abraham, G., Tye-Din, J. A., Bhalala, O. G., Kowalczyk, A., Zobel, J., & Inouye, M. (2014). Accurate and robust genomic prediction of celiac disease using statistical learning.. PLoS Genet, 10 (2), e1004137. https://doi.org/10.1371/journal.pgen.1004137
Abstract
Practical application of genomic-based risk stratification to clinical diagnosis is appealing yet performance varies widely depending on the disease and genomic risk score (GRS) method. Celiac disease (CD), a common immune-mediated illness, is strongly genetically determined and requires specific HLA haplotypes. HLA testing can exclude diagnosis but has low specificity, providing little information suitable for clinical risk stratification. Using six European cohorts, we provide a proof-of-concept that statistical learning approaches which simultaneously model all SNPs can generate robust and highly accurate predictive models of CD based on genome-wide SNP profiles. The high predictive capacity replicated both in cross-validation within each cohort (AUC of 0.87-0.89) and in independent replication across cohorts (AUC of 0.86-0.9), despite differences in ethnicity. The models explained 30-35% of disease variance and up to ∼43% of heritability. The GRS's utility was assessed in different clinically relevant settings. Comparable to HLA typing, the GRS can be used to identify individuals without CD with ≥99.6% negative predictive value however, unlike HLA typing, fine-scale stratification of individuals into categories of higher-risk for CD can identify those that would benefit from more invasive and costly definitive testing. The GRS is flexible and its performance can be adapted to the clinical situation by adjusting the threshold cut-off. Despite explaining a minority of disease heritability, our findings indicate a genomic risk score provides clinically relevant information to improve upon current diagnostic pathways for CD and support further studies evaluating the clinical utility of this approach in CD and other complex diseases.
Keywords
Alleles, Biometry, Celiac Disease, Female, Genetic Predisposition to Disease, Genome, Human, Genomics, HLA Antigens, Haplotypes, Humans, Polymorphism, Single Nucleotide, Risk
Identifiers
External DOI: https://doi.org/10.1371/journal.pgen.1004137
This record's URL: https://www.repository.cam.ac.uk/handle/1810/284874
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk