Repository logo
 

Use of genome sequencing to hunt for cryptic second-hit variants: analysis of 31 cases recruited to the 100 000 Genomes Project.

Published version
Peer-reviewed

Repository DOI


Change log

Abstract

BACKGROUND: Current clinical testing methods used to uncover the genetic basis of rare disease have inherent limitations, which can lead to causative pathogenic variants being missed. Within the rare disease arm of the 100 000 Genomes Project (100kGP), families were recruited under the clinical indication 'single autosomal recessive mutation in rare disease'. These participants presented with strong clinical suspicion for a specific autosomal recessive disorder, but only one suspected pathogenic variant had been identified through standard-of-care testing. Whole genome sequencing (WGS) aimed to identify cryptic 'second-hit' variants. METHODS: To investigate the 31 families with available data that remained unsolved following formal review within the 100kGP, SVRare was used to aggregate structural variants present in <1% of 100kGP participants. Small variants were assessed using population allele frequency data and SpliceAI. Literature searches and publicly available online tools were used for further annotation of pathogenicity. RESULTS: Using these strategies, 8/31 cases were solved, increasing the overall diagnostic yield of this cohort from 10/41 (24.4%) to 18/41 (43.9%). Exemplar cases include a patient with cystic fibrosis harbouring a novel exonic LINE1 insertion in CFTR and a patient with generalised arterial calcification of infancy with complex interlinked duplications involving exons 2-6 of ENPP1. Although ambiguous by short-read WGS, the ENPP1 variant structure was resolved using optical genome mapping and RNA analysis. CONCLUSION: Systematic examination of cryptic variants across a multi-disease cohort successfully identifies additional pathogenic variants. WGS data analysis in autosomal recessive rare disease should consider complex structural and small intronic variants as potentially pathogenic second hits.

Description

Peer reviewed: True


Acknowledgements: We thank the families involved in this study and the clinicians for providing information about the respective first-hit variants. This work was made possible through access to the data and findings generated by the 100 000 Genomes Project.


Funder: NIHR Oxford Biomedical Research Centre; FundRef: http://dx.doi.org/10.13039/501100013373


Funder: Cancer Research UK; FundRef: http://dx.doi.org/10.13039/501100000289

Journal Title

J Med Genet

Conference Name

Journal ISSN

0022-2593
1468-6244

Volume Title

Publisher

BMJ

Rights and licensing

Except where otherwised noted, this item's license is described as https://creativecommons.org/licenses/by/4.0/
Sponsorship
Medical Research Council (MR/W01761X/1)
Wellcome Trust (203141/Z/16/Z)