Repository logo
 

Complex structural variants in Mendelian disorders: identification and breakpoint resolution using short- and long-read genome sequencing.

Published version
Peer-reviewed

Type

Article

Change log

Authors

Sanchis-Juan, Alba 
French, Courtney E 
Gleadall, Nicholas 
Mégy, Karyn 

Abstract

BACKGROUND: Studies have shown that complex structural variants (cxSVs) contribute to human genomic variation and can cause Mendelian disease. We aimed to identify cxSVs relevant to Mendelian disease using short-read whole-genome sequencing (WGS), resolve the precise variant configuration and investigate possible mechanisms of cxSV formation. METHODS: We performed short-read WGS and analysis of breakpoint junctions to identify cxSVs in a cohort of 1324 undiagnosed rare disease patients. Long-read WGS and gene expression analysis were used to resolve one case. RESULTS: We identified three pathogenic cxSVs: a de novo duplication-inversion-inversion-deletion affecting ARID1B, a de novo deletion-inversion-duplication affecting HNRNPU and a homozygous deletion-inversion-deletion affecting CEP78. Additionally, a de novo duplication-inversion-duplication overlapping CDKL5 was resolved by long-read WGS demonstrating the presence of both a disrupted and an intact copy of CDKL5 on the same allele, and gene expression analysis showed both parental alleles of CDKL5 were expressed. Breakpoint analysis in all the cxSVs revealed both microhomology and longer repetitive elements. CONCLUSIONS: Our results corroborate that cxSVs cause Mendelian disease, and we recommend their consideration during clinical investigations. We show that resolution of breakpoints can be critical to interpret pathogenicity and present evidence of replication-based mechanisms in cxSV formation.

Description

Keywords

ARID1B, CDKL5, CEP78, Complex structural variant, Genome sequencing, HNRNPU, Nanopore, Next-generation sequencing, Cell Cycle Proteins, DNA-Binding Proteins, Female, Genetic Predisposition to Disease, Genome, Human, Genomic Structural Variation, Heterogeneous-Nuclear Ribonucleoprotein U, High-Throughput Nucleotide Sequencing, Humans, Male, Mutation, Protein Serine-Threonine Kinases, Sequence Analysis, DNA, Transcription Factors

Journal Title

Genome Med

Conference Name

Journal ISSN

1756-994X
1756-994X

Volume Title

10

Publisher

Springer Science and Business Media LLC
Sponsorship
Cambridge University Hospitals NHS Foundation Trust (CUH) (BRC 2012-2017)