Repository logo
 

Intersect-then-combine approach: improving the performance of somatic variant calling in whole exome sequencing data using multiple aligners and callers.

Published version
Peer-reviewed

Change log

Authors

Sammut, Stephen-John 
De Mattos-Arruda, Leticia 
Rueda, Oscar M 

Abstract

Bioinformatic analysis of genomic sequencing data to identify somatic mutations in cancer samples is far from achieving the required robustness and standardisation. In this study we generated a whole exome sequencing benchmark dataset using the platinum genome sample NA12878 and developed an intersect-then-combine (ITC) approach to increase the accuracy in calling single nucleotide variants (SNVs) and indels in tumour-normal pairs. We evaluated the effect of alignment, base quality recalibration, mutation caller and filtering on sensitivity and false positive rate. The ITC approach increased the sensitivity up to 17.1%, without increasing the false positive rate per megabase (FPR/Mb) and its validity was confirmed in a set of clinical samples.

Description

Keywords

BWA, Filtering, Mutect2, NA12878, Novoalign, Platinum genome, Somatic mutation, Strelka, Variant calling, Whole exome sequencing, Algorithms, Computational Biology, DNA, Neoplasm, Exome, Genome, Human, Humans, INDEL Mutation, Mutation, Neoplasms, Polymorphism, Single Nucleotide, Sensitivity and Specificity, Sequence Analysis, DNA

Journal Title

Genome Med

Conference Name

Journal ISSN

1756-994X
1756-994X

Volume Title

9

Publisher

Springer Science and Business Media LLC
Sponsorship
Wellcome Trust (106566/Z/14/Z)
Cancer Research UK (unknown)
Department of Health (via National Institute for Health Research (NIHR)) (unknown)
Cancer Research UK (60098573)
Cancer Research UK (unknown)
Cancer Research UK (CB4140)
Department of Health (via National Institute for Health Research (NIHR)) (NF-SI-0515-10090)
European Commission (260791)
Cambridge University Hospitals NHS Foundation Trust (CUH) (RG51913)
Cancer Research Uk (None)
European Commission FP7 Network of Excellence (NoE) (260791)
Cambridge University Hospitals NHS Foundation Trust (CUH) (unknown)
Cancer Research Uk (None)
Academy of Medical Sciences (unknown)
Medical Research Council (MR/M008975/1)
Academy of Medical Sciences (ALI 01/08/14)
Pathological Society of Great Britain & Ireland (CDF 2012/01)
European Commission FP7 Collaborative projects (CP) (258967)
Cancer Research UK (C507/A16278)
European Commission (258967)
Cancer Research UK (20544)
Medical Research Council (MR/P012442/1)
European Commission and European Federation of Pharmaceutical Industries and Associations (EFPIA) FP7 Innovative Medicines Initiative (IMI) (115749)
European Commission (242006)
European Research Council (694620)
Cancer Research UK (A24622)
European Commission Horizon 2020 (H2020) Marie Sk?odowska-Curie actions (660060)