Repository logo
 

A new parallel pipeline for DNA methylation analysis of long reads datasets.


Change log

Authors

Olanda, Ricardo 
Pérez, Mariano 
Tárraga, Joaquín 
Dopazo, Joaquín 

Abstract

BACKGROUND: DNA methylation is an important mechanism of epigenetic regulation in development and disease. New generation sequencers allow genome-wide measurements of the methylation status by reading short stretches of the DNA sequence (Methyl-seq). Several software tools for methylation analysis have been proposed over recent years. However, the current trend is that the new sequencers and the ones expected for an upcoming future yield sequences of increasing length, making these software tools inefficient and obsolete. RESULTS: In this paper, we propose a new software based on a strategy for methylation analysis of Methyl-seq sequencing data that requires much shorter execution times while yielding a better level of sensitivity, particularly for datasets composed of long reads. This strategy can be exported to other methylation, DNA and RNA analysis tools. CONCLUSIONS: The developed software tool achieves execution times one order of magnitude shorter than the existing tools, while yielding equal sensitivity for short reads and even better sensitivity for long reads.

Description

Keywords

DNA methylation, High performance computing, Parallel pipeline, DNA Methylation, Databases, Genetic, Epigenesis, Genetic, Gene Expression Regulation, Genome, Human, Humans, Mutation, Sensitivity and Specificity, Sequence Alignment, Sequence Analysis, DNA, Software

Journal Title

BMC Bioinformatics

Conference Name

Journal ISSN

1471-2105
1471-2105

Volume Title

Publisher

Springer Science and Business Media LLC