Repository logo
 

SCoTCH-seq reveals that 5-hydroxymethylcytosine encodes regulatory information across DNA strands

Published version
Peer-reviewed

Repository DOI


Change log

Abstract

In mammalian genomes, cytosine modifications form a layer of regulatory information alongside the genetic code. Decoding this information is crucial to our understanding of biology and disease. Established sequencing methods cannot simultaneously resolve cytosine’s three most common forms—cytosine (C), 5-methylcytosine (mC), and 5-hydroxymethylcytosine (hmC)—across both strands of the DNA double helix. Thus, how epigenetic information is distributed in DNA remains unclear. Here, we present Strand-Coupled Tandem Cytosine Hydroxymethylation and methylation sequencing (SCoTCH-seq): an accurate and quantitative, base-resolution approach to sequence genomes, together with mC and hmC, in both strands of the same DNA fragment. We show that different forms of cytosine combine across the double helix at CpG sites to form discrete information states in the mouse epigenome. These CpG states have distinct genomic distributions—including at promoters, enhancers, and gene bodies—and have different relationships with transcription. We show that while all possible forms of hydroxymethylation occur, hmC is predominantly asymmetric and that different forms of asymmetric hmC are not equivalent. Our findings demonstrate that 5-hydroxymethylcytosine combines with different cytosine variants across the DNA double helix to form distinct states of regulatory information.

Description

Peer reviewed: True


Publication status: Published

Journal Title

Proceedings of the National Academy of Sciences

Conference Name

Journal ISSN

0027-8424
1091-6490

Volume Title

122

Publisher

National Academy of Sciences

Rights and licensing

Except where otherwised noted, this item's license is described as https://creativecommons.org/licenses/by/4.0/
Sponsorship
cambridgeuniversity | Cancer Research UK Cambridge Institute, University of Cambridge (CRUK CI) (SEBINT-2024/100003)
Wellcome Trust (WT) (209441/Z/17/Z)
Leverhulme Trust (ECF-2022-390)
Deutsche Forschungsgemeinschaft (DFG) (458174228)
UKRI | Engineering and Physical Sciences Research Council (EPSRC) (EP/R513180/1)
Leverhulme Trust (ECF-2021-398)