Complete Sequence of a 641-kb Insertion of Mitochondrial DNA in the Arabidopsis thaliana Nuclear Genome.
View / Open Files
Publication Date
2022-05-03Journal Title
Genome Biol Evol
ISSN
1759-6653
Publisher
Oxford University Press (OUP)
Volume
14
Issue
5
Language
eng
Type
Article
This Version
VoR
Metadata
Show full item recordCitation
Fields, P. D., Waneka, G., Naish, M., Schatz, M. C., Henderson, I., & Sloan, D. B. (2022). Complete Sequence of a 641-kb Insertion of Mitochondrial DNA in the Arabidopsis thaliana Nuclear Genome.. Genome Biol Evol, 14 (5) https://doi.org/10.1093/gbe/evac059
Abstract
Intracellular transfers of mitochondrial DNA continue to shape nuclear genomes. Chromosome 2 of the model plant Arabidopsis thaliana contains one of the largest known nuclear insertions of mitochondrial DNA (numts). Estimated at over 600 kb in size, this numt is larger than the entire Arabidopsis mitochondrial genome. The primary Arabidopsis nuclear reference genome contains less than half of the numt because of its structural complexity and repetitiveness. Recent data sets generated with improved long-read sequencing technologies (PacBio HiFi) provide an opportunity to finally determine the accurate sequence and structure of this numt. We performed a de novo assembly using sequencing data from recent initiatives to span the Arabidopsis centromeres, producing a gap-free sequence of the Chromosome 2 numt, which is 641 kb in length and has 99.933% nucleotide sequence identity with the actual mitochondrial genome. The numt assembly is consistent with the repetitive structure previously predicted from fiber-based fluorescent in situ hybridization. Nanopore sequencing data indicate that the numt has high levels of cytosine methylation, helping to explain its biased spectrum of nucleotide sequence divergence and supporting previous inferences that it is transcriptionally inactive. The original numt insertion appears to have involved multiple mitochondrial DNA copies with alternative structures that subsequently underwent an additional duplication event within the nuclear genome. This work provides insights into numt evolution, addresses one of the last unresolved regions of the Arabidopsis reference genome, and represents a resource for distinguishing between highly similar numt and mitochondrial sequences in studies of transcription, epigenetic modifications, and de novo mutations.
Keywords
Cpg Methylation, Nupt, Structural Variants, Tandem Duplications, Numt, Intracellular Gene Transfer, Cell Nucleus, Mitochondria, Arabidopsis, DNA, Mitochondrial, In Situ Hybridization, Fluorescence, Sequence Analysis, DNA, Genome, Mitochondrial
Sponsorship
BBSRC (BB/V003984/1)
Identifiers
35446419, PMC9071559
External DOI: https://doi.org/10.1093/gbe/evac059
This record's URL: https://www.repository.cam.ac.uk/handle/1810/337377
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.