Repository logo
 

Sixteen diverse laboratory mouse reference genomes define strain-specific haplotypes and novel functional loci.

Published version
Peer-reviewed

Type

Article

Change log

Authors

Doran, Anthony G 
Fiddes, Ian T 
Abrudan, Monica 
Armstrong, Joel 

Abstract

We report full-length draft de novo genome assemblies for 16 widely used inbred mouse strains and find extensive strain-specific haplotype variation. We identify and characterize 2,567 regions on the current mouse reference genome exhibiting the greatest sequence diversity. These regions are enriched for genes involved in pathogen defence and immunity and exhibit enrichment of transposable elements and signatures of recent retrotransposition events. Combinations of alleles and genes unique to an individual strain are commonly observed at these loci, reflecting distinct strain phenotypes. We used these genomes to improve the mouse reference genome, resulting in the completion of 10 new gene structures. Also, 62 new coding loci were added to the reference genome annotation. These genomes identified a large, previously unannotated, gene (Efcab3-like) encoding 5,874 amino acids. Mutant Efcab3-like mice display anomalies in multiple brain regions, suggesting a possible role for this gene in the regulation of brain development.

Description

Keywords

Animals, Animals, Laboratory, Chromosome Mapping, Genetic Loci, Genome, Haplotypes, Mice, Mice, Inbred BALB C, Mice, Inbred C3H, Mice, Inbred C57BL, Mice, Inbred CBA, Mice, Inbred DBA, Mice, Inbred NOD, Mice, Inbred Strains, Molecular Sequence Annotation, Phylogeny, Polymorphism, Single Nucleotide, Species Specificity

Journal Title

Nat Genet

Conference Name

Journal ISSN

1061-4036
1546-1718

Volume Title

50

Publisher

Springer Science and Business Media LLC
Sponsorship
European Commission (282510)
European Research Council (615584)
Cancer Research UK (20412)
Wellcome Trust (202878/Z/16/Z)