A database of battery materials auto-generated using ChemDataExtractor
Publication Date
2020-08-06Journal Title
Scientific Data
Publisher
Nature Publishing Group UK
Volume
7
Issue
1
Language
en
Type
Article
This Version
VoR
Metadata
Show full item recordCitation
Huang, S., & Cole, J. M. (2020). A database of battery materials auto-generated using ChemDataExtractor. Scientific Data, 7 (1)https://doi.org/10.1038/s41597-020-00602-2
Description
Funder: University of Cambridge | Christ's College, University of Cambridge (Christ's College); doi: https://doi.org/10.13039/501100000590
Abstract
Abstract: A database of battery materials is presented which comprises a total of 292,313 data records, with 214,617 unique chemical-property data relations between 17,354 unique chemicals and up to five material properties: capacity, voltage, conductivity, Coulombic efficiency and energy. 117,403 data are multivariate on a property where it is the dependent variable in part of a data series. The database was auto-generated by mining text from 229,061 academic papers using the chemistry-aware natural language processing toolkit, ChemDataExtractor version 1.5, which was modified for the specific domain of batteries. The collected data can be used as a representative overview of battery material information that is contained within text of scientific papers. Public availability of these data will also enable battery materials design and prediction via data-science methods. To the best of our knowledge, this is the first auto-generated database of battery materials extracted from a relatively large number of scientific papers. We also provide a Graphical User Interface (GUI) to aid the use of this database.
Keywords
Data Descriptor, /639/638/675, /639/766/94, /639/301/299/891, data-descriptor
Sponsorship
Royal Academy of Engineering (RCSRF1819\7\10)
Identifiers
s41597-020-00602-2, 602
External DOI: https://doi.org/10.1038/s41597-020-00602-2
This record's URL: https://www.repository.cam.ac.uk/handle/1810/308849
Rights
Licence:
https://creativecommons.org/licenses/by/4.0/