Repository logo
 

A database of battery materials auto-generated using ChemDataExtractor.

Accepted version
Peer-reviewed

Type

Article

Change log

Authors

Huang, Shu 

Abstract

A database of battery materials is presented which comprises a total of 292,313 data records, with 214,617 unique chemical-property data relations between 17,354 unique chemicals and up to five material properties: capacity, voltage, conductivity, Coulombic efficiency and energy. 117,403 data are multivariate on a property where it is the dependent variable in part of a data series. The database was auto-generated by mining text from 229,061 academic papers using the chemistry-aware natural language processing toolkit, ChemDataExtractor version 1.5, which was modified for the specific domain of batteries. The collected data can be used as a representative overview of battery material information that is contained within text of scientific papers. Public availability of these data will also enable battery materials design and prediction via data-science methods. To the best of our knowledge, this is the first auto-generated database of battery materials extracted from a relatively large number of scientific papers. We also provide a Graphical User Interface (GUI) to aid the use of this database.

Description

Keywords

46 Information and Computing Sciences, 40 Engineering, 4016 Materials Engineering, 34 Chemical Sciences, 7 Affordable and Clean Energy

Journal Title

Sci Data

Conference Name

Journal ISSN

2052-4463
2052-4463

Volume Title

7

Publisher

Springer Science and Business Media LLC

Rights

All rights reserved
Sponsorship
Royal Academy of Engineering (RAEng) (RCSRF1819\7\10)
STFC (Unknown)
The BASF/Royal Academy of Engineering Research Chair in Data-Driven Molecular Engineering of Functional Materials is partly supported by the STFC via the ISIS Neutron and Muon Source.