Repository logo
 

Auto-generating databases of Yield Strength and Grain Size using ChemDataExtractor.

Published version
Peer-reviewed

Change log

Authors

Kumar, Pankaj 
Kabra, Saurabh 

Abstract

The emerging field of material-based data science requires information-rich databases to generate useful results which are currently sparse in the stress engineering domain. To this end, this study uses the'materials-aware' text-mining toolkit, ChemDataExtractor, to auto-generate databases of yield-strength and grain-size values by extracting such information from the literature. The precision of the extracted data is 83.0% for yield strength and 78.8% for grain size. The automatically-extracted data were organised into four databases: a Yield Strength, Grain Size, Engineering-Ready Yield Strength and Combined database. For further validation of the databases, the Combined database was used to plot the Hall-Petch relationship for, the alloy, AZ31, and similar results to the literature were found, demonstrating how one can make use of these automatically-extracted datasets.

Description

Keywords

Databases, Factual, Data Mining

Journal Title

Sci Data

Conference Name

Journal ISSN

2052-4463
2052-4463

Volume Title

9

Publisher

Springer Science and Business Media LLC
Sponsorship
Royal Academy of Engineering (RCSRF1819\7\10)
RCUK | Science and Technology Facilities Council (STFC) (Fellowship support from ISIS Neutron and Muon Source)
U.S. Department of Energy (DOE) (DE-AC02-06CH11357)