Testing the reproducibility and robustness of the cancer biology literature by robot.

Roper, Katherine; Abdel-Rehim, A; Hubbard, Sonya; Carpenter, Martin; Rzhetsky, Andrey; Soldatova, Larisa; King, Ross D

Testing the reproducibility and robustness of the cancer biology literature by robot.

Published version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/336906

Repository DOI

https://doi.org/10.17863/CAM.84325

Files

Published version (796.78 KB)

Type

Article

Authors

Roper, Katherine

Abdel-Rehim, A

Hubbard, Sonya

Carpenter, Martin

Rzhetsky, Andrey

Show 2 more

Abstract

Scientific results should not just be 'repeatable' (replicable in the same laboratory under identical conditions), but also 'reproducible' (replicable in other laboratories under similar conditions). Results should also, if possible, be 'robust' (replicable under a wide range of conditions). The reproducibility and robustness of only a small fraction of published biomedical results has been tested; furthermore, when reproducibility is tested, it is often not found. This situation is termed 'the reproducibility crisis', and it is one the most important issues facing biomedicine. This crisis would be solved if it were possible to automate reproducibility testing. Here, we describe the semi-automated testing for reproducibility and robustness of simple statements (propositions) about cancer cell biology automatically extracted from the literature. From 12 260 papers, we automatically extracted statements predicted to describe experimental results regarding a change of gene expression in response to drug treatment in breast cancer, from these we selected 74 statements of high biomedical interest. To test the reproducibility of these statements, two different teams used the laboratory automation system Eve and two breast cancer cell lines (MCF7 and MDA-MB-231). Statistically significant evidence for repeatability was found for 43 statements, and significant evidence for reproducibility/robustness in 22 statements. In two cases, the automation made serendipitous discoveries. The reproduced/robust knowledge provides significant insight into cancer. We conclude that semi-automated reproducibility testing is currently achievable, that it could be scaled up to generate a substantive source of reliable knowledge and that automation has the potential to mitigate the reproducibility crisis.

Keywords

biology, cancer, literature, reproducibility, robustnesses, testings, Automation, Biology, Breast Neoplasms, Female, Humans, Reproducibility of Results, Robotics

Journal Title

J R Soc Interface

Journal ISSN

1742-5689
1742-5662

Publisher

The Royal Society

Publisher DOI

https://doi.org/10.1098/rsif.2021.0821

Rights

Attribution 4.0 International

Sponsorship

EPSRC (EP/W004801/1)

Collections

Jisc Publications Router