Data intelligence for process performance prediction in biologics manufacturing
Accepted version
Peer-reviewed
Repository URI
Repository DOI
Change log
Authors
Abstract
Despite the availability of large amount of data in bioprocess databases, little has been done for its retrospective analysis for process improvement. Historic bioprocess data is multivariate time-series, and due to its inherent nature, is incompatible with a variety of statistical methods employed in data analysis resulting in the lack of a tailored methodology. We present here an integrative framework of knowledge discovery tailored for handling historical bioprocess datasets. The pipeline successfully predicts process performance at harvest from an early time point, and robustly identifies the most relevant process parameters to model process performance. We present the utility of this pipeline on biologics manufacturing data from upstream bioprocess development for antibody production by mammalian cells. The proposed multi-model system that employs machine learning can predict performance at harvest after two weeks of operation with satisfactory accuracy employing data generated as early as on the sixth day of the culture.
Description
Keywords
Journal Title
Conference Name
Journal ISSN
1873-4375
Volume Title
Publisher
Publisher DOI
Rights
Sponsorship
Biotechnology and Biological Sciences Research Council (BB/K011138/1)