Comparative performance of fully-automated and semi-automated artificial intelligence methods for the detection of clinically significant prostate cancer on MRI: a systematic review.
Moreira Da Silva, Nadia
MetadataShow full item record
Sushentsev, N., Moreira Da Silva, N., Yeung, M., Barrett, T., Sala, E., Roberts, M., & Rundo, L. (2022). Comparative performance of fully-automated and semi-automated artificial intelligence methods for the detection of clinically significant prostate cancer on MRI: a systematic review.. Insights Imaging https://doi.org/10.1186/s13244-022-01199-3
OBJECTIVES: We systematically reviewed the current literature evaluating the ability of fully-automated deep learning (DL) and semi-automated traditional machine learning (TML) MRI-based artificial intelligence (AI) methods to differentiate clinically significant prostate cancer (csPCa) from indolent PCa (iPCa) and benign conditions. METHODS: We performed a computerised bibliographic search of studies indexed in MEDLINE/PubMed, arXiv, medRxiv, and bioRxiv between 1 January 2016 and 31 July 2021. Two reviewers performed the title/abstract and full-text screening. The remaining papers were screened by four reviewers using the Checklist for Artificial Intelligence in Medical Imaging (CLAIM) for DL studies and Radiomics Quality Score (RQS) for TML studies. Papers that fulfilled the pre-defined screening requirements underwent full CLAIM/RQS evaluation alongside the risk of bias assessment using QUADAS-2, both conducted by the same four reviewers. Standard measures of discrimination were extracted for the developed predictive models. RESULTS: 17/28 papers (five DL and twelve TML) passed the quality screening and were subject to a full CLAIM/RQS/QUADAS-2 assessment, which revealed a substantial study heterogeneity that precluded us from performing quantitative analysis as part of this review. The mean RQS of TML papers was 11/36, and a total of five papers had a high risk of bias. AUCs of DL and TML papers with low risk of bias ranged between 0.80-0.89 and 0.75-0.88, respectively. CONCLUSION: We observed comparable performance of the two classes of AI methods and identified a number of common methodological limitations and biases that future studies will need to address to ensure the generalisability of the developed models.
The review is supported by the National Institute of Health Research Cambridge Biomedical Research Centre, Cancer Research UK (Cambridge Imaging Centre grant number C197/A16465), the Engineering and Physical Sciences Research Council Imaging Centre in Cambridge and Manchester, Gates Cambridge Trust, The Mark Foundation for Cancer Research, the Wellcome Trust, CRUK National Cancer Imaging Translational Accelerator (NCITA) and the Cambridge Experimental Cancer Medicine Centre
External DOI: https://doi.org/10.1186/s13244-022-01199-3
This record's URL: https://www.repository.cam.ac.uk/handle/1810/334906
All Rights Reserved
Licence URL: http://www.rioxx.net/licenses/all-rights-reserved