Repository logo
 

SMMILe enables accurate spatial quantification in digital pathology using multiple-instance learning.

Published version
Peer-reviewed

Repository DOI


Change log

Abstract

Spatial quantification is a critical step in most computational pathology tasks, from guiding pathologists to areas of clinical interest to discovering tissue phenotypes behind novel biomarkers. To circumvent the need for manual annotations, modern computational pathology methods have favored multiple-instance learning approaches that can accurately predict whole-slide image labels, albeit at the expense of losing their spatial awareness. Here we prove mathematically that a model using instance-level aggregation could achieve superior spatial quantification without compromising on whole-slide image prediction performance. We then introduce a superpatch-based measurable multiple-instance learning method, SMMILe, and evaluate it across 6 cancer types, 3 highly diverse classification tasks and 8 datasets involving 3,850 whole-slide images. We benchmark SMMILe against nine existing methods using two different encoders-an ImageNet pretrained and a pathology-specific foundation model-and show that in all cases SMMILe matches or exceeds state-of-the-art whole-slide image classification performance while simultaneously achieving outstanding spatial quantification.

Description

Funder: GE | GE Healthcare; doi: https://doi.org/10.13039/100006775


Funder: We acknowledge funding and support from Cancer Research UK and the Cancer Research UK Cambridge Centre [CTRQQR-2021-100012], The Mark Foundation for Cancer Research [RG95043], GE HealthCare, Medical Intelligence Research Institute Project of the First Affiliated Hospital of Xi'an Jiaotong University [HX202440] and the Noncommunicable Chronic Diseases-National Science and Technology Major Project [2024ZD0527700]. Additional support was also provided by the National Institute of Health Research (NIHR) Cambridge Biomedical Research Centre [NIHR203312] and EPSRC Tier-2 capital grant [EP/P020259/1]. M.C.O. was supported by the Joseph Mitchell Cancer Research Fund, the Academy of Medical Sciences [G117526] and NIHR [NIHR206092]. Z.G. was supported by GE HealthCare. H.C. was supported by the University of Cambridge Harding Distinguished Postgraduate Scholars Programme.

Journal Title

Nat Cancer

Conference Name

Journal ISSN

2662-1347
2662-1347

Volume Title

6

Publisher

Springer Nature

Rights and licensing

Except where otherwised noted, this item's license is described as http://creativecommons.org/licenses/by/4.0/
Sponsorship
Department of Health (via National Institute for Health Research (NIHR)) (NIHR206092)
Academy of Medical Sciences (SBF008\1170)
Engineering and Physical Sciences Research Council (EP/P020259/1)
We acknowledge funding and support from Cancer Research UK and the Cancer Research UK Cambridge Centre [CTRQQR-2021-100012], The Mark Foundation for Cancer Research [RG95043], GE HealthCare, and the CRUK National Cancer Imaging Translational Accelerator (NCITA) [A27066]. Additional support was also provided by the National Institute of Health Research (NIHR) Cambridge Biomedical Research Centre [NIHR203312] and EPSRC Tier-2 capital grant [EP/P020259/1]. Calculations were performed in part using the Sulis Tier 2 HPC platform hosted by the Scientific Computing Research Technology Platform at the University of Warwick. Sulis is funded by EPSRC Grant EP/T022108/1 and the HPC Midlands+ consortium. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The results shown here are in whole or part based upon data generated by the TCGA Research Network: https://www.cancer.gov/tcga.