2020
DOI: 10.1007/978-3-030-59710-8_80
|View full text |Cite
|
Sign up to set email alerts
|

Can You Trust Predictive Uncertainty Under Real Dataset Shifts in Digital Pathology?

Abstract: Deep learning-based algorithms have shown great promise for assisting pathologists in detecting lymph node metastases when evaluated based on their predictive accuracy. However, for clinical adoption, we need to know what happens when the test set dramatically changes from the training distribution. In such settings, we should estimate the uncertainty of the predictions, so we know when to trust the model (and when not to). Here, we i) investigate current popular methods for improving the calibration of predic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
22
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
3

Relationship

2
7

Authors

Journals

citations
Cited by 26 publications
(27 citation statements)
references
References 9 publications
2
22
0
Order By: Relevance
“…Clinical models trained on one hospital or region typically degrade in performance in the presence of domain shift [5,21,50,63,73,74,77,84]. In this paper, we evaluated the performance of eight domain generalization methods on their ability to generalize to an unseen test environment for typical clinical datasets.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Clinical models trained on one hospital or region typically degrade in performance in the presence of domain shift [5,21,50,63,73,74,77,84]. In this paper, we evaluated the performance of eight domain generalization methods on their ability to generalize to an unseen test environment for typical clinical datasets.…”
Section: Discussionmentioning
confidence: 99%
“…Prior work has found significant decreases in model performance under the presence of cross-institutional domain shift, in the chest X-ray [21,63,84], MRI [5,50], and pathology [73,74,77] settings. Temporal domain shifts have also been found to reduce performance in clinical machine learning models [53].…”
Section: Introductionmentioning
confidence: 99%
“…First, even though our models show good generalizability on the retrospective cohort ( n = 480 WSIs), we developed them on a limited number of cases. This means that the models might not perform optimally on another study cohort from a different site with a distributional shift in, e.g., preanalytical protocols, staining protocol, or scanner type [ 56 , 57 ]. Future development of our approach should extend the development dataset of both tissue- and cell-level models to be multi-institutional, covering the innate variability of the above-mentioned factors.…”
Section: Discussionmentioning
confidence: 99%
“…In four different unseen domains, BigAug obtains a comparable performance to the two state-of-the-art methods. Finally, in digital pathology and histopahology, the domain shift effect for deep learning has been studied in Thagaard et al (2020); Stacke et al (2019Stacke et al ( , 2020.…”
Section: Samalamentioning
confidence: 99%