Skip to yearly menu bar Skip to main content

Long Oral
Workshop: Trustworthy Machine Learning for Healthcare

Do Tissue Source Sites leave identifiable Signatures in Whole Slide Images beyond staining?

Muhammad Dawood · Piotr Keller · Fayyaz ul Amir Minhas


Why can deep learning predictors trained on Whole Slide Images fail to generalize? It is a common theme in Computational Pathology to see a high performing model developed in a research setting experience a large drop in performance when it is deployed to a new clinical environment. One of the major reasons for this is the batch effect that is introduced during the creation of Whole Slide Images resulting in a domain shift. Computational Pathology pipelines try to reduce this effect via stain normalization techniques. However, in this paper, we provide empirical evidence that stain normalization methods do not result in any significant reduction of the batch effect. This is done via clustering analysis of the dataset as well as training weakly-supervised models to predict source sites. This study aims to open up avenues for further research for effective handling of batch effects for improving trustworthiness and generalization of predictive modelling in the Computational Pathology domain.

Chat is not available.