2022
DOI: 10.1101/2022.01.21.477189
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep learning can predict multi-omic biomarkers from routine pathology images: A systematic large-scale study

Abstract: We assessed the pan-cancer predictability of multi-omic biomarkers from haematoxylin and eosin (H&E)-stained whole slide image (WSI) using deep learning and standard evaluation measures throughout a systematic study. A total of 13,443 deep learning (DL) models predicting 4,481 multi-omic biomarkers across 32 cancer types were trained and validated. The investigated biomarkers included genetic mutations, transcriptomic (mRNA) and proteomic under- and over-expression status, metabolomic pathways, established… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3

Relationship

4
2

Authors

Journals

citations
Cited by 7 publications
(11 citation statements)
references
References 87 publications
0
11
0
Order By: Relevance
“…Initial studies demonstrated this predictability in lung cancer 3 , breast cancer 4 , and colorectal cancer 5 . Subsequently, several "pan-cancer" studies showed that DL-based prediction of biomarkers is feasible across the whole spectrum of human cancer [6][7][8][9][10] . However, these studies were overwhelmingly performed in a single large cohort without externally validating the results on a large scale.…”
mentioning
confidence: 99%
“…Initial studies demonstrated this predictability in lung cancer 3 , breast cancer 4 , and colorectal cancer 5 . Subsequently, several "pan-cancer" studies showed that DL-based prediction of biomarkers is feasible across the whole spectrum of human cancer [6][7][8][9][10] . However, these studies were overwhelmingly performed in a single large cohort without externally validating the results on a large scale.…”
mentioning
confidence: 99%
“…A broad comparison to the results of Arslan et al is challenging due to the very limited overlap in biomarkers between their work and ours. However, for CDK4, which is the only shared RNA biomarker, they report at AUROC of around 0.72 (averaged across 3 cancer types) [16], whereas we obtain an AUROC of 0.84 pan-cancer; of 0.75 in a stratified analysis, averaged across all cancer types; and of 0.77 when filtering to 4 relevant cancer types (specifically, breast, colorectal, lung, and pancreatic).…”
Section: Discussionmentioning
confidence: 60%
“…A broad comparison to the results of Arslan et al is challenging due to the very limited overlap in biomarkers between their work and ours. However, for CDK4, which is the only shared RNA biomarker, they report at AUROC of around 0.72 (averaged across 3 cancer types) [16],…”
Section: Machine Learning Lessonsmentioning
confidence: 99%
“…Recent studies have shown that machine learning methods can use tumor histology to predict key disease characteristics, including grade [36][37][38] , immune infiltration [39][40][41] , HPV status 42 , mutation status 19,[43][44][45] , expression of individual genes 20,46,47 , and established multi-omic features 41 , but have not attempted to predict de novo learned, complex transcriptional features, such as our CLFs. To assess whether and how each CLF was histologically encoded, we trained a deep learning network to predict binarized CLF status (high or low) from tumor histology images (Fig.…”
Section: Transcriptional Clf Weights Can Be Predicted From Tumor Hist...mentioning
confidence: 99%