2017
DOI: 10.1007/s10278-017-9976-3
|View full text |Cite
|
Sign up to set email alerts
|

Medical Image Data and Datasets in the Era of Machine Learning—Whitepaper from the 2016 C-MIMI Meeting Dataset Session

Abstract: At the first annual Conference on Machine Intelligence in Medical Imaging (C-MIMI), held in September 2016, a conference session on medical image data and datasets for machine learning identified multiple issues. The common theme from attendees was that everyone participating in medical image evaluation with machine learning is data starved. There is an urgent need to find better ways to collect, annotate, and reuse medical imaging data. Unique domain issues with medical image datasets require further study, d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
94
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 168 publications
(94 citation statements)
references
References 26 publications
(20 reference statements)
0
94
0
Order By: Relevance
“…Research studies are generally complex and the resulting data are valuable, not only to the principal investigator but to society as a whole [123,124]. Nonetheless, many researchers remain reluctant to share their data with an expert audience [125,126] beyond describing them as part of peer-reviewed publications.…”
Section: Data Sharingmentioning
confidence: 99%
“…Research studies are generally complex and the resulting data are valuable, not only to the principal investigator but to society as a whole [123,124]. Nonetheless, many researchers remain reluctant to share their data with an expert audience [125,126] beyond describing them as part of peer-reviewed publications.…”
Section: Data Sharingmentioning
confidence: 99%
“…Particularly in the medical domain, this might be a strong assumption for a solution, as annotated data contains strong human bias. Although there has been a huge effort in the community to mitigate this drawback by providing datasets such as ChestX-ray14, the has annotations but is far from being a definite expression of ground truth [14]. Therefore, by using supervised learning techniques one allows the labelling error and uncertainty to adversely effect the classification output of our machine learn framework.…”
mentioning
confidence: 99%
“…Deep learning models such as CNN require voluminous data to train the model without overfitting. This is the biggest challenge in the biomedical images [38]. The data that is available is limited, and most of them are raw images without annotations.…”
Section: Data Augmentationmentioning
confidence: 99%