2021
DOI: 10.48550/arxiv.2105.03020
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Structured dataset documentation: a datasheet for CheXpert

Abstract: Billions of X-ray images are taken worldwide each year. Machine learning, and deep learning in particular, has shown potential to help radiologists triage and diagnose images. However, deep learning requires large datasets with reliable labels.The CheXpert dataset was created with the participation of board-certified radiologists, resulting in the strong ground truth needed to train deep learning networks.Following the structured format of Datasheets for Datasets, this paper expands on the original CheXpert pa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 6 publications
(23 reference statements)
0
1
0
Order By: Relevance
“…Classifying pediatric cases is a potential issue for models trained on adult x-rays. This is noted, for example, in the chest-radiograph dataset CheXpert's datasheet [27]. In order to have enough examples of incorrectly-classified anomalous input, one anomalous input that was classified correctly with 73% confidence was instead presented as classified incorrectly with 60% confidence.…”
Section: Example Selection and Presentationmentioning
confidence: 99%
“…Classifying pediatric cases is a potential issue for models trained on adult x-rays. This is noted, for example, in the chest-radiograph dataset CheXpert's datasheet [27]. In order to have enough examples of incorrectly-classified anomalous input, one anomalous input that was classified correctly with 73% confidence was instead presented as classified incorrectly with 60% confidence.…”
Section: Example Selection and Presentationmentioning
confidence: 99%