2020
DOI: 10.1016/j.patter.2020.100019
|View full text |Cite
|
Sign up to set email alerts
|

Cross-Modal Data Programming Enables Rapid Medical Machine Learning

Abstract: Labeling training datasets has become a key barrier to building medical machine learning models. One strategy is to generate training labels programmatically, for example by applying natural language processing pipelines to text reports associated with imaging studies. We propose cross-modal data programming, which generalizes this intuitive strategy in a theoretically-grounded way that enables simpler, clinician-driven input, reduces required labeling time, and improves with additional unlabeled data. In this… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
30
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 39 publications
(30 citation statements)
references
References 48 publications
0
30
0
Order By: Relevance
“…[Data Modalities] Bridging the Modality Gap. Solutions for similar cross-modal problems (see Section 8) assume that other tasks have already been trained for the target modality [36,45,58], or data of different modalities are directly connected [23]. Examples of direct connections are images paired with captions, 2D projections of 3D point clouds, or clinical notes and lab results.…”
Section: Cross-modal Challengesmentioning
confidence: 99%
See 3 more Smart Citations
“…[Data Modalities] Bridging the Modality Gap. Solutions for similar cross-modal problems (see Section 8) assume that other tasks have already been trained for the target modality [36,45,58], or data of different modalities are directly connected [23]. Examples of direct connections are images paired with captions, 2D projections of 3D point clouds, or clinical notes and lab results.…”
Section: Cross-modal Challengesmentioning
confidence: 99%
“…We refer to this process of adapting models to new data modalities as cross-modal adaptation (a form of transductive transfer learning [46]). Existing work in cross-modal adaptation assumes points across data modalities are easily or directly linked (e.g., captions directly linked to images, or clinical notes to lab results) to leverage zero-shot learning [45,58] or weak supervision [23]. However, in our environment, such direct connections do not often exist, resulting in a modality gap.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Due to the inherent property of Snorkel, it is widely used in various real life problems like surveillance with electronic health records 33 , clinical text classification 34 , web content and event classification 35 . Also, Snorkel is used for improving gene clusteing 36 and medical image training 37 .…”
mentioning
confidence: 99%