2020
DOI: 10.1109/jbhi.2020.2974425
|View full text |Cite
|
Sign up to set email alerts
|

Inaccurate Labels in Weakly-Supervised Deep Learning: Automatic Identification and Correction and Their Impact on Classification Performance

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 34 publications
(22 citation statements)
references
References 27 publications
0
21
0
1
Order By: Relevance
“…H. Soleimani et al [ 29 ] employed the segmentation of pectoral muscle to locate cancerous regions in mammograms and used a deep learning algorithm for the classification. A deep learning data-driven-based approach was proposed [ 30 ] for automatic identification of cancerous regions from mammograms, which improved the performance of the classification. D. song et al [ 31 ] applied a deep neural network for breast cancer prognosis prediction from multidimensional data and achieved a specificity of 99%.…”
Section: Literature Reviewmentioning
confidence: 99%
“…H. Soleimani et al [ 29 ] employed the segmentation of pectoral muscle to locate cancerous regions in mammograms and used a deep learning algorithm for the classification. A deep learning data-driven-based approach was proposed [ 30 ] for automatic identification of cancerous regions from mammograms, which improved the performance of the classification. D. song et al [ 31 ] applied a deep neural network for breast cancer prognosis prediction from multidimensional data and achieved a specificity of 99%.…”
Section: Literature Reviewmentioning
confidence: 99%
“…However, since a relatively large amount of unstructured data, be it images, videos, or text responses, needs to be labeled or coded, human involvement is necessary. The coding quality can have a large impact on the accuracy of classifiers, depending on the relative number of miscodings [37,38]. Once again, a distinction must be made between incorrect codings where the label does not fit the data material and just disagreement between raters where a certain margin of interpretation allows varying codings (i.e., border cases).…”
Section: Gold Standard: Human Codingmentioning
confidence: 99%
“…As the name indicates, unsupervised learning based models work only with unlabelled data so no training phase is involved; whereas supervised techniques have the requirement of training over a large dataset often requiring costly data labelling [17]. Unsupervised algorithms most commonly attempt to discover a common pattern associated with the features being processed within the dataset [18].…”
Section: Proposed Approachmentioning
confidence: 99%
“…Homogeneity and Completeness are two critical characteristics of a cluster. V-measure, @ is given in (17). OE signifies the degree of weightage given to each of these two characteristics, and in this case it is '1' (equal weightage).…”
Section: E2c V-measurementioning
confidence: 99%