2021 8th Swiss Conference on Data Science (SDS) 2021
DOI: 10.1109/sds51136.2021.00012
|View full text |Cite
|
Sign up to set email alerts
|

A Survey of Un-, Weakly-, and Semi-Supervised Learning Methods for Noisy, Missing and Partial Labels in Industrial Vision Applications

Abstract: When applying deep learning methods in an industrial vision application, they often fall short of the performance shown in a clean and controlled lab environment due to data quality issues. Few would consider the actual labels as a driving factor, yet inaccurate label data can impair model performance significantly. However, being able to mitigate inaccurate or incomplete labels might also be a cost-saver for real-world projects.Here, we survey state-of-the-art deep learning approaches to resolve such missing … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 56 publications
(57 reference statements)
0
3
0
Order By: Relevance
“…Limited amount of training data and noisy labels of public datasets are other factors contributing to low classification accuracies. One possible way to tackle this limitation is to rely on weakly supervised learning methods to improve the COVID-19 classification accuracy with the methodology summarized in [37].…”
Section: Discussionmentioning
confidence: 99%
“…Limited amount of training data and noisy labels of public datasets are other factors contributing to low classification accuracies. One possible way to tackle this limitation is to rely on weakly supervised learning methods to improve the COVID-19 classification accuracy with the methodology summarized in [37].…”
Section: Discussionmentioning
confidence: 99%
“…To the contrary: First, it reduces the validity of performance result comparisons of the different systems if this occurs in the test dataset. Second, it is detrimental for the learning of MER systems, if it occurs in the training dataset (by teaching the model that the same input has ambiguous output, leading to reduced learning [30]). In order to minimize these meaningless variations in the GT of im2latex-100k, we adopted a data-centric approach to develop a new LaTeX normalization procedure.…”
Section: Detrimental Latex Variationsmentioning
confidence: 99%
“…In this paper, we present an approach to identify vertebrae of the spine automatically without the need of excessive labeling of own data (or even no labels at all), thereby heralding a data-centric approach [ 12 ] based on un- or semi-supervised learning [ 13 ]. To this end, our contribution is the development and evaluation of a novel method that requires no labels at all to achieve reliable vertebrae detection and identification and, if given less than 5% of the labels we perform on par with comparable supervised approaches.…”
Section: Introductionmentioning
confidence: 99%