2021
DOI: 10.3390/e23040423
|View full text |Cite
|
Sign up to set email alerts
|

Co-Training for Visual Object Recognition Based on Self-Supervised Models Using a Cross-Entropy Regularization

Abstract: Automatic recognition of visual objects using a deep learning approach has been successfully applied to multiple areas. However, deep learning techniques require a large amount of labeled data, which is usually expensive to obtain. An alternative is to use semi-supervised models, such as co-training, where multiple complementary views are combined using a small amount of labeled data. A simple way to associate views to visual objects is through the application of a degree of rotation or a type of filter. In th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 27 publications
0
5
0
Order By: Relevance
“…First, each base classifier is initialized with labeled data, and then the classifier selects the “most confident” unlabeled data, assigns the predicted category of samples and adds them to the labeled data set. The labeled data set is updated and provided to another classifier, and the iterations are repeated until each classifier does not change ( 113 ).…”
Section: Discussionmentioning
confidence: 99%
“…First, each base classifier is initialized with labeled data, and then the classifier selects the “most confident” unlabeled data, assigns the predicted category of samples and adds them to the labeled data set. The labeled data set is updated and provided to another classifier, and the iterations are repeated until each classifier does not change ( 113 ).…”
Section: Discussionmentioning
confidence: 99%
“…In the latter case, a GAN-based virtual-to-real image translation [ 19 ] is used as pre-processing for the virtual-world images, i.e., before taking them for running the co-training procedure. Very recently, Díaz et al [ 26 ] presented co-training for visual object recognition. In other words, the paper addresses a classification problem, while we address both localization and classification to perform object detection.…”
Section: Related Workmentioning
confidence: 99%
“…In other words, the paper addresses a classification problem, while we address both localization and classification to perform object detection. While the different views proposed in [ 26 ] rely on self-supervision (e.g., forcing image rotations), here, these rely on data multi-modality. In fact, in our previous work [ 17 ], we used mirroring to force different data views, which can be considered as a kind of self-supervision too.…”
Section: Related Workmentioning
confidence: 99%
“…In the latter case, a GAN-based virtual-to-real image translation [37] is used as pre-processing for the virtualworld images, i.e., before taking them for running the co-training procedure. Very recently, Díaz et al [4] presented co-training for visual object recognition. In other words, the paper addresses a classification problem, while we address both localization and classification to perform object detection.…”
Section: Related Workmentioning
confidence: 99%
“…In other words, the paper addresses a classification problem, while we address both localization and classification to perform object detection. While the different views proposed in [4] rely on self-supervision (e.g., forcing image rotations), here, these rely on data multi-modality. In fact, in our previous work [28], we used mirroring to force different data views, which can be considered as a kind of self-supervision too.…”
Section: Related Workmentioning
confidence: 99%