2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2020
DOI: 10.1109/cvprw50498.2020.00018
|View full text |Cite
|
Sign up to set email alerts
|

Minimizing Supervision in Multi-label Categorization

Abstract: Multiple categories of objects are present in most images. Treating this as a multi-class classification is not justified. We treat this as a multi-label classification problem. In this paper, we further aim to minimize the supervision required for providing supervision in multi-label classification. Specifically, we investigate an effective class of approaches that associate a weak localization with each category either in terms of the bounding box or segmentation mask. Doing so improves the accuracy of multi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 27 publications
0
1
0
Order By: Relevance
“…These datasets consist of a large vocabulary in an unconstrained environment with thousands of speakers. However, this conventional task needs data to be annotated by the human [17](i.e., transcribe speech into text format). On the other hand, the lip to speech generation task does not require any annotations; hence it has drawn considerable attention as an alternative form of lip reading.…”
Section: Introductionmentioning
confidence: 99%
“…These datasets consist of a large vocabulary in an unconstrained environment with thousands of speakers. However, this conventional task needs data to be annotated by the human [17](i.e., transcribe speech into text format). On the other hand, the lip to speech generation task does not require any annotations; hence it has drawn considerable attention as an alternative form of lip reading.…”
Section: Introductionmentioning
confidence: 99%