2019
DOI: 10.1109/tcsvt.2018.2884173
|View full text |Cite
|
Sign up to set email alerts
|

Learning Object Detectors With Semi-Annotated Weak Labels

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 23 publications
(5 citation statements)
references
References 41 publications
0
5
0
Order By: Relevance
“…Thus the proposed method outperforms the generic few-shot models. Based on our analysis, the TOAN framework can also be extended to other weakly-supervised tasks, such as objection detection [70], [71], localization [72], [73], and segmentation [74], etc., where only the image-level supervision is available. The intra-class and inter-class variances in these tasks can be modeled by the proposed target-oriented matching mechanism and global pair-wise bilinear pooling operation, respectively.…”
Section: Generic Few-shot Learningmentioning
confidence: 99%
“…Thus the proposed method outperforms the generic few-shot models. Based on our analysis, the TOAN framework can also be extended to other weakly-supervised tasks, such as objection detection [70], [71], localization [72], [73], and segmentation [74], etc., where only the image-level supervision is available. The intra-class and inter-class variances in these tasks can be modeled by the proposed target-oriented matching mechanism and global pair-wise bilinear pooling operation, respectively.…”
Section: Generic Few-shot Learningmentioning
confidence: 99%
“…Our first limitation is that the targeted domain for reviewed papers in this survey is comparatively narrow, i.e., surveillance BVD. A broader version can be the complete video analytics domain, covering action and activity recognition [60], video summarization, [61], video retrieval [7], healthcare [62], objects detection and tracking [63,64], etc. The specific focus of our research is to conduct an in-depth review of fuzzy methods applied to the generic video analysis domain, towards deriving a proper taxonomy of the applied fuzzy techniques.…”
Section: Representative Surveys In Fuzzy Logic and Our Surveymentioning
confidence: 99%
“…In order to accurately locate different keypoints of the targets, most deep models take multi-scale or high-resolution information into account [21], [26], [28], whilst looking at contextual information. In general, contextual information is referred to as regions surrounding the targets, and it has been proved effective in pose estimation [26], [34], object detection [37], [38], co-saliency detection [39]. However, these models ignore the difference between different keypoints to some extent, which are significant to mouse pose estimation due to relatively weak spatial correlation caused by highly deformable mouse body.…”
Section: A Structured Context Mixermentioning
confidence: 99%