Procedings of the British Machine Vision Conference 2017 2017
DOI: 10.5244/c.31.69
|View full text |Cite
|
Sign up to set email alerts
|

Weakly-supervised Learning of Mid-level Features for Pedestrian Attribute Recognition and Localization

Abstract: Most existing methods for pedestrian attribute recognition in video surveillance can be formulated as a multi-label image classification methodology, while attribute localization is usually disregarded due to the low image qualities and large variations of camera viewpoints and human poses. In this paper, we propose a weakly-supervised learning based approaching to implementing multi-attribute classification and localization simultaneously, without the need of bounding box annotations of attributes. Firstly, a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
27
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 30 publications
(27 citation statements)
references
References 13 publications
0
27
0
Order By: Relevance
“…In order to verify the performance of the proposed method on the PETA and RAP datasets, we compared it with the two traditional pedestrian attribute recognition representative methods; ikSVM [2,17] and ELF [18], and another four deep learning-based methods, including ACN [5], DeepMAR [19], WPAL [8] and multi-task CNN [6].…”
Section: Comparisons With State-of-the-art Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to verify the performance of the proposed method on the PETA and RAP datasets, we compared it with the two traditional pedestrian attribute recognition representative methods; ikSVM [2,17] and ELF [18], and another four deep learning-based methods, including ACN [5], DeepMAR [19], WPAL [8] and multi-task CNN [6].…”
Section: Comparisons With State-of-the-art Methodsmentioning
confidence: 99%
“…In contrast to previous methods of attributes recognition using the whole pedestrian image, WPAL [8] first determines the location of different attributes by the weakly supervised method, obtaining the attribute detection result, and then predicts the attributes in the detected results.…”
Section: Comparisons With State-of-the-art Methodsmentioning
confidence: 99%
“…The contribution of each attribute determines the weight of the corresponding CNN. Zhou et al [21] first extracted mid-level features from detection layers using GoogLeNet. They localized the pedestrian attributes by fusing and clustering the activation maps of the detection layers.…”
Section: Related Workmentioning
confidence: 99%
“…Dividing the human body part into 15 parts, [14] train a separate CNN for each part where the weight of a CNN is based on the contribution of a particular attribute. [15] makes use of the GoogLeNet to extract mid-level features from detection layers. Activation maps of these detected layers are fused, and clustering is performed to localize pedestrian attributes.…”
Section: Related Workmentioning
confidence: 99%