2016
DOI: 10.48550/arxiv.1611.05603
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Weakly-supervised Learning of Mid-level Features for Pedestrian Attribute Recognition and Localization

Abstract: State-of-the-art methods treat pedestrian attribute recognition as a multi-label image classification problem. The location information of person attributes is usually eliminated or simply encoded in the rigid splitting of whole body in previous work. In this paper, we formulate the task in a weakly-supervised attribute localization framework. Based on GoogLeNet, firstly, a set of mid-level attribute features are discovered by novelly designed detection layers, where a max-pooling based weakly-supervised objec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2017
2017
2019
2019

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(18 citation statements)
references
References 18 publications
0
18
0
Order By: Relevance
“…Refer to (Li et al 2016a) for their definitions and explanations. We compare our approach with 14 existing counterparts, including HPNet (Liu et al 2017), JRL (Wang et al 2017), VeSPA (Sarfraz et al 2017), WPAL (Yu et al 2016), GAM (Fabbri, Calderara, and Cucchiara 2017), GRL (Zhao et al 2018), LGNet (Liu et al 2018), PGDM (Li et al 2018), VSGR , RCRA (Zhao et al 2019), I 2 ANet (Ji et al 2019), JLPLS (Tan et al 2019), CoCNN , and DCL (Wang et al 2019), as shown in Table 2. The samples in the RAP dataset are collected from real world surveillance scenarios, and compared to the ones in WIDER-Attribute, there are less distractions.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Refer to (Li et al 2016a) for their definitions and explanations. We compare our approach with 14 existing counterparts, including HPNet (Liu et al 2017), JRL (Wang et al 2017), VeSPA (Sarfraz et al 2017), WPAL (Yu et al 2016), GAM (Fabbri, Calderara, and Cucchiara 2017), GRL (Zhao et al 2018), LGNet (Liu et al 2018), PGDM (Li et al 2018), VSGR , RCRA (Zhao et al 2019), I 2 ANet (Ji et al 2019), JLPLS (Tan et al 2019), CoCNN , and DCL (Wang et al 2019), as shown in Table 2. The samples in the RAP dataset are collected from real world surveillance scenarios, and compared to the ones in WIDER-Attribute, there are less distractions.…”
Section: Resultsmentioning
confidence: 99%
“…When the total number of concerned attributes increases, the influence of the class imbalance problem can no longer be neglected. We thus also employ the weighted BCE-loss (Yu et al 2016) as:…”
Section: Training Schemementioning
confidence: 99%
See 1 more Smart Citation
“…Note that body parts are related to semantic attributes which are often specific to different body parts. A number of attributes based re-id models [43,36,51,11] have been proposed. They use attributes to provide additional supervision for learning identity-sensitive features.…”
Section: Related Workmentioning
confidence: 99%
“…As discussed in [24], [25], [26], SPD matrix transformation networks are capable of achieving the better performance than the original SPD matrix. Inspired by [37] and [21], we add a learnable layer to make the network more flexible and more adaptive to the specific task. Based on the SPD matrix generated by the kernel aggregation layer, we expect to transform the existing SPD representation to be a more discriminative, suitable and desirable matrix.…”
Section: Spd Matrix Transformation Layermentioning
confidence: 99%