AANet: Attribute Attention Network for Person Re-Identifications

Tay, Chiat-Pin; Roy, Sharmili; Yap, Kim–Hui

doi:10.1109/cvpr.2019.00730

Cited by 315 publications

(155 citation statements)

References 24 publications

Supporting

Mentioning

155

Contrasting

Order By: Relevance

“…We compare the proposed method with 33 recent published works including (1) global feature based methods which aims to learn the global feature from the feature map directly, including PAN [74], DMML [7], DCDS [1], VCFL [30], MVPM [41], LRDNN [79], RB [35], LITM [63], IANet [23], Sphere [14], BNNeck [32], OSNet [78], AANet [46], DG-Net [72], BDB [12], Circle [42], SFT [31], (2) part based methods including PCB+RPP [43], Local [57], HPM [16], CASN [71], AutoReID [34], MGN [49], BHP [20] and Pyramidal [68] which utilize the semantic parts or horizontal stripes to extract part-level feature, and (3) attention based methods including MHAN [3], CAMA [58], SONA [53], CAR [80], SCAL [6], ABD-Net [8], DAAF [10] and RGA [65]. These methods are categorized into 3 types based on different backbones: the ones which employ ResNet-50 directly, the ones which modify ResNet-50 by introducing additional branches, attention subnets or dilated convolution, and the others which don't use ResNet-50.…”

Section: Comparison Resultsmentioning

confidence: 99%

“…Holistic Features Based Methods Given a backbone C-NN such as ResNet-50 [21] or other network architectures [2,51,71,78], this type of methods learns discriminative holistic features from the feature map directly. Specifically, they aim to learn the features by improving loss functions [9,14,22,31,41,42,50,55,63], improving the training techniques [1,4,12,24,32,35,37,54], adding additional network modules [23,23,51,62], using extra semantic annotations [30,46,47,79] or generating more training samples [17,33,72,76,77]. Besides, more recent studies [3,6,8,10,27,28,38,46,48,53,58,61,64,…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Discriminative Spatial Feature Learning for Person Re-Identification

Peng

Huang

Wang

et al. 2020

Proceedings of the 28th ACM International Conference on Multimedia

View full text Add to dashboard Cite

Person re-identification (ReID) aims to match detected pedestrian images from multiple non-overlapping cameras. Most existing methods employ a backbone CNN to extract a vectorized feature representation by performing some global pooling operations (such as global average pooling and global max pooling) on the 3D feature map (i.e., the output of the backbone CNN). Although simple and effective in some situations, the global pooling operation only focuses on the statistical properties and ignores the spatial distribution of the feature map. Hence, it can not distinguish two feature maps when they have similar response values located in totally different positions. To handle this challenge, a novel method is proposed to learn the discriminative spatial features. Firstly, a self-constrained spatial transformer network (SC-STN) is introduced to handle the misalignments caused by detection errors. Then, based on the prior knowledge that the spatial structure of a pedestrian often keeps robust in vertical orientation of images, a novel vertical convolution network (VCN) is proposed to extract the spatial feature in vertical. Extensive experimental evaluations on several benchmarks demonstrate that the proposed method achieves state-of-theart performances by introducing only a few parameters to the backbone.

show abstract

Section: Comparison Resultsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Discriminative Spatial Feature Learning for Person Re-Identification

Peng

Huang

Wang

et al. 2020

Proceedings of the 28th ACM International Conference on Multimedia

View full text Add to dashboard Cite

show abstract

“…Semantic attributes [46,25,7] have been exploited as feature representations for person reidentification tasks. Previous work [47,6,20,42,58] leverages the attribute labels provided by original dataset to generate attribute-aware feature representation. Different from previous work, our latent part branch can attend to important visual cues without relying on detailed supervision signals from the limited predefined attributes.…”

Section: Related Workmentioning

confidence: 99%

Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification

Guo

Yuan

Huang

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

173

View full text Add to dashboard Cite

Person re-identification is a challenging task due to various complex factors. Recent studies have attempted to integrate human parsing results or externally defined attributes to help capture human parts or important object regions. On the other hand, there still exist many useful contextual cues that do not fall into the scope of predefined human parts or attributes. In this paper, we address the missed contextual cues by exploiting both the accurate human parts and the coarse non-human parts. In our implementation, we apply a human parsing model to extract the binary human part masks and a self-attention mechanism to capture the soft latent (non-human) part masks. We verify the effectiveness of our approach with new state-of-the-art performances on three challenging benchmarks: Market-1501, DukeMTMC-reID and CUHK03. Our implementation is available at https://github.com/ggjy/P2Net.pytorch.

show abstract

“…Most re-ID methods are in a supervised manner, in which sufficient labeled images are given. Recently, with the developing of deep learning approaches [36,35,34], methods with convolutional neural networks have dominated the re-ID community [12,26,45,46,25,16]. Specifically, methods proposed to learn discriminative features from parts of pedestrian images achieve impressive performance [24,8,23].…”

Section: Supervised Person Re-identificationmentioning

confidence: 99%

“…Given a query image, person re-identification (re-ID) aims to match the person across multiple non-overlapped cameras. In the last few years, person re-ID has drawn increasing research attention [12,45,46,25,24,23], due to its wide range of applications such as finding people of interest (e.g., lost kids or criminals) and person tracking. However, most of the proposed methods are of supervised manner, which requires intensive manual labeling and is not applicable to real-world applications.…”

Section: Introductionmentioning

confidence: 99%

Unsupervised Person Re-identification via Cross-Camera Similarity Exploration

Lin

Yan

et al. 2020

IEEE Trans. on Image Process.

View full text Add to dashboard Cite

Person re-identification (re-ID) is an important topic in computer vision. This paper studies the unsupervised setting of re-ID, which does not require any labeled information and thus is freely deployed to new scenarios. There are very few studies under this setting, and one of the best approach till now used iterative clustering and classification, so that unlabeled images are clustered into pseudo classes for a classifier to get trained, and the updated features are used for clustering and so on. This approach suffers two problems, namely, the difficulty of determining the number of clusters, and the hard quantization loss in clustering. In this paper, we follow the iterative training mechanism but discard clustering, since it incurs loss from hard quantization, yet its only product, image-level similarity, can be easily replaced by pairwise computation and a softened classification task. With these improvements, our approach becomes more elegant and is more robust to hyperparameter changes. Experiments on two image-based and video-based datasets demonstrate state-of-the-art performance under the unsupervised re-ID setting.

show abstract

AANet: Attribute Attention Network for Person Re-Identifications

Cited by 315 publications

References 24 publications

Discriminative Spatial Feature Learning for Person Re-Identification

Discriminative Spatial Feature Learning for Person Re-Identification

Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification

Unsupervised Person Re-identification via Cross-Camera Similarity Exploration

Contact Info

Product

Resources

About