ABD-Net: Attentive but Diverse Person Re-Identification

Chen, Tianlong; Ding, Shaojin; Xie, Jingyi; Yuan, Ye; Chen, Wuyang; Yang, Yang; Ren, Zhou; Wang, Zhangyang

doi:10.1109/iccv.2019.00844

Cited by 443 publications

(257 citation statements)

References 61 publications

Supporting

Mentioning

239

Contrasting

Order By: Relevance

“…Specifically, they aim to learn the features by improving loss functions [9,14,22,31,41,42,50,55,63], improving the training techniques [1,4,12,24,32,35,37,54], adding additional network modules [23,23,51,62], using extra semantic annotations [30,46,47,79] or generating more training samples [17,33,72,76,77]. Besides, more recent studies [3,6,8,10,27,28,38,46,48,53,58,61,64,65,67,80] integrate attention mechanisms into deep models to enhance the feature representation. To obtain the holistic features, most of these methods utilize global average pooling (GAP), global max pooling (GMP) or both of them on each channel of the feature map.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Discriminative Spatial Feature Learning for Person Re-Identification

Peng

Huang

Wang

et al. 2020

Proceedings of the 28th ACM International Conference on Multimedia

View full text Add to dashboard Cite

Person re-identification (ReID) aims to match detected pedestrian images from multiple non-overlapping cameras. Most existing methods employ a backbone CNN to extract a vectorized feature representation by performing some global pooling operations (such as global average pooling and global max pooling) on the 3D feature map (i.e., the output of the backbone CNN). Although simple and effective in some situations, the global pooling operation only focuses on the statistical properties and ignores the spatial distribution of the feature map. Hence, it can not distinguish two feature maps when they have similar response values located in totally different positions. To handle this challenge, a novel method is proposed to learn the discriminative spatial features. Firstly, a self-constrained spatial transformer network (SC-STN) is introduced to handle the misalignments caused by detection errors. Then, based on the prior knowledge that the spatial structure of a pedestrian often keeps robust in vertical orientation of images, a novel vertical convolution network (VCN) is proposed to extract the spatial feature in vertical. Extensive experimental evaluations on several benchmarks demonstrate that the proposed method achieves state-of-theart performances by introducing only a few parameters to the backbone.

show abstract

Section: Related Workmentioning

confidence: 99%

“…In the spatial feature branch, a SC-STN is firstly employed to refine the feature map, and then a VCN is introduced to extract the spatial feature. For both branches, the label-smoothed cross-entropy loss [8,32,53] and the ranked list loss [50] are utilized to make the features discriminative.…”

Section: Related Workmentioning

confidence: 99%

Discriminative Spatial Feature Learning for Person Re-Identification

Peng

Huang

Wang

et al. 2020

Proceedings of the 28th ACM International Conference on Multimedia

View full text Add to dashboard Cite

show abstract

“…The performance of our proposed person Re-ID method is compared with the state-of-the-art methods on both Market-1501 and DukeMTMC-reID datasets. The employed comparison methods include AlignedReID [45], IDE (ID-discriminative embedding) [39], SVDNet (singular vector decomposition net) [46], TriNet (triplet net) [30], Pyramid [47], AWTL (adaptive weighted triplet loss) [48], ABD-Net (Attentive but Diverse Net) [49], DSA-reID (Densely Semantically Aligned reID) [50], the baseline method in [51], and the baseline method together with triplet loss [51].…”

Section: A Person Re-id Performancementioning

confidence: 99%

“…In practical industry applications that require simple but effective solutions, our proposed method could be one of the promising counterbalanced solutions to real-world person ReID tasks. [39] 79.5% 59.5% --SVDNet [46] 82.3% 62.1% 76.7% 56.8% TriNet [30] 84.9% 69.1% --Pyramid [47] 92.8% 82.1% --AWTL [48] 89.5% 75.7% 79.8% 63.4% ABD-Net [49] 95.6% 88.3% 89.0% 78.6% DSA-reID [50] 95.7% 87.6% 86.2% 74.3% Baseline [51] 93 When comparing the performance of our proposed ADCSLL and ADCSLL + triplet loss, the results show that ADCSLL together with triplet loss achieves better performance than ADCSLL alone. The rank-1 accuracies when using ADCSLL + triplet loss on Market-1501 and DukeMTMC-reID are 95.0% and 88.6%, respectively, 0.2% and 1.1% higher than the numbers when using ADCSLL alone.…”

Section: A Person Re-id Performancementioning

confidence: 99%

Person Re-Identification Using Additive Distance Constraint With Similar Labels Loss

Lei

Tang³

et al. 2020

IEEE Access

View full text Add to dashboard Cite

Despite the promising progress made in recent years, person re-identification (Re-ID) remains a challenging task due to the intra-class variations. Most of the current studies used the traditional Softmax loss for solutions, but its discriminative capability encounters a bottleneck. Therefore, how to improve person Re-ID performance is still a challenging task. To address this problem, we proposed a novel loss function, namely additive distance constraint with similar labels loss (ADCSLL). Specifically, we reformulated the Softmax loss by adding a distance constraint to the ground truth label, based on which similar labels were introduced to enhance the learned features to be much more stable and centralized. Experimental evaluations were conducted on two popular datasets (Market-1501 and DukeMTMC-reID) to examine the effectiveness of our proposed method. The results showed that our proposed ADCSLL was more discriminative than most of the other compared state-of-the-art methods. The rank-1 accuracy and the mAP on Market-1501 were 95.0% and 87.0%, respectively. The numbers were 88.6% and 77.2% on DukeMTMC-reID, respectively.

show abstract

“…The deep learning model automatically decides through backpropagation what features to be extracted. Various deep neural models reported in the literature [6,32,33] can re-identify the individual in the presence of extreme distortion. Various pre-defined models that are AluxNet, Caffenet, Googlenet, VGG networks, ResNet, and SVDnet have also been used as feature extraction criteria for the person Re-ID.…”

Section: Introductionmentioning

confidence: 99%

Person Re-Identification by Discriminative Local Features of Overlapping Stripes

2020

View full text Add to dashboard Cite

The human visual system can recognize a person based on his physical appearance, even if extreme spatio-temporal variations exist. However, the surveillance system deployed so far fails to re-identify the individual when it travels through the non-overlapping camera’s field-of-view. Person re-identification (Re-ID) is the task of associating individuals across disjoint camera views. In this paper, we propose a robust feature extraction model named Discriminative Local Features of Overlapping Stripes (DLFOS) that can associate corresponding actual individuals in the disjoint visual surveillance system. The proposed DLFOS model accumulates the discriminative features from the local patch of each overlapping strip of the pedestrian appearance. The concatenation of histogram of oriented gradients, Gaussian of color, and the magnitude operator of CJLBP bring robustness in the final feature vector. The experimental results show that our proposed feature extraction model achieves rank@1 matching rate of 47.18% on VIPeR, 64.4% on CAVIAR4REID, and 62.68% on Market1501, outperforming the recently reported models from the literature and validating the advantage of the proposed model.

show abstract

ABD-Net: Attentive but Diverse Person Re-Identification

Cited by 443 publications

References 61 publications

Discriminative Spatial Feature Learning for Person Re-Identification

Discriminative Spatial Feature Learning for Person Re-Identification

Person Re-Identification Using Additive Distance Constraint With Similar Labels Loss

Person Re-Identification by Discriminative Local Features of Overlapping Stripes

Contact Info

Product

Resources

About