Multi-level Factorisation Net for Person Re-identification

Chang, Xiaobin; Hospedales, Timothy M.; Xiang, Tao

doi:10.1109/cvpr.2018.00225

Cited by 493 publications

(315 citation statements)

References 49 publications

(140 reference statements)

Supporting

Mentioning

307

Contrasting

Order By: Relevance

“…We apply binary human masks (1 for non-human pixels and 0 for human pixels) to remove the influence of pixels predicted as human parts, which is called as Latent w/o HP. (2) only use human part information within latent part branch. We also apply binary human masks (1 for human pixels and 0 for non-human pixels) to remove the influence of pixels predicted as non-human parts, which is called as Latent w/o NHP.…”

Section: Ablation Studymentioning

confidence: 99%

“…The details are as folows: (1) We prepare each mini-batch by randomly sampling 16 classes (identities) and 4 images for each class. (2) We set the weight rate as 1:1 on all three datasets. (3) Given a minibatch of 64 samples, we construct a triplet for each sample by choosing the hardest positive sample and the hardest negative sample measured by their Euclidean distances.…”

Section: Triplet Lossmentioning

confidence: 99%

See 1 more Smart Citation

Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification

Guo

Yuan

Huang

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

184

View full text Add to dashboard Cite

Person re-identification is a challenging task due to various complex factors. Recent studies have attempted to integrate human parsing results or externally defined attributes to help capture human parts or important object regions. On the other hand, there still exist many useful contextual cues that do not fall into the scope of predefined human parts or attributes. In this paper, we address the missed contextual cues by exploiting both the accurate human parts and the coarse non-human parts. In our implementation, we apply a human parsing model to extract the binary human part masks and a self-attention mechanism to capture the soft latent (non-human) part masks. We verify the effectiveness of our approach with new state-of-the-art performances on three challenging benchmarks: Market-1501, DukeMTMC-reID and CUHK03. Our implementation is available at https://github.com/ggjy/P2Net.pytorch.

show abstract

Section: Ablation Studymentioning

confidence: 99%

Section: Triplet Lossmentioning

confidence: 99%

Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification

Guo

Yuan

Huang

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

184

View full text Add to dashboard Cite

show abstract

“…DukeMTMC-reID rank 1 mAP rank 1 mAP SVDNet 82.3 62.1 76.7 56.8 PAN 82.8 63.4 71.6 51.5 MultiScale (Chen et al, 2017) 88.9 73.1 79.2 60.6 MLFN (Chang et al, 2018) 90.0 74.3 81.0 62.8 HA-CNN (Li et al, 2018) 91.2 75.7 80.5 63.8 Mancs (Wang et al, 2018a) 93.1 82.3 84.9 71.8 Attention-Driven (Yang et al, 2019) 94.9 86.4 86.0 74.5 PCB+RPP (Sun et al, 2018) 93.8 81.6 83.3 69.2 HPM (Fu et al, 2018) 94.2 82.7 86.6 74.3 MGN (Wang et al, 2018b) 95.7 86.9 88.7 78.4 VMRFANet(Ours) 95.5 88.1 88.9 80.0 Table 3: Comparison of results on CUHK03-labeled (CUHK03-L) and CUHK03-detected (CUHK03-D) with new protocol (Zhong et al, 2017a). The best results are in bold, while the numbers with underlines denote the second best.…”

Section: Market1501mentioning

confidence: 99%

“…Model CUHK03-L CUHK03-D rank 1 mAP rank 1 mAP SVDNet 40.9 37.8 41.5 37.3 MLFN (Chang et al, 2018) 54.7 49.2 52.8 47.8 HA-CNN (Li et al, 2018) 44.4 41.0 41.7 38.6 PCB+RPP (Sun et al, 2018) --63.7 57.5 MGN (Wang et al, 2018b) 68.0 67.4 68.0 66.0 MRFANet (Ours) 81.1 78.8 78.9 75.3…”

Section: Market1501mentioning

confidence: 99%

VMRFANet: View-specific Multi-Receptive Field Attention Network for Person Re-identification

Cai

Fang

Wang

et al. 2020

Proceedings of the 12th International Conference on Agents and Artificial Intelligence

View full text Add to dashboard Cite

Person re-identification (re-ID) aims to retrieve the same person across different cameras. In practice, it still remains a challenging task due to background clutter, variations on body poses and view conditions, inaccurate bounding box detection, etc. To tackle these issues, in this paper, we propose a novel multi-receptive field attention (MRFA) module that utilizes filters of various sizes to help network focusing on informative pixels. Besides, we present a view-specific mechanism that guides attention module to handle the variation of view conditions. Moreover, we introduce a Gaussian horizontal random cropping/padding method which further improves the robustness of our proposed network. Comprehensive experiments demonstrate the effectiveness of each component. Our method achieves 95.5% / 88.1% in rank-1 / mAP on Market-1501, 88.9% / 80.0% on DukeMTMC-reID, 81.1% / 78.8% on CUHK03 labeled dataset and 78.9% / 75.3% on CUHK03 detected dataset, outperforming current state-of-the-art methods.

show abstract

“…Supervised person re-id Most existing person re-id models are created by supervised learning methods on a separate set of cross-camera identity labelled training data (Wang et al, 2014b(Wang et al, , 2016bZhao et al, 2017;Chen et al, 2017;Li et al, 2017;Chen et al, 2018b;Li et al, 2018b;Song et al, 2018;Chang et al, 2018;Sun et al, 2018;Shen et al, 2018a;Wei et al, 2018;Hou et al, 2019;Zheng et al, 2019;Zhang et al, 2019;Quan et al, 2019;Zhou et al, 2019). Relying on the strong supervision of cross-camera identity labelled training data, they have achieved remarkable performance boost.…”

Section: Related Workmentioning

confidence: 99%

Intra-Camera Supervised Person Re-Identification: A New Benchmark

Zhu

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)

View full text Add to dashboard Cite

Existing person re-identification (re-id) methods mostly exploit a large set of cross-camera identity labelled training data. This requires a tedious data collection and annotation process, leading to poor scalability in practical re-id applications. On the other hand unsupervised re-id methods do not need identity label information, but they usually suffer from much inferior and insufficient model performance. To overcome these fundamental limitations, we propose a novel person reidentification paradigm based on an idea of independent per-camera identity annotation. This eliminates the most time-consuming and tedious inter-camera identity labelling process, significantly reducing the amount of human annotation efforts. Consequently, it gives rise to a more scalable and more feasible setting, which we call Intra-Camera Supervised (ICS) person re-id, for which we formulate a Multi-tAsk mulTi-labEl (MATE) deep learning method. Specifically, MATE is designed for self-discovering the cross-camera identity correspondence in a per-camera multi-task inference framework. Extensive experiments demonstrate the cost-effectiveness Camera 3 Camera 1 Camera 4 Camera 2 (a) (b) Fig. 1 (a) Person re-identification challenges. Each triplet bounded by a dashed box shows the images of a single person from different camera views. (b) Illustration of manually associating identities across camera-views. The dashed arrow denotes the comparison between two identities. The associated identities are bounded with red boxes.superiority of our method over the alternative approaches on three large person re-id datasets. For example, MATE yields 88.7% rank-1 score on Market-1501 in the proposed ICS person re-id setting, significantly outperforming unsupervised learning models and closely approaching conventional fully supervised learning competitors.

show abstract

Multi-level Factorisation Net for Person Re-identification

Cited by 493 publications

References 49 publications

Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification

Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification

VMRFANet: View-specific Multi-Receptive Field Attention Network for Person Re-identification

Intra-Camera Supervised Person Re-Identification: A New Benchmark

Contact Info

Product

Resources

About