Multi-Region bilinear convolutional neural networks for person re-identification

Ustinova, Evgeniya; Ganin, Yaroslav; Lempitsky, Victor

doi:10.1109/avss.2017.8078460

Cited by 121 publications

(81 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Part-based algorithms: By performing bilinear pooling in a more local way, an embedding can be learned, in which each pooling is confined to a predefined region [25]. Inspired by attention models, in [16,14,21], the attention-based deep neural networks are proposed to capture multiple attentions and select multi-scale attentive features.…”

Section: Related Workmentioning

confidence: 99%

Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training

Zheng

Deng

Sun

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

345

204

View full text Add to dashboard Cite

Most existing Re-IDentification (Re-ID) methods are highly dependent on precise bounding boxes that enable images to be aligned with each other. However, due to the challenging practical scenarios, current detection models often produce inaccurate bounding boxes, which inevitably degenerate the performance of existing Re-ID algorithms. In this paper, we propose a novel coarse-to-fine pyramid model to relax the need of bounding boxes, which not only incorporates local and global information, but also integrates the gradual cues between them. The pyramid model is able to match at different scales and then search for the correct image of the same identity, even when the image pairs are not aligned. In addition, in order to learn discriminative identity representation, we explore a dynamic training scheme to seamlessly unify two losses and extract appropriate shared information between them. Experimental results clearly demonstrate that the proposed method achieves the state-of-the-art results on three datasets. Especially, our approach exceeds the current best method by 9.5% on the most challenging CUHK03 dataset.

show abstract

Section: Related Workmentioning

confidence: 99%

Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training

Zheng

Deng

Sun

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

345

204

View full text Add to dashboard Cite

show abstract

“…This strategy has been widely adopted in fine-grained recogni- tion [53][54][55] and shows promising performance. For person re-identification, Ustinova et al [56] adopted a bilinear pooling to aggregate two different appearance maps; this method does not generate part-aligned representations and leads to poor performance. Our approach uses a bilinear pooling to aggregate appearance and part maps to compute part-aligned representations.…”

Section: Related Workmentioning

confidence: 99%

Part-Aligned Bilinear Representations for Person Re-identification

Suh

Wang

Tang

et al. 2018

Lecture Notes in Computer Science

464

247

View full text Add to dashboard Cite

We propose a novel network that learns a part-aligned representation for person re-identification. It handles the body part misalignment problem, that is, body parts are misaligned across human detections due to pose/viewpoint change and unreliable detection. Our model consists of a two-stream network (one stream for appearance map extraction and the other one for body part map extraction) and a bilinear-pooling layer that generates and spatially pools a partaligned map. Each local feature of the part-aligned map is obtained by a bilinear mapping of the corresponding local appearance and body part descriptors. Our new representation leads to a robust image matching similarity, which is equivalent to an aggregation of the local similarities of the corresponding body parts combined with the weighted appearance similarity. This part-aligned representation reduces the part misalignment problem significantly. Our approach is also advantageous over other pose-guided representations (e.g., extracting representations over the bounding box of each body part) by learning part descriptors optimal for person re-identification. For training the network, our approach does not require any part annotation on the person re-identification dataset. Instead, we simply initialize the part sub-stream using a pre-trained sub-network of an existing pose estimation network, and train the whole network to minimize the re-identification loss. We validate the effectiveness of our approach by demonstrating its superiority over the state-of-the-art methods on the standard benchmark datasets, including Market-1501, CUHK03, CUHK01 and DukeMTMC, and standard video dataset MARS.

show abstract

“…Wu et al [79] improved performance based on Ahmed's idea by using a deeper architecture and a new optimization method. Other deep network structures such as [69] and [65] have been designed which also effectively solved the ReID problem on older ReID datasets. Qui et al [54] attempted to perform facial ReID by using domain adaptation methods to reconcile different facial poses; however, their experiments were performed on the Multi-PIE [15] dataset, in which face images have controlled poses and illuminations.…”

Section: B Face Re-identificationmentioning

confidence: 99%

On Low-Resolution Face Recognition in the Wild: Comparisons and New Techniques

Prieto

Mery

et al. 2019

IEEE Trans.Inform.Forensic Secur.

138

View full text Add to dashboard Cite

Although face recognition systems have achieved impressive performance in recent years, the low-resolution face recognition task remains challenging, especially when the lowresolution faces are captured under non-ideal conditions, as is common in surveillance-based applications. Faces captured in such conditions are often contaminated by blur, non-uniform lighting, and non-frontal face pose. In this paper, we analyze face recognition techniques using data captured under lowquality conditions in the wild. We provide a comprehensive analysis of experimental results for two of the most important applications in real surveillance applications, and demonstrate practical approaches to handle both cases that show promising performance. The following three contributions are made: (i) we conduct experiments to evaluate super-resolution methods for low-resolution face recognition; (ii) we study face re-identification on various public face datasets including real surveillance and low-resolution subsets of large-scale datasets, present a baseline result for several deep learning based approaches, and improve them by introducing a Generative Adversarial Network (GAN) pre-training approach and fully convolutional architecture; and (iii) we explore low-resolution face identification by employing a state-of-the-art supervised discriminative learning approach. Evaluations are conducted on challenging portions of the SCface and UCCSface datasets.

show abstract

Multi-Region bilinear convolutional neural networks for person re-identification

Cited by 121 publications

References 28 publications

Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training

Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training

Part-Aligned Bilinear Representations for Person Re-identification

On Low-Resolution Face Recognition in the Wild: Comparisons and New Techniques

Contact Info

Product

Resources

About