Visible-Infrared Person Re-Identification: A Comprehensive Survey and a New Setting

Zheng, Huantao; Zhong, Xian; Huang, Wenxin; Jiang, Kui; Liu, Wenxuan; Wang, Zheng

doi:10.3390/electronics11030454

Cited by 13 publications

(14 citation statements)

References 86 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In light of the challenges posed by low-light conditions and their impact on the adequate capture of comprehensive person appearance information using a single visible light camera, the integration of visible light and infrared images has emerged as a trend in person re-identification research. Zheng et al [22] conducted a comprehensive and meticulous study, offering an in-depth survey of prevailing methodologies concerning the fusion of visible and infrared light data. They embarked on a detailed examination of various facets related to image fusion, encompassing crucial aspects such as data structure, encountered challenges, and performance evaluation metrics.…”

Section: Person Re-identificationmentioning

confidence: 99%

A person re‐identification method for sports event scenes incorporating textual information mining

Wang,

Zhu,

Wan

et al. 2024

IET Image Processing

View full text Add to dashboard Cite

Person re‐identification represents a pivotal sub‐problem in image retrieval, boasting broad application prospects in fields such as intelligent security and video surveillance. However, most existing person re‐identification methods predominantly focus solely on visual features pertaining to the person targets, thereby disregarding some supporting information closely related to the scene context. In the context of athlete re‐identification during sports event scenes, the athlete bib number is fully considered, an important clue that can provide different athletes' identities, and the traditional visual features of the person and high‐level semantic information of the bib number text are fused. A multi‐source information mutual gain mechanism is designed to improve the accuracy of the person re‐identification task. In the existing only publicly available marathon bib number dataset RBNR, the recognition accuracy of this method is significantly superior to that of the existing person re‐identification method. In addition, this paper constructs and publishes an athlete re‐identification dataset (HNNU‐ReID8000) for mainstream sports events, and the mean average precision (mAP) value of this method reaches 96.1% on this dataset, significantly ahead of existing state‐of‐the‐art person re‐identification methods. The code and the HNNU‐ReID8000 dataset will be released at https://github.com/yanbin‐zhu/zyb_person‐reid.

show abstract

Section: Person Re-identificationmentioning

confidence: 99%

A person re‐identification method for sports event scenes incorporating textual information mining

Wang,

Zhu,

Wan

et al. 2024

IET Image Processing

View full text Add to dashboard Cite

show abstract

“…Single-modal person Re-ID matches probe samples with gallery samples, and all samples are taken from the same modality (i.e., RGB–RGB or IR–IR matching) [ 65 ]. Unlike single-modal Re-ID, cross-modal Re-ID aims to match the probe sample taken from one modality against a gallery set from another modality, such as RGB–IR, RGB–depth, or sketch–RGB images [ 13 , 52 , 66 ], as shown in Figure 10 .…”

Section: Cross-modal Person Re-identificationmentioning

confidence: 99%

“…RGB–IR-based Person Re-ID is the most widely studied cross-modal setting over all the other alternatives, thanks to the introduction of the SYSU-MM01 dataset [ 45 ], which initiates the path of an RGB–IR-based cross-modal Re-ID scenario. Following the survey paper in [ 13 ], state-of-the-art Re-ID approaches using RGB–IR-based cross-modal methods can be divided into two categories: non-generative- [ 67 , 68 , 69 , 70 , 71 , 72 , 73 , 74 , 75 , 76 , 77 , 78 , 79 , 80 , 81 ] and generative-based approaches. The former one relies on traditional feature representation [ 67 , 68 , 69 , 70 , 71 , 72 , 73 , 74 , 75 , 76 , 77 , 78 , 79 , 80 , 81 ] and metric learning approaches to maximize the similarities between two images with the same identity and minimize the similarities between two images with different identities, while the latter one depends on the unification of images from different modalities to minimize the data distribution gap between two different modalities.…”

Section: Cross-modal Person Re-identificationmentioning

confidence: 99%

See 1 more Smart Citation

Person Re-Identification with RGB–D and RGB–IR Sensors: A Comprehensive Survey

Uddin

Bhuiyan

Bappee

et al. 2023

Sensors

View full text Add to dashboard Cite

Learning about appearance embedding is of great importance for a variety of different computer-vision applications, which has prompted a surge in person re-identification (Re-ID) papers. The aim of these papers has been to identify an individual over a set of non-overlapping cameras. Despite recent advances in RGB–RGB Re-ID approaches with deep-learning architectures, the approach fails to consistently work well when there are low resolutions in dark conditions. The introduction of different sensors (i.e., RGB–D and infrared (IR)) enables the capture of appearances even in dark conditions. Recently, a lot of research has been dedicated to addressing the issue of finding appearance embedding in dark conditions using different advanced camera sensors. In this paper, we give a comprehensive overview of existing Re-ID approaches that utilize the additional information from different sensor-based methods to address the constraints faced by RGB camera-based person Re-ID systems. Although there are a number of survey papers that consider either the RGB–RGB or Visible-IR scenarios, there are none that consider both RGB–D and RGB–IR. In this paper, we present a detailed taxonomy of the existing approaches along with the existing RGB–D and RGB–IR person Re-ID datasets. Then, we summarize the performance of state-of-the-art methods on several representative RGB–D and RGB–IR datasets. Finally, future directions and current issues are considered for improving the different sensor-based person Re-ID systems.

show abstract

“…Similar Works considered body reconstruction models and distance metric learning [8]. In the last decade, the growth of AI-based approaches has captured the Pe-reID problem and has subsequently proved its potential with exceptionally good recognition accuracies across datasets [9]. However, the results obtained were nowhere close to the requirements of a real-time deployment pipeline.…”

Section: Introductionmentioning

confidence: 99%

Learning Global Average Attention Pooling (GAAP) on Resnet50 Backbone for Person Re-identification Problem

Kanchimani¹,

Maloji²,

Kishore³

2022

IJACSA

View full text Add to dashboard Cite

Person re-identification has been an extremely challenging task in computer vision which has been seen as a success with deep learning approaches. Despite successful models, there are gaps in the form of unbalanced labels, poor resolution, uncertain bounding box annotations, occlusions, and unlabelled datasets. Previous methods applied deep learning approaches based on feature representation, metric learning, and ranking optimization. In this work, we propose Global Average Attention Pooling (GAAP) on Resnet50 applied on four benchmark Re-ID datasets for classification tasks. We also perform an extensive evaluation on the proposed Attention module with different deep learning pipelines as backbone architecture. The four benchmark person Re-ID datasets used is Market-1501, RAiD, Partial-iLIDS, and RPIfield. We computed cumulative matching characteristics (CMC) and mean Average Precision (mAP) as the performance evaluation parameters of the proposed against the state of the art. The results obtained have shown that the added attention layer has improved the overall recognition precision over the baselines.

show abstract

Visible-Infrared Person Re-Identification: A Comprehensive Survey and a New Setting

Cited by 13 publications

References 86 publications

A person re‐identification method for sports event scenes incorporating textual information mining

A person re‐identification method for sports event scenes incorporating textual information mining

Person Re-Identification with RGB–D and RGB–IR Sensors: A Comprehensive Survey

Learning Global Average Attention Pooling (GAAP) on Resnet50 Backbone for Person Re-identification Problem

Contact Info

Product

Resources

About