Proceedings of the 28th ACM International Conference on Multimedia 2020
DOI: 10.1145/3394171.3413991
|View full text |Cite
|
Sign up to set email alerts
|

Bottom-Up Foreground-Aware Feature Fusion for Person Search

Abstract: The key to efficient person search is jointly localizing pedestrians and learning discriminative representation for person re-identification (re-ID). Some recently developed task-joint models are built with separate detection and re-ID branches on top of shared region feature extraction networks, where the large receptive field of neurons leads to background information redundancy for the following re-ID task. Our diagnostic analysis indicates the task-joint model suffers from considerable performance drop whe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 35 publications
0
2
0
Order By: Relevance
“…After years of development, the Faster R-CNN [ 16 ] based target detection algorithm can now achieve high performance while also slowing down detection. Some algorithms for pedestrian retrieval [ 17 ] with deeply integrated networks [ 18 ] have achieved reasonable performance. For the one-step approach, they designed a multi-tasking framework [ 19 ] based on Faster R-CNN, establishing a regional proposal network (RPN) to generate region proposal [ 20 ] and then input it into subsequent parallel detection and re-ID branches.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…After years of development, the Faster R-CNN [ 16 ] based target detection algorithm can now achieve high performance while also slowing down detection. Some algorithms for pedestrian retrieval [ 17 ] with deeply integrated networks [ 18 ] have achieved reasonable performance. For the one-step approach, they designed a multi-tasking framework [ 19 ] based on Faster R-CNN, establishing a regional proposal network (RPN) to generate region proposal [ 20 ] and then input it into subsequent parallel detection and re-ID branches.…”
Section: Introductionmentioning
confidence: 99%
“…The first part does not contain residual blocks, which are mainly used for the calculation of convolution, regularization, activation function, and maximum pooling of the input, and the second, third, fourth, and fifth parts of the structure contain residual blocks that do not change the size of the residual blocks but are only used to change the dimensionality of the residual blocks. ResNet-50 is a residual network with lower complexity, more stable performance [ 18 ], and faster convergence compared to VGG 16, and it is suitable for many projects with more accurate results in image classification, target detection, and natural language processing. First, we replaced the general convolution in ResNet-50 with inception convolution in Seq-Net, dynamically enhancing the receptive field of feature diagrams [ 20 , 21 ] without increasing computation or degrading feature diagram resolution.…”
Section: Introductionmentioning
confidence: 99%