2018
DOI: 10.1109/tip.2018.2818018
|View full text |Cite
|
Sign up to set email alerts
|

Too Far to See? Not Really!—Pedestrian Detection With Scale-Aware Localization Policy

Abstract: A major bottleneck of pedestrian detection lies on the sharp performance deterioration in the presence of small-size pedestrians that are relatively far from the camera. Motivated by the observation that pedestrians of disparate spatial scales exhibit distinct visual appearances, we propose in this paper an active pedestrian detector that explicitly operates over multiple-layer neuronal representations of the input still image. More specifically, convolutional neural nets, such as ResNet and faster R-CNNs, are… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
46
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 80 publications
(46 citation statements)
references
References 51 publications
0
46
0
Order By: Relevance
“…In the recent few years, CNN based methods have achieved great success on object detection and pedestrian detection [53], [4], [31], [59], [1], [2], [3]. At first, CNN [26], [45] is simply acted as the feature extractor for pedestrian detection, which is fed to the shallow classifier.…”
Section: A a Review Of Pedestrian Detectionmentioning
confidence: 99%
“…In the recent few years, CNN based methods have achieved great success on object detection and pedestrian detection [53], [4], [31], [59], [1], [2], [3]. At first, CNN [26], [45] is simply acted as the feature extractor for pedestrian detection, which is fed to the shallow classifier.…”
Section: A a Review Of Pedestrian Detectionmentioning
confidence: 99%
“…Brazil et al [29] implemented a cascaded-phase design with a backbone network of VGG16 for progressive region proposal and pedestrian detection other than independent detection procedures with ROI systems. Zhang et al [30] and Song et al [31] employed ResNet with 50 layers for pedestrian feature extraction and obtained state-of-the-art results. Li et al [32] utilized deep feature (feature maps from ResNet-50) pyramids for multi-resolution feature extraction and realized a competitive accuracy and real-time pedestrian detection on Geforce GTX GPU.…”
Section: Deep Learning-based Pedestrian Detectionmentioning
confidence: 99%
“…Recently, an active detection model (ADM) [24] based on multi-layer feature representations, executes sequences of coordinate transformation actions on a set of initial bounding-box proposals to deliver accurate prediction of pedestrian locations, and achieve a more balanced detection performance for different scale pedestrian instances on the Caltech benchmark. However, the aboved boundingbox based methods inevitably incorporates a large proportion of uncertain background pixels (false positive) to the human pattern, while impels the instances to be predicted as false negatives.…”
Section: Multi-scale Object Detectionmentioning
confidence: 99%
“…It has been revealed in [4,24] that large-scale pedestrian instances typically exhibit dramatically different visual characteristics and internal features from the small-scale ones. For the network, pedestrian instances of different scales should have different responses at distinct feature representation layers.…”
Section: Multi-scale Representationmentioning
confidence: 99%