2022
DOI: 10.1109/access.2022.3150988
|View full text |Cite
|
Sign up to set email alerts
|

Occlusion Handling and Multi-Scale Pedestrian Detection Based on Deep Learning: A Review

Abstract: Pedestrian detection is an important branch of computer vision, and it has important applications in the fields of autonomous driving, artificial intelligence and video surveillance. With the rapid development of deep learning and the proposal of large-scale datasets, pedestrian detection has reached a new stage and achieves better performance. However, the performance of state-of-the-art methods is far behind the expectation, especially when occlusion and scale variance exist. Therefore, a lot of works focuse… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 24 publications
(11 citation statements)
references
References 115 publications
0
6
0
Order By: Relevance
“…Spinning radar focuses on CNNs to encode the representation. Similar to the occlusion issue in computer vision, most models try to generate a mask or weight function to deal with the noisy and potentially faulty radar data [145].…”
Section: Discussion and Future Researchmentioning
confidence: 99%
See 1 more Smart Citation
“…Spinning radar focuses on CNNs to encode the representation. Similar to the occlusion issue in computer vision, most models try to generate a mask or weight function to deal with the noisy and potentially faulty radar data [145].…”
Section: Discussion and Future Researchmentioning
confidence: 99%
“…Although state-of-the-art deep learning is used, the number of papers published for radar odometry and localization is far below the number of pages published for object detection and segmentation for radars [5,9]. Similar to [107], the results from segmentation papers [145][146][147] can be used to mask out irrelevant detections and areas from scans that do not contribute to the odometry or localization of the system. In a complete pipeline, segmentation, classification, odometry, and ego-localization should be performed in an end-to-end approach, where the results from one module are integrated, e.g.…”
Section: Discussion and Future Researchmentioning
confidence: 99%
“…The proposed technique was used to compare the social force model of [58], sparse representation of [55], and mixture of dynamic textures method of [40], [59], [60]. The performance is assessed using standard performance evaluation metrics.…”
Section: Suspicious Activity Recognitionmentioning
confidence: 99%
“…In addition, when the detection confidence score is less than the threshold (ID = 1 at t1) or the worker is in unusual behavior/posture variation [11] (ID = 2 at t3), the bounding box may be missed. Previous studies' successes were owing to the development of more complicated networks or having extensively re-trained on new datasets; however, their effectiveness in addressing the challenges of complex scenarios was limited [12] by having to manually label new datasets [13], re-train entire neural networks [14,15], or switch to offline features [16]. Regardless of the chosen approaches, they primarily relied on the convolutional neural network (CNN) [10] outputs.…”
Section: Introductionmentioning
confidence: 99%