2021
DOI: 10.1016/j.eswa.2021.115071
|View full text |Cite
|
Sign up to set email alerts
|

A cross-modal fusion based approach with scale-aware deep representation for RGB-D crowd counting and density estimation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(7 citation statements)
references
References 17 publications
0
6
0
Order By: Relevance
“…The idea of this group's methods, such as [39,61,64,67] in its most simplistic form is to obtain a density map from an image and then integrate it in order to get the estimation of people in the image. Contrary to the previous approaches, these also consider the spatial information.…”
Section: Density Based Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…The idea of this group's methods, such as [39,61,64,67] in its most simplistic form is to obtain a density map from an image and then integrate it in order to get the estimation of people in the image. Contrary to the previous approaches, these also consider the spatial information.…”
Section: Density Based Approachesmentioning
confidence: 99%
“…In recent surveys [48,11] authors classify CNN-based approaches into four categories, based on the property of the networks: Basic CNNs include networks with basic CNN layers and represent initial deep learning approaches for crowd counting [14,35,56,58,67], scale-aware models that leverage multi-column or multi-resolution architectures to achieve scale robustness [3,25,37,68], context-aware models that incorporate global and local contextual information to improve performance [45,46], and multi-task frameworks that combine crowd counting with tasks such as crowd velocity estimation, etc. [2,47,66,70] Based on the inference methodology, they also classify them into patch-based, where models are trained using patches from the image and the inference is done using sliding window approach [2,3,14,25,35,37,56,58,66,70], and whole image-based [45,46,47,68,60].…”
Section: Density Based Approachesmentioning
confidence: 99%
“…Since there are few RGBD crowd counting datasets currently, there is not much work to complete crowd counting based on RGBD images [54,55]. In these studies, depth information usually provides prior knowledge of head position for RGB image segmentation.…”
Section: Related Workmentioning
confidence: 99%
“…There are currently methods that utilize adaptive Gaussian kernels to generate high-quality density maps [ 18 , 19 ]. High-quality density maps train more robust regression networks, providing prior knowledge for crowd detection that is closer to the actual distribution of crowds [ 20 ]. One of the reasons that previous detection methods cannot detect small heads is due to the lack of scale perceptron or the limitation of its own structure.…”
Section: Introductionmentioning
confidence: 99%