2022
DOI: 10.1109/tpami.2021.3124956
|View full text |Cite
|
Sign up to set email alerts
|

Locating and Counting Heads in Crowds With a Depth Prior

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 31 publications
(7 citation statements)
references
References 79 publications
0
7
0
Order By: Relevance
“…Since there are few RGBD crowd counting datasets currently, there is not much work to complete crowd counting based on RGBD images [54,55]. In these studies, depth information usually provides prior knowledge of head position for RGB image segmentation.…”
Section: Related Workmentioning
confidence: 99%
“…Since there are few RGBD crowd counting datasets currently, there is not much work to complete crowd counting based on RGBD images [54,55]. In these studies, depth information usually provides prior knowledge of head position for RGB image segmentation.…”
Section: Related Workmentioning
confidence: 99%
“…Then, in order to create the dual attention map, we aggregated the FoV attention map and depth attention map as given in (7). After that, to enhanced the saliency estimation we aggregated this dual attention map with hypothetical gaze distribution G (object channel) to create hypo dual attention map as in (8), where ⊗ denotes the element wise product.…”
Section: ) Depth Range Selectormentioning
confidence: 99%
“…Several solutions have introduced for customer behaviour analyzis in retail using the rapid developments in computer vision technology. For instance, counting the number of people and detecting the hot spots in retail [6] and public [7], and tracking shoppers' emotion [5] are such applications. However, the existing solutions only captures coarse touch-points of a shopper's journey and vulnerable to unconstrained environment settings.…”
Section: Introductionmentioning
confidence: 99%
“…Song et al [53] use the Kinect sensor to obtain depth images and propose a depth region suggestion network for counting. Lian et al [54], [55] propose a strategy to generate crowd depth-adaptive density maps, and a real-world RGB-D crowd counting dataset called Shang-haiTechRGBD and a synthetic dataset ShanghaiTechRGBDsyn are collected. Yang et al [56] propose a Bidirectional Cross-modal Attention (BCA) mechanism to focus on crowded regions in images though depth information.…”
Section: B Multi-modal Crowd Countingmentioning
confidence: 99%