2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00545
|View full text |Cite
|
Sign up to set email alerts
|

DecideNet: Counting Varying Density Crowds Through Attention Guided Detection and Density Estimation

Abstract: In real-world crowd counting applications, the crowd densities vary greatly in spatial and temporal domains. A detection based counting method will estimate crowds accurately in low density scenes, while its reliability in congested areas is downgraded. A regression based approach, on the other hand, captures the general density information in crowded regions. Without knowing the location of each person, it tends to overestimate the count in low density areas. Thus, exclusively using either one of them is not … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
251
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
4
2

Relationship

1
9

Authors

Journals

citations
Cited by 352 publications
(253 citation statements)
references
References 42 publications
2
251
0
Order By: Relevance
“…Two examples are shown in Figure 1. Similar to [3,19,20], we observe that dense-crowd regions are usually underestimated, while sparse-crowd regions are overestimated. Such phenomenon is due to two main factors.…”
Section: Introductionsupporting
confidence: 60%
“…Two examples are shown in Figure 1. Similar to [3,19,20], we observe that dense-crowd regions are usually underestimated, while sparse-crowd regions are overestimated. Such phenomenon is due to two main factors.…”
Section: Introductionsupporting
confidence: 60%
“…The influence of Cmax to SS-DCNet on the ShanghaiTech Part_A dataset [53]. The numbers in the brackets denote quantiles of the training set, for example, 22 (95%) means the 95% quantile is 22. 'VGG16 Encoder' is the classification baseline without S-DC. 'One-Linear' and 'Two-Linear' are defined in Section 6.1.1.…”
Section: How Many Times To Divide?mentioning
confidence: 99%
“…Sam et al [50] introduce a switching structure, which uses a classifier to assign input image patches to best column structures. Recently, Liu et al [32] propose a multi-column network to simultaneously estimate crowd density by detection and regression models. Ranjan et al [44] employ a two-column structure to iterative train their model with different resolution images.…”
Section: Cnn-based Methodsmentioning
confidence: 99%