2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00381
|View full text |Cite
|
Sign up to set email alerts
|

Divide and Grow: Capturing Huge Diversity in Crowd Images with Incrementally Growing CNN

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
143
0
2

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 206 publications
(145 citation statements)
references
References 14 publications
0
143
0
2
Order By: Relevance
“…A Spatial Softmax function is applied at the End of Upsampler, which constrains the sum of upsampling weights in each 2 × 2 adjacent regions to be 1 and ensures consistent local count values in the same image area after upsampling. The final output channel is 1 for R-Counter and class num for C-Counter…”
Section: Classification-based Counter (C-counter)mentioning
confidence: 99%
“…A Spatial Softmax function is applied at the End of Upsampler, which constrains the sum of upsampling weights in each 2 × 2 adjacent regions to be 1 and ensures consistent local count values in the same image area after upsampling. The final output channel is 1 for R-Counter and class num for C-Counter…”
Section: Classification-based Counter (C-counter)mentioning
confidence: 99%
“…While the these methods build techniques that are robust to scale variations, more recent methods have focused on other aspects such as progressively increasing the capacity of the network based on dataset [3], use of adversarial loss to reduce blurry effects in the predicted output maps [49,56], learning generalizable features via deep negative correlation based learning [51], leveraging unlabeled data for counting by introducing a learning to rank framework [34], cascaded feature fusion [43] and scale-based feature aggregation [7], weakly-supervised learning for crowd counting [58]. Recently, Idrees et al [19] created a new large-scale high-density crowd dataset with approximately 1.25 million head annotations and a new localization task for crowded images.…”
Section: Related Workmentioning
confidence: 99%
“…In addition to multi-column networks, there are a lot of methods to improve scale invariance of feature learning by 1) studying on the fusion of multi-scale features [35,57,62,63], 2) studying on multiblob based scale aggregation networks [7,64], 3) designing scaleinvariant convolutional or pooling layers [21,30,33,56,62], and 4) studying on automated scale adaptive networks [48,49,66]. On the other hand, a lot of studies devote to using perspective maps [52], geometric constraints [34,68], and region-of-interest [33] to further improve the counting accuracy.…”
Section: Cnn-based Methodsmentioning
confidence: 99%