2020 IEEE Winter Conference on Applications of Computer Vision (WACV) 2020
DOI: 10.1109/wacv45572.2020.9093386
|View full text |Cite
|
Sign up to set email alerts
|

Going Beyond the Regression Paradigm with Accurate Dot Prediction for Dense Crowds

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 22 publications
0
9
0
Order By: Relevance
“…In [1], bounding boxes around point annotations were initialized using their distance to the nearest neighbour head and while training a CNN to output box annotations, updated bounding boxes around the heads to obtain the most suitable size in the anchor box set for the corresponding head. In [20], two scales of the input image feature maps (one-fourth and one-eighth) were used depending on the sparsity of the crowd to output a point map. Crowd counting was also cast as a classification task of dot prediction with point supervision, dropping the prevalent density regression.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In [1], bounding boxes around point annotations were initialized using their distance to the nearest neighbour head and while training a CNN to output box annotations, updated bounding boxes around the heads to obtain the most suitable size in the anchor box set for the corresponding head. In [20], two scales of the input image feature maps (one-fourth and one-eighth) were used depending on the sparsity of the crowd to output a point map. Crowd counting was also cast as a classification task of dot prediction with point supervision, dropping the prevalent density regression.…”
Section: Related Workmentioning
confidence: 99%
“…A threshold was then applied on this map to generate the final accurate dot predictions. Therefore in [20], the fusion module was not learnable and the count highly depended on the accuracy of the threshold value which might be different across different images.…”
Section: Related Workmentioning
confidence: 99%
“…Liu et al [12] propose a curriculum learning strategy to generate pseudo box-level labels from point annotations. Sam et al [35] train a pixel-wise binary classifier to detect people instead of regressing local crowd density, which proposes a novel multi-scale architecture incorporating top-down feedback to address the scale variation. Furthermore, they propose the upgraded version, LSC-CNN [13], which is a tailor-made dense object detection method that predicts position, size for each head simultaneously only with point-level annotations.…”
Section: Crowd Localizationmentioning
confidence: 99%
“…Recently, crowd localization methods have been developed for dense crowds, like [2,17,18], showing that those methods can localize crowds as well as obtain a comparable counting result. The localization process is to produce the keypoints [2,19], blobs [17], or bounding boxes [18,20,21] on each head (see Fig. 1 (a)), and 978-1-6654-4989-2/21/$31.00 ©2021 IEEE Fig.…”
Section: (B) Congested Scene Recognition Networkmentioning
confidence: 99%