2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021
DOI: 10.1109/iccv48922.2021.00319
|View full text |Cite
|
Sign up to set email alerts
|

Towards A Universal Model for Cross-Dataset Crowd Counting

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4
1

Relationship

2
8

Authors

Journals

citations
Cited by 45 publications
(15 citation statements)
references
References 44 publications
0
13
0
Order By: Relevance
“…Lin et al [45] further improve the loss function based on Sinkhorn distance. More improvements such as incorporating perspective information [41], [46], auxiliary task [34], [47], cross-datasets training [48], [49] and neural architecture search [50] further promote the counting performance. However, as revealed in [51], [52], designing powerful deep architectures remains an active topic in crowd counting.…”
Section: A Crowd Countingmentioning
confidence: 99%
“…Lin et al [45] further improve the loss function based on Sinkhorn distance. More improvements such as incorporating perspective information [41], [46], auxiliary task [34], [47], cross-datasets training [48], [49] and neural architecture search [50] further promote the counting performance. However, as revealed in [51], [52], designing powerful deep architectures remains an active topic in crowd counting.…”
Section: A Crowd Countingmentioning
confidence: 99%
“…Object counting strives to estimate the total number of objects dispersing across still images [35] or dynamic video sequences [25]. It has increasingly drawn attention from computer vision community, thanks to its wide spread societal applications, e.g., social distance monitoring [28], traffic surveillance [48], counting in agriculture [24] and metropolis crowd management [26].…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, typical counting methods [20,21,41,50] utilize the Convolution Neural Network (CNN) as backbone and regress density map to predict the total crowd count. However, due to the wide viewing angle of cameras and the 2D perspective projection, large-scale variations often exist in crowd images.…”
Section: Introductionmentioning
confidence: 99%