2023
DOI: 10.1007/s41095-022-0313-5
|View full text |Cite
|
Sign up to set email alerts
|

DTCC: Multi-level dilated convolution with transformer for weakly-supervised crowd counting

Abstract: Crowd counting provides an important foundation for public security and urban management. Due to the existence of small targets and large density variations in crowd images, crowd counting is a challenging task. Mainstream methods usually apply convolution neural networks (CNNs) to regress a density map, which requires annotations of individual persons and counts. Weakly-supervised methods can avoid detailed labeling and only require counts as annotations of images, but existing methods fail to achieve satisfa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 53 publications
0
2
0
Order By: Relevance
“…These results have undergone rigorous scrutiny across the test, train, and validation domains. Importantly, our proposed model, in contrast to its multi-tasked counterparts such as CMTL [78], DSSI-Net [79], SANet [60], CG-DRCN [80], MBTTBF [36], SFCN [46], DTCC [81], MCNN [24], CSRNet [33], CAN [25] operates as a single-task model. This distinction alleviates the necessity of generating density maps, resulting in a reduction of computational complexity.…”
Section: Performance Generalization and Comparisonmentioning
confidence: 99%
“…These results have undergone rigorous scrutiny across the test, train, and validation domains. Importantly, our proposed model, in contrast to its multi-tasked counterparts such as CMTL [78], DSSI-Net [79], SANet [60], CG-DRCN [80], MBTTBF [36], SFCN [46], DTCC [81], MCNN [24], CSRNet [33], CAN [25] operates as a single-task model. This distinction alleviates the necessity of generating density maps, resulting in a reduction of computational complexity.…”
Section: Performance Generalization and Comparisonmentioning
confidence: 99%
“…For example, the benchmarking ShanghaiTech A dataset [56] includes 482 images with average 1000+ pedestrians for each image. Therefore, weak/semi-supervised approaches [10,29,30,59] are proposed to address this issue. The strategy of weak-supervised solution is adapting small-sample approach such as the Transformer [10].…”
Section: Weak/semi-supervised Solutionsmentioning
confidence: 99%