DTCC: Multi-level dilated convolution with transformer for weakly-supervised crowd counting

Zhang, Miao; Zhang, Yong; Yuan, Peng; Peng, Haocheng; Yin, Baocai

doi:10.1007/s41095-022-0313-5

Cited by 4 publications

(2 citation statements)

References 53 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These results have undergone rigorous scrutiny across the test, train, and validation domains. Importantly, our proposed model, in contrast to its multi-tasked counterparts such as CMTL [78], DSSI-Net [79], SANet [60], CG-DRCN [80], MBTTBF [36], SFCN [46], DTCC [81], MCNN [24], CSRNet [33], CAN [25] operates as a single-task model. This distinction alleviates the necessity of generating density maps, resulting in a reduction of computational complexity.…”

Section: Performance Generalization and Comparisonmentioning

confidence: 99%

ED-DKCNN: Advancing Supervised Crowd Counting in Complex Environments

Tomar,

Kumar,

Nijhawan

et al. 2024

Preprint

View full text Add to dashboard Cite

The increasing urban population has led to challenges in managing crowd dynamics, especially preventing tragic incidents like stampedes. Real-time, accurate crowd counting faces obstacles such as background clutter and perspective variations. The study accepts these challenges of supervised crowd counting by examining the effectiveness of convolutional arrangements in improving accuracy by using an encoder-decoder dynamic convolutional neural network * (ED-DKCNN). The combined segmented, edge-oriented data and texture-rich features are the input for the model capable of precise crowd counting, achievable even in complex scenarios with occlusions and dense crowds. It explores low and high-level crowd features, addresses occlusion and uneven crowd distribution, and utilizes deep mining and dense complementarity for optimized people counting without density map estimation. The proposed framework harnesses intra-and inter-depth information representation through a non-increasing-order kernel arrangement, achieving state-of-the-art accuracy in people counting compared to existing methods across various datasets. The extensive experiments over free and surveillance category datasets through multiple evaluation criteria firmly * https://link.springer.com/article/10. establish the proposed ED-DKCNN model as a state-of-the-art performer in this domain. Moreover, the proposed model significantly advances crowd-counting methodologies, offering potential applications in multi-modal data integration, real-time scenarios, privacy concerns, edge computing, cross-domain situations, and human behavior.

show abstract

Section: Performance Generalization and Comparisonmentioning

confidence: 99%

ED-DKCNN: Advancing Supervised Crowd Counting in Complex Environments

Tomar,

Kumar,

Nijhawan

et al. 2024

Preprint

View full text Add to dashboard Cite

show abstract

“…For example, the benchmarking ShanghaiTech A dataset [56] includes 482 images with average 1000+ pedestrians for each image. Therefore, weak/semi-supervised approaches [10,29,30,59] are proposed to address this issue. The strategy of weak-supervised solution is adapting small-sample approach such as the Transformer [10].…”

Section: Weak/semi-supervised Solutionsmentioning

confidence: 99%

A Survey on Regression-Based Crowd Counting Techniques

Hao,

Du,

Mao

et al. 2023

ITC

View full text Add to dashboard Cite

Traditional detect and count strategy can’t well handle the extremely crowded footage in computer vision-based counting task. In recent years, deep learning approaches have been widely explored to tackle this challenge. By regressing visual features to density map, the total crowd number can be predicted while avoids the detection of their actual positions. Efforts of improving performance distribute at various phases of the detecting pipeline, such as feature extraction and eliminating deviation of regressed density map etc. In this article, we conduct a thorough review on the most representative and state-of-the-art techniques. The efforts are systematically categorized into three topics: the evolving of front-end network, the handling of unbalanced density map prediction, and the selection of loss function. After the evaluation of most significant techniques, innovations of the state-of-the-art are inspected in detail to analyze specific reasons to achieve high performances. As conclusion, possible directions of enhancement are discussed to provide insights of future research.

show abstract

K-fold matching model for crowd behavioral anomaly detection from discontinuous inputs

Ye,

Wang,

2024

Expert Systems with Applications

View full text Add to dashboard Cite

DTCC: Multi-level dilated convolution with transformer for weakly-supervised crowd counting

Cited by 4 publications

References 53 publications

ED-DKCNN: Advancing Supervised Crowd Counting in Complex Environments

ED-DKCNN: Advancing Supervised Crowd Counting in Complex Environments

A Survey on Regression-Based Crowd Counting Techniques

K-fold matching model for crowd behavioral anomaly detection from discontinuous inputs

Contact Info

Product

Resources

About