2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00629
|View full text |Cite
|
Sign up to set email alerts
|

Crowd Counting and Density Estimation by Trellis Encoder-Decoder Networks

Abstract: Crowd counting has recently attracted increasing interest in computer vision but remains a challenging problem. In this paper, we propose a trellis encoder-decoder network (TEDnet) for crowd counting, which focuses on generating high-quality density estimation maps. The major contributions are four-fold. First, we develop a new trellis architecture that incorporates multiple decoding paths to hierarchically aggregate features at different encoding stages, which improves the representative capability of convolu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
175
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 321 publications
(176 citation statements)
references
References 54 publications
0
175
0
Order By: Relevance
“…Most recently, several methods have focused on incorporating additional cues such as segmentation and semantic priors [61,75], attention [31,54,58], perspective [50], context information respectively [33], multiple-views [70] and multi-scale features [20] into the network. Wang et al [63] introduced a new synthetic dataset and proposed a SSIM based CycleGAN [78] to adapt the synthetic datasets to real world dataset.…”
Section: Related Workmentioning
confidence: 99%
“…Most recently, several methods have focused on incorporating additional cues such as segmentation and semantic priors [61,75], attention [31,54,58], perspective [50], context information respectively [33], multiple-views [70] and multi-scale features [20] into the network. Wang et al [63] introduced a new synthetic dataset and proposed a SSIM based CycleGAN [78] to adapt the synthetic datasets to real world dataset.…”
Section: Related Workmentioning
confidence: 99%
“…Recent approaches like [22,47,48,51,62] have aimed at incorporating various forms of related information like attention [22], semantic priors [51], segmentation [62], inverse attention [48], and hierarchical attention [47] respectively into the network. Other techniques such as [12,23,40,60] leverage features from different layers of the network using different techniques like trellis style encoder decoder [12], explicitly considering perspective [40], context information [23], and multiple views [60]. Crowd Datasets.…”
Section: Related Workmentioning
confidence: 99%
“…Method MAE MSE Idrees et al [10] 315.0 508.0 Zhang et al [59] 277.0 426.0 CMTL et al [43] 252.0 514.0 Switching-CNN [38] 228.0 445.0 Idrees et al [11] 132.0 191.0 Jian et al [12] 113.0 188.0 CG-DRCN (proposed) 112.2 176.3 column network (MCNN) [61], cascaded multi-task learning for crowd counting (CMTL) [43], Switching-CNN [38], CSR-Net [20] and SANet [4] 2 . Furthermore, we also evaluate the proposed method (CG-DRCN) and demonstrate its effectiveness over the other methods.…”
Section: Jhu-crowd Datasetmentioning
confidence: 99%
See 2 more Smart Citations