2020
DOI: 10.1016/j.image.2020.115976
|View full text |Cite
|
Sign up to set email alerts
|

Saliency detection in human crowd images of different density levels using attention mechanism

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 26 publications
0
2
0
Order By: Relevance
“…Chen et al [ 16 ] argued that ResNet is essentially DenseNet with shared connections, and ResNet implicitly has residual path reuse features, and dense connections that can be mined for new features, based on this joint perspective dual-channel network (DPN) were proposed, by sharing same features of network through dual-path structure; high accuracy is achieved with flexibility, small models, less computation, and low resources. Nguyen et al [ 26 ] proposed MSDENSE-DAT to predict regions in crowd scenes that attract human attention, designed two-branch homography network based on DenseNet201 to extract multiscale features, extracted as many features as possible from original size and half-size images, and cascaded two-branch features along channel axis, and also designed self-attention blocks to emphasize interfeature correlation; MSDENSE-DAT extracts the best features with low density in the crowd.…”
Section: Development Of Densenetmentioning
confidence: 99%
“…Chen et al [ 16 ] argued that ResNet is essentially DenseNet with shared connections, and ResNet implicitly has residual path reuse features, and dense connections that can be mined for new features, based on this joint perspective dual-channel network (DPN) were proposed, by sharing same features of network through dual-path structure; high accuracy is achieved with flexibility, small models, less computation, and low resources. Nguyen et al [ 26 ] proposed MSDENSE-DAT to predict regions in crowd scenes that attract human attention, designed two-branch homography network based on DenseNet201 to extract multiscale features, extracted as many features as possible from original size and half-size images, and cascaded two-branch features along channel axis, and also designed self-attention blocks to emphasize interfeature correlation; MSDENSE-DAT extracts the best features with low density in the crowd.…”
Section: Development Of Densenetmentioning
confidence: 99%
“…e first step of the method is also to collect the raw image dataset, and then, the collected raw images are detected and identified using an existing network model of convolutional neural networks with high accuracy, where the dataset is trained using the existing VOC2012 dataset. e convolutional neural network algorithm is used to frame out the pedestrians in the image and calculate their confidence level, the confidence interval of this system is set to be greater than 50%, but most of the framed pedestrian targets in the image have a confidence level of less than 50%, so there is severe occlusion between these targets and they are unidentifiable, but in reality, the targets are present, so the accuracy of this solution is not high [34][35][36][37][38][39][40]. However, when combined with the actual situation and the observation of multiple images, it is easy to see that there is a minimal occlusion in the pedestrian head region, so this paper uses head recognition to detect pedestrians.…”
Section: Classification Of Density Of People and System Flowmentioning
confidence: 99%