2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016
DOI: 10.1109/cvpr.2016.620
|View full text |Cite
|
Sign up to set email alerts
|

End-to-End Saliency Mapping via Probability Distribution Prediction

Abstract: Most saliency estimation methods aim to explicitly model low-level conspicuity cues such as edges or blobs and may additionally incorporate top-down cues using face or text detection. Data-driven methods for training saliency models using eye-fixation data are increasingly popular, particularly with the introduction of large-scale datasets and deep architectures. However, current methods in this latter paradigm use loss functions designed for classification or regression tasks whereas saliency estimation is ev… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
100
0
2

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 133 publications
(106 citation statements)
references
References 35 publications
(57 reference statements)
0
100
0
2
Order By: Relevance
“…As for loss function, most of the existing DCNN-based saliency models directly use the typical pixel-wise classification or regression loss functions whereas saliency prediction is evaluated on the whole saliency maps. In [27], Jetley et al propose to use loss functions based on statistical distances with softmax normalization for training saliency models. Their results demonstrate the improvement by considering saliency maps as probability distributions.…”
Section: A Deep Learning-based Visual Saliency Predictionmentioning
confidence: 99%
See 1 more Smart Citation
“…As for loss function, most of the existing DCNN-based saliency models directly use the typical pixel-wise classification or regression loss functions whereas saliency prediction is evaluated on the whole saliency maps. In [27], Jetley et al propose to use loss functions based on statistical distances with softmax normalization for training saliency models. Their results demonstrate the improvement by considering saliency maps as probability distributions.…”
Section: A Deep Learning-based Visual Saliency Predictionmentioning
confidence: 99%
“…In order to convert the predicted saliency map and its corresponding ground-truth into probability distributions, a normalization method should be applied first. Here, we improve the existing method [27] by replacing their softmax normalization with a simple linear regularization.…”
Section: Loss Functionmentioning
confidence: 99%
“…Visual Saliency: The early CNN-based approaches for saliency were based on the adaptation of pretrained CNN models for visual recognition tasks [39,58]. Later, in [45] both shallow and deep CNN were trained end-to-end for saliency prediction while [28,29] trained the networks by optimizing common saliency evaluation metrics. In [44] the authors employed end-to-end Generative Adversarial Networks (GAN), while [62] has utilized multi-level saliency information from different layer through skip connections.…”
Section: Related Workmentioning
confidence: 99%
“…Many CNN architectures have been proposed in the area of image classification [25,35,12,13,49], where a deep CNN model is able to achieve a higher accuracy for classification 1 . However, CNN models used in state-ofthe-art saliency applications are relatively shallow, such as VGGNet-16 [27,41,8,16,26,37,31] or ResNet-50 [29,9]. In the work of [14], the deeper model, GoogleNet, did not achieve better performance due to the limited training set.…”
Section: Introductionmentioning
confidence: 99%