2010 IEEE International Conference on Image Processing 2010
DOI: 10.1109/icip.2010.5651381
|View full text |Cite
|
Sign up to set email alerts
|

Spatio-temporal combination of saliency maps and eye-tracking assessment of different strategies

Abstract: The modeling of the human visual attention into a computational attention model leads to the split of visual features into several independent channels. Then, a difficult problem arises to combine these maps, having different dynamic ranges or distribution. When several maps are considered, such process is mandatory in order to compute a single measure of interest for each location, regardless of which features contributed to the salience. Several strategies of cue combination are proposed in this paper for th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2012
2012
2016
2016

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 29 publications
(12 citation statements)
references
References 6 publications
0
12
0
Order By: Relevance
“…Due to these limitations, these models do not perform well on new and streaming real-world data. Some of these models have been compared against human eye tracking data [3,7,[9][10]. As expected, there is low correlation between human fixation and models.…”
Section: Introductionmentioning
confidence: 92%
“…Due to these limitations, these models do not perform well on new and streaming real-world data. Some of these models have been compared against human eye tracking data [3,7,[9][10]. As expected, there is low correlation between human fixation and models.…”
Section: Introductionmentioning
confidence: 92%
“…Different fusion strategies can be utilized [12], such as additive fusion, multiplicative fusion and maximum fusion. In our DCVS codec, we simply adopt the maximum fusion to obtain the spatiotemporal saliency (i.e., S(f SI ) = max(S s (f SI ), S t (f SI ))) because this fusion method can adequately detect most of the salient regions in the SI frame and circumvent the difficulty of determining the fusing weights for spatial and temporal saliency.…”
Section: B Perceptually-aware Bcs For a Non-key Framementioning
confidence: 99%
“…Dealing with different sources of information, their correct combination is a great challenge [7]. In this work we propose to fuse several conspicuity maps considering the human visual characteristics.…”
Section: Neural Network For the Combination Of Featuresmentioning
confidence: 99%