2010
DOI: 10.1007/s11263-010-0354-6
|View full text |Cite
|
Sign up to set email alerts
|

Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video

Abstract: In this paper, we present a probabilistic multi-task learning approach for visual saliency estimation in video. In our approach, the problem of visual saliency estimation is modeled by simultaneously considering the stimulusdriven and task-related factors in a probabilistic framework. In this framework, a stimulus-driven component simulates the low-level processes in human vision system using multiscale wavelet decomposition and unbiased feature competition; while a task-related component simulates the highlev… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
74
0
1

Year Published

2011
2011
2018
2018

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 132 publications
(75 citation statements)
references
References 27 publications
0
74
0
1
Order By: Relevance
“…5) from left to right and top to bottom. They can be categorized into: orientation pop-out (3,9,21,25,38,43,51,54), texture pop-out (6,12,14,24,36,39,47), curvature popout (35,48), size pop-out (8,10,17,30,52), grouping (2,13,26,28,34), color pop-out (1,4,16,19,20,27,29,31,32,33,41,44,50,53), intensity pop-out (11,18,37,42), search asymmetry (5;15, 22;46, 40;49), and other complex search arrays (7,23). In some patterns, targets are embedded in noise (e.g., speckle noise: 11, 20, 31 and orientation noise: 19, 41).…”
Section: B Stimulimentioning
confidence: 99%
“…5) from left to right and top to bottom. They can be categorized into: orientation pop-out (3,9,21,25,38,43,51,54), texture pop-out (6,12,14,24,36,39,47), curvature popout (35,48), size pop-out (8,10,17,30,52), grouping (2,13,26,28,34), color pop-out (1,4,16,19,20,27,29,31,32,33,41,44,50,53), intensity pop-out (11,18,37,42), search asymmetry (5;15, 22;46, 40;49), and other complex search arrays (7,23). In some patterns, targets are embedded in noise (e.g., speckle noise: 11, 20, 31 and orientation noise: 19, 41).…”
Section: B Stimulimentioning
confidence: 99%
“…The knowledge-based models have a potential to apply various machine learning techniques. For instance, Li et al [123] introduced multi-task learning to simulate the conjunction search (cf. Sect.…”
Section: Discussionmentioning
confidence: 99%
“…Vig et al [9] used 3D spatio-temporal volumes from video for spatiotemporal saliency modeling. Li et al [10] proposed a multi-tasking Bayesian approach for combining bottomup and top-down saliency components. Kimura et al [11] learned a Dynamic Bayesian Network (DBN) to predict the likelihood of locations where humans typically focus on a video scene.…”
Section: A Bottom-up (Bu) Modelsmentioning
confidence: 99%