2019
DOI: 10.1109/tcsvt.2018.2859773
|View full text |Cite
|
Sign up to set email alerts
|

Weakly Supervised Salient Object Detection With Spatiotemporal Cascade Neural Networks

Abstract: Recently, deep learning techniques have substantially boosted the performance of salient object detection in still images. However, the salient object detection in videos by using traditional handcrafted features or deep learning features is not fully investigated, probably due to the lack of sufficient manually labeled video data for saliency modeling, especially for the data-driven deep learning. This paper proposes a novel weakly supervised approach to salient object detection in a video, which can learn a … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
40
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 87 publications
(40 citation statements)
references
References 56 publications
(80 reference statements)
0
40
0
Order By: Relevance
“…According to the common concepts used in deep learning methods, firstly, the global framework for each method is described in 2.1, then the deep network in each method is analyzed in 2.2, and finally an overview of the categorization of methods is shown at a functional level in 2.3. As a matter of convenience, the describled methods, are denoted as SCOMd [12], NRF [13], DHSNet [14], OSVOS [15], NLDF [16], LMP [17], SFCN [18], SegFlow [19], LVO [20], WSS [21], SCNN [22], DSS [23], SPD [24], AFNet [25] and CPD [26].…”
Section: Classification Of the State-of-the-art Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…According to the common concepts used in deep learning methods, firstly, the global framework for each method is described in 2.1, then the deep network in each method is analyzed in 2.2, and finally an overview of the categorization of methods is shown at a functional level in 2.3. As a matter of convenience, the describled methods, are denoted as SCOMd [12], NRF [13], DHSNet [14], OSVOS [15], NLDF [16], LMP [17], SFCN [18], SegFlow [19], LVO [20], WSS [21], SCNN [22], DSS [23], SPD [24], AFNet [25] and CPD [26].…”
Section: Classification Of the State-of-the-art Methodsmentioning
confidence: 99%
“…Moving objects attract large attention and thus can be regarded as salient objects in videos. As in the methods[12,22,18], we also use the 30 test videos with the provided ground truth. The DAVIS 2016 dataset[27] is a popular video dataset for video foreground segmentation.It is divided into two splits: the training (30 sequences) part used for training only and the validation (20 sequences) part for the inference.…”
mentioning
confidence: 99%
“…Different from the cascade structure in [48], we propose a two-stream deep network to estimate salient regions. The specific framework are shown in Figure 2, which consists of four components, two network streams without shared parameters, refinement of motion information and attentive module.…”
Section: The Proposed Approachmentioning
confidence: 99%
“…In the temporal stream, we treat the RGB frames and corresponding motion priors as the network input. The temporal prior is generated by [48]. Specifically, the superpixels of the optical flow map are firstly obtained by using [10].…”
Section: The Proposed Approachmentioning
confidence: 99%
See 1 more Smart Citation