2021
DOI: 10.1016/j.patcog.2020.107630
|View full text |Cite
|
Sign up to set email alerts
|

Context-aware network for RGB-D salient object detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(3 citation statements)
references
References 69 publications
(96 reference statements)
0
3
0
Order By: Relevance
“…With the flourish in this research direction, other inspiring techniques are also recently employed into the RGB-D SOD task, such as discrepant cross-modality interaction [65], triplet transformer embedding network [66], pure transformer network [67], neural architecture search [68], mutual information minimization [69], specificity-preserving architecture [70], hierarchical cross-modal distillation [21], cross-modal edgeguidance [71], LSTM-based context-aware modules [72]. A relatively complete survey on RGB-D SOD can be found in [22].…”
Section: General Rgb-d Sod Methodsmentioning
confidence: 99%
“…With the flourish in this research direction, other inspiring techniques are also recently employed into the RGB-D SOD task, such as discrepant cross-modality interaction [65], triplet transformer embedding network [66], pure transformer network [67], neural architecture search [68], mutual information minimization [69], specificity-preserving architecture [70], hierarchical cross-modal distillation [21], cross-modal edgeguidance [71], LSTM-based context-aware modules [72]. A relatively complete survey on RGB-D SOD can be found in [22].…”
Section: General Rgb-d Sod Methodsmentioning
confidence: 99%
“…In [34], a saliency detection model is proposed based on the spatial position prior of attractive objects and sparse background features. Some approaches use neural networks and deep learning techniques such as convolution neural networks [35,36,37], sparse deep learning networks [38] Boltzmann machine [39], and ensemble deep neural network [40] for modeling bottom-up attention. In [41], a multiple convolution layers model is proposed to predict eye fixation which uses the end-to-end encoder-decoder network.…”
Section: Related Workmentioning
confidence: 99%
“…This overcame the interference of occlusion and dense crowds partly. Some researchers [ 24 , 25 ] used the symmetric dual-stream network to extract the RGB feature and the depth feature of the image simultaneously. However, it is difficult to acquire the high-quality RGB image feature and depth image feature simultaneously with the symmetric dual-stream network.…”
Section: Introductionmentioning
confidence: 99%