2022
DOI: 10.1016/j.imavis.2022.104549
|View full text |Cite
|
Sign up to set email alerts
|

SiaTrans: Siamese transformer network for RGB-D salient object detection with depth image classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…The CNN model possesses translation invariance and locality, which have been proven beneficial for extracting local spatial information. In addition, RGB data are generally more informative than depth data [48,49]. Therefore, we argue that it is unnecessary to use a large Transformer-based complex network like PVTv2 to process depth data.…”
Section: Encoder Of Depth Channelmentioning
confidence: 99%
“…The CNN model possesses translation invariance and locality, which have been proven beneficial for extracting local spatial information. In addition, RGB data are generally more informative than depth data [48,49]. Therefore, we argue that it is unnecessary to use a large Transformer-based complex network like PVTv2 to process depth data.…”
Section: Encoder Of Depth Channelmentioning
confidence: 99%
“…Recently, Jia et al [43] presented an all-in-one salient object detection model, which can process RGB SOD tasks, RGB-T SOD tasks, and RGB-D SOD tasks by using one model. Besides, in some other computer vision fields, modality unified frameworks have also aroused the interest of researchers and made impressive progress [44], [45].…”
Section: Modality Unified Sodmentioning
confidence: 99%
“…Recently, Jia et al [27] proposed an all-in-one SOD model, namely AiOSOD. This model can detect salient objects from three types of data (RGB, RGB-D, and RGB-T) by using one model with the same weight parameters.…”
Section: B Am Sodmentioning
confidence: 99%
“…Identifying salient areas in an image can facilitate subsequent advanced visual tasks, enhancing efficiency and resource management and improving performance (Gupta et al, 2020). Thus, SOD can help filter irrelevant backgrounds, and SOD plays a significant pre-processing role in computer vision applications, providing important basic processing for these applications, e.g., segmentation (Donoser et al, 2009;Qin et al, 2014;Noh et al, 2015;Fu et al, 2017;Shelhamer et al, 2017), classification (Borji and Itti, 2011;Joseph et al, 2019;Akila et al, 2021;Liu et al, 2021;Jia et al, 2022;Ma and Yang, 2023), tracking (Frintrop and Kessel, 2009;Su et al, 2014;Ma et al, 2017;Lee and Kim, 2018;Chen et al, 2019), etc.…”
Section: Introductionmentioning
confidence: 99%