2020
DOI: 10.1109/tie.2019.2956418
|View full text |Cite
|
Sign up to set email alerts
|

Salient Object Detection by Spatiotemporal and Semantic Features in Real-Time Video Processing Systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
4
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
4
1
1

Relationship

2
8

Authors

Journals

citations
Cited by 12 publications
(6 citation statements)
references
References 64 publications
0
4
0
Order By: Relevance
“…In context embedding object detect networks, backbone features are attached to tree parallel branches with dilation sizes of 3, 6 and 12 to form the context embedding module and to incorporate surrounding information. Fang [21] fused the semantic object feature extraction module (Conv2dNet), the spatiotemporal feature extraction module (Conv3DNet) and the saliency feature-sharing module to generate the final saliency map for real-time video processing. Wang [22] combined dual-branch feature extraction and gradually refined the cross-fusion module in the network for camouflaged object detection.…”
Section: Related Workmentioning
confidence: 99%
“…In context embedding object detect networks, backbone features are attached to tree parallel branches with dilation sizes of 3, 6 and 12 to form the context embedding module and to incorporate surrounding information. Fang [21] fused the semantic object feature extraction module (Conv2dNet), the spatiotemporal feature extraction module (Conv3DNet) and the saliency feature-sharing module to generate the final saliency map for real-time video processing. Wang [22] combined dual-branch feature extraction and gradually refined the cross-fusion module in the network for camouflaged object detection.…”
Section: Related Workmentioning
confidence: 99%
“…High-efficiency video coding (HEVC) [1] is the latest video coding standard that was published by ISO/IEC MPEG, and ITU-T VCEG formed the Joint Collaborative Team on Video Coding (JCT-VC) in 2013, which has a high efficiency to compress video. HEVC is adapted to the transmission and storage from small-scale multimedia networks to large scale TV distributors and thus has been widely used in daily life [2][3][4][5]. Video contains an enormous amount of information including private, sensitive and copyright items [6][7][8], which would be easily leaked in an unreliable public channel and the insecurity of the cloud service.…”
Section: Introductionmentioning
confidence: 99%
“…individual RGB/color images [25]- [30] or sequences [31]- [35]. As depth cameras, such as Kinect and RealSense, become more and more popular, SOD from RGB-D inputs ("D" refers to depth) is emerging as an attractive research topic.…”
Section: Introductionmentioning
confidence: 99%