2022
DOI: 10.1109/tcsvt.2021.3127562
|View full text |Cite
|
Sign up to set email alerts
|

Bilateral Temporal Re-Aggregation for Weakly-Supervised Video Object Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 13 publications
(4 citation statements)
references
References 64 publications
0
2
0
Order By: Relevance
“…Therefore, the phenomenon of inconsistent segmentation results inside the object is easy to occur. Motivated by knowledge distillation methods [26]- [28], [60], [61], which are able to increase the accuracy of multi tasks via transferring the specific knowledge from each single task or increase the accuracy of small student network via transferring context information from large teacher model, a consistent constraint module is proposed to enrich the long-range dependency among pixels within non-key frame by distilling it from the feature obtained from key frame segmentation. It could promote semantic consistency of nonkey frame, and do not add any computing burden at the same time.…”
Section: B Local Attention Based Modulementioning
confidence: 99%
See 1 more Smart Citation
“…Therefore, the phenomenon of inconsistent segmentation results inside the object is easy to occur. Motivated by knowledge distillation methods [26]- [28], [60], [61], which are able to increase the accuracy of multi tasks via transferring the specific knowledge from each single task or increase the accuracy of small student network via transferring context information from large teacher model, a consistent constraint module is proposed to enrich the long-range dependency among pixels within non-key frame by distilling it from the feature obtained from key frame segmentation. It could promote semantic consistency of nonkey frame, and do not add any computing burden at the same time.…”
Section: B Local Attention Based Modulementioning
confidence: 99%
“…It could promote semantic consistency of nonkey frame, and do not add any computing burden at the same time. Moreover, with the aim of transferring context knowledge, instead of aligning feature maps directly [61], we use the pair-wise similarity among pixels as knowledge.…”
Section: B Local Attention Based Modulementioning
confidence: 99%
“…Zhao et al [67] proposed the first weakly supervised video salient object detection model based on "fixation guided scribble annotations". And some methods used weakly-supervised approaches to video object segmentation by fusing information between different frames [68]- [70]. In contrast, Zhou et al [71] relied only on the current frame image and the corresponding optical flow data to achieve the zero-shot video object segmentation.…”
Section: B Weakly Supervised Salient Object Detectionmentioning
confidence: 99%
“…With the recent success of deep learning, deep metric learning (DML) methods have demonstrated strong ability in various tasks (Ge et al 2021;Liu et al 2021;Peng et al 2021;Lin et al 2021;Wang et al 2020b), such as semantic search (Huang et al 2020;Li et al 2021b;Min et al 2020a,b) and face recognition (Li et al 2021a). Most existing approaches (Roth, Brattoli, and Ommer 2019;Wu et al 2017) take as input a sample (e.g., an image or a document), use a trained neural network as an encoder and represent this sample with the output embedding.…”
Section: Introductionmentioning
confidence: 99%