Learning Quality-aware Dynamic Memory for Video Object Segmentation

Liu, Yong; Rong, Yu; Yin, Fang‐Fang; Zhao, Xianfeng; Wang, Zhao; Xia, Weihao; Yang, Yujiu

doi:10.48550/arxiv.2207.07922

Cited by 1 publication

(1 citation statement)

References 45 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The Space-Time Memory Network [35] memorizes intermediate frames with segmentation masks as references and performs pixellevel matching between them with the current frame to segment target objects in a bottom-up manner, which has been proved effective and has served as the current mainstream framework. Some works [40,23,5,15,59,41,51,6,62,46,25,27] further develop STM and have achieved excellent performance.…”

Section: Introductionmentioning

confidence: 99%

Global Spectral Filter Memory Network for Video Object Segmentation

Liu¹,

Rong²,

Wang³

et al. 2022

Preprint

View full text Add to dashboard Cite

This paper studies semi-supervised video object segmentation through boosting intra-frame interaction. Recent memory networkbased methods focus on exploiting inter-frame temporal reference while paying little attention to intra-frame spatial dependency. Specifically, these segmentation model tends to be susceptible to interference from unrelated nontarget objects in a certain frame. To this end, we propose Global Spectral Filter Memory network (GSFM), which improves intraframe interaction through learning long-term spatial dependencies in the spectral domain. The key components of GSFM is 2D (inverse) discrete Fourier transform for spatial information mixing. Besides, we empirically find low frequency feature should be enhanced in encoder (backbone) while high frequency for decoder (segmentation head). We attribute this to semantic information extracting role for encoder and fine-grained details highlighting role for decoder. Thus, Low (High) Frequency Module is proposed to fit this circumstance. Extensive experiments on the popular DAVIS and YouTube-VOS benchmarks demonstrate that GSFM noticeably outperforms the baseline method and achieves state-of-theart performance. Besides, extensive analysis shows that the proposed modules are reasonable and of great generalization ability. Our source code is available at https://github.com/workforai/GSFM.

show abstract