2016
DOI: 10.48550/arxiv.1611.05198
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

One-Shot Video Object Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(11 citation statements)
references
References 0 publications
0
11
0
Order By: Relevance
“…So we use 5 video set AC, IU, JM, MS, VK in i2iDatabase dataset (results shown in Figure 2) and three measures used in DAVIS database to evaluate our system in the following terms: region similarity J (with respect to intersection of union -IoU), contour accuracy F and temporal stability T . Although our method focus on the case with no manual input, we compare our result with the state-of-the-art methods in both unsupervised (FST [28]) and semi-supervised techniques (BVS [31] and OSVOS [29]), the latter of which takes ground-truth of first frame as initial mask.…”
Section: Experiments and Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…So we use 5 video set AC, IU, JM, MS, VK in i2iDatabase dataset (results shown in Figure 2) and three measures used in DAVIS database to evaluate our system in the following terms: region similarity J (with respect to intersection of union -IoU), contour accuracy F and temporal stability T . Although our method focus on the case with no manual input, we compare our result with the state-of-the-art methods in both unsupervised (FST [28]) and semi-supervised techniques (BVS [31] and OSVOS [29]), the latter of which takes ground-truth of first frame as initial mask.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…For monocular video, algorithms are normally difficult to define the region of foreground by only color and motion information without human interaction. Therefore, most researches [4,5,31,29] have adapted the approach of providing manually the mask of key frames to facilitate segmentation. Based on this approach, video segmentation systems [1,2,3,6] which require gradually adding the user's input to correct the result during segmentation processing are built, and they could achieve considerable segmenting accuracy under human interaction.…”
Section: Introductionmentioning
confidence: 99%
“…Our approach outputs per-frame instance segmentation using a convnet architecture, inspired by works from other domains like [6,40,49]. A concurrent work [5] also exploits convnets for video object segmentation. Differently from our approach their segmentation is not guided, which might result in performance decay over time.…”
Section: Global Propagationmentioning
confidence: 99%
“…e-mail: mennatul@ualberta.ca. 2 Mahmoud Gamal is with Cairo University, Egypt. 3 Mohamed El-Hoseiny is with Facebook AI Research.…”
Section: Introductionmentioning
confidence: 99%
“…(1) Abundance of the different poses of the object. (2) The existence of different instances/classes within the same category. (3) Different challenges introduced by cluttered backgrounds, different rigid and non-rigid transformations, occlusions and illumination changes.…”
Section: Introductionmentioning
confidence: 99%