2017
DOI: 10.48550/arxiv.1709.06031
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Video Object Segmentation Without Temporal Information

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(15 citation statements)
references
References 0 publications
0
15
0
Order By: Relevance
“…Without these enhancements, our performance is still higher. This result demonstrates the ro-J Mean F Mean Overall Validation Set OSVOS [3] 56.6 63.9 60.3 PReMVOS [27] 73.9 81.7 77.8 OSVOS s [28] 64.7 71.3 68.0 OSMN [48] 52.5 57.1 54.8 VideoMatch [17] 56.5 68.2 62.4 RGMP [45] 64.8 68.6 66.7 A-Game [20] 67.2 72.7 70.0 FAVOS [7] 54 bustness and generalization of our approach on a complex dataset.…”
Section: Compare With the State-of-the-art Methodsmentioning
confidence: 63%
“…Without these enhancements, our performance is still higher. This result demonstrates the ro-J Mean F Mean Overall Validation Set OSVOS [3] 56.6 63.9 60.3 PReMVOS [27] 73.9 81.7 77.8 OSVOS s [28] 64.7 71.3 68.0 OSMN [48] 52.5 57.1 54.8 VideoMatch [17] 56.5 68.2 62.4 RGMP [45] 64.8 68.6 66.7 A-Game [20] 67.2 72.7 70.0 FAVOS [7] 54 bustness and generalization of our approach on a complex dataset.…”
Section: Compare With the State-of-the-art Methodsmentioning
confidence: 63%
“…Video object segmentation is gaining rapid development in computer vision. Most solutions are fundamentally supervised, as they rely on heavily pretrained models with human-labeled annotations [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16]. Although manual annotation is extremely costly, there are very few genuine unsupervised methods [17], [18], [19], [20], [21].…”
Section: Scientific Contextmentioning
confidence: 99%
“…Given the manual foreground/background annotations for the first frame in a video clip, semi-supervised VOS methods segment the foreground object along the remaining frames. Deep learning based methods have achieved excellent performance [53,8,25,61,58,60], and static image segmentation [5,44,38,22,23] is utilized to perform video object segmentation without any temporal information. MaskTrack [44] considers the output of the previous frame as a guidance in the next frame to refine the mask.…”
Section: Semi-supervised Video Object Segmentationmentioning
confidence: 99%
“…MaskTrack [44] considers the output of the previous frame as a guidance in the next frame to refine the mask. OSVOS [5] processes each frame independently by finetuning on the first frame, and OSVOS-S [38] further transfers instance-level semantic information learned on ImageNet [12] to produce more accurate results. OnAVOS [53] proposes online finetuning with the predicted frames to further optimize the inference network.…”
Section: Semi-supervised Video Object Segmentationmentioning
confidence: 99%