2012
DOI: 10.1007/978-3-642-33863-2_20
|View full text |Cite
|
Sign up to set email alerts
|

Weakly Supervised Learning of Object Segmentations from Web-Scale Video

Abstract: Abstract. We propose to learn pixel-level segmentations of objects from weakly labeled (tagged) internet videos. Specifically, given a large collection of raw YouTube content, along with potentially noisy tags, our goal is to automatically generate spatiotemporal masks for each object, such as "dog", without employing any pre-trained object detectors. We formulate this problem as learning weakly supervised classifiers for a set of independent spatio-temporal segments. The object seeds obtained using segment-le… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
63
0
1

Year Published

2013
2013
2019
2019

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 48 publications
(64 citation statements)
references
References 23 publications
0
63
0
1
Order By: Relevance
“…Methods for this "weakly supervised" setting attempt to learn an object model from ambiguously labeled exemplars [15,23,28,30]. This is very different from the propagation problem we tackle; our method gets only one video at a time and cannot benefit from cross-video appearance sharing.…”
Section: Related Workmentioning
confidence: 99%
“…Methods for this "weakly supervised" setting attempt to learn an object model from ambiguously labeled exemplars [15,23,28,30]. This is very different from the propagation problem we tackle; our method gets only one video at a time and cannot benefit from cross-video appearance sharing.…”
Section: Related Workmentioning
confidence: 99%
“…The second scenario, which we term inductive segment annotation (ISA), is studied in [11]. In this setting, a segment classifier is trained using a large quantity of weakly labeled segments from both positively-and negatively-tagged videos.…”
Section: Spatiotemporal Segmentationmentioning
confidence: 99%
“…We make publicly available segment-level annotations for a subset of the Prest et al dataset [20] and show convincing results. We also show state-of-the-art results on Hartmann et al's more difficult, large-scale object segmentation dataset [11]. …”
mentioning
confidence: 95%
See 1 more Smart Citation
“…Weakly supervised video segmentation methods [10][11][12] are proposed to curtail the need of pixel-level labeled training video data. Our proposed method is inspired by this line of research.…”
Section: Related Workmentioning
confidence: 99%