Weakly Supervised Learning of Object Segmentations from Web-Scale Video

Hartmann, Glenn D.; Grundmann, Matthias; Hoffman, Judy; Tsai, David; Kwatra, Vivek; Madani, Omid; Vijayanarasimhan, Sudheendra; Essa, Irfan; Rehg, James M.; Sukthankar, Rahul

doi:10.1007/978-3-642-33863-2_20

Cited by 48 publications

(64 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Methods for this "weakly supervised" setting attempt to learn an object model from ambiguously labeled exemplars [15,23,28,30]. This is very different from the propagation problem we tackle; our method gets only one video at a time and cannot benefit from cross-video appearance sharing.…”

Section: Related Workmentioning

confidence: 99%

Supervoxel-Consistent Foreground Propagation in Video

Jain

Grauman

2014

Lecture Notes in Computer Science

169

159

View full text Add to dashboard Cite

Abstract. A major challenge in video segmentation is that the foreground object may move quickly in the scene at the same time its appearance and shape evolves over time. While pairwise potentials used in graph-based algorithms help smooth labels between neighboring (super)pixels in space and time, they offer only a myopic view of consistency and can be misled by inter-frame optical flow errors. We propose a higher order supervoxel label consistency potential for semi-supervised foreground segmentation. Given an initial frame with manual annotation for the foreground object, our approach propagates the foreground region through time, leveraging bottom-up supervoxels to guide its estimates towards long-range coherent regions. We validate our approach on three challenging datasets and achieve state-of-the-art results.

show abstract

Section: Related Workmentioning

confidence: 99%

Supervoxel-Consistent Foreground Propagation in Video

Jain

Grauman

2014

Lecture Notes in Computer Science

169

159

View full text Add to dashboard Cite

show abstract

“…The second scenario, which we term inductive segment annotation (ISA), is studied in [11]. In this setting, a segment classifier is trained using a large quantity of weakly labeled segments from both positively-and negatively-tagged videos.…”

Section: Spatiotemporal Segmentationmentioning

confidence: 99%

“…We make publicly available segment-level annotations for a subset of the Prest et al dataset [20] and show convincing results. We also show state-of-the-art results on Hartmann et al's more difficult, large-scale object segmentation dataset [11]. …”

mentioning

confidence: 95%

“…However, since tags are not spatially or temporally localized within the video, such videos cannot be directly exploited for training traditional supervised recognition systems. This has stimulated significant recent interest in methods that learn localized concepts under weak supervision [11,16,20,25]. In this paper, we examine the problem of generating pixel-level concept annotations for weakly labeled video.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Discriminative Segment Annotation in Weakly Labeled Video

Tang

Sukthankar

Yagnik

et al. 2013

2013 IEEE Conference on Computer Vision and Pattern Recognition

Self Cite

120

157

View full text Add to dashboard Cite

This paper tackles the problem of segment annotation in complex Internet videos. Given a weakly labeled video, we automatically generate spatiotemporal masks for each of the concepts with which it is labeled. This is a particularly relevant problem in the video domain, as large numbers of Internet videos are now available, tagged with the visual concepts that they contain. Given such weakly labeled videos, we focus on the problem of spatiotemporal segment classification. We propose a straightforward algorithm, CRANE, that utilizes large amounts of weakly labeled video to rank spatiotemporal segments by the likelihood that they correspond to a given visual concept. We make publicly available segment-level annotations for a subset of the Prest et al. dataset [20] and show convincing results. We also show state-of-the-art results on Hartmann et al.'s more difficult, large-scale object segmentation dataset [11].

show abstract

“…Weakly supervised video segmentation methods [10][11][12] are proposed to curtail the need of pixel-level labeled training video data. Our proposed method is inspired by this line of research.…”

Section: Related Workmentioning

confidence: 99%

Efficient Object Localization and Segmentation in Weakly Labeled Videos

Rochan

Yang

2014

Advances in Visual Computing

View full text Add to dashboard Cite

Abstract. In this paper, we tackle the problem of efficiently segmenting objects in weakly labeled videos. Internet videos (e.g., YouTube) are often associated with a semantic tag describing the main object within the video. However, this tag does not provide any spatial or temporal information about the object within the video. So these videos are weakly labeled. We propose a novel and efficient approach to localize the object of interest within the video and perform pixel-level segmentation. Given a video with an object tag, our proposed method automatically localizes the object and segments it from the background in each frame of the video. Our method combines object appearance modeling and temporal consistency among frames in a principled framework. Our method does not require user inputs or object detectors, so it can be potentially applied to videos of any object categories. We evaluate our method on a dataset consisting of more than 100 video shots of 10 different object categories. Our experimental results show that our method outperforms other baseline approaches.

show abstract

Weakly Supervised Learning of Object Segmentations from Web-Scale Video

Cited by 48 publications

References 23 publications

Supervoxel-Consistent Foreground Propagation in Video

Supervoxel-Consistent Foreground Propagation in Video

Discriminative Segment Annotation in Weakly Labeled Video

Efficient Object Localization and Segmentation in Weakly Labeled Videos

Contact Info

Product

Resources

About