Automatic Attribute Discovery and Characterization from Noisy Web Data

Berg, Tamara L.; Berg, Alexander C.; Shih, Jonathan

doi:10.1007/978-3-642-15549-9_48

Cited by 324 publications

(282 citation statements)

References 23 publications

Supporting

Mentioning

276

Contrasting

Unclassified

Order By: Relevance

“…Compared to an existing Shoes attribute dataset [4] with relative attributes [21], UT-Zap50K is about 3.5× larger, offers meta-data and 10× more comparative labels, and most importantly, specifically targets fine-grained tasks.…”

Section: Fine-grained Attribute Zappos Datasetmentioning

confidence: 99%

Fine-Grained Visual Comparisons with Local Learning

Grauman

2014

2014 IEEE Conference on Computer Vision and Pattern Recognition

407

312

View full text Add to dashboard Cite

Given two images, we want to predict which exhibits a particular visual attribute more than the other-even when the two images are quite similar. Existing relative attribute methods rely on global ranking functions; yet rarely will the visual cues relevant to a comparison be constant for all data, nor will humans' perception of the attribute necessarily permit a global ordering. To address these issues, we propose a local learning approach for fine-grained visual comparisons. Given a novel pair of images, we learn a local ranking model on the fly, using only analogous training comparisons. We show how to identify these analogous pairs using learned metrics. With results on three challenging datasets-including a large newly curated dataset for fine-grained comparisons-our method outperforms stateof-the-art methods for relative attribute prediction.

show abstract

Section: Fine-grained Attribute Zappos Datasetmentioning

confidence: 99%

Fine-Grained Visual Comparisons with Local Learning

Grauman

2014

2014 IEEE Conference on Computer Vision and Pattern Recognition

407

312

View full text Add to dashboard Cite

show abstract

“…As a result, there are many great datasets that cover object [9,7,31,25,20], attribute [16,1,10], material [26], and scene categories [32,33]. Here, our goal is to create an extensive dataset for characterizing state variation that occurs within image classes.…”

Section: States and Transformations Datasetmentioning

confidence: 99%

“…Unlike this work, we investigate discrete, nameable transformations, like crinkling, rather than working in a hard-to-interpret parameter space. Photo collections have also been mined for storylines [15] as well as spatial and temporal trends [18], and systems have been proposed for more general knowledge discovery from big visual data [21], [1], [3]. Our paper differs from all this work in that we focus on physical state transformations, and in addition to discovering states we also study state pairs that define a transformation.…”

Section: Introductionmentioning

confidence: 99%

Discovering states and transformations in image collections

Isola

Lim

Adelson

2015

2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

181

161

View full text Add to dashboard Cite

Objects in visual scenes come in a rich variety of transformed states. A few classes of transformation have been heavily studied in computer vision: mostly simple, parametric changes in color and geometry. However, transformations in the physical world occur in many more flavors, and they come with semantic meaning: e.g., bending, folding, aging, etc. The transformations an object can undergo tell us about its physical and functional properties. In this paper, we introduce a dataset of objects, scenes, and materials, each of which is found in a variety of transformed states. Given a novel collection of images, we show how to explain the collection in terms of the states and transformations it depicts. Our system works by generalizing across object classes: states and transformations learned on one set of objects are used to interpret the image collection for an entirely new object class.

show abstract

“…We validate with three public datasets: Shoes [1], with the attributes from [8] (14,658 images and 10 attributes); outdoor Scenes (2,688 images and 6 attributes); and PubFig celebrity Faces [10] (772 images and 11 attributes). We concatenate GIST and color features for Shoes and Faces, and GIST alone for Scenes.…”

Section: Methodsmentioning

confidence: 99%

Attribute Pivots for Guiding Relevance Feedback in Image Search

Kovashka

Grauman

2013

2013 IEEE International Conference on Computer Vision

View full text Add to dashboard Cite

In interactive image search, a user iteratively refines his results by giving feedback on exemplar images. Active selection methods aim to elicit useful feedback, but traditional approaches suffer from expensive selection criteria and cannot predict informativeness reliably due to the imprecision of relevance feedback. To address these drawbacks, we propose to actively select "pivot" exemplars for which feedback in the form of a visual comparison will most reduce the system's uncertainty. For example, the system might ask, "Is your target image more or less crowded than this image?" Our approach relies on a series of binary search trees in relative attribute space, together with a selection function that predicts the information gain were the user to compare his envisioned target to the next node deeper in a given attribute's tree. It makes interactive search more efficient than existing strategies-both in terms of the system's selection time as well as the user's feedback effort.

show abstract

Automatic Attribute Discovery and Characterization from Noisy Web Data

Cited by 324 publications

References 23 publications

Fine-Grained Visual Comparisons with Local Learning

Fine-Grained Visual Comparisons with Local Learning

Discovering states and transformations in image collections

Attribute Pivots for Guiding Relevance Feedback in Image Search

Contact Info

Product

Resources

About