2014 IEEE Conference on Computer Vision and Pattern Recognition 2014
DOI: 10.1109/cvpr.2014.211
|View full text |Cite
|
Sign up to set email alerts
|

Decorrelating Semantic Visual Attributes by Resisting the Urge to Share

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
165
0
1

Year Published

2014
2014
2021
2021

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 133 publications
(167 citation statements)
references
References 14 publications
1
165
0
1
Order By: Relevance
“…[23] proposed a model DKRL, combining the existing model TransE (originally used for KG completion) [3] and CNN (or BOW), for KGE in zero-shot scenario. In computer vision, [24], [25] train a recognition model for zero-shot object recognition by specifying the category's attributes. [26] proposes a label-embedding model for attribute-based zero-shot classification.…”
Section: Related Workmentioning
confidence: 99%
“…[23] proposed a model DKRL, combining the existing model TransE (originally used for KG completion) [3] and CNN (or BOW), for KGE in zero-shot scenario. In computer vision, [24], [25] train a recognition model for zero-shot object recognition by specifying the category's attributes. [26] proposes a label-embedding model for attribute-based zero-shot classification.…”
Section: Related Workmentioning
confidence: 99%
“…During the test, a prediction can be made by Maximum-a-Posteriori criteria over all of the outputs of the binary classifiers. The main drawback of such framework is the correlation problem that reported in [10]. Besides, the human-defined attribute list can be unrealistic and noisy and need to be selected [9,7,16,18].…”
Section: Related Workmentioning
confidence: 99%
“…To further prove that our part-aware method somehow decorrelates the attributes, we evaluate against the state-of-the-art attribute decorrelation method introduced in [18], where they use semantic groups to encourage in-group feature sharing and between-group competition for features through a lasso multi-task learning framework. We compare with two variants of their method (i) similar to [18], when holistic image-wide features divided into 6 regular grids are used (Weakly-Supervised (WS)-Decor), and (ii) when ground-truth part annotations are supplied to extract part-level features (StronglySupervised (SS)-Decor). We also compare performance of strongly-supervised DPM against the original weaklysupervised DPM [11] which works without strong part annotations at training (Weakly-Supervised (WS)-DPM).…”
Section: Baselines (Attribute Detection)mentioning
confidence: 99%
“…Moreover, such attributes may provide a route to bridge the sketch/photo modality gap, as they are domain invariant if reliably detected (e.g., a high-heel shoe is 'high-heel' regardless if depicted in a photo or sketch). However, they suffer from being hard to predict due to spurious correlations [18]. In this paper we bring together attribute and part-centric modeling to decorrelate and better predict attributes, as well as provide two complementary views of the data to enhance matching.…”
Section: Introductionmentioning
confidence: 99%