2018
DOI: 10.1016/j.cag.2017.12.006
|View full text |Cite
|
Sign up to set email alerts
|

Sketching out the details: Sketch-based image retrieval using convolutional neural networks with multi-stage regression

Abstract: We propose and evaluate several deep network architectures for measuring the similarity between sketches and photographs, within the context of the sketch based image retrieval (SBIR) task. We study the ability of our networks to generalize across diverse object categories from limited training data, and explore in detail strategies for weight sharing, pre-processing, data augmentation and dimensionality reduction. In addition to a detailed comparative study of network configurations, we contribute by describi… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
68
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 61 publications
(76 citation statements)
references
References 28 publications
1
68
0
Order By: Relevance
“…Our embedding (LS) outperforms all ablations and baselines, with vector query alone contributing significant margin over raster. The addition of fc layers to create cross-modal embedding (-R) slightly improves (importantly, does not degrade) the intermediate raster embedding R available via [6]. The method significantly outperforms recent triplet SBIR approaches [28,5].…”
Section: Evaluating Cross-modal Searchmentioning
confidence: 94%
See 3 more Smart Citations
“…Our embedding (LS) outperforms all ablations and baselines, with vector query alone contributing significant margin over raster. The addition of fc layers to create cross-modal embedding (-R) slightly improves (importantly, does not degrade) the intermediate raster embedding R available via [6]. The method significantly outperforms recent triplet SBIR approaches [28,5].…”
Section: Evaluating Cross-modal Searchmentioning
confidence: 94%
“…LiveSketch accepts a query sketch Q in vector graphics form (as a variable length sequence of strokes), and searches a large (∼ 10 8 ) dataset of raster images I = {I 1 , ..., I N }. Our two-stream network architecture ( ; the image branch of [6]. Query sketches are encoded via SQ(.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…This was achieved using a statistical dependency measure to pair unlabeled data during training and supervised with corresponding training pairs. Using a multi-phase training approach (Bui et al, 2018) pretrained a classifier for each domain in a supervised manner and then used a second training phase to learn a transformation between the learned embeddings for cross-domain image retrieval.…”
Section: Semi-supervised Learningmentioning
confidence: 99%