Sketching out the details: Sketch-based image retrieval using convolutional neural networks with multi-stage regression

Bui, Tien D.; Ribeiro, Leonardo Sampaio Ferraz; Ponti, Moacir Antonelli; Collomosse, John

doi:10.1016/j.cag.2017.12.006

Cited by 61 publications

(76 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our embedding (LS) outperforms all ablations and baselines, with vector query alone contributing significant margin over raster. The addition of fc layers to create cross-modal embedding (-R) slightly improves (importantly, does not degrade) the intermediate raster embedding R available via [6]. The method significantly outperforms recent triplet SBIR approaches [28,5].…”

Section: Evaluating Cross-modal Searchmentioning

confidence: 94%

“…LiveSketch accepts a query sketch Q in vector graphics form (as a variable length sequence of strokes), and searches a large (∼ 10 8 ) dataset of raster images I = {I 1 , ..., I N }. Our two-stream network architecture ( ; the image branch of [6]. Query sketches are encoded via SQ(.…”

Section: Methodsmentioning

confidence: 99%

“…1). A core novelty of our approach lies within the method by which visual suggestions are generated, exploiting the reversibility of deep neural networks (DNNs) that are commonly used to encode image features to create the search index in visual search systems [26,13,7,6]. By identifying clusters of likely target intents for the user's search, we reverse the DNN encoder to explain how such clusters could be gen- Figure 1.…”

Section: Introductionmentioning

confidence: 99%

“…Average time to retrieve 20 class-and instance-level search targets (18 participants, 3 per method). Comparing LiveSketch (LS) interactive method with ablations (-NI) non-interactive/one-shot; (-R) raster substitutes vector query; (-I) intermediate structure embedding, and with the three baselines[6,28,5]. Times in seconds; parentheses total the averaged missed queries.…”

mentioning

confidence: 99%

See 3 more Smart Citations

LiveSketch: Query Perturbations for Guided Sketch-Based Visual Search

Collomosse

Bui

Jin

2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Self Cite

View full text Add to dashboard Cite

LiveSketch is a novel algorithm for searching large image collections using hand-sketched queries. LiveSketch tackles the inherent ambiguity of sketch search by creating visual suggestions that augment the query as it is drawn, making query specification an iterative rather than one-shot process that helps disambiguate users' search intent. Our technical contributions are: a triplet convnet architecture that incorporates an RNN based variational autoencoder to search for images using vector (stroke-based) queries; real-time clustering to identify likely search intents (and so, targets within the search embedding); and the use of backpropagation from those targets to perturb the input stroke sequence, so suggesting alterations to the query in order to guide the search. We show improvements in accuracy and time-to-task over contemporary baselines using a 67M image corpus.

show abstract

Section: Evaluating Cross-modal Searchmentioning

confidence: 94%

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

LiveSketch: Query Perturbations for Guided Sketch-Based Visual Search

Collomosse

Bui

Jin

2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Self Cite

View full text Add to dashboard Cite

show abstract

“…This was achieved using a statistical dependency measure to pair unlabeled data during training and supervised with corresponding training pairs. Using a multi-phase training approach (Bui et al, 2018) pretrained a classifier for each domain in a supervised manner and then used a second training phase to learn a transformation between the learned embeddings for cross-domain image retrieval.…”

Section: Semi-supervised Learningmentioning

confidence: 99%

A Semi-Supervised Approach to Sar-Optical Image Matching

Hughes

Schmitt

2019

ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci.

View full text Add to dashboard Cite

<p><strong>Abstract.</strong> Matching synthetic aperture radar (SAR) and optical remote sensing imagery is a key first step towards exploiting the complementary nature of these data in data fusion frameworks. While numerous signal-based approaches to matching have been proposed, they often fail to perform well in multi-sensor situations. In recent years deep learning has become the go-to approach for solving image matching in computer vision applications, and has also been adapted to the case of SAR-optical image matching. However, the hitherto proposed techniques still fail to match SAR and optical imagery in a generalizable manner. These limitations are largely due to the complexities in creating large-scale datasets of corresponding SAR and optical image patches. In this paper we frame the matching problem within semi-supervised learning, and use this as a proxy for investigating the effects of data scarcity on matching. In doing so we make an initial contribution towards the use of semi-supervised learning for matching SAR and optical imagery. We further gain insight into the non-complementary nature of commonly used supervised and unsupervised loss functions, as well as dataset size requirements for semi-supervised matching.</p>

show abstract

Deep Manifold Alignment for Mid-Grain Sketch Based Image Retrieval

Bui

Ribeiro

Ponti

et al. 2019

Computer Vision – ACCV 2018

Self Cite

View full text Add to dashboard Cite

We present an algorithm for visually searching image collections using free-hand sketched queries. Prior sketch based image retrieval (SBIR) algorithms adopt either a category-level or fine-grain (instancelevel) definition of cross-domain similarity-returning images that match the sketched object class (category-level SBIR), or a specific instance of that object (fine-grain SBIR). In this paper we take the middle-ground; proposing an SBIR algorithm that returns images sharing both the object category and key visual characteristics of the sketched query without assuming photo-approximate sketches from the user. We describe a deeply learned cross-domain embedding in which 'mid-grain' sketch-image similarity may be measured, reporting on the efficacy of unsupervised and semi-supervised manifold alignment techniques to encourage better intracategory (mid-grain) discrimination within that embedding. We propose a new mid-grain sketch-image dataset (MidGrain65c) and demonstrate not only mid-grain discrimination, but also improved category-level discrimination using our approach.

show abstract

Sketching out the details: Sketch-based image retrieval using convolutional neural networks with multi-stage regression

Cited by 61 publications

References 28 publications

LiveSketch: Query Perturbations for Guided Sketch-Based Visual Search

LiveSketch: Query Perturbations for Guided Sketch-Based Visual Search

A Semi-Supervised Approach to Sar-Optical Image Matching

Deep Manifold Alignment for Mid-Grain Sketch Based Image Retrieval

Contact Info

Product

Resources

About