2022
DOI: 10.48550/arxiv.2201.06570
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

BDA-SketRet: Bi-Level Domain Adaptation for Zero-Shot SBIR

Abstract: The efficacy of zero-shot sketch-based image retrieval (ZS-SBIR) models is governed by two challenges. The immense distributions-gap between the sketches and the images requires a proper domain alignment. Moreover, the fine-grained nature of the task and the high intra-class variance of many categories necessitates a class-wise discriminative mapping among the sketch, image, and the semantic spaces. Under this premise, we propose BDA-SketRet, a novel ZS-SBIR framework performing a bi-level domain adaptation fo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 40 publications
0
2
0
Order By: Relevance
“…Competitors. We compare our method with several baselines, including ZSIH [50], CC-DG [40], DOODLE [16], SEM-PCYC [19], SAKE [34], SketchGCN [67], StyleGuide [20], PDFD [13], DSN [57], BDA-SketRet [8], SBTKNet [55], Sketch3T [44], TVT [54] and ViT-Ret/ViT-Vis [18] adapted by us. ViT-Ret means replacing the class token in ViT with a retrieval token used for matching; while ViT-Vis uses the visual tokens for matching.…”
Section: Category-level Zs-sbirmentioning
confidence: 99%
See 1 more Smart Citation
“…Competitors. We compare our method with several baselines, including ZSIH [50], CC-DG [40], DOODLE [16], SEM-PCYC [19], SAKE [34], SketchGCN [67], StyleGuide [20], PDFD [13], DSN [57], BDA-SketRet [8], SBTKNet [55], Sketch3T [44], TVT [54] and ViT-Ret/ViT-Vis [18] adapted by us. ViT-Ret means replacing the class token in ViT with a retrieval token used for matching; while ViT-Vis uses the visual tokens for matching.…”
Section: Category-level Zs-sbirmentioning
confidence: 99%
“…Zero-shot sketch-based image retrieval (ZS-SBIR) is a central problem to sketch understanding [8,16,19,20,28,34,50,54,55,57,60,67]. The zero-shot setting is largely driven by the prevailing data scarcity problem of human sketches [19,28,58] -they are much harder to acquire compared with photos.…”
Section: Introductionmentioning
confidence: 99%
“…IV. RESULTS AND DISCUSSIONS We compare the performance of the proposed model with some of the existing state-of-the-art frameworks of [9]- [12], [14], [17], [18], [21], [34]. We also lay down the performances of some of the notable works in SBIR that uses the same datasets to show how the proposed model which solves a more challenging ZS-SBIR problem achieves comparable performance.…”
Section: Sketchy-ext Tu Berlin-extmentioning
confidence: 99%
“…On the other hand, some of the notable discriminative models involve [10], [16]- [18], to name a few. The Doo-dle2search [10] uses a triplet architecture and uses gradient reversal layers to enforce learning domain agnostic features from image and sketches.…”
Section: Introductionmentioning
confidence: 99%