Proceedings of the 2023 ACM International Conference on Multimedia Retrieval 2023
DOI: 10.1145/3591106.3592266
|View full text |Cite
|
Sign up to set email alerts
|

Improving Image Encoders for General-Purpose Nearest Neighbor Search and Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 41 publications
0
1
0
Order By: Relevance
“…For example, vibro [99] employs a CLIP ViT-L [15], [86] network that has been pre-trained on the LAION-2B dataset and fine-tuned in publicly available image datasets [98]. This enables the system to extract feature vectors useful for content-based image retrieval [97].…”
Section: Query By Examplementioning
confidence: 99%
“…For example, vibro [99] employs a CLIP ViT-L [15], [86] network that has been pre-trained on the LAION-2B dataset and fine-tuned in publicly available image datasets [98]. This enables the system to extract feature vectors useful for content-based image retrieval [97].…”
Section: Query By Examplementioning
confidence: 99%