2022
DOI: 10.1007/978-3-030-98355-0_55
|View full text |Cite
|
Sign up to set email alerts
|

V-FIRST: A Flexible Interactive Retrieval System for Video at VBS 2022

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
2
1

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 16 publications
0
3
0
Order By: Relevance
“…For instance, vibro [99] employs the OpenCLIP ViT-L/14 [14], [60] trained on LAION-2B [101] to produce joint text-visual embeddings. VideoCLIP [82] and v-FIRST [110] uses the visual transformer CLIP ViT-L@336 [15], [60], [86] trained on the LAION-2B dataset. In VideoCLIP, the integration of Milvus [113] vector database facilitates seamless matching between embeddings.…”
Section: Model Systemmentioning
confidence: 99%
See 1 more Smart Citation
“…For instance, vibro [99] employs the OpenCLIP ViT-L/14 [14], [60] trained on LAION-2B [101] to produce joint text-visual embeddings. VideoCLIP [82] and v-FIRST [110] uses the visual transformer CLIP ViT-L@336 [15], [60], [86] trained on the LAION-2B dataset. In VideoCLIP, the integration of Milvus [113] vector database facilitates seamless matching between embeddings.…”
Section: Model Systemmentioning
confidence: 99%
“…In VideoCLIP, the integration of Milvus [113] vector database facilitates seamless matching between embeddings. v-FIRST [59] presents a revised version of their previous interactive video retrieval system [110], which supports querying by textual descriptions and visual examples. The joint textvisual feature space is the basis for many of v-FIRST's functionalities, such as optimized vector search, fast neighbor search, and compression of similar video segments.…”
Section: Model Systemmentioning
confidence: 99%
“…For additional detail about any system, please see the corresponding publication referenced beside the system name in the overview table. [42] KR 249 25 2 ✓ ✓ ✓ ✓ ✓ ✓ AVSEEKER [41] IE 207 25 2 [75] VN 200 26 2 ✓ ✓ ✓ ✓ ✓ ✓ VideoFall [60] IE 197 25 2 ✓ ✓ ✓ ✓ ✓ ✓ ✓ VERGE [6] GR 176 24 2 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ vitrivr [28] CH 175 21 2 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ VNUHCM [54] VN 161 22 2 ✓ ✓ ✓ ✓ VIREO [55] SG 158 16 2 [34] VN 146 22 2 ✓ ✓ ✓ ✓ ✓ vitrivr-VR [71] CH 137 20 2 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ diveXplore [43] AT 75 14 2 ✓ ✓ ✓ ✓ ✓ Exquisitor [40] DK 72 14 1…”
Section: Related Work Used By Participating Systemsmentioning
confidence: 99%
“…V-FIRST [75] simply allows the user to input two separate queries, then uses a weighted sum of the two queries to generate ordered pairs of images in a video and return them for the user to browse.…”
Section: Temporal Queryingmentioning
confidence: 99%
“…We propose a new idea for query expansion with the assistance of external search engine to find unknown/unfamiliar concepts (see Section 3.5). We also provide a simple sketch-based retrieval [23] so that users can quickly sketch out the scene of interest.…”
Section: System Overviewmentioning
confidence: 99%