Proceedings of the CHI Conference on Human Factors in Computing Systems 2024
DOI: 10.1145/3613904.3642839
|View full text |Cite
|
Sign up to set email alerts
|

Making Short-Form Videos Accessible with Hierarchical Video Summaries

Tess Van Daele,
Akhil Iyer,
Yuning Zhang
et al.

Abstract: Figure 1: ShortScribe makes short-form videos accessible with hierarchical video descriptions. ShortScribe extracts video data by identifying key frames then applying automatic speech recognition (ASR), automated description (BLIP-2), and optical character recognition (OCR). A large language model (GPT-4) then generates multiple descriptions. TikTok

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 55 publications
0
0
0
Order By: Relevance