Anais Do X Symposium on Knowledge Discovery, Mining and Learning (KDMiLe 2022) 2022
DOI: 10.5753/kdmile.2022.227792
|View full text |Cite
|
Sign up to set email alerts
|

Successful Youtube video identification using multimodal deep learning

Abstract: Text from titles and audio transcriptions, image thumbnails, number of likes, dislikes, and views are examples of available data in a YouTube video. Despite the variability, most standard Deep Learning models use only one type of data. Moreover, the simultaneous use of multiple data sources for such problems is still rare. To shed light on these problems, we empirically evaluate eight different multimodal fusion operations using embeddings extracted from image thumbnails and video titles of YouTube videos usin… Show more

Help me understand this report

This publication either has no citations yet, or we are still processing them

Set email alert for when this publication receives citations?

See others like this or search for similar articles