Recognizing and Presenting the Storytelling Video Structure With Deep Multimodal Networks

Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita

doi:10.1109/tmm.2016.2644872

Cited by 44 publications

(31 citation statements)

References 37 publications

(54 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…In particular, our clustering algorithm relies on the minimization of variances inside each scene. For further details, the reader is encouraged to read the paper in which the technique was proposed [5].…”

Section: Textual Concept Featuresmentioning

confidence: 99%

“…Using a scene detection algorithm that we have recently proposed in literature [5], and thanks to the application of Speech-to-Text techniques, it has been possible to automatically annotate a set of 500 educational broadcast videos taken from the large Rai Scuola archive 2 . Also, we developed a browsing and retrieval interface on top of a commercial ECMS, namely eXo Platform, from which the results of the automatic annotation can be browsed and manually refined.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Video Library System Using Scene Detection and Automatic Tagging

Baraldi

Grana

Cucchiara

2017

Communications in Computer and Information Science

Self Cite

View full text Add to dashboard Cite

We present a novel video browsing and retrieval system for edited videos, in which videos are automatically decomposed into meaningful and storytelling parts (i.e. scenes) and tagged according to their transcript. The system relies on a Triplet Deep Neural Network which exploits multimodal features, and has been implemented as a set of extensions to the eXo Platform Enterprise Content Management System (ECMS). This set of extensions enable the interactive visualization of a video, its automatic and semi-automatic annotation, as well as a keyword-based search inside the video collection. The platform also allows a natural integration with third-party add-ons, so that automatic annotations can be exploited outside the proposed platform

show abstract

Section: Textual Concept Featuresmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

A Video Library System Using Scene Detection and Automatic Tagging

Baraldi

Grana

Cucchiara

2017

Communications in Computer and Information Science

Self Cite

View full text Add to dashboard Cite

show abstract

“…Atualmente, as pessoas podem ter acesso a conteúdos usando diferentes tipos de dispositivos e meios (notebooks, celulares, Personal Digital Assistants, WiFi, 3G e 4G, entre outros). Toda essa evolução criou ambientes heterogêneos [Bouyakoub e Belkhir, 2008;Baraldi et al, 2017;Pouyanfar et al, 2018], surgindo com isso desafios no tratamento dos dados, já que, geralmente, quando um dispositivo acessa um conteúdo multimídia para o qual não foi projetado, a experiência do usuário é insatisfatória.…”

Section: Introductionunclassified

Recuperação de Informação Multimídia em Big Data Utilizando OpenCV Python

Goularte¹,

Trojahn²,

Kishi³

2019

Minicursos Do XXV Simpósio Brasileiro De Sistemas Multimídia E Web

View full text Add to dashboard Cite

The popularization of systems, applications and devices to produce, view and share multimedia, saw the need to treat a large volume of data arise. In related areas (such as Multimedia Big Data, Data Science and Multimedia Information Retrieval) a key step is commonly referred as Multimedia Indexing or Multimedia Big Data Analysis, where the aim is to represent multimedia content into smaller, more manageable units, allowing the extraction of data features and information essential to the proper performance of the associated services. This mini-course discusses current tools and techniques for indexing, extracting and processing of multimodal multimedia content. The techniques are exemplified in Python OpenCV over different content (like images, audio, text and video), leading to the interest of services like Netflix, Google and YouTube on this subject, attracting the interest of researchers and developers. ResumoA popularização de sistemas, aplicativos e dispositivos para produzir, exibir e compartilhar conteúdo multimídia fez surgir a necessidade de tratar um grande volume de dados. Nas áreas relacionadas (como Multimedia Big Data, Ciência de Dados e Recuperação de Informação Multimídia) um pré-requisito chave é comumente conhecido como Indexação Multimídia (ou Análise Multimídia em Big Data), onde o objetivo é representar o conteúdo em unidades menores e mais gerenciáveis, permitindo a extração de features dos dados e informações essenciais para o bom funcionamento dos serviços associados. Este minicurso aborda ferramentas e técnicas atuais para indexação, extração e processamento de conteúdo multimídia multimodal. As técnicas XXV Simpósio Brasileiro de Sistemas Multimídia e Web: Minicursos Listagem 5.1. Exemplo de código Python utilizando OpenCV. Arquivo exemplo1.py XXV Simpósio Brasileiro de Sistemas Multimídia e Web: Minicursos

show abstract

“…The Gamification process consists in the application of game-design elements and principles in non-game contexts [16]: it uses the game mechanics to improve skills and knowledge of a subject, also enhancing its engagement and excitement while performing a task that usually does not provides them. Referring to the Csíkszentmihályi [12,19] and Chen [8] studies the sense of fun is strictly connected with the Flow theory characterized by the constant steady and balance between the challenge offered to gamers and the skills developed while facing them: in [5,11,13] are studies about video semantic recognition while the evaluation of affective states and moods are in [3].…”

Section: Introductionmentioning

confidence: 99%

Annote: A Serious Game for Medical Students to Approach Lesion Skin Images of a Digital Library

Balducci

2017

Communications in Computer and Information Science

View full text Add to dashboard Cite

Nowadays it is claimed that one method to learn how to execute a task is to present it as a gaming activity: in this way a teacher can offer a safe and controlled environment for learners also arousing excitement and engagement. In this work we present the design of the serious game 'Annote', to exploit a medical digital library with the aim to help dermatologists to teach students how to approach the examination of skin lesion images to prevent melanomas.

show abstract

Recognizing and Presenting the Storytelling Video Structure With Deep Multimodal Networks

Cited by 44 publications

References 37 publications

A Video Library System Using Scene Detection and Automatic Tagging

A Video Library System Using Scene Detection and Automatic Tagging

Recuperação de Informação Multimídia em Big Data Utilizando OpenCV Python

Annote: A Serious Game for Medical Students to Approach Lesion Skin Images of a Digital Library

Contact Info

Product

Resources

About