Proceedings of the First Workshop on Storytelling 2018
DOI: 10.18653/v1/w18-1503
|View full text |Cite
|
Sign up to set email alerts
|

A Pipeline for Creative Visual Storytelling

Abstract: Computational visual storytelling produces a textual description of events and interpretations depicted in a sequence of images. These texts are made possible by advances and crossdisciplinary approaches in natural language processing, generation, and computer vision. We define a computational creative visual storytelling as one with the ability to alter the telling of a story along three aspects: to speak about different environments, to produce variations based on narrative goals, and to adapt the narrative … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
24
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 20 publications
(26 citation statements)
references
References 12 publications
0
24
0
Order By: Relevance
“…Moving from objects to actions, several tasks have been proposed to mimic more realistic settings where a higher degree of integration between modalities is required. One is visual storytelling (Huang et al, 2016;Gonzalez-Rico and Pineda, 2018;Lukin et al, 2018), where models have to understand the action depicted in each photo and their relations to generate a story. Similar abilities are required in the task of generating non-grounded, human-like questions about an image (Mostafazadeh et al, 2016;Jain et al, 2017), and in that of asking discriminative questions over pairs of similar scenes .…”
Section: Related Workmentioning
confidence: 99%
“…Moving from objects to actions, several tasks have been proposed to mimic more realistic settings where a higher degree of integration between modalities is required. One is visual storytelling (Huang et al, 2016;Gonzalez-Rico and Pineda, 2018;Lukin et al, 2018), where models have to understand the action depicted in each photo and their relations to generate a story. Similar abilities are required in the task of generating non-grounded, human-like questions about an image (Mostafazadeh et al, 2016;Jain et al, 2017), and in that of asking discriminative questions over pairs of similar scenes .…”
Section: Related Workmentioning
confidence: 99%
“…First, all were publicly available and well-documented, ensuring easy replicability. Other existing visual storytelling models (Huang et al, 2016;Yu et al, 2017;Hsu et al, 2018;Lukin et al, 2018) would have required reimplementation. Doing so introduces the possibility of unintentionally crippling performance (e.g., when setting required but unreported parameters), which we wished to avoid.…”
Section: Methodsmentioning
confidence: 99%
“…First, all were publicly available and well-documented, ensuring easy replicability. Other existing visual storytelling models Yu et al, 2017;Hsu et al, 2018;Lukin et al, 2018) would have required reimplementation. Doing so introduces the possibility of unintentionally crippling performance (e.g., when setting required but unreported parameters), which we wished to avoid.…”
Section: Methodsmentioning
confidence: 99%