Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 2021
DOI: 10.18653/v1/2021.findings-acl.390
|View full text |Cite
|
Sign up to set email alerts
|

Plot and Rework: Modeling Storylines for Visual Storytelling

Abstract: Writing a coherent and engaging story is not easy. Creative writers use their knowledge and worldview to put disjointed elements together to form a coherent storyline, and work and rework iteratively toward perfection. Automated visual storytelling (VIST) models, however, make poor use of external knowledge and iterative generation when attempting to create stories. This paper introduces PR-VIST, a framework that represents the input image sequence as a story graph in which it finds the best path to form a sto… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 34 publications
(35 reference statements)
0
8
0
Order By: Relevance
“…To our knowledge, none of these approaches make use of plan-based decoding. Hsu et al (2021) construct a graph representing the image sequence (based on training data and external resources) and identify the highest scoring path as the best storyline encapsulated therein. The storyline can be viewed as a form of planning, however, on the encoder side.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…To our knowledge, none of these approaches make use of plan-based decoding. Hsu et al (2021) construct a graph representing the image sequence (based on training data and external resources) and identify the highest scoring path as the best storyline encapsulated therein. The storyline can be viewed as a form of planning, however, on the encoder side.…”
Section: Related Workmentioning
confidence: 99%
“…KG-Story (Hsu et al, 2020) predicts a set of words representative of the image sequence, enriches them using external knowledge graphs, and generates stories based on the enriched word set. PR-VIST (Hsu et al, 2021) is a state-of-the-art model which constructs a graph representing the relations between elements in the image sequence, identifies the best storyline captured therein, and proceeds to generate a story based on it. The process of constructing the story graph can be viewed as a form of planning.…”
Section: Comparison Systemsmentioning
confidence: 99%
See 3 more Smart Citations