Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua 2021
DOI: 10.18653/v1/2021.naacl-main.343
|View full text |Cite
|
Sign up to set email alerts
|

Plot-guided Adversarial Example Construction for Evaluating Open-domain Story Generation

Abstract: Human Written Story: jenny liked fresh fish. she decided to go fishing to catch her own. she brought her worms and pole and a chair. she sat there all day but didn't catch anything. she packed it up and went home disappointed. Sentence Manipulation: jenny liked fresh fish. she decided to go fishing to catch her own. she wrote songs every single day. she sat there all day but didn't catch anything. she packed it up and went home disappointed. Keyword Manipulation: jenny liked fresh fish. she decided to go fishi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(12 citation statements)
references
References 22 publications
0
8
0
Order By: Relevance
“…Therefore our model trained on human-written stories can hardly evaluate story coherence. To enable our model to evaluate story considering coherence issues, we further train our model (Ours (N)) with negative stories that are generated by the methods in the previous works (Guan and Huang, 2020;Ghazarian et al, 2021). We change the margin ranking loss as follow:…”
Section: Task 1: Preference Score Prediction (Ranking)mentioning
confidence: 99%
See 2 more Smart Citations
“…Therefore our model trained on human-written stories can hardly evaluate story coherence. To enable our model to evaluate story considering coherence issues, we further train our model (Ours (N)) with negative stories that are generated by the methods in the previous works (Guan and Huang, 2020;Ghazarian et al, 2021). We change the margin ranking loss as follow:…”
Section: Task 1: Preference Score Prediction (Ranking)mentioning
confidence: 99%
“…COH 200 . We use the same human collected data in the previous work (Ghazarian et al, 2021) 8 , which focused on recognizing coherence issues in the machine-generated stories (e.g., repeat plots, conflict logic).…”
Section: Correlation With Human Judgmentsmentioning
confidence: 99%
See 1 more Smart Citation
“…Synthetic datasets Synthetic dataset construction has been shown to improve robustness of evaluation models (Gupta et al, 2021;Ghazarian et al, 2021) and improve the complexity of test sets (Sakaguchi et al, 2021;Feng et al, 2021). Synthetic claims have been explored in fact-checking to create adversarial and hard test sets.…”
Section: Consistency In Dialoguementioning
confidence: 99%
“…Furthermore, detection of sentences with event boundaries can also be useful when generating engaging stories with a good amount of surprises. (Yao et al, 2019;Rashkin et al, 2020;Ghazarian et al, 2021).…”
Section: Introductionmentioning
confidence: 99%