Proceedings of the 2nd Workshop on New Frontiers in Summarization 2019
DOI: 10.18653/v1/d19-5409
|View full text |Cite
|
Sign up to set email alerts
|

SAMSum Corpus: A Human-annotated Dialogue Dataset for Abstractive Summarization

Abstract: This paper introduces the SAMSum Corpus, a new dataset with abstractive dialogue summaries. We investigate the challenges it poses for automated summarization by testing several models and comparing their results with those obtained on a corpus of news articles. We show that model-generated summaries of dialogues achieve higher ROUGE scores than the model-generated summaries of news -in contrast with human evaluators' judgement. This suggests that a challenging task of abstractive dialogue summarization requir… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
278
0
1

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 257 publications
(366 citation statements)
references
References 15 publications
1
278
0
1
Order By: Relevance
“…Global View and Discrete View In addition to the aforementioned two structured views, conversations can also be naturally viewed from a relatively coarse perspective, i.e., a global view that concatenates all utterances into one giant block (Gliwa et al, 2019), and a discrete view that separates each utterance into a distinct block Gliwa et al, 2019).…”
Section: Conversation View Extractionmentioning
confidence: 99%
See 4 more Smart Citations
“…Global View and Discrete View In addition to the aforementioned two structured views, conversations can also be naturally viewed from a relatively coarse perspective, i.e., a global view that concatenates all utterances into one giant block (Gliwa et al, 2019), and a discrete view that separates each utterance into a distinct block Gliwa et al, 2019).…”
Section: Conversation View Extractionmentioning
confidence: 99%
“…We evaluate our model on a large-scale dialogue summary dataset SAMSum (Gliwa et al, 2019) that has 14732 dialogues with human-written summaries. The data statistics are shown in Table 3.…”
Section: Dataset and Baselinesmentioning
confidence: 99%
See 3 more Smart Citations