2022
DOI: 10.48550/arxiv.2205.11465
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SQuALITY: Building a Long-Document Summarization Dataset the Hard Way

Abstract: Summarization datasets are often assembled either by scraping naturally occurring publicdomain summaries-which are nearly always in difficult-to-work-with technical domainsor by using approximate heuristics to extract them from everyday text-which frequently yields unfaithful summaries. In this work, we turn to a slower but more straightforward approach to developing summarization benchmark data: We hire highly-qualified contractors to read stories and write original summaries from scratch. To amortize reading… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 20 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?