Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing 2018
DOI: 10.18653/v1/d18-1206
|View full text |Cite
|
Sign up to set email alerts
|

Don’t Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization

Abstract: We introduce extreme summarization, a new single-document summarization task which does not favor extractive strategies and calls for an abstractive modeling approach. The idea is to create a short, one-sentence news summary answering the question "What is the article about?". We collect a real-world, large scale dataset for this task by harvesting online articles from the British Broadcasting Corporation (BBC). We propose a novel abstractive model which is conditioned on the article's topics and based entirel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
738
1
2

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 825 publications
(868 citation statements)
references
References 28 publications
4
738
1
2
Order By: Relevance
“…XSum contains 226,711 news articles accompanied with a one-sentence summary, answering the question "What is this article about?". We used the splits of Narayan et al (2018a) for training, validation, and testing (204,045/11,332/11,334) and followed the pre-processing introduced in their work. Input documents were truncated to 512 tokens.…”
Section: Summarization Datasetsmentioning
confidence: 99%
“…XSum contains 226,711 news articles accompanied with a one-sentence summary, answering the question "What is this article about?". We used the splits of Narayan et al (2018a) for training, validation, and testing (204,045/11,332/11,334) and followed the pre-processing introduced in their work. Input documents were truncated to 512 tokens.…”
Section: Summarization Datasetsmentioning
confidence: 99%
“…Lin and Hovy (1997) studied the position hypothesis, especially in the news article writing (Hong and Nenkova, 2014;Narayan et al, 2018a) but not in other domains such as conversations (Kedzie et al, 2018). Narayan et al (2018a) collected a new corpus to address the bias by compressing multiple contents of source document in the single target summary. In the bias analysis of systems, Lin andBilmes (2012, 2011) studied the sub-aspect hypothesis of summarization systems.…”
Section: Related Workmentioning
confidence: 99%
“…• Summarization of personal post and news articles except for XSum (Narayan et al, 2018a) are biased to the position aspect, while academic papers are well balanced among the three aspects (see Figure 1 (a)). Summarizing long documents (e.g.…”
Section: Introductionmentioning
confidence: 99%
“…Several datasets have been used to aid the development of text summarization models. These datasets are predominantly from the news domain and have several drawbacks such as limited training data (Document Understanding Conference 2 ), shorter summaries (Gigaword (Napoles et al, 2012), XSum (Narayan et al, 2018), and Newsroom (Grusky et al, 2018)), and near-extractive summaries (CNN / Daily Mail dataset (Hermann et al, 2015) news reporting, summary-worthy content is nonuniformly distributed within each article. ArXiv and PubMed datasets (Cohan et al, 2018), which are collected from scientific repositories, are limited in size and have longer yet extractive summaries.…”
Section: Related Workmentioning
confidence: 99%