Controlling the Amount of Verbatim Copying in Abstractive Summarization

Song, Kaiqiang; Wang, Bingqing; Feng, Zhe; Liu, Ren; Liu, Fei

doi:10.1609/aaai.v34i05.6420

Cited by 43 publications

(34 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Abstractive Summarization. The majority of research in abstractive summarization has focused on monolingual summarization in English (Gehrmann et al, 2018;Song et al, 2020;Narayan et al, 2018). Rush et al (2015) proposes the first neural abstractive summarization model using an attentionbased convolutional neural network encoder and a feed-forward decoder.…”

Section: Related Workmentioning

confidence: 99%

“…They further train the extractor and abstractor end-to-end with a policygradient method, using ROUGE-L F1 as the reward function. Recently, pre-trained language models have achieved the state of the art results in abstractive summarization (Lewis et al, 2019b;Liu and Lapata, 2019;Song et al, 2020). Therefore, we use mBART for all the baselines and our direct cross-lingual models.…”

Section: Related Workmentioning

confidence: 99%

“…Although there has been a tremendous amount of progress in abstractive summarization in recent years, most research has focused on monolingual summarization because of the lack of high quality multilingual resources (Lewis et al, 2019a;Song et al, 2020). While there have been a few studies to address the lack of resources for cross-lingual summarization (Giannakopoulos, 2013;Li et al, 2013;Elhadad et al, 2013;Nguyen and Daumé III, 2019), the datasets employed are very limited in size.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

WikiLingua: A New Benchmark Dataset for Cross-Lingual Abstractive Summarization

Ladhak¹,

Durmus²,

Cardie³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

We introduce WikiLingua, a large-scale, multilingual dataset for the evaluation of crosslingual abstractive summarization systems. We extract article and summary pairs in 18 languages from WikiHow 12 , a high quality, collaborative resource of how-to guides on a diverse set of topics written by human authors. We create gold-standard articlesummary alignments across languages by aligning the images that are used to describe each how-to step in an article. As a set of baselines for further studies, we evaluate the performance of existing cross-lingual abstractive summarization methods on our dataset. We further propose a method for direct crosslingual summarization (i.e., without requiring translation at inference time) by leveraging synthetic data and Neural Machine Translation as a pre-training step. Our method significantly outperforms the baseline approaches, while being more cost efficient during inference.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

WikiLingua: A New Benchmark Dataset for Cross-Lingual Abstractive Summarization

Ladhak¹,

Durmus²,

Cardie³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

show abstract

“…However, they are still quite limited (Jain and Wallace, 2019) in consistently explaining all aspects of a neural summarizer. This leaves a gap in the ongoing efforts (Song et al, 2020a;Song et al, 2020b) to generate abstractive summaries that are guided by human-interpretable semantic/syntactic qualities. Briefly, the main goal of attention mechanism in a encoder-decoder network is to assign a softmax score to every encoder hidden state (based on its relevance to the token being decoded) and amplify those that are assigned high scores through a weighted average.…”

Section: Prototype Summarymentioning

confidence: 99%

Interpretable Multi-headed Attention for Abstractive Summarization at Controllable Lengths

Sarkhel¹,

Keymanesh²,

Nandi³

et al. 2020

Proceedings of the 28th International Conference on Computational Linguistics

View full text Add to dashboard Cite

ive summarization at controllable lengths is a challenging task in natural language processing. It is even more challenging for domains where limited training data is available or scenarios in which the length of the summary is not known beforehand. At the same time, when it comes to trusting machine-generated summaries, explaining how a summary was constructed in human-understandable terms may be critical. We propose Multi-level Summarizer (MLS), a supervised method to construct abstractive summaries of a text document at controllable lengths.The key enabler of our method is an interpretable multi-headed attention mechanism that computes attention distribution over an input document using an array of timestep independent semantic kernels. Each kernel optimizes a human-interpretable syntactic or semantic property. Exhaustive experiments on two low-resource datasets in English language show that MLS outperforms strong baselines by up to 14.70% in the METEOR score. Human evaluation of the summaries also suggests that they capture the key concepts of the document at various length-budgets.

show abstract

“…Recent years have witnessed the success in abstractive summarization using encoder-decoder framework with sequence-to-sequence models (Rush et al, 2015;Nallapati et al, 2016;See et al, 2017;Celikyilmaz et al, 2018). The encoder which is leveraged for syntactic compression can be implemented using recurrent neural networks (Chopra et al, 2016;Tan et al, 2017;Chen and Bansal, 2018), convolutional networks (Allamanis et al, 2016;Liu et al, 2018) and transformerbased methods (Devlin et al, 2019;Song et al, 2020b). To handle the problem that many OOV words are generated by vanilla sequence-to-sequence decoder, copy mechanism is proposed to copy a word from the source text or select an unseen word from the vocabulary (See et al, 2017;Zhou et al, 2018;.…”

Section: Related Workmentioning

confidence: 99%

Controllable Abstractive Sentence Summarization with Guiding Entities

Zheng¹,

Cai²,

Zhang³

et al. 2020

Proceedings of the 28th International Conference on Computational Linguistics

View full text Add to dashboard Cite

Entities are the major proportion and build up the topic of text summaries. Although existing text summarization models can produce promising results of automatic metrics, for example, ROUGE, it is difficult to guarantee that an entity is contained in generated summaries. In this paper, we propose a controllable abstractive sentence summarization model which generates summaries with guiding entities. Instead of generating summaries from left to right, we start with a selected entity, generate the left part first, then the right part of a complete summary. Compared to previous entity-based text summarization models, our method can ensure that entities appear in final output summaries rather than generating the complete sentence with implicit entity and article representations. Our model can also generate more novel entities with them incorporated into outputs directly. To evaluate the informativeness of the proposed model, we develop a fine-grained informativeness metrics in the relevance, extraness and omission perspectives. We conduct experiments in two widely-used sentence summarization datasets and experimental results show that our model outperforms the state-of-the-art methods in both automatic evaluation scores and informativeness metrics.

show abstract

Controlling the Amount of Verbatim Copying in Abstractive Summarization

Cited by 43 publications

References 27 publications

WikiLingua: A New Benchmark Dataset for Cross-Lingual Abstractive Summarization

WikiLingua: A New Benchmark Dataset for Cross-Lingual Abstractive Summarization

Interpretable Multi-headed Attention for Abstractive Summarization at Controllable Lengths

Controllable Abstractive Sentence Summarization with Guiding Entities

Contact Info

Product

Resources

About