Single Document Summarization based on Nested Tree Structure

Kikuchi, Yuta; Hirao, Tsutomu; Takamura, Hiroya; Okumura, Manabu; Nagata, Michio

doi:10.3115/v1/p14-2052

Cited by 53 publications

(42 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Earlier attempts in this paradigm used Hidden Markov Models and rule-based systems (Jing and McKeown, 2000), statistical models based on parse trees (Knight and Marcu, 2000), and integer linear programming based methods (Martins and Smith, 2009;Gillick and Favre, 2009;Clarke and Lapata, 2010;Berg-Kirkpatrick et al, 2011). Recent approaches investigated discourse structures (Louis et al, 2010;Hirao et al, 2013;Kikuchi et al, 2014;Wang et al, 2015), graph cuts (Qian and Liu, 2013), and parse trees (Li et al, 2014;Bing et al, 2015). For neural models, Cheng and Lapata (2016) used a second neural net to select words from an extractor's output.…”

Section: Related Workmentioning

confidence: 99%

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

Chen¹,

Bansal²

2018

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

528

606

View full text Add to dashboard Cite

Inspired by how humans summarize long documents, we propose an accurate and fast summarization model that first selects salient sentences and then rewrites them abstractively (i.e., compresses and paraphrases) to generate a concise overall summary. We use a novel sentence-level policy gradient method to bridge the nondifferentiable computation between these two neural networks in a hierarchical way, while maintaining language fluency. Empirically, we achieve the new state-of-theart on all metrics (including human evaluation) on the CNN/Daily Mail dataset, as well as significantly higher abstractiveness scores. Moreover, by first operating at the sentence-level and then the word-level, we enable parallel decoding of our neural generative model that results in substantially faster (10-20x) inference speed as well as 4x faster training convergence than previous long-paragraph encoder-decoder models. We also demonstrate the generalization of our model on the test-only DUC-2002 dataset, where we achieve higher scores than a state-of-the-art model.

show abstract

Section: Related Workmentioning

confidence: 99%

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

Chen¹,

Bansal²

2018

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

528

606

View full text Add to dashboard Cite

show abstract

“…The more central units to each RST relation are nuclei while the more peripheral are satellites. Prior work in document compression (Daumé and Marcu, 2002) and single-document summarization (Marcu, 1999;Louis et al, 2010;Hirao et al, 2013;Kikuchi et al, 2014;Yoshida et al, 2014) has shown that the structure of discourse trees, especially the nuclearity of non-terminal discourse relations in the tree, is valuable for content selection in summarization. The Penn Discourse Treebank (PDTB) (Prasad et al, 2008) on the other hand is theory-neutral and does not define a recursive structure for the entire document like RST.…”

Section: Background and Related Workmentioning

confidence: 99%

“…Finer-grained units such as n-grams are frequently used for quantifying content salience and redundancy prior to summarization over sentences (Filatova and Hatzivassiloglou, 2004;Thadani and McKeown, 2008;Gillick and Favre, 2009;Lin and Bilmes, 2011;Cao et al, 2015). In contrast, when the task at hand is more abstractive, the units are more finegrained, e.g., n-grams and phrases in abstractive summarization (Kikuchi et al, 2014;Liu et al, 2015;Bing et al, 2015), n-grams and humanannotated concept units in summarization evaluation (Lin, 2004;Hovy et al, 2006). Recently, subject-verb-object triplets were used to automatically identify concept units (Yang et al, 2016) and in abstractive summarization (Li, 2015); however, this requires semantic processing while EDU segmentation is presently more accurate and scalable.…”

Section: Background and Related Workmentioning

confidence: 99%

The Role of Discourse Units in Near-Extractive Summarization

Thadani

Stent

2016

Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue

View full text Add to dashboard Cite

Although human-written summaries of documents tend to involve significant edits to the source text, most automated summarizers are extractive and select sentences verbatim. In this work we examine how elementary discourse units (EDUs) from Rhetorical Structure Theory can be used to extend extractive summarizers to produce a wider range of human-like summaries. Our analysis demonstrates that EDU segmentation is effective in preserving human-labeled summarization concepts within sentences and also aligns with near-extractive summaries constructed by news editors. Finally, we show that using EDUs as units of content selection instead of sentences leads to stronger summarization performance in near-extractive scenarios, especially under tight budgets.

show abstract

“…When we need to generate a summary with high compression rate (small L max ), it is difficult to produce informative summaries by simply extracting predefined textual units such as EDUs or sentences. Recently, Kikuchi et al [37] proposed a discourse-based summarization method that integrates sentence extraction and compression. They build a nested dependency tree that represents the dependency relationships between sentences and words and then obtain a summary by trimming the tree.…”

Section: Decoding Time and Summary Examplesmentioning

confidence: 99%

Summarizing a Document by Trimming the Discourse Tree

Hirao

Nishino

Yoshida

et al. 2015

IEEE/ACM Trans. Audio Speech Lang. Process.

Self Cite

View full text Add to dashboard Cite

Recent studies on extractive text summarization formulate it as a combinatorial optimization problem, extracting the optimal subset from a set of the textual units that maximizes an objective function without violating the length constraint. Although these methods successfully improve automatic evaluation scores, they do not consider the discourse structure in the source document. Thus, summaries generated by these methods may lack logical coherence.In previous work, we proposed a method that exploits a discourse tree structure to produce coherent summaries. By transforming a traditional discourse tree, namely a rhetorical structure theory-based discourse tree (RST-DT), into a dependency-based discourse tree (DEP-DT), we formulated the summarization procedure as a Tree Knapsack Problem whose tree corresponds to the DEP-DT. This paper extends the work with a detailed discussion of the approach together with a novel efficient dynamic programming algorithm for solving the Tree Knapsack Problem. Experiments show that our method not only achieved the highest score in both automatic and human evaluation, but also obtained good performance in terms of the linguistic qualities of the summaries.

show abstract

Single Document Summarization based on Nested Tree Structure

Cited by 53 publications

References 13 publications

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

The Role of Discourse Units in Near-Extractive Summarization

Summarizing a Document by Trimming the Discourse Tree

Contact Info

Product

Resources

About