Doo Soon Kim scite author profile

Neural abstractive summarization models have led to promising results in summarizing relatively short documents. We propose the first model for abstractive summarization of single, longer-form documents (e.g., research papers). Our approach consists of a new hierarchical encoder that models the discourse structure of a document, and an attentive discourse-aware decoder to generate the summary. Empirical results on two large-scale datasets of scientific papers show that our model significantly outperforms state-of-the-art models.

show abstract

Scoring Sentence Singletons and Pairs for Abstractive Summarization

Lebanoff

Song

Dernoncourt

et al. 2019

View full text Add to dashboard Cite

When writing a summary, humans tend to choose content from one or two sentences and merge them into a single summary sentence. However, the mechanisms behind the selection of one or multiple source sentences remain poorly understood. Sentence fusion assumes multi-sentence input; yet sentence selection methods only work with single sentences and not combinations of them. There is thus a crucial gap between sentence selection and fusion to support summarizing by both compressing single sentences and fusing pairs. This paper attempts to bridge the gap by ranking sentence singletons and pairs together in a unified space. Our proposed framework attempts to model human methodology by selecting either a single sentence or a pair of sentences, then compressing or fusing the sentence(s) to produce a summary sentence. We conduct extensive experiments on both single-and multidocument summarization datasets and report findings on sentence selection and abstraction.

show abstract

A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents

Cohan¹,

Dernoncourt²,

Kim³

et al. 2018

Preprint

View full text Add to dashboard Cite

A Compare-Aggregate Model with Latent Clustering for Answer Selection

Yoon

Dernoncourt

Kim

et al. 2019

View full text Add to dashboard Cite

In this paper, we propose a novel method for a sentence-level answer-selection task that is one of the fundamental problems in natural language processing. First, we explore the effect of additional information by adopting a pretrained language model to compute the vector representation of the input text and by applying transfer learning from a large-scale corpus. Second, we enhance the compare-aggregate model by proposing a novel latent clustering method to compute additional information within the target corpus and by changing the objective function from listwise to pointwise. To evaluate the performance of the proposed approaches, experiments are performed with the WikiQA and TREC-QA datasets. The empirical results demonstrate the superiority of our proposed approach, which achieve state-of-the-art performance on both datasets.

show abstract

Analyzing Sentence Fusion in Abstractive Summarization

Lebanoff¹,

Muchovej²,

Dernoncourt³

et al. 2019

View full text Add to dashboard Cite

While recent work in abstractive summarization has resulted in higher scores in automatic metrics, there is little understanding on how these systems combine information taken from multiple document sentences. In this paper, we analyze the outputs of five state-of-the-art abstractive summarizers, focusing on summary sentences that are formed by sentence fusion. We ask assessors to judge the grammaticality, faithfulness, and method of fusion for summary sentences. Our analysis reveals that system sentences are mostly grammatical, but often fail to remain faithful to the original article.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Doo Soon Kim

A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents

Scoring Sentence Singletons and Pairs for Abstractive Summarization

A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents

A Compare-Aggregate Model with Latent Clustering for Answer Selection

Analyzing Sentence Fusion in Abstractive Summarization

Contact Info

Product

Resources

About