An Exploratory Study on Long Dialogue Summarization: What Works and What’s Next

Zhang, Yusen; Ni, Ansong; Yu, Tao; Zhang, Rui; Zhu, Chenguang; Deb, Budhaditya; Çelikyılmaz, Aslı; Awadallah, Ahmed Hassan; Radev, Dragomir

doi:10.18653/v1/2021.findings-emnlp.377

Cited by 17 publications

(6 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This extra step reduces the burden on the neural summarizers that have to generate an abstract summary and select important content at the same time. Some also refer models that utilize the hybrid approach as retrieve-the-summarize model because it involves retrieving a subset of long document text before summarizing it [131]. TLM+Ext [94] irst implemented this method by limiting inputs of the scientiic articles in arXiv datasets as the introduction of the document, a subset of carefully selected sentences of the original article using extractive summarization approach, and, inally, include the remaining text if there remains extra space for Transformer-based decoder.…”

Section: ) Discourse Biasmentioning

confidence: 99%

“…Other than the discourse bias mechanism, we observe that (a) eicient attention and (b) content selection mechanisms are the two most notable long document mechanisms. As the content selection mechanism requires a separate retriever to extract salient content from the source (i.e., the hybrid approach), we distinguish Transformer models with content selection mechanism as the retrieve-then-summarize model [131] and the pure encoder-decoder Transformer without this mechanism as an end-to-end model for the rest of this work. Lastly, it is also important to note that both mechanisms can be jointly implemented within a single architecture, where the content selection mechanism will extract a longer subset of input to be processed by a Transformer with eicient attention [77].…”

Section: Supervised Hybridmentioning

confidence: 99%

“…Each approach has its advantages and limitations that may suit certain summarization tasks better. For example, extractive summarization may be suicient in summarizing certain news articles [15,128] but inadequate to summarize a long dialogue where salient content are sparsely distributed [131]. This is because while the extractive summarization approach is always factually consistent with the source document, it does not modify the original text and thus lacks the ability to generate luent and concise summary [120].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

An Empirical Survey on Long Document Summarization: Datasets, Models, and Metrics

Koh

Liu

et al. 2022

ACM Comput. Surv.

View full text Add to dashboard Cite

Long documents such as academic articles and business reports have been the standard format to detail out important issues and complicated subjects that require extra attention. An automatic summarization system that can effectively condense long documents into short and concise texts to encapsulate the most important information would thus be significant in aiding the reader’s comprehension. Recently, with the advent of neural architectures, significant research efforts have been made to advance automatic text summarization systems, and numerous studies on the challenges of extending these systems to the long document domain have emerged. In this survey, we provide a comprehensive overview of the research on long document summarization and a systematic evaluation across the three principal components of its research setting: benchmark datasets, summarization models, and evaluation metrics. For each component, we organize the literature within the context of long document summarization and conduct an empirical analysis to broaden the perspective on current research progress. The empirical analysis includes a study on the intrinsic characteristics of benchmark datasets, a multi-dimensional analysis of summarization models, and a review of the summarization evaluation metrics. Based on the overall findings, we conclude by proposing possible directions for future exploration in this rapidly growing field.

show abstract

Section: ) Discourse Biasmentioning

confidence: 99%

Section: Supervised Hybridmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

An Empirical Survey on Long Document Summarization: Datasets, Models, and Metrics

Koh

Liu

et al. 2022

ACM Comput. Surv.

View full text Add to dashboard Cite

show abstract

“…(3) Our summaries adhere to people-personalized preferences. In comparisons with strong LLM summarization methods on the standard datasets MACSum [4] and arXiv [5], which contain long document summarization examples, our approach demonstrated clear advantages.…”

Section: Introductionmentioning

confidence: 98%

Topic Knowledge Based Controlled Generation for Long Documents Using Retrieval-Based Language Models

Zhang,

He,

Deb

et al. 2023

Fuzzy Systems and Data Mining IX

View full text Add to dashboard Cite

Current LLM summarization systems Produce broad overviews which are disconnected from people specific interests and expectations. Basically, people preferences (topics) can be expressed by a collection of semantic keywords. Previous work exploit these keywords as extra input to generate summary. That requires additional human annotations. To tackle these constraints, we propose a novel framework, Topic Knowledge based Controlled Generation (TKCG), to control generated summaries through a set of topic keywords that are extracted automatically from source documents. First, as large language models (LLMs) are limited by context window length, we need to split the documents into small pieces like chapters acccording to the document format, as one chapter is a semantically complete section. Secondly we extract some topic keywords from source documents with a transformer-based model. These topic keywords are used to retrieve the chapters that are related to the topic. We then input the combination of topic keywords and chapters as prompts into LLM to get conditional summaries. We also demonstrate the effectiveness of TKCG on two standard datasets, MACSum and arXiv.

show abstract

“…The results of the Longformer-based model in AMI and ICSI are from(Fabbri et al 2021), and the results of it in QMSum come from(Zhang et al 2021b). In screenplay domain, the results of Longformer are fromChen et al (2021a).3 BART-large-CNN refers to further fine-tuning BART-large on the news summarization dataset CNN/DailyMail.…”

mentioning

confidence: 99%

DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization

Zhong

Liu

et al. 2022

AAAI

Self Cite

View full text Add to dashboard Cite

Dialogue is an essential part of human communication and cooperation. Existing research mainly focuses on short dialogue scenarios in a one-on-one fashion. However, multi-person interactions in the real world, such as meetings or interviews, are frequently over a few thousand words. There is still a lack of corresponding research and powerful tools to understand and process such long dialogues. Therefore, in this work, we present a pre-training framework for long dialogue understanding and summarization. Considering the nature of long conversations, we propose a window-based denoising approach for generative pre-training. For a dialogue, it corrupts a window of text with dialogue-inspired noise, and guides the model to reconstruct this window based on the content of the remaining conversation. Furthermore, to process longer input, we augment the model with sparse attention which is combined with conventional attention in a hybrid manner. We conduct extensive experiments on five datasets of long dialogues, covering tasks of dialogue summarization, abstractive question answering and topic segmentation. Experimentally, we show that our pre-trained model DialogLM significantly surpasses the state-of-the-art models across datasets and tasks. Source code and all the pre-trained models are available on our GitHub repository (https://github.com/microsoft/DialogLM).

show abstract

An Exploratory Study on Long Dialogue Summarization: What Works and What’s Next

Cited by 17 publications

References 21 publications

An Empirical Survey on Long Document Summarization: Datasets, Models, and Metrics

An Empirical Survey on Long Document Summarization: Datasets, Models, and Metrics

Topic Knowledge Based Controlled Generation for Long Documents Using Retrieval-Based Language Models

DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization

Contact Info

Product

Resources

About