Highlight-Transformer: Leveraging Key Phrase Aware Attention to Improve Abstractive Multi-Document Summarization

Liu, Shuaiqi; Cao, Jiannong; Yang, Ruosong; Wen, Zhiyuan

doi:10.18653/v1/2021.findings-acl.445

Cited by 3 publications

(5 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…BERT [5] and BART [22], two famous language models, utilize the Transformer Encoder-Decoder structure to obtain summaries. Based on the structure, previous research introduce Entity Aggregation [10], Key Phrases Detection [14], Sentence Structure Relations [1], and Time Content Selection [4] to generate summaries.…”

Section: Abstractive Summarizationmentioning

confidence: 99%

Towards Legal Judgment Summarization: A Structure-Enhanced Approach

Wang,

Zhao

et al. 2023

Frontiers in Artificial Intelligence and Applications

View full text Add to dashboard Cite

Judgment summaries are beneficial for legal practitioners to comprehend and retrieve case law efficiently. Unlike summaries in general domains, e.g., news, judgment summaries often require a clear structure. Such a structure helps readers grasp the information contained in the summary and reduces information loss. To the best of our knowledge, none of the existing text summarizers can generate summaries aligned with the summary structure in the legal domain. Inspired by this observation, this paper introduces a Summary Structure-Enhanced (SSE) method to synthesize structured summaries for legal documents. SSE can easily be incorporated into the Encoder-Decoder framework, which is commonly adopted in state-of-the-art text summarizers. Experiments on the datasets of New Zealand and Chinese judgments show that the proposed method consistently improves the performance of state-of-the-art summarizers in terms of Rouge scores.

show abstract

Section: Abstractive Summarizationmentioning

confidence: 99%

Towards Legal Judgment Summarization: A Structure-Enhanced Approach

Wang,

Zhao

et al. 2023

Frontiers in Artificial Intelligence and Applications

View full text Add to dashboard Cite

show abstract

“…To compose the pairs of input and output, they match body sections with abstract's sections by section titles. Meng et al [2021] with target summary's sections. These SDS works can utilize the explicit formats (e.g., the division of sections) of input documents and target summaries to determine the alignment relationships between the input and output.…”

Section: Related Workmentioning

confidence: 99%

“…But writing a survey paper needs a lot of time and effort, making it difficult to cover the latest papers and all the research topics. The multidocument summarization (MDS) techniques [Liu et al, 2018;Fabbri et al, 2019;Liu et al, 2021;Liu et al, 2022] can be utilized to automatically produce summaries as a supplement to human-written summaries. To cover the latest papers and more research topics at a low cost, people can flexibly adjust the input papers and let the summarization methods produce summaries for these papers.…”

Section: Introductionmentioning

confidence: 99%

Generating a Structured Summary of Numerous Academic Papers: Dataset and Method

Cheng

Liang

et al. 2022

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence

Self Cite

View full text Add to dashboard Cite

As time goes by, language evolves with word semantics changing. Unfortunately, traditional word embedding methods neglect the evolution of language and assume that word representations are static. Although contextualized word embedding models can capture the diverse representations of polysemous words, they ignore temporal information as well. To tackle the aforementioned challenges, we propose a graph-based dynamic word embedding (GDWE) model, which focuses on capturing the semantic drift of words continually. We introduce word-level knowledge graphs (WKGs) to store short-term and long-term knowledge. WKGs can provide rich structural information as supplement of lexical information, which help enhance the word embedding quality and capture semantic drift quickly. Theoretical analysis and extensive experiments validate the effectiveness of our GDWE on dynamic word embedding learning.

show abstract

“…However, extractive methods often suffer the coherence problem (Wu and Hu, 2018). Therefore, instead of directly extracting sentences from the articles, abstractive methods that can rewrite the articles achieve great success with the advantages of large annotated corpora (Pang et al, 2021;Zhou et al, 2021;Liu et al, 2021a;Zhong et al, 2020;Liu and Lapata, 2019a).…”

Section: Related Workmentioning

confidence: 99%

“…We compare REFLECT with several strong baselines (Liu and Lapata, 2019a;Gehrmann et al, 2018;Fabbri et al, 2019;Perez-Beltrachini and Lapata, 2021;Liu et al, 2021a;Zhong et al, 2020;Zhang et al, 2020a;Pasunuru et al, 2021) on Multi-News (Fabbri et al, 2019), Multi-XScience (Lu et al, 2020) and WikiCat-Sum (Perez-Beltrachini et al, 2019) corpora, derived from news, academic domains and Wikipedia, respectively. Due to space limit, the results of Multi-XScience and WikiCatSum are provided in the Appendix A.…”

Section: Settingsmentioning

confidence: 99%

Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness

Song¹,

Chen²,

Shuai³

2022

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

A notable challenge in Multi-Document Summarization (MDS) is the extremely-long length of the input. In this paper, we present an extract-then-abstract Transformer framework to overcome the problem. Specifically, we leverage pre-trained language models to construct a hierarchical extractor for salient sentence selection across documents and an abstractor for rewriting the selected contents as summaries. However, learning such a framework is challenging since the optimal contents for the abstractor are generally unknown. Previous works typically create pseudo extraction oracle to enable the supervised learning for both the extractor and the abstractor. Nevertheless, we argue that the performance of such methods could be restricted due to the insufficient information for prediction and inconsistent objectives between training and testing. To this end, we propose a loss weighting mechanism that makes the model aware of the unequal importance for the sentences not in the pseudo extraction oracle, and leverage the fine-tuned abstractor to generate summary references as auxiliary signals for learning the extractor. Moreover, we propose a reinforcement learning method that can efficiently apply to the extractor for harmonizing the optimization between training and testing. Experiment results show that our framework substantially outperforms strong baselines with comparable model sizes and achieves the best results on the Multi-News, Multi-XScience, and WikiCatSum corpora. 1

show abstract

Highlight-Transformer: Leveraging Key Phrase Aware Attention to Improve Abstractive Multi-Document Summarization

Cited by 3 publications

References 26 publications

Towards Legal Judgment Summarization: A Structure-Enhanced Approach

Towards Legal Judgment Summarization: A Structure-Enhanced Approach

Generating a Structured Summary of Numerous Academic Papers: Dataset and Method

Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness

Contact Info

Product

Resources

About