A Large-Scale Multi-Document Summarization Dataset from the Wikipedia Current Events Portal

Ghalandari, Demian Gholipour; Hokamp, Chris; Pham, Nghia The; Glover, John; Ifrim, Georgiana

doi:10.48550/arxiv.2005.10070

Cited by 1 publication

(2 citation statements)

References 3 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Supervised methods in abstractive summarization always use the encoder-decoder transformer architecture with data sets of large, paired document-summary examples. Ghalandari et al (2020) propose an end-to-end Hierarchical MMR-Attention Pointergenerator (Hi-MAP) model to address the information redundancy. Li et al (2020) develop a neural abstractive MDS model which can leverage similarity graph or discourse graph representations of documents, to more effectively capture cross-document relations.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Chain-of-event prompting for multi-document summarization by large language models

Bao,

Li,

Cao

2024

IJWIS

View full text Add to dashboard Cite

Purpose In the era of big data, various industries are generating large amounts of text data every day. Simplifying and summarizing these data can effectively serve users and improve efficiency. Recently, zero-shot prompting in large language models (LLMs) has demonstrated remarkable performance on various language tasks. However, generating a very “concise” multi-document summary is a difficult task for it. When conciseness is specified in the zero-shot prompting, the generated multi-document summary still contains some unimportant information, even with the few-shot prompting. This paper aims to propose a LLMs prompting for multi-document summarization task. Design/methodology/approach To overcome this challenge, the authors propose chain-of-event (CoE) prompting for multi-document summarization (MDS) task. In this prompting, the authors take events as the center and propose a four-step summary reasoning process: specific event extraction; event abstraction and generalization; common event statistics; and summary generation. To further improve the performance of LLMs, the authors extend CoE prompting with the example of summary reasoning. Findings Summaries generated by CoE prompting are more abstractive, concise and accurate. The authors evaluate the authors’ proposed prompting on two data sets. The experimental results over ChatGLM2-6b show that the authors’ proposed CoE prompting consistently outperforms other typical promptings across all data sets. Originality/value This paper proposes CoE prompting to solve MDS tasks by the LLMs. CoE prompting can not only identify the key events but also ensure the conciseness of the summary. By this method, users can access the most relevant and important information quickly, improving their decision-making processes.

show abstract

Section: Related Workmentioning

confidence: 99%

“…WCEP (Ghalandari et al, 2020). WCEP data set contains human-written summaries of recent news events.…”

Section: Large Language Modelsmentioning

confidence: 99%

Chain-of-event prompting for multi-document summarization by large language models

Bao,

Li,

Cao

2024

IJWIS

View full text Add to dashboard Cite

show abstract

A Large-Scale Multi-Document Summarization Dataset from the Wikipedia Current Events Portal

Cited by 1 publication

References 3 publications

Chain-of-event prompting for multi-document summarization by large language models

Chain-of-event prompting for multi-document summarization by large language models

Contact Info

Product

Resources

About