Focus Attention: Promoting Faithfulness and Diversity in Summarization

Aralikatte, Rahul; Narayan, Shashi; Maynez, Joshua; Rothe, Sascha; McDonald, Ryan

doi:10.18653/v1/2021.acl-long.474

Cited by 29 publications

(29 citation statements)

References 68 publications

(85 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The encoder is expected to turn input text in meaningful representations so a model can comprehend the input. When encoders learn wrong correlations between different parts of the training data, it could result in erroneous generation that diverges from the input [2,48,98,172].…”

Section: Hallucination From Training and Inferencementioning

confidence: 99%

“…Attention Mechanism is an integral implement selectively concentrating on relevant parts while ignoring others based on dependencies in neural networks [4,177]. In order to encourage the generator to pay more attention to the source, Aralikatte et al [2] introduce a short circuit from the input document to the vocabulary distribution via source-conditioned bias. Krishna et al [85] employ sparse attention to improve the model's long-range dependencies in the hope of modeling more retrieved documents to mitigate the hallucination in the answer.…”

Section: Information Augmentationmentioning

confidence: 99%

“…1 Hallucination Definition in Data-to-Text Generation 10. 2 Hallucination Metrics in Data-to-Text Generation 10. 3 Hallucination Mitigation in Data-to-Text Generation 10.…”

mentioning

confidence: 99%

“…1 Hallucinations Definition and Categories in NMT 11. 2 Hallucination Metrics in NMT 11.3 Hallucination Mitigation Methods in NMT 11.4 Future Directions in NMT 12 Conclusion References…”

mentioning

confidence: 99%

See 3 more Smart Citations

Survey of Hallucination in Natural Language Generation

Ji¹,

Lee²,

Frieske³

et al. 2022

Preprint

View full text Add to dashboard Cite

Natural Language Generation (NLG) has improved exponentially in recent years thanks to the development of sequence-to-sequence deep learning technologies such as Transformer-based language models. This advancement has led to more fluent and coherent natural language generation, leading to improved development in downstream tasks such as abstractive summarization, dialogue generation and data-to-text generation. However, it is also apparent that deep learning based generation is prone to hallucinate unintended texts, which degrades the system performance and fail to meet user expectations in many real-world scenarios. In order to address this issue, there have been studies in measuring and mitigating hallucinated texts. However there has not been a comprehensive review of the state-of-the-art in hallucination detection and mitigation.In this survey, we provide a broad overview of the research progress and challenges in the hallucination problem of NLG. The survey is organized into two parts: (1) a general overview of metrics, mitigation methods, and future directions; (2) an overview of task-specific research progress for hallucinations in a large set of downstream tasks, namely abstractive summarization, dialogue generation, generative question answering, data-to-text generation, and machine translation. This survey serves to facilitate collaborative efforts among researchers in tackling the challenge of hallucinated texts in NLG.

show abstract

Section: Hallucination From Training and Inferencementioning

confidence: 99%

Section: Information Augmentationmentioning

confidence: 99%

“…1 Hallucination Definition in Data-to-Text Generation 10. 2 Hallucination Metrics in Data-to-Text Generation 10. 3 Hallucination Mitigation in Data-to-Text Generation 10.…”

mentioning

confidence: 99%

“…1 Hallucinations Definition and Categories in NMT 11. 2 Hallucination Metrics in NMT 11.3 Hallucination Mitigation Methods in NMT 11.4 Future Directions in NMT 12 Conclusion References…”

mentioning

confidence: 99%

See 2 more Smart Citations

Survey of Hallucination in Natural Language Generation

Ji¹,

Lee²,

Frieske³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…First, a separate correction model is learned to fix errors made by the summarizers (Zhao et al, 2020;, including replacing entities absent from the source or revising all possible errors (Cao et al, 2020). The second type targets at modifying the sequence-to-sequence architecture to incorporate relation triplets (Cao et al, 2018), knowledge graphs (Zhu et al, 2021), and topics (Aralikatte et al, 2021) to inform the summarizers of article facts. Yet additional engineering efforts and model retraining are often needed.…”

Section: Related Workmentioning

confidence: 99%

CLIFF: Contrastive Learning for Improving Faithfulness and Factuality in Abstractive Summarization

Cao¹,

Wang²

2021

Preprint

View full text Add to dashboard Cite

We study generating abstractive summaries that are faithful and factually consistent with the given articles. A novel contrastive learning formulation is presented, which leverages both reference summaries, as positive training data, and automatically generated erroneous summaries, as negative training data, to train summarization systems that are better at distinguishing between them. We further design four types of strategies for creating negative samples, to resemble errors made commonly by two state-of-the-art models, BART and PEGASUS, found in our new human annotations of summary errors. Experiments on XSum and CNN/Daily Mail show that our contrastive learning framework is robust across datasets and models. It consistently produces more factual summaries than strong comparisons with post error correction, entailmentbased reranking, and unlikelihood training, according to QA-based factuality evaluation. Human judges echo the observation and find that our model summaries correct more errors.

show abstract

Factual Error Correction in Summarization with Retriever-Reader Pipeline

Liu²,

Gao³

2023

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Focus Attention: Promoting Faithfulness and Diversity in Summarization

Cited by 29 publications

References 68 publications

Survey of Hallucination in Natural Language Generation

Survey of Hallucination in Natural Language Generation

CLIFF: Contrastive Learning for Improving Faithfulness and Factuality in Abstractive Summarization

Factual Error Correction in Summarization with Retriever-Reader Pipeline

Contact Info

Product

Resources

About