Generation-Augmented Retrieval for Open-Domain Question Answering

Mao, Yuning; He, Pengcheng; Liu, Xiaodong; Shen, Yulong; Gao, Jianfeng; Chen, Weizhu

doi:10.18653/v1/2021.acl-long.316

Cited by 60 publications

(58 citation statements)

References 38 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…DeepCT (Dai & Callan, 2019) uses BERT to dynamically generate lexical weights to augment BM25 Systems. doc2Query (Nogueira et al, 2019b), docTTTTTQuery (Nogueira et al, 2019a), and GAR (Mao et al, 2021a) use text generation to expand queries or documents to make better use of BM25. The middle block lists the results of strong dense retrieval methods, including DPR (Karpukhin et al, 2020), ANCE (Xiong et al, 2021), RDR (Yang & Seo, 2020), RocketQA (Qu et al, 2021), Joint andIndividual Top-k (Sachan et al, 2021b), PAIR (Ren et al, 2021), DPR-PAQ (Oguz et al, 2021), Condenser (Gao & Callan, 2021b).…”

Section: Resultsmentioning

confidence: 99%

Adversarial Retriever-Ranker for dense text retrieval

Zhang¹,

Gong²,

Shen³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Current dense text retrieval models face two typical challenges. First, it adopts a siamese dual-encoder architecture to encode query and document independently for fast indexing and searching, whereas neglecting the finer-grained term-wise interactions. This results in a sub-optimal recall performance. Second, it highly relies on a negative sampling technique to build up the negative documents in its contrastive loss. To address these challenges, we present Adversarial Retriever-Ranker (AR2), which consists of a dual-encoder retriever plus a cross-encoder ranker. The two models are jointly optimized according to a minimax adversarial objective: the retriever learns to retrieve negative documents to cheat the ranker, while the ranker learns to rank a collection of candidates including both the ground-truth and the retrieved ones, as well as providing progressive direct feedback to the dual-encoder retriever. Through this adversarial game, the retriever gradually produces harder negative documents to train a better ranker, whereas the cross-encoder ranker provides progressive feedback to improve retriever. We evaluate AR2 on three benchmarks. Experimental results show that AR2 consistently and significantly outperforms existing dense retriever methods and achieves new state-of-the-art results on all of them. This includes the improvements on Natural Questions R@5 to 77.9% (+2.1%), TriviaQA R@5 to 78.2% (+1.4%), and MS-MARCO MRR@10 to 39.5% (+1.3%). We will make our code, models, and data publicly available.

show abstract

Section: Resultsmentioning

confidence: 99%

Adversarial Retriever-Ranker for dense text retrieval

Zhang¹,

Gong²,

Shen³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…To remedy the vocabulary gap between queries and documents, Nogueira and Lin [29,28] employed seq2seq model transformer [39] and later T5 [33] to generate document expansions, which brings significant gains for BM25. In the same vein, Mao et al [27] adopted seq2seq model BART [20] to generate query expansions, which outperforms RM3 [15], a highly performant lexical query expansion method.…”

Section: Related Workmentioning

confidence: 99%

Out-of-Domain Semantics to the Rescue! Zero-Shot Hybrid Retrieval Models

Chen¹,

Zhang²,

Lu³

et al. 2022

Preprint

View full text Add to dashboard Cite

The pre-trained language model (eg, BERT) based deep retrieval models achieved superior performance over lexical retrieval models (eg, BM25) in many passage retrieval tasks. However, limited work has been done to generalize a deep retrieval model to other tasks and domains. In this work, we carefully select five datasets, including two in-domain datasets and three out-of-domain datasets with different levels of domain shift, and study the generalization of a deep model in a zero-shot setting. Our findings show that the performance of a deep retrieval model is significantly deteriorated when the target domain is very different from the source domain that the model was trained on. On the contrary, lexical models are more robust across domains. We thus propose a simple yet effective framework to integrate lexical and deep retrieval models. Our experiments demonstrate that these two models are complementary, even when the deep model is weaker in the out-of-domain setting. The hybrid model obtains an average of 20.4% relative gain over the deep retrieval model, and an average of 9.54% over the lexical model in three out-of-domain datasets.

show abstract

“…In another line of work, Mao et al (2021) seek to generate clarification texts for input questions to improve the retrieval quality in open-domain QA (answering factoid questions without a prespecified domain). The most common approach for this problem involves a retriever-reader architecture (Chen et al, 2017), which first retrieves a small subset of documents in the pool using the input question as the query and then analyzes the retrieved documents to extract (or generate) an answer.…”

Section: Question Generationmentioning

confidence: 99%

“…The most common approach for this problem involves a retriever-reader architecture (Chen et al, 2017), which first retrieves a small subset of documents in the pool using the input question as the query and then analyzes the retrieved documents to extract (or generate) an answer. To generate augmented texts for the input question in the first retrieval component, Mao et al (2021) fine-tune BART to consume the input question and attempt to produce the answer and the sentence or title of the paragraph containing the answer. This method demonstrates superior performance for both retrieval and end-to-end QA performance.…”

Section: Question Generationmentioning

confidence: 99%

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

Min¹,

Ross²,

Sulem³

et al. 2021

Preprint

View full text Add to dashboard Cite

Large, pre-trained transformer-based language models such as BERT have drastically changed the Natural Language Processing (NLP) field. We present a survey of recent work that uses these large language models to solve NLP tasks via pre-training then fine-tuning, prompting, or text generation approaches. We also present approaches that use pre-trained language models to generate data for training augmentation or other purposes. We conclude with discussions on limitations and suggested directions for future research.

show abstract

Generation-Augmented Retrieval for Open-Domain Question Answering

Cited by 60 publications

References 38 publications

Adversarial Retriever-Ranker for dense text retrieval

Adversarial Retriever-Ranker for dense text retrieval

Out-of-Domain Semantics to the Rescue! Zero-Shot Hybrid Retrieval Models

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

Contact Info

Product

Resources

About