TANDA: Transfer and Adapt Pre-Trained Transformer Models for Answer Sentence Selection

Garg, Siddhant; Vu, Thuy; Moschitti, Alessandro

doi:10.1609/aaai.v34i05.6282

Cited by 164 publications

(251 citation statements)

References 16 publications

Supporting

Mentioning

245

Contrasting

Unclassified

Order By: Relevance

“…Yang et al [23] applied it to Ad Hoc Document Retrieval, obtaining significant improvement. Garg et al [8] fine-tuned BERT for AS2, achieving the state of the art. However, BERT's high computational cost prevents its use in most real-word applications.…”

Section: Related Workmentioning

confidence: 99%

“…AS2, given a question and a set of answer sentence candidates, consists in selecting sentences (e.g., retrieved by a search engine) that correctly answer the question. Neural models have significantly contributed with new techniques, e.g., [8,11] to AS2. More recently, neural language models, e.g., ELMO [13], GPT [14], BERT [5], RoBERTa [10], XLNet [3] have led to major advancements in NLP.…”

Section: Introductionmentioning

confidence: 99%

“…To carry out our experiments, we created three different AS2 datasets, all including a large number of answer sentence candidates. Two of them are built using different samples of the anonymized questions from Alexa Information Traffic, while the third, ASNQ [8], is a sample from the Google Natural Question dataset [6], adapted for the AS2 task, which we further extended. We tested the combinations of fast rerankers, Jaccard similarity, Rel-CNN 1 and CA with an accurate BERT selector, and compared them with the upper bound, obtained with the expensive approach of classifying all candidates with the BERT model.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Reranking for Efficient Transformer-based Answer Selection

Matsubara

Moschitti

2020

Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

Self Cite

View full text Add to dashboard Cite

IR-based Question Answering (QA) systems typically use a sentence selector to extract the answer from retrieved documents. Recent studies have shown that powerful neural models based on the Transformer can provide an accurate solution to Answer Sentence Selection (AS2). Unfortunately, their computation cost prevents their use in real-world applications. In this paper, we show that standard and efficient neural rerankers can be used to reduce the amount of sentence candidates fed to Transformer models without hurting Accuracy, thus improving efficiency up to four times. This is an important finding as the internal representation of shallower neural models is dramatically different from the one used by a Transformer model, e.g., word vs. contextual embeddings. CCS CONCEPTS • Information systems → Retrieval models and ranking; Question answering.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Reranking for Efficient Transformer-based Answer Selection

Matsubara

Moschitti

2020

Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

Self Cite

View full text Add to dashboard Cite

show abstract

“…However, more recent neural models, such as BERT based models [Garg et al 2019;Raffel et al 2019] have achieved a largely improved performance on the same tasks. It is worth exploring the integration of quantum models with the new BERT architecture [Devlin et al 2019] in the future.…”

Section: Quantum-inspired Neural Representation Modelsmentioning

confidence: 99%

“…Note that the current state-of-the-art BERT-based neural model for TREC-QA has achieved MAP and MRR of 0.943 and 0.974 respectively[Garg et al 2019].4 The TANDA model mentioned above currently gives the best performance on WikiQA dataset (MAP and MRR of 0.92 and 0.933 respectively, as compared to 0.695 and 0.71 by[Zhang et al 2018d]). …”

mentioning

confidence: 95%

A Survey of Quantum Theory Inspired Approaches to Information Retrieval

2020

View full text Add to dashboard Cite

Since 2004, researchers have been using the mathematical framework of Quantum Theory (QT) in Information Retrieval (IR). QT offers a generalized probability and logic framework. Such a framework has been shown capable of unifying the representation, ranking and user cognitive aspects of IR, and helpful in developing more dynamic, adaptive and context-aware IR systems. Although Quantum-inspired IR is still a growing area, a wide array of work in different aspects of IR has been done and produced promising results. This paper presents a survey of the research done in this area, aiming to show the landscape of the field and draw a road-map of future directions.

show abstract

Text‐based question answering from information retrieval and deep neural network perspectives: A survey

Abbasiantaeb

Momtazi

2021

WIREs Data Min & Knowl

View full text Add to dashboard Cite

Text‐based question answering (QA) is a challenging task which aims at finding short concrete answers for users' questions. This line of research has been widely studied with information retrieval (IR) techniques and has received increasing attention in recent years by considering deep neural network approaches. Deep learning (DL) approaches, which are the main focus of this paper, provide a powerful technique to learn multiple layers of representations and interaction between the questions and the answer sentences. In this paper, we provide a comprehensive overview of different models proposed for the QA task, including both a traditional IR perspective and a more recent deep neural network environment. We also introduce well‐known datasets for the task and present available results from the literature to have a comparison between different techniques. This article is categorized under: Algorithmic Development > Text Mining Technologies > Machine Learning

show abstract

TANDA: Transfer and Adapt Pre-Trained Transformer Models for Answer Sentence Selection

Cited by 164 publications

References 16 publications

Reranking for Efficient Transformer-based Answer Selection

Reranking for Efficient Transformer-based Answer Selection

A Survey of Quantum Theory Inspired Approaches to Information Retrieval

Text‐based question answering from information retrieval and deep neural network perspectives: A survey

Contact Info

Product

Resources

About