Asking Questions the Human Way: Scalable Question-Answer Generation from Text Corpus

Liu, Bang; Wei, Haojie; Niu, Di; Chen, Haolan; He, Yancheng

doi:10.1145/3366423.3380270

Cited by 61 publications

(31 citation statements)

References 57 publications

(130 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As a dual task of question-answering, QG can be used to improve QA performance. Some works [7,11,26,46,58] take QG as a generator to harvest question-answer pairs from passages, and use this harvested data to pre-train QA models, which subsequently resulted in improved QA model effectiveness. QG is also widely used in IR tasks, such as improving search system effectiveness by generating clarifying questions [57], or generating questions from e-commercial customers reviews [55].…”

Section: Question Generationmentioning

confidence: 99%

Evaluating BERT-based Rewards for Question Generation with Reinforcement Learning

Zhu

Hauff

2021

Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval

View full text Add to dashboard Cite

Question generation systems aim to generate natural language questions that are relevant to a given piece of text, and can usually be answered by just considering this text. Prior works have identified a range of shortcomings (including semantic drift and exposure bias) and thus have turned to the reinforcement learning paradigm to improve the effectiveness of question generation. As part of it, different reward functions have been proposed. As typically these reward functions have been empirically investigated in different experimental settings (different datasets, models and parameters) we lack a common framework to fairly compare them. In this paper, we first categorize existing rewards systematically. We then provide such a fair empirical evaluation of different reward functions (including three we propose here) in a common framework. We find rewards that model answerability to be the most effective. CCS CONCEPTS• Information systems → Question answering.

show abstract

Section: Question Generationmentioning

confidence: 99%

Evaluating BERT-based Rewards for Question Generation with Reinforcement Learning

Zhu

Hauff

2021

Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval

View full text Add to dashboard Cite

show abstract

“…We used heuristics to filter out low-quality generated QA pairs, dropping questions that are longer than 20 words or shorter than 5 words and answers that are longer than 10 words, keeping questions that have at least one interrogative word, and removing n-gram repetition in questions. While some existing works used the BERT QA model or an entailment model as a data filter Zhang and Bansal, 2019;Liu et al, 2020), our heuristics are enough to obtain improvement in the downstream QA task as shown in §4.6. Some samples in our datasets are given in Table 3, showing that the diverse QA pairs are generated.…”

Section: £ ¢ ¡mentioning

confidence: 99%

“…We also chose 100 samples from SQuAD Du test . In addition to the three items proposed by Liu et al (2020), we asked annotators if an given answer is important, i.e., it is worth being asked about. We showed the workers a triple (passage, question, answer) and asked them to answer the four questions shown in Table 4.…”

Section: Human Evaluationmentioning

confidence: 99%

“…For the downstream QA task, most existing studies have evaluated QAG methods using a test set from the same distribution as a training set (Yang et al, 2017a;Zhang and Bansal, 2019;Liu et al, 2020). However, when a QA model is evaluated only on an in-distribution test set, it is difficult to verify that the model is not exploiting unintended biases in a dataset (Geirhos et al, 2020).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Improving the Robustness of QA Models to Challenge Sets with Variational Question-Answer Pair Generation

Shinoda

Sugawara

Aizawa

2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

Question answering (QA) models for reading comprehension have achieved human-level accuracy on in-distribution test sets. However, they have been demonstrated to lack robustness to challenge sets, whose distribution is different from that of training sets. Existing data augmentation methods mitigate this problem by simply augmenting training sets with synthetic examples sampled from the same distribution as the challenge sets. However, these methods assume that the distribution of a challenge set is known a priori, making them less applicable to unseen challenge sets. In this study, we focus on question-answer pair generation (QAG) to mitigate this problem. While most existing QAG methods aim to improve the quality of synthetic examples, we conjecture that diversity-promoting QAG can mitigate the sparsity of training sets and lead to better robustness. We present a variational QAG model that generates multiple diverse QA pairs from a paragraph. Our experiments show that our method can improve the accuracy of 12 challenge sets, as well as the in-distribution accuracy. 1

show abstract

“…As with the real-world QA, most existing works use the encoder-decoder framework that takes as input a question and a sequence of words in a given context and then generates answer words [55]- [57]. For augmenting the question and answer pairs to improve QA performance, the methods of generating questions as well as answers have also been proposed [58], [59]. However, these works mainly focus on a single domain for answer generation.…”

Section: Natural Answer Generationmentioning

confidence: 99%

Sim2RealQA: Using Life Simulation to Solve Question Answering Real-World Events

2021

View full text Add to dashboard Cite

As smart speakers continue to proliferate, question answering (QA) by smart devices is being woven into our daily lives. This study assumes question answering related to daily life events detected by context recognition systems, such as activity recognition and indoor positioning systems, e.g., answering questions like "Did my grandma eat dinner?" and "How many times did my grandpa go to the toilet?" These questions can effectively support human memory-aids, locate lost items, and monitor human activities. However, training a question-answering model requires large amounts of labeled training data (i.e., questions, answers, and the time-series of real-world event triplets) collected in a target environment. In this paper, we propose a novel simulation to real QA (Sim2RealQA) framework that completely trains a QA model with QA datasets produced in a life simulator and use it for solving real-word QA problems without answer labels. Our proposed QA model can learn a general reasoning process for QA that is independent of environments and deal with diverse types of questions specific to question answering in real-world environments, e.g., counting the number of occurrences of a real-world event and enumerating the names of those who are performing an activity together. Experiments show that using life simulations is a promising approach for solving real-world QA problems when no real-world answer labels are available.INDEX TERMS Lifelog, ubiquitous computing, daily-living activities, real-world question answering.

show abstract

Asking Questions the Human Way: Scalable Question-Answer Generation from Text Corpus

Cited by 61 publications

References 57 publications

Evaluating BERT-based Rewards for Question Generation with Reinforcement Learning

Evaluating BERT-based Rewards for Question Generation with Reinforcement Learning

Improving the Robustness of QA Models to Challenge Sets with Variational Question-Answer Pair Generation

Sim2RealQA: Using Life Simulation to Solve Question Answering Real-World Events

Contact Info

Product

Resources

About