Which is the Effective Way for Gaokao: Information Retrieval or
            Neural Networks?

Guo, Shangmin; Zeng, Xiangrong; He, Shizhu; Liu, Kang; Zhao, Jun

doi:10.18653/v1/e17-1011

Cited by 13 publications

(12 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, Chen, Gao, Shi, Du, and Wen (2014) employ a language model to evaluate whether two sentences have a question‐answer relationship, and Surdeanu, Ciaramita, and Zaragoza (2011) use linguistically motivated features to train a ranking model for selecting correct results. Guo et al (2017) verify that IR approaches perform relatively well on entity questions. In information retrieval models, the statistical global features obtained from QA corpora are widely used, while the implicit semantics between words are not effectively utilized.…”

Section: Related Workmentioning

confidence: 99%

“…Rather than learning representations of the question and answer candidate separately, researchers have recently introduced various types of attention mechanisms to the answer selection task (Bian, Li, Yang, Chen, & Lin, 2017; Deng et al, 2019; Kim, Kang, & Kwak, 2019; Shen et al, 2018; Tay, Tuan, & Hui, 2018b) that better focus on the relevant parts of the input QA pairs. To enrich the representation of features, researchers have integrated knowledge bases into neural networks and captured more relevant information to improve performance (Deng et al, 2018; Guo et al, 2017; Shijia, Xu, & Xiang, 2018; F. Wang, Wu, Li, & Zhou, 2017; J. Wang, Wang, Zhang, & Yan, 2017; Zhu, Cheng, & Su, 2020). Recently, a new paradigm is to obtain better performance by using huge pre‐trained models (e.g., ELMo, BERT) (Li, Yu, Chen, & Li, 2019; Mozafari, Fatemi, & Nematbakhsh, 2019).…”

Section: Related Workmentioning

confidence: 99%

“…With the increasing popularity of deep neural networks in natural language processing (Khamparia & Singh, 2019), a large number of answer selection models (Deng et al, 2019; Guo, Zeng, He, Liu, & Zhao, 2017; Liu et al, 2018; Tay, Tuan, & Hui, 2018a; Wang, Hamza, & Florian, 2017) have achieved impressive results. However, most of the current models focus on local features generated by QA pairs, while global information contained in the question and its answer candidates (the QA corpus) lacks research.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Improving answer selection with global features

Luo

Wang

et al. 2020

Expert Systems

View full text Add to dashboard Cite

Given a question and its answer candidates (named QA corpus), answer selection is the task of identifying the most relevant answers to the question. Answer selection is widely used in question answering, web search, and so on. Current deep neural network models primarily utilize local features extracted from input question-answer pairs (QA pairs). However, the global features contained in QA corpora are underutilized, and we argue that these global features substantially contribute to the answer selection task. To verify this point of view, we propose a novel model that combines local and global features for answer selection. In our model, two different global feature extractors are employed to extract statistical global features and deep global features from a QA corpus, respectively. Furthermore, we investigate the integration of these global features with local features in various experimental settings: statistical global features, deep global features, and a combination of statistical and deep global features. Our experimental results show that the global features are effective for answer selection. Our model obtains new state-of-the-art results on two public answer selection datasets and performs especially well on YahooCQA, where it achieves 9.2 and 6% higher precision@1 (P@1) and mean reciprocal rank (MRR) scores than previously published models.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Improving answer selection with global features

Luo

Wang

et al. 2020

Expert Systems

View full text Add to dashboard Cite

show abstract

“…There exist some datasets, collected from standard exams/tests, including English datasets RACE (Lai et al, 2017), DREAM (Sun et al, 2019), ARC (Clark et al, 2018), ReClor , etc., and Chinese datasets C3 (Sun et al, 2020), MCQA (Guo et al, 2017), GeoSQA (Huang et al, 2019) and MedQA etc. Some of these datasets are of specific domains.…”

Section: Datasets From Standard Testsmentioning

confidence: 99%

GCRC: A New Challenging MRC Dataset from Gaokao Chinese for Explainable Evaluation

Tan¹,

Wang²,

Ji³

et al. 2021

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

View full text Add to dashboard Cite

Recently, driven by numerous publicly available machine reading comprehension (MRC) datasets, MRC systems have made some progress. These datasets, however, have two major limitations: 1) the defined tasks are relatively simple, and 2) they do not provide explainable evaluation which is critical to objectively and comprehensively review the reasoning capabilities of current MRC systems. In this paper, we propose GCRC, a new dataset with challenging and high-quality multi-choice questions, collected from Gaokao Chinese (Chinese subject from the National College Entrance Examination of China). We have manually labelled three types of evidence to evaluate MRC systems' reasoning process: 1) sentence-level relevant supporting facts in an article required for answering a given question, 2) error reason of a distractor (i.e., an incorrect option) for explaining why a distractor should be eliminated, which is an important reasoning step for multi-choice questions, and 3) types of reasoning skills required for answering questions. Extensive experiments show that our proposed dataset is more challenging and very useful for identifying the limitations of existing MRC systems in an explainable way, facilitating researchers to develop novel machine learning and reasoning approaches to tackle this challenging research problem. 1

show abstract

“…Other similar works include geographical and history Gaokao [33]- [35], since some of our experimental datasets come from geographical examinations. As an attempt to answer multiple-choice questions in history Gaokao, Cheng et al [33] proposed a three-stage framework including retrieving, ranking, and filtering concept and quote pages, as well as Guo et al [34] proposed a permanent-provisional memory network. The most relevant work is by Ding et al [35] on multiple-choice questions in geographical Gaokao.…”

Section: B Question Answeringmentioning

confidence: 99%

A Hybrid Framework for Problem Solving of Comparative Questions

Zhang

et al. 2019

IEEE Access

View full text Add to dashboard Cite

Comparative questions in Chinese, as a special and complex form of question answering (QA), have their own unique sentence structure, existing methods cannot solve them well. Inspired by cognitive studies on how humans solve complex problems, we propose a hybrid framework which combines Logic Programming and attention based Bi-LSTM. This framework is decomposed into three consecutive components: 1) identify comparative questions, 2) extract comparative elements from the identified comparative questions, and 3) answer factoid questions containing the extracted comparative elements. Specifically, for the former two components, Logic Programming is adopted to filter out non-comparative questions and extract comparative elements. For the latter one, a bidirectional long and short term memory (Bi-LSTM) model with attention mechanism is utilized. Experimental results on Chinese geographical question datasets show that our proposed hybrid framework achieves outstanding performance for practical use.

show abstract

Which is the Effective Way for Gaokao: Information Retrieval or Neural Networks?

Cited by 13 publications

References 15 publications

Improving answer selection with global features

Improving answer selection with global features

GCRC: A New Challenging MRC Dataset from Gaokao Chinese for Explainable Evaluation

A Hybrid Framework for Problem Solving of Comparative Questions

Contact Info

Product

Resources

About