QA4MRE 2011-2013: Overview of Question Answering for Machine Reading Evaluation

Peñas, Anselmo; Hovy, Eduard; Forner, Pamela; Rodrigo, Álvaro; Sutcliffe, Richard F. E.; Morante, Roser

doi:10.1007/978-3-642-40802-1_29

Cited by 39 publications

(26 citation statements)

References 8 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Inherently, answers cannot be derived from external resources for this type of question. Although such questions may be reduced into textual entailment between a given text and choices Kasahara et al 2010;Peñas et al 2011], these cases were excluded from the present study.…”

Section: Reading Comprehensionmentioning

confidence: 99%

“…Third, the performance of entailment recognition can be mapped to the test scores, which are comparable with human performance for the same task. Dagan et al [2006], Kasahara et al [2010], and Peñas et al [2011] investigated the concept of using reading comprehension tests for the evaluation of textual entailment recognition. They considered reading comprehension tests to be intended to test human ability to compute entailment relations, and they considered such tests to be applicable to the evaluation of automatic systems.…”

Section: Textual Entailment Recognitionmentioning

confidence: 99%

See 1 more Smart Citation

Evaluating Textual Entailment Recognition for University Entrance Examinations

Miyao

Shima

Kitamura

et al. 2012

ACM Transactions on Asian Language Information Processing

View full text Add to dashboard Cite

The present article addresses an attempt to apply questions in university entrance examinations to the evaluation of textual entailment recognition. Questions in several fields, such as history and politics, primarily test the examinee's knowledge in the form of choosing true statements from multiple choices. Answering such questions can be regarded as equivalent to finding evidential texts from a textbase such as textbooks and Wikipedia. Therefore, this task can be recast as recognizing textual entailment between a description in a textbase and a statement given in a question. We focused on the National Center Test for University Admission in Japan and converted questions into the evaluation data for textual entailment recognition by using Wikipedia as a textbase. Consequently, it is revealed that nearly half of the questions can be mapped into textual entailment recognition; 941 text pairs were created from 404 questions from six subjects. This data set is provided for a subtask of NTCIR RITE (Recognizing Inference in Text), and 16 systems from six teams used the data set for evaluation. The evaluation results revealed that the best system achieved a correct answer ratio of 56%, which is significantly better than a random choice baseline.

show abstract

Section: Reading Comprehensionmentioning

confidence: 99%

Section: Textual Entailment Recognitionmentioning

confidence: 99%

Evaluating Textual Entailment Recognition for University Entrance Examinations

Miyao

Shima

Kitamura

et al. 2012

ACM Transactions on Asian Language Information Processing

View full text Add to dashboard Cite

show abstract

“…These clues may include Tang poem and Song iambic verse, domainspecific expressions, even some mixture of mod- ern Chinese and excerpt from ancient books and etc. The dependence of background knowledge makes the models that are designed for reading comprehension such as (Peñas et al, 2013;Richardson et al, 2013) fail. Thirdly, the diversity of candidates' granularity, i.e.…”

Section: Introductionmentioning

confidence: 99%

Which is the Effective Way for Gaokao: Information Retrieval or Neural Networks?

Guo¹,

Zeng²,

He³

et al. 2017

Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 1

View full text Add to dashboard Cite

As one of the most important test of China, Gaokao is designed to be difficult enough to distinguish the excellent high school students. In this work, we detailed the Gaokao History Multiple Choice Questions(GKHMC) and proposed two different approaches to address them using various resources. One approach is based on entity search technique (IR approach), the other is based on text entailment approach where we specifically employ deep neural networks(NN approach). The result of experiment on our collected real Gaokao questions showed that they are good at different categories of questions, i.e. IR approach performs much better at entity questions(EQs) while NN approach shows its advantage on sentence questions(SQs). Our new method achieves state-of-the-art performance and show that it's indispensable to apply hybrid method when participating in the real-world tests.

show abstract

“…Оценка качества работы таких систем осуществляется на тестовых данных, разрабо-танных обществом экспертов и представленных в виде фрагмента текста, вопроса по этому тексту и предполагаемого ответа. В случае, если система по запросу выдает результат, не соответствующий пред-полагаемому ответу, результат не засчитывается.В 2012 году проводились соревнования по качеству работы вопросно-ответных систем в области биомедицины, наилучший результат (с точностью 55 %) показала система [2]. Отечественных систем, способных составить конкуренцию зарубежным системам по качеству работы, на данный момент не су-ществует.…”

unclassified

“…В 2012 году проводились соревнования по качеству работы вопросно-ответных систем в области биомедицины, наилучший результат (с точностью 55 %) показала система [2]. Отечественных систем, способных составить конкуренцию зарубежным системам по качеству работы, на данный момент не су-ществует.…”

unclassified

Формирование Вопросно-Ответной Системы В Условиях Ограниченного Объема Семантически Размеченного Корпуса

Кулешов¹,

Kuleshov²,

Беляев³

et al. 2016

ППС

View full text Add to dashboard Cite

В статье рассмотрены существующие подходы к реализации вопросно-ответных систем, готовые ре-шения, которые могли бы использоваться в качестве основы, выявлены их достоинства и недостатки, предложены альтернативные подходы к построению системы, представлена их функциональная структу-ра. Предложена математическая модель решения задачи, приведены результаты экспериментов в услови-ях ограниченного объема семантически размеченного корпуса для сравнения качества работы альтерна-тивных решений.Ключевые слова: вопросно-ответная система, QAS, семантико-синтаксический анализатор, SRL, семантически размеченный корпус.Современные поисковые системы обеспечивают выдачу множества разных типов информации в виде веб-страниц, документов, изображений, видео, новостей и карт, но до сих пор не в состоянии в полной мере распознавать запросы на естественном языке и формировать ответы в соответствующем формате. Подобных запросов, согласно проведенным поисковой системой Яндекс исследованиям [1], задается по-рядка полутора миллионов в день, что составляет более 1 % от общего ежедневного потока запросов. Как результат подобных запросов -список релевантных ссылок, по которым пользователю предстоит осуществить дополнительный поиск информации, что является неоптимальным вариантом с точки зре-ния затраченных пользователем времени и ресурсов. В связи с этим в последнее время наблюдается смещение акцентов в сторону использования интеллектуальных систем поиска информации, каковыми считаются вопросно-ответные системы.Вопросно-ответная система (QA) в общем случае представляет собой информационную систему, ак-кумулирующую в себе комплекс справочных и интеллектуальных систем, использующих естественно-языковой интерфейс. На вход QA-системе формируется вопрос на естественном языке, обработав кото-рый, система генерирует естественно-языковой ответ. В качестве источников данных система использует как локальные хранилища, так и глобальную сеть Интернет.В настоящий момент известные реализации вопросно-ответных систем имеют показатель качества работы ниже 60 %. Оценка качества работы таких систем осуществляется на тестовых данных, разрабо-танных обществом экспертов и представленных в виде фрагмента текста, вопроса по этому тексту и предполагаемого ответа. В случае, если система по запросу выдает результат, не соответствующий пред-полагаемому ответу, результат не засчитывается.В 2012 году проводились соревнования по качеству работы вопросно-ответных систем в области биомедицины, наилучший результат (с точностью 55 %) показала система [2]. Отечественных систем, способных составить конкуренцию зарубежным системам по качеству работы, на данный момент не су-ществует.Проблема усугубляется тем, что как таковая задача установки семантических ролей для русского языка не поставлена в отличие от английского, где подготовлена существенная база аннотированных текстов, как следствие -отсутствие полноценного корпуса с семантической разметкой. Поэтому решение поставленных задач должно подтолкнуть к развитию отечественных подходов к автоматической обра-ботке текста и увели...

show abstract

QA4MRE 2011-2013: Overview of Question Answering for Machine Reading Evaluation

Cited by 39 publications

References 8 publications

Evaluating Textual Entailment Recognition for University Entrance Examinations

Evaluating Textual Entailment Recognition for University Entrance Examinations

Which is the Effective Way for Gaokao: Information Retrieval or Neural Networks?

Формирование Вопросно-Ответной Системы В Условиях Ограниченного Объема Семантически Размеченного Корпуса

Contact Info

Product

Resources

About