2017 Brazilian Conference on Intelligent Systems (BRACIS) 2017
DOI: 10.1109/bracis.2017.12
|View full text |Cite
|
Sign up to set email alerts
|

MilkQA: A Dataset of Consumer Questions for the Task of Answer Selection

Abstract: Abstract-We introduce MilkQA, a question answering dataset from the dairy domain dedicated to the study of consumer questions. The dataset contains 2,657 pairs of questions and answers, written in the Portuguese language and originally collected by the Brazilian Agricultural Research Corporation (Embrapa). All questions were motivated by real situations and written by thousands of authors with very different backgrounds and levels of literacy, while answers were elaborated by specialists from Embrapa's custome… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
3
0
1

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 9 publications
0
3
0
1
Order By: Relevance
“…The MilkQA dataset [ 57 ] is a closed-domain dataset prepared for the Portuguese language. The questions are about dairy.…”
Section: Related Workmentioning
confidence: 99%
“…The MilkQA dataset [ 57 ] is a closed-domain dataset prepared for the Portuguese language. The questions are about dairy.…”
Section: Related Workmentioning
confidence: 99%
“…We mention, for instance, versions of SQuAD 3 and GLUE 4 in Portuguese. The two exceptions are the ENEM-Challeng [36] and MilkQA [7]. The ENEM-Challenge is based on ENEM, the entrance examination valid for almost all universities in Brazil.…”
Section: Existing Resources For Portuguesementioning
confidence: 99%
“…In our dataset, however, the rewriting process was executed by a different individual, bringing an additional factor of diversity to the generation of QA sets. 7 Some examples of paraphrases are shown in Table 4.…”
Section: Assessmentmentioning
confidence: 99%
“…No que concerne especificamente ao português, e tanto quanto conhecemos, o mais próximo com uma coleção de FAQ e respetivas respostas será a coleção MilkQA (Criscuolo et al, 2017), que inclui perguntas colocadas de forma mais densa, no domínio dos laticínios, seguidas pelas suas respostas. A disponibilização do corpo AIA-BDE é mais uma contribuição neste sentido, já que inclui FAQ em português, no domínio da administração pública, e está preparada para avaliar um conjunto de tarefas relevantes para sistemas de IR, RAP, e mesmo diálogo, com foco na associação de interações em linguagem natural com perguntas conhecidas.…”
Section: Trabalho Relacionadounclassified