2020
DOI: 10.1109/access.2020.3035701
|View full text |Cite
|
Sign up to set email alerts
|

Enhancing Lexical-Based Approach With External Knowledge for Vietnamese Multiple-Choice Machine Reading Comprehension

Abstract: Although Vietnamese is the 17 th most popular native-speaker language a in the world, there are not many research studies on Vietnamese machine reading comprehension (MRC), the task of understanding a text and answering questions about it. One of the reasons is because of the lack of high-quality benchmark datasets for this task. In this work, we construct a dataset which consists of 2,783 pairs of multiple-choice questions and answers based on 417 Vietnamese texts which are commonly used for teaching reading … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
13
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
4

Relationship

2
7

Authors

Journals

citations
Cited by 27 publications
(16 citation statements)
references
References 56 publications
0
13
0
Order By: Relevance
“…The MRC task is a popular natural language processing task with several studies in recent years [1], [10]- [12]. Among these, some studies focus on generalization capability and transfer learning using supervised or unsupervised learning approaches across question answering (QA) or MRC models.…”
Section: Related Workmentioning
confidence: 99%
“…The MRC task is a popular natural language processing task with several studies in recent years [1], [10]- [12]. Among these, some studies focus on generalization capability and transfer learning using supervised or unsupervised learning approaches across question answering (QA) or MRC models.…”
Section: Related Workmentioning
confidence: 99%
“…For the Vietnamese language, the UIT-ViQuAD [10] (Wikipedia domain) and ViNewQA [15] (Health news domain) are two extractive MRC corpora for machine reading comprehension. Besides, the ViMMRC [9] is the multiple-choice reading comprehension corpus on the Vietnamese students' textbook for primary schools domain.…”
Section: Related Workmentioning
confidence: 99%
“…Jing et al (2019) crowdsourced parallel paragraphs from novels in Chinese and English. A few datasets investigated multiple-choice school QA (Hardalov et al, 2019;Van Nguyena et al, 2020), albeit in a limited domain, and for lower school grades (1st-5th). Other efforts focused on building bi-lingual datasets that are similar in spirit to SQuAD (Rajpurkar et al, 2016) -extractive reading comprehension over open-domain articles.…”
Section: Related Workmentioning
confidence: 99%