Son T. Luu scite author profile

Although Vietnamese is the 17 th most popular native-speaker language a in the world, there are not many research studies on Vietnamese machine reading comprehension (MRC), the task of understanding a text and answering questions about it. One of the reasons is because of the lack of high-quality benchmark datasets for this task. In this work, we construct a dataset which consists of 2,783 pairs of multiple-choice questions and answers based on 417 Vietnamese texts which are commonly used for teaching reading comprehension for elementary school pupils. In addition, we propose a lexicalbased MRC method that utilizes semantic similarity measures and external knowledge sources to analyze questions and extract answers from the given text. We compare the performance of the proposed model with several baseline lexical-based and neural network-based models. Our proposed method achieves 61.81% by accuracy, which is 5.51% higher than the best baseline model. We also measure human performance on our dataset and find that there is a big gap between machine-model and human performances. This indicates that significant progress can be made on this task. The dataset is freely available on our website b for research purposes.

show abstract

Measure of the Content Creation Score on Social Network Using Sentiment Score and Passion Point

Nguyen

Huynh

Luu

et al. 2020

View full text Add to dashboard Cite

Social network is one of efficient tools for spreading information. The evaluation of the content creation of a user is a useful feature to improve the ability of information propagation on social network. In this paper, the measures for evaluating the user’s content creation are proposed. They include the passion point of a user with a brand and the quality of the user’s posts. The passion point is computed based on the sentiment score of posting and the activity of the user. The quality of the user’s posts is computed through the analyzing of the post’s content. Those measures are combined to analyze the interesting of posts. The proposed method has been tested and get the positive experimental results.

show abstract

An Experimental Study of Deep Neural Network Models for Vietnamese Multiple-Choice Reading Comprehension

Luu

Nguyen

et al. 2021

View full text Add to dashboard Cite

Multi-Level Sentiment Analysis of Product Reviews Based on Grammar Rules

Nguyen

Tran

et al. 2021

View full text Add to dashboard Cite

Vietnamese is a tonal and isolated language. Its highly ambiguity makes the designing of methods for sentiment analysis being difficult. For getting the most effectiveness, the designed method has to analyze sentiment of sentences based on combining the grammar and syllable structures of Vietnamese. In this paper, a method to build a Vietnamese dataset of product reviews with many sentiment levels, including very negative, negative, neutral, positive and very positive, is proposed. This method can be scaled to a large dataset using for analyzing sentiment of product reviews. Moreover, a solution to add more grammar rules of Vietnamese into the pre-processing of sentiment analysis is also constructed. Those rules simulate the sentiment recognition of humans and help to increase the accuracy of sentiment determination. The combination of grammar rules and some methods for sentiment analysis are experimented on the Vietnamese dataset of product reviews to classify them into sentiment-levels. The testing results show that their accuracy and F-measure are improved and suitable to apply in the practical business analyzing of customer behaviors.

show abstract

Conversational Machine Reading Comprehension for Vietnamese Healthcare Texts

Luu¹,

Bui²,

Nguyen³

et al. 2021

View full text Add to dashboard Cite

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Son T. Luu

A Large-Scale Dataset for Hate Speech Detection on Vietnamese Social Media Texts

VLSP 2021 - ViMRC Challenge: Vietnamese Machine Reading Comprehension

Comparison Between Traditional Machine Learning Models And Neural Network Models For Vietnamese Hate Speech Detection

Enhancing Lexical-Based Approach With External Knowledge for Vietnamese Multiple-Choice Machine Reading Comprehension

Measure of the Content Creation Score on Social Network Using Sentiment Score and Passion Point

An Experimental Study of Deep Neural Network Models for Vietnamese Multiple-Choice Reading Comprehension

Multi-Level Sentiment Analysis of Product Reviews Based on Grammar Rules

Conversational Machine Reading Comprehension for Vietnamese Healthcare Texts

Contact Info

Product

Resources

About