Young-Kil Kim scite author profile

One of the main challenges in Spoken Language Understanding (SLU) is dealing with 'open-vocabulary' slots. Recently, SLU models based on neural network were proposed, but it is still difficult to recognize the slots of unknown words or 'open-vocabulary' slots because of the high cost of creating a manually tagged SLU dataset. This paper proposes data noising, which reflects the characteristics of the 'open-vocabulary' slots, for data augmentation. We applied it to an attention based bi-directional recurrent neural network (Liu and Lane, 2016) and experimented with three datasets: Airline Travel Information System (ATIS), Snips, and MIT-Restaurant. We achieved performance improvements of up to 0.57% and 3.25 in intent prediction (accuracy) and slot filling (f1-score), respectively. Our method is advantageous because it does not require additional memory and it can be applied simultaneously with the training process of the model.

show abstract

Phrase-Based Statistical Model for Korean Morpheme Segmentation and POS Tagging

Kim

2018

IEICE Trans. Inf. & Syst.

View full text Add to dashboard Cite

SUMMARY In this paper, we propose a novel phrase-based model for Korean morphological analysis by considering a phrase as the basic processing unit, which generalizes all the other existing processing units. The impetus for using phrases this way is largely motivated by the success of phrase-based statistical machine translation (SMT), which convincingly shows that the larger the processing unit, the better the performance. Experimental results using the SEJONG dataset show that the proposed phrasebased models outperform the morpheme-based models used as baselines. In particular, when combined with the conditional random field (CRF) model, our model leads to statistically significant improvements over the state-of-the-art CRF method.

show abstract

A chatbot for a dialogue-based second language learning system

Huang¹,

Lee²,

Kwon³

et al. 2017

View full text Add to dashboard Cite

This paper presents a chatbot for a Dialogue-Based Computer-Assisted second Language Learning (DB-CALL) system. A DB-CALL system normally leads dialogues by asking questions according to given scenarios. User utterances outside the scenarios are normally considered as semantically improper and simply rejected. In this paper, we assume that raising the freedom of dialogue can stimulate the user's interest in learning. For this, a chatbot based on a search engine with a dialogue corpus has been developed to deal with conversations out of the scenarios. We evaluate the chatbot separately in two different cases: as an independent bot and as an auxiliary system. The results showed that, unlike the independent chatbot system, the chatbot as an auxiliary system showed a much lower turn success ratio.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Young-Kil Kim

Characterization and temporal expression of a ω-6 fatty acid desaturase cDNA from sesame (Sesamum indicum L.) seeds

GenieTutor: A Computer-Assisted Second-Language Learning System Based on Spoken Language Understanding

Data Augmentation by Data Noising for Open-vocabulary Slots in Spoken Language Understanding

Phrase-Based Statistical Model for Korean Morpheme Segmentation and POS Tagging

A chatbot for a dialogue-based second language learning system

Contact Info

Product

Resources

About