Zae Myung Kim scite author profile

Zae Myung Kim

14Publications

15Citation Statements Received

172Citation Statements Given

How they've been cited

How they cite others

182

172

Affiliations

Korea Advanced Institute of Science and Technology

Publications

Order By: Most citations

Modeling long-term human activeness using recurrent neural networks for biometric data

Kim

et al. 2017

BMC Med Inform Decis Mak

View full text Add to dashboard Cite

BackgroundWith the invention of fitness trackers, it has been possible to continuously monitor a user’s biometric data such as heart rates, number of footsteps taken, and amount of calories burned. This paper names the time series of these three types of biometric data, the user’s “activeness”, and investigates the feasibility in modeling and predicting the long-term activeness of the user.MethodsThe dataset used in this study consisted of several months of biometric time-series data gathered by seven users independently. Four recurrent neural network (RNN) architectures–as well as a deep neural network and a simple regression model–were proposed to investigate the performance on predicting the activeness of the user under various length-related hyper-parameter settings. In addition, the learned model was tested to predict the time period when the user’s activeness falls below a certain threshold.ResultsA preliminary experimental result shows that each type of activeness data exhibited a short-term autocorrelation; and among the three types of data, the consumed calories and the number of footsteps were positively correlated, while the heart rate data showed almost no correlation with neither of them. It is probably due to this characteristic of the dataset that although the RNN models produced the best results on modeling the user’s activeness, the difference was marginal; and other baseline models, especially the linear regression model, performed quite admirably as well. Further experimental results show that it is feasible to predict a user’s future activeness with precision, for example, a trained RNN model could predict–with the precision of 84%–when the user would be less active within the next hour given the latest 15 min of his activeness data.ConclusionsThis paper defines and investigates the notion of a user’s “activeness”, and shows that forecasting the long-term activeness of the user is indeed possible. Such information can be utilized by a health-related application to proactively recommend suitable events or services to the user.

show abstract

Do Multilingual Neural Machine Translation Models Contain Language Pair Specific Attention Heads?

Kim¹,

Besacier²,

Nikoulina³

et al. 2021

View full text Add to dashboard Cite

Recent studies on the analysis of the multilingual representations focus on identifying whether there is an emergence of languageindependent representations, or whether a multilingual model partitions its weights among different languages. While most of such work has been conducted in a "black-box" manner, this paper aims to analyze individual components of a multilingual neural translation (NMT) model. In particular, we look at the encoder self-attention and encoder-decoder attention heads (in a many-to-one NMT model) that are more specific to the translation of a certain language pair than others by (1) employing metrics that quantify some aspects of the attention weights such as "variance" or "confidence", and (2) systematically ranking the importance of attention heads with respect to translation quality. Experimental results show that surprisingly, the set of most important attention heads are very similar across the language pairs and that it is possible to remove nearly one-third of the less important heads without hurting the translation quality greatly.

show abstract

Temporal Information Extraction from Korean Texts

Jeong¹,

Kim²,

Do³

et al. 2015

View full text Add to dashboard Cite

As documents tend to contain temporal information, extracting such information is attracting much research interests recently. In this paper, we propose a hybrid method that combines machine-learning models and hand-crafted rules for the task of extracting temporal information from unstructured Korean texts. We address Korean-specific research issues and propose a new probabilistic model to generate complementary features. The performance of our approach is demonstrated by experiments on the TempEval-2 dataset, and the Korean TimeBank dataset which we built for this study.

show abstract

A Multilingual Neural Machine Translation Model for Biomedical Data

Bérard¹,

Kim²,

Nikoulina³

et al. 2020

Preprint

View full text Add to dashboard Cite

We release a multilingual neural machine translation model, which can be used to translate text in the biomedical domain. The model can translate from 5 languages (French, German, Italian, Korean and Spanish) into English. It is trained with large amounts of generic and biomedical data, using domain tags. Our benchmarks show that it performs near stateof-the-art both on news (generic domain) and biomedical test sets, and that it outperforms the existing publicly released models. We believe that this release will help the large-scale multilingual analysis of the digital content of the COVID-19 crisis and of its effects on society, economy, and healthcare policies. We also release a test set of biomedical text for Korean-English. It consists of 758 sentences from official guidelines and recent papers, all about COVID-19.

show abstract

Investigating the Impact of Possession-Way of a Smartphone on Action Recognition

Kim

Jeong

et al. 2016

Sensors

View full text Add to dashboard Cite

For the past few decades, action recognition has been attracting many researchers due to its wide use in a variety of applications. Especially with the increasing number of smartphone users, many studies have been conducted using sensors within a smartphone. However, a lot of these studies assume that the users carry the device in specific ways such as by hand, in a pocket, in a bag, etc. This paper investigates the impact of providing an action recognition system with the information of the possession-way of a smartphone, and vice versa. The experimental dataset consists of five possession-ways (hand, backpack, upper-pocket, lower-pocket, and shoulder-bag) and two actions (walking and running) gathered by seven users separately. Various machine learning models including recurrent neural network architectures are employed to explore the relationship between the action recognition and the possession-way recognition. The experimental results show that the assumption of possession-ways of smartphones do affect the performance of action recognition, and vice versa. The results also reveal that a good performance is achieved when both actions and possession-ways are recognized simultaneously.

show abstract

A Multilingual Neural Machine Translation Model for Biomedical Data

Bérard¹,

Kim²,

Nikoulina³

et al. 2020

View full text Add to dashboard Cite

We release a multilingual neural machine translation model, which can be used to translate text in the biomedical domain. The model can translate from 5 languages (French, German, Italian, Korean and Spanish) into English. It is trained with large amounts of generic and biomedical data, using domain tags. Our benchmarks show that it performs near stateof-the-art both on news (generic domain) and biomedical test sets, and that it outperforms the existing publicly released models. We believe that this release will help the large-scale multilingual analysis of the digital content of the COVID-19 crisis and of its effects on society, economy, and healthcare policies.We also release a test set of biomedical text for Korean-English. It consists of 758 sentences from official guidelines and recent papers, all about COVID-19.

show abstract

An adaptive vocabulary learning application through modeling learner's linguistic proficiency and interests

Kim

et al. 2017

View full text Add to dashboard Cite

Visualizing Cross‐Lingual Discourse Relations in Multilingual TED Corpora

Kim¹,

Nikoulina²,

Kang³

et al. 2021

View full text Add to dashboard Cite

This paper presents an interactive data dashboard that provides users with an overview of the preservation of discourse relations among 28 language pairs. We display a graph network depicting the cross-lingual discourse relations between a pair of languages for multilingual TED talks and provide a search function to look for sentences with specific keywords or relation types, facilitating ease of analysis on the cross-lingual discourse relations.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Zae Myung Kim

Modeling long-term human activeness using recurrent neural networks for biometric data

Do Multilingual Neural Machine Translation Models Contain Language Pair Specific Attention Heads?

Temporal Information Extraction from Korean Texts

A Multilingual Neural Machine Translation Model for Biomedical Data

Investigating the Impact of Possession-Way of a Smartphone on Action Recognition

A Multilingual Neural Machine Translation Model for Biomedical Data

An adaptive vocabulary learning application through modeling learner's linguistic proficiency and interests

Visualizing Cross‐Lingual Discourse Relations in Multilingual TED Corpora

Contact Info

Product

Resources

About