Sang‐Ki Ko scite author profile

We propose a sign language translation system based on human keypoint estimation. It is well-known that many problems in the field of computer vision require a massive amount of dataset to train deep neural network models. The situation is even worse when it comes to the sign language translation problem as it is far more difficult to collect high-quality training data. In this paper, we introduce the KETI (short for Korea Electronics Technology Institute) sign language dataset which consists of 14,672 videos of high resolution and quality. Considering the fact that each country has a different and unique sign language, the KETI sign language dataset can be the starting line for further research on the Korean sign language translation. Using the KETI sign language dataset, we develop a neural network model for translating sign videos into natural language sentences by utilizing the human keypoints extracted from a face, hands, and body parts. The obtained human keypoint vector is normalized by the mean and standard deviation of the keypoints and used as input to our translation model based on the sequence-to-sequence architecture. As a result, we show that our approach is robust even when the size of the training data is not sufficient. Our translation model achieves 93.28% (55.28%, respectively) translation accuracy on the validation set (test set, respectively) for 105 sentences that can be used in emergency situations. We compare several types of our neural sign translation models based on different attention mechanisms in terms of classical metrics for measuring the translation performance.

show abstract

A movie recommendation algorithm based on genre correlations

Choi

Han

2012

Expert Systems with Applications

129

View full text Add to dashboard Cite

SoftRegex: Generating Regex from Natural Language Descriptions using Softened Regex Equivalence

Park¹,

Ko²,

Cognetta³

et al. 2019

View full text Add to dashboard Cite

We continue the study of generating semantically correct regular expressions from natural language descriptions (NL). The current stateof-the-art model, SemRegex, produces regular expressions from NLs by rewarding the reinforced learning based on the semantic (rather than syntactic) equivalence between two regular expressions. Since the regular expression equivalence problem is PSPACE-complete, we introduce the EQ Reg model for computing the similarity of two regular expressions using deep neural networks. Our EQ Reg model essentially softens the equivalence of two regular expressions when used as a reward function. We then propose a new regex generation model, SoftRegex, using the EQ Reg model, and empirically demonstrate that SoftRegex substantially reduces the training time (by a factor of at least 3.6) and produces state-ofthe-art results on three benchmark datasets.

show abstract

Pseudo-inversion on Formal Languages

Cho

Han

Kang

et al. 2014

View full text Add to dashboard Cite

Sign language recognition with recurrent neural network using human keypoint detection

Son

Jung

2018

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Sang‐Ki Ko

Neural Sign Language Translation Based on Human Keypoint Estimation

A movie recommendation algorithm based on genre correlations

SoftRegex: Generating Regex from Natural Language Descriptions using Softened Regex Equivalence

Pseudo-inversion on Formal Languages

Sign language recognition with recurrent neural network using human keypoint detection

Contact Info

Product

Resources

About