Indian Sign Language Recognition through Hybrid ConvNet-LSTM Networks

H, Muthu Mariappan; Gomathi,

doi:10.24003/emitter.v9i1.613

Cited by 6 publications

(3 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…CSL-Chinese Sign Language, ASL-American Sign LanguageAs per literature 3D CNN & BiLSTM methods[21],[18],[19] proposed on Chinese Sign language are giving promising results out of which BiLSTM is giving better results. Hence, we used pretrained networks with BiLSTM on custom made ISL data set consisting of words and phrases and could achieve better recognition accuracy[Table:3].…”

mentioning

confidence: 86%

“…Hence, we used pretrained networks with BiLSTM on custom made ISL data set consisting of words and phrases and could achieve better recognition accuracy[Table:3]. Inception v3 + LSTM on CasTalk ISL[18] also gave similar results but the size of database is less and also it consists of only words. Another work on ASL Database[20] gave better recognition accuracy on DHG 14 but the number of gestures is less.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Spatiotemporal Modeling for Dynamic Gesture Recognition in Video Streams

Sandhya,

ANITHASHEELA

2024

Preprint

View full text Add to dashboard Cite

Indian Sign Language (ISL) serves as a vital means of communication for the hearing-impaired community in India. Accurate recognition of ISL gestures through computer vision is of paramount importance for enhancing accessibility and inclusivity. Hence this research focuses on translating sign language gestures used by the hearing impairment community into formats understandable by the general population in order to bridge the communication gap between these communities. For this a Continuous Sign language recognition module is to be designed which is complicated since the grammar for Sign language is different from the spoken language due to which first the continuous ISL (Indian Sign language) is to be converted to glosses and then these glosses are to be used for generating the spoken language. Also, as per literature it is observed that Sign language translator is built for American, Chinese and Argentina Sign language but very little work is done on Indian Sign language. Also, many of the ISL translators are built either on static data or very a smaller number of gestures of video data [20]. In our work it is proposed to build a system which uses combinational network that can convert directly the ISL to Speech on 76 video gestures. The proposed combinational network includes Pre-trained network designs such as ResNet18, ResNet50, GoogLeNet, and InceptionV3 to efficiently extract spatial features from video frames and subsequently, these extracted features are further processed through a two-layer Long Short-Term Memory (Bi-LSTM) network to represent time dependencies between the frames for a particular gesture. Compared to conventional RNNs, BiLSTM models are used since they were able to represent well the longer time dependencies of frames in the gesture. To validate the proposed idea a standard balanced database of around 76 gestures with each gesture enacted by 10 individuals 05 times each which includes letters, words, phrases are created in Anechoic Chamber lab using Sony HXR-NX100 camera sponsored under UGC-MRP at JNTUHCEH. We explored various combinations of pre-trained networks and BiLSTM layers to strike a balance between computational resources and precision on this database and we could achieve incredible accuracy in gesture classification while minimizing training time and memory usage. GoogLenet with LSTM gave better results with an average test accuracy of 94.21% compared to other combinational networks.

show abstract

mentioning

confidence: 86%

mentioning

confidence: 99%

Spatiotemporal Modeling for Dynamic Gesture Recognition in Video Streams

Sandhya,

ANITHASHEELA

2024

Preprint

View full text Add to dashboard Cite

show abstract

“…Both SVM and CNN classifiers exhibited high accuracy rates on the testing set of images, with SVM achieving 99.14% accuracy on test data and an overall accuracy of 99% for alphabets and digits. A team proposed the development of a real-time sign language recognition system utilizing a hybrid CNN-RNN architecture to identify sign language words from real-time videos [2]. The system achieved remarkable results with a top-1 accuracy of 95.99% and a top-3 accuracy of 99.46% on the test dataset.…”

Section: Related Workmentioning

confidence: 99%

A Communication Translator Interface for Sign Language Interpretation

Patole,

Sarawate,

Joshi

2023

IJRASET

View full text Add to dashboard Cite

Sign language is an essential means of communication for deaf and hard-of-hearing individuals. However, unlike spoken languages which have a universal language, every country has its own native sign language. In India, the Indian Sign Language (ISL) is used. This survey aims to provide an overview of the recognition and translation of essential Indian sign language. While significant research has been conducted in American Sign Language (ASL), the same cannot be said for Indian Sign Language due to its unique characteristics. The proposed method focuses on designing a tool for translating ISL hand gestures to help the deaf-mute community convey their ideas. A self-created ISL dataset was used to train the model for gesture recognition. The literature contains a plethora of methods for extracting features and classifying sign language, with a majority of them utilizing machine learning techniques. However, this article proposes the adoption of a deep learning method by designing a Convolution Neural Network (CNN) model for the purpose of extracting sign language features and recognizing them accurately. This CNN model is specifically designed to identify complex patterns in the data and use them to efficiently recognize sign language features. By adopting this approach, it is expected that the recognition of sign language will improve significantly, providing a more effective means of communication for the deaf and hard-of-hearing community.

show abstract

Indian Sign Language Generation – A multi-modal approach

Chaudhari,

Bedekar

2023

2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT)

View full text Add to dashboard Cite

Indian Sign Language Recognition through Hybrid ConvNet-LSTM Networks

Cited by 6 publications

References 26 publications

Spatiotemporal Modeling for Dynamic Gesture Recognition in Video Streams

Spatiotemporal Modeling for Dynamic Gesture Recognition in Video Streams

A Communication Translator Interface for Sign Language Interpretation

Indian Sign Language Generation – A multi-modal approach

Contact Info

Product

Resources

About