Md. Shahinur Alam scite author profile

Trajectory-based writing system refers to writing a linguistic character or word in free space by moving a finger, marker, or handheld device. It is widely applicable where traditional pen-up and pen-down writing systems are troublesome. Due to the simple writing style, it has a great advantage over the gesture-based system. However, it is a challenging task because of the non-uniform characters and different writing styles. In this research, we developed an air-writing recognition system using three-dimensional (3D) trajectories collected by a depth camera that tracks the fingertip. For better feature selection, the nearest neighbor and root point translation was used to normalize the trajectory. We employed the long short-term memory (LSTM) and a convolutional neural network (CNN) as a recognizer. The model was tested and verified by the self-collected dataset. To evaluate the robustness of our model, we also employed the 6D motion gesture (6DMG) alphanumeric character dataset and achieved 99.32% accuracy which is the highest to date. Hence, it verifies that the proposed model is invariant for digits and characters. Moreover, we publish a dataset containing 21,000 digits; which solves the lack of dataset in the current research.

show abstract

Trajectory-Based Air-Writing Character Recognition Using Convolutional Neural Network

Alam

Kim

2019

View full text Add to dashboard Cite

Implementation of a Character Recognition System Based on Finger-Joint Tracking Using a Depth Camera

Alam

Kim

2021

IEEE Trans. Human-Mach. Syst.

View full text Add to dashboard Cite

Super-Resolution Enhancement Method Based on Generative Adversarial Network for Integral Imaging Microscopy

Alam

Erdenebat

Abbass

et al. 2021

Sensors

View full text Add to dashboard Cite

The integral imaging microscopy system provides a three-dimensional visualization of a microscopic object. However, it has a low-resolution problem due to the fundamental limitation of the F-number (the aperture stops) by using micro lens array (MLA) and a poor illumination environment. In this paper, a generative adversarial network (GAN)-based super-resolution algorithm is proposed to enhance the resolution where the directional view image is directly fed as input. In a GAN network, the generator regresses the high-resolution output from the low-resolution input image, whereas the discriminator distinguishes between the original and generated image. In the generator part, we use consecutive residual blocks with the content loss to retrieve the photo-realistic original image. It can restore the edges and enhance the resolution by ×2, ×4, and even ×8 times without seriously hampering the image quality. The model is tested with a variety of low-resolution microscopic sample images and successfully generates high-resolution directional view images with better illumination. The quantitative analysis shows that the proposed model performs better for microscopic images than the existing algorithms.

show abstract

Two Dimensional Convolutional Neural Network Approach for Real-Time Bangla Sign Language Characters Recognition and Translation

et al. 2021

View full text Add to dashboard Cite

This paper presents a convolutional neural network (CNN) architecture to recognize Bangla Sign Language (BdSL) characters. The proposed CNN architecture also translates the recognized signs into respective textual Bangla characters and provides real-time prediction. The automatic recognition and translation of sign language make communication between dumb and deaf and normal people natural. The recognition structure is validated with a dataset of a total of 4600 selected hand sign images collected from volunteers using an HD Webcam. The proposed structure guarantees the functionality of enhancing the directed dataset by capturing new images of variant persons in real-time. The proposed model recognizes all the 36 letters and 10 digits of the BdSL with significant accuracy. A state-of-the-art result with a validation accuracy of 99.57% and a validation loss of 0.56% has been achieved in the recognition and translation of the BdSL characters.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Md. Shahinur Alam

Trajectory-Based Air-Writing Recognition Using Deep Neural Network and Depth Sensor

Trajectory-Based Air-Writing Character Recognition Using Convolutional Neural Network

Implementation of a Character Recognition System Based on Finger-Joint Tracking Using a Depth Camera

Super-Resolution Enhancement Method Based on Generative Adversarial Network for Integral Imaging Microscopy

Two Dimensional Convolutional Neural Network Approach for Real-Time Bangla Sign Language Characters Recognition and Translation

Contact Info

Product

Resources

About