Macaranga monandra

In pattern recognition, the handwritten character recognition (HCR) is considered as the classical challenge. Due to the unavailability of the dataset for different languages, it is complex to train the recognition system. In particular, the benchmark dataset for HCR in the Gujarati language is limited. To overcome this challenge, a proper dataset is required for experimentation. Hence, this work introduces dataset generation for the Gujarati language using pre-processing and classification techniques. Initially, the handwritten data is collected from various native Gujarati writers. In this work, there are three processes carried out to generate the dataset. They are pre-processing, segmentation and classification. Initially, the pre-processing stages like a selection of image, noise removal, normalization, conversion of integer value to double, grayscale image into a binary image, dimensionality reduction, and vector conversation are performed. Then, the pre-processed image is segmented using line segmentation, character segmentation and word segmentation. Then, for testing and training, the data are transformed into CSV file format (for converting the information to numbers). Finally, the data are classified using a Convolutional neural network (CNN). The kappa and FPR values achived by the CNN are 0.981 and0.189.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Sanket B. Suthar

Smart Water Hardness Monitoring System

A Concise Review on Automatic Text Summarization

Performance Scrutiny of Thinning Algorithms on Printed Gujarati Characters and Handwritten Numerals

Segmentation of Gujarati Handwritten Characters and Numerals with and Without Modifiers from the Scanned Document

Dataset Generation for Gujarati Language Using Handwritten Character Images

Contact Info

Product

Resources

About