2019
DOI: 10.36548/jiip.2019.1.004
|View full text |Cite
|
Sign up to set email alerts
|

A Smart Image Processing Algorithm for Text Recognition, Information Extraction and Vocalization for the Visually Challenged

Abstract: This paper proposes a smart algorithm for image processing by means of recognition of text, extraction of information and vocalization for the visually challenged. The system uses LattePanda Alpha system on board that processes the scanned images. The image is categorized into its equivalent alphanumeric characters following pre-processing, segmentation, extraction of features and post-processing of the scanned or image based information. Further, a text to speech synthesizer is used for vocalization processed… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
17
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 85 publications
(17 citation statements)
references
References 16 publications
0
17
0
Order By: Relevance
“…NLP module of the general TTS synthesizer consists of the Pre-processor, text analyzer, and contextual analyzer (2) . The components are automatic phonetization, text analysis, and prosody generation (3,4) . There are several factors which is affected NLP and the final output of digital signal processing work like, quality of the microphone, environmental echo,noise, and sampling frequency.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…NLP module of the general TTS synthesizer consists of the Pre-processor, text analyzer, and contextual analyzer (2) . The components are automatic phonetization, text analysis, and prosody generation (3,4) . There are several factors which is affected NLP and the final output of digital signal processing work like, quality of the microphone, environmental echo,noise, and sampling frequency.…”
Section: Related Workmentioning
confidence: 99%
“…Synthesized speech can be created by concatenating part of recorded speech which is stored in a database. Speech is often based on concatenation of natural speech that is the units, which are taken from natural speech put together to form a word or sentence (3) .text-to-speech synthesis system has a wide range of applications in everyday life and a text-to-speech synthesizer is used for vocalization processed content (4) . In the last decade, a great deal of TTS-Synthesis system has done much work in various languages as well as different synthesis techniques such as Unit-selection, Formant, Hidden Markov Model, and Articulatory synthesis was done by researchers (5) .…”
Section: Introductionmentioning
confidence: 99%
“…The author Manoharan, Samuel et al [9] proposes the utilization of "the image processing techniques in the extracting the texts, recognizing and vocalizing for the people with visual impairments" the method utilizes Journal of Artificial Intelligence and Capsule Networks (2020) Vol.02/ No. 01 Pages: 1-10 http://irojournals.com/aicn/ DOI: https://doi.org/10.36548/jaicn.2020.1.001 the "latte panda alpha" for processing scanned images.…”
Section: Related Workmentioning
confidence: 99%
“…It can play a significant role in many aspects such as post-office automation, national ID number recognition, parking lot management system, and online banking ( Alom et al, 2018 ). This recognition system can also play an essential part in signboard translation, digital character conversation, keyword spotting, scene image analysis, text-to-speech conversion ( Manoharan, 2019 ), meaning translation, and most notably in Bangla optical character recognition (OCR) ( Manisha, Sreenivasa & Sundara Krishna, 2016 ). But it has been a great challenge to provide such a system for Bangla than most other languages.…”
Section: Introductionmentioning
confidence: 99%