Applications of Text Detection and its Challenges

Nevetha, M. P.; Baskar, A.

doi:10.1145/2791405.2791555

Cited by 13 publications

(1 citation statement)

References 74 publications

(120 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…continued on following page challenges,solutions,andconstraints.Further,theydiscussedcontentforms,imageminingmethods, andlanguage/scriptidentificationandclassificationschemes,whichwerecomparedforcriteriaof languagesused,scriptandlanguagedetection,featureextraction,grayscaleorcoloredimage,font variation,resolution,printerorscannertype,andclassifier.Theirobservationsfoundthemaximum contributionintextualcontentformwithmonoandmulti-lingualdocumentsalongwiththescript identification,anduseof300DPI,grayscale,andSVMclassifier.Pal(2014)discussedareview onlanguageandscriptidentificationmethodswithfont-cum-stylerecognitionandfurtherprovided languageoverview,origin,difficulties,singleandmulti-scriptidentificationtechniquesforprintedand handwrittendocuments,challenges,andfinally,fontstyle,generation,variation,andtheirrecognition methods Nevetha and Baskar (2015). demonstrated text detection applications and techniques withtheirchallengesongeneral,scientific,unconstrainedandscenedocumentimagesoftextual information.Theyfurtherdiscussedtextrecognitionphasesofpreprocessing,segmentation,feature extraction,andrecognition.Felhi,TabboneandSegovia(2014)providedamulti-scalestroke-based pagesegmentationapproachtogetthetext,lines,photosandbackground.Theyfollowedthesteps ofglobalstrokewidthvariation-basedtextandlineCCdetection,imagesegmentationintophoto andbackgroundregionswithactivecontourmodel,textclassification,lineseparation,textcandidate clustering by mean-shift analysis, and finally, horizontal and vertical text regions separation word Recognition and Spotting Thissectionillustratesvariouswordrecognitionandspottingmethods.Thescalable,statistical,script independentline-basedwordspottingmethodperformedminimumpreprocessing,nosegmentation, fillermodelcreationinnon-keywordregions,featureextraction,and,finally,HiddenMarkovModel (HMM)basedrecognition(Wshah,KumarandGovindaraju,2014).Thismethodhasbeentested onEnglishdocumentsfromIAMdatasets,ArabicdocumentsfromAMAdatasets,andDevanagari documentsfromLAWdatasetsandfoundsystemcomplexityofO(K 2 L)+(R 2 L*)usinglexicon-…”

mentioning

confidence: 99%

A Fuzzy Matching based Image Classification System for Printed and Handwritten Text Documents

Puri

Singh

2020

Journal of Information Technology Research

View full text Add to dashboard Cite

This article proposes a bi-leveled image classification system to classify printed and handwritten English documents into mutually exclusive predefined categories. The proposed system follows the steps of preprocessing, segmentation, feature extraction, and SVM based character classification at level 1, and word association and fuzzy matching based document classification at level 2. The system architecture and its modular structure discuss various task stages and their functionalities. Further, a case study on document classification is discussed to show the internal score computations of words and keywords with fuzzy matching. The experiments on proposed system illustrate that the system achieves promising results in the time-efficient manner and achieves better accuracy with less computation time for printed documents than handwritten ones. Finally, the performance of the proposed system is compared with the existing systems and it is observed that proposed system performs better than many other systems.

show abstract

mentioning

confidence: 99%