2020
DOI: 10.4018/jitr.2020040110
|View full text |Cite
|
Sign up to set email alerts
|

A Fuzzy Matching based Image Classification System for Printed and Handwritten Text Documents

Abstract: This article proposes a bi-leveled image classification system to classify printed and handwritten English documents into mutually exclusive predefined categories. The proposed system follows the steps of preprocessing, segmentation, feature extraction, and SVM based character classification at level 1, and word association and fuzzy matching based document classification at level 2. The system architecture and its modular structure discuss various task stages and their functionalities. Further, a case study o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 60 publications
0
3
0
Order By: Relevance
“…This method can quickly extract the text in the image, but for the text with shadows or uneven illumination, the detection effect is poor because the edges or corner points of the text cannot be detected accurately. Puri et al proposed a classification-based text detection algorithm for natural scenes based on the idea of sparse representation of distinguished dictionaries [7]. The algorithm first detects image edges by wavelet transform while sliding window scans the detected image edges as patches, then obtains text candidate regions by a simple classification process using two learned discriminative dictionaries, and finally uses adaptive tour smoothing algorithm and contour projection analysis to further fine filter the candidate regions to form stable text regions [8].…”
Section: Current Status Of Researchmentioning
confidence: 99%
“…This method can quickly extract the text in the image, but for the text with shadows or uneven illumination, the detection effect is poor because the edges or corner points of the text cannot be detected accurately. Puri et al proposed a classification-based text detection algorithm for natural scenes based on the idea of sparse representation of distinguished dictionaries [7]. The algorithm first detects image edges by wavelet transform while sliding window scans the detected image edges as patches, then obtains text candidate regions by a simple classification process using two learned discriminative dictionaries, and finally uses adaptive tour smoothing algorithm and contour projection analysis to further fine filter the candidate regions to form stable text regions [8].…”
Section: Current Status Of Researchmentioning
confidence: 99%
“…Like PriceSpy, it uses web scraping techniques to gather pricing data from a range of websites. However, PriceRunner also includes features for analyzing pricing trends and providing users with recommendations for finding the best deals [13] .…”
Section: D) Pricerunnermentioning
confidence: 99%
“…The important means of education informatization is to apply information technology and network technology to education to realize the mode of "Internet + education" [1]. Education informatization covers various aspects such as education management, education process, and education resources.…”
Section: Introductionmentioning
confidence: 99%