Image Preprocessing and Modified Adaptive Thresholding for Improving Ocr

Kshetry, Rohan Lal

doi:10.2139/ssrn.4135966

Cited by 2 publications

(1 citation statement)

References 1 publication

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The weighted average method is the most commonly used grayscale method [16]. The following is the formula for the weighted average method: In some special cases, the obtained images may have problems such as angular tilt, unclear images, noise, or information loss [14], so before performing character recognition, it is necessary to pre-process the image to improve the accuracy of subsequent recognition. Common pre-processing operations include geometric transformation, image grayscale, binarization, denoising, etc.…”

Section: Ocr Recognition Technologymentioning

confidence: 99%

Research on a Web System Data-Filling Method Based on Optical Character Recognition and Multi-Text Similarity

Su,

Kang,

Fan

2024

Applied Sciences

View full text Add to dashboard Cite

In the development of web systems, data uploading is a relatively important function. The traditional method of uploading data is to manually fill out forms, but when the data to be uploaded mostly exist in the form of form images, and the form content contains a lot of similar field information and irrelevant edge information, using traditional methods is not only time-consuming and labor-intensive, but also prone to errors. This requires a technology that can automatically fill in complex form images. OCR is an optical character recognition technology that can convert images into digitized text data using computer vision methods. However, using this technology alone cannot complete the tasks of extracting relevant data and filling corresponding fields. To address this issue, this article proposes a method that combines OCR technology and Levenshtein multi-text similarity. This method can effectively solve the problem of data filling after parsing complex form images, and the application results of this method in web systems show that the filling accuracy for complex form images can reach over 90%.

show abstract

Section: Ocr Recognition Technologymentioning

confidence: 99%