This paper presents a system for offline recognition of cursive Arabic handwritten text based on Hidden Markov Models (HMMs). The proposed work reports an effective method taking into account the context of character by applying an embedded training-based HMMs to perform and enhance the character models. The system is analytical without explicit segmentation; extracted features preceded by baseline estimation are statistical and structural to integrate both the peculiarities of the text and the pixel distribution characteristics of the word image. The experiments are done on benchmark IFN/ENIT database. The proposed work shows the effectiveness of using embedded training-based HMMs for enhancing the recognition rate, and the obtained results are promising and encouraging.
In this work, we propose a new method to enhance text in document-image. Firstly, we introduce a classical model and a way to solve it by means of a non-convex optimization problem. So, a simoultaneaous estimation of the reflectance and the luminance is obtained when the non uniform illumination (also called luminance) is a smooth function and the reflectance is a function of bounded variation. We give an analyse of this problem and some conditions of existence and unicity. Then, we consider the "log" of the classical model. A new pde's model is proposed. This method is based on the resolution of an original partial differential equation (PDE) estimating the log of the luminance. We assume that the luminance is enough smooth and is the solution of a non classical second order's PDE.Then we deduce the reflectance from the estimated luminance and the acquired image. The effectiveness and the robustness of the proposed process are shown on numerical examples in real-world situation (images acquired from cameraphones). Then, we illustrate the ability of this method to improve an Optical Character Recognition (OCR) in text recognition.
In this paper we present a system for offline recognition cursive Arabic handwritten text based on Hidden Markov Models (HMMs). The system is analytical without explicit segmentation used embedded training to perform and enhance the character models. Extraction features preceded by baseline estimation are statistical and geometric to integrate both the peculiarities of the text and the pixel distribution characteristics in the word image. These features are modelled using hidden Markov models and trained by embedded training. The experiments on images of the benchmark IFN/ENIT database show that the proposed system improves recognition.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.