Recognition of documents has become a basic necessity for two reasons: first to secure the existing data in paper because of the limited of their lives duration and the high rate of destruction insects, fire or humidity secondly to reduce space of archives. The aim of this work is to realize a converter that detects images and text within a document image taken by a scanner and applying a system for the recognition of characters (OCR) in order to obtain a web page (HTML extension) ready to be used in the same computer or on the web hosts to be accessible by everyone.
In this paper, the authors came up with a different approach based on the combination of the different descriptors. For object recognition, regardless of orientation, size and position, feature vectors are computed with the help of Zernike moments and Centrist descriptors. For a large data base the fact of using the classic descriptors has never been a satisfying method for perfect recognition rates. The authors deduced that the combination of descriptors can have good recognition rates, accordingthe result of a comparative study of the different descriptors and the different combinations (Zernike + Centrist, Zernike + ACP, Centrist + ACP). The Zernike moment with Centrist descriptors ended up being the best hybrid description. For the recognition process, the authors opted for support vector machine (SVM) and Neural Networks (NN). The authors illustrate the proposed method on 3D objects using representations of two-dimensional images that are taken from different angles of view are the main features leading the authors to their objective.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.