Abstract: Optical character recognition (OCR) is a strategy to perceive character from optically checked and digitized pages.
OCR plays an important role for Indian script research. The official language of the state Odisha is Odia. OCR face an incredible difficulties to recognize Odia language due to similar shape characters, their complex nature, the complicated way in which they combine form to compound character, use of Matra etc. Each character and numbers are passed through several modules like binarization, noise removal, segmentation, line segmentation, word segmentation, skeletonization, deskewing, thinning, thickening. The input picture is standardized to a size of 50 x 50 2D pictures. HMM is a stochastic process which has utilized in various applications for example speech recognition, Handwriting recognition, Gesture recognition. In this paper we utilized HMM to recognize the Odia character and numbers. Hidden Markov Model have many advantages such as resistant to noise, handle contrast recorded as a hard copy and the HMM devices are effectively accessible. In our proposed method we have developed an efficient recognition algorithm using HiddenMarkov model based on moment based and structural feature to recognize Odia characters and numerals.
Abstract-Microarray data processing revolves around the pivotal issue of locating genes altering their expression in response to pathogens, other organisms or other multiple environmental conditions resulted out of a comparison between infected and uninfected cells or tissues. To have a comprehensive analysis of the corollaries of certain treatments, deseases and developmental stages embodied as a data matrix on gene expression data is possible through simultaneous observation and monitoring of the expression levels of multiple genes. Clustering is the mechanism of grouping genes into clusters based on different parameters. Clustering is the process of grouping genes into clusters either considering row at a time(row clustering) or considering column at a time(column clustering). The application of clustering approach is crippled by conditions which are unrelated to genes. To get better of these problems a unique form of clustering technique has evolved which offers simultaneous clustering (both rows and columns) which is known as biclustering. A bicluster is deemed to be a sub matrix consisting data values. A bicluster is resulted out of the removal of some of the rows as well as some of the columns of given data matrix in such a fashion that each row of what is left reads the same string. A fast, simple and efficient randomized algorithm is explored in this paper, which discovers the largest bicluster by random projections.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.