The computational handling of Modern Standard Arabic is a challenge in the field of natural language processing due to its highly rich morphology. However, several authors have pointed out that the Arabic morphological system is in fact extremely regular. The existing Arabic morphological analyzers have exploited this regularity to variable extent, yet we believe there is still some scope for improvement. Taking inspiration in traditional Arabic prosody, we have designed and implemented a compact and simple morphological system which in our opinion takes further advantage of the regularities encountered in the Arabic morphological system. The output of the system is a large-scale lexicon of inflected forms that has subsequently been used to create an Online Interface for a morphological analyzer of Arabic verbs. The Jabalín Online Interface is available at
OCR has seen major improvements in recent years, even though conventional OCR strategies don't yet exploit linguistic concepts on Arabic script analysis. We present a new, additional strategy that aims to enhance Arabic OCR. In this approach A. disambiguating dots are temporarily eliminated, which reduces classes of graphemes sharing the same base element to single archigraphemes and B. contextual behaviour of Arabic archigraphemes is redefined as fusing: archigraphemes merge unrecognizably into letter blocks according to a rule-based system called script grammar. The letter block is defined as the minimum unit of Arabic script formation. E.g., the word بحوث consists of two letter blocks, groups of fused allographs surrounded by graphic space, بحو and ب (BGW B). From an Arabic corpus of circa 85 million words we extracted
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.