LMF for Arabic

Khemakhem, Aida; Gargouri, Bilel; Haddar, Kais; Hamadou, Abdelmajid Ben

doi:10.1002/9781118712696.ch6

Cited by 10 publications

(2 citation statements)

References 6 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In order to accelerate the feeding of the dictionary, we converted the content of print dictionaries into a normalized dictionary (Khemakhem et al 2009). We carried out the conversion of El-Ghany dictionary [].…”

Section: Methodsmentioning

confidence: 99%

ISO standard modeling of a large Arabic dictionary

et al. 2015

Self Cite

View full text Add to dashboard Cite

In this paper, we address the problem of the large coverage dictionaries of Arabic language usable both for direct human reading and automatic Natural Language Processing. For these purposes, we propose a normalized and implemented modeling, based on Lexical Markup Framework (LMF-ISO 24613) and Data Registry Category (DCR-ISO 12620), which allows a stable and well-defined interoperability of lexical resources through a unification of the linguistic concepts. Starting from the features of the Arabic language, and due to the fact that a large range of details and refinements need to be described specifically for Arabic, we follow a finely structuring strategy. Besides its richness in morphology, syntax and semantics knowledge, our model includes all the Arabic morphological patterns to generate the inflected forms from a given lemma and highlights the syntactic-semantic relations. In addition, an appropriate codification has been designed for the management of all types of relationships among lexical entries and their related knowledge. According to this model, a dictionary named El Madar 1 has been built and is now publicly available on line. The data are managed by a user-friendly Web-based lexicographical workstation. This work has not been done in isolation, but is the result of a collaborative effort by an international team mainly within the ISO network during a period of eight years. A. Khemakhem et al.is to merge them in order to obtain a new richer resource. More generally, the exchange remains a difficult (and expensive) issue when nothing has been scheduled for this purpose. To meet this challenge, several projects were conducted such as ACQUILEX (Bogurev et al. These projects led to the emergence of the LMF (Lexical Markup Framework) ISO standard for the lexical structure modeling (ISO 24613) (Francopoulo 2003), (Francopoulo and George 2008) in association with the ISO Data Categories Registry (DCR) 3 following ISO 12620 (Ide and Romary 2004). These standards were designed by a group of sixty ISO experts coming from different cultures, languages and continents. Numerous developments followed in different parts of the world. 4 Unfortunately, the Arabic language did not immediately benefit from the emergence of these standards, although it is spoken by more than 300 million people around the world, and is the official language of more than twenty countries. The language still uses references to different printed dictionaries based on incompatible lexicographical schools. Only few works tried the application of LMF on the Arabic language out, according to previous revisions of this standard. Some developments were made in morphology (Khemakhem, Gargouri and Abdelwahed 2006), (Romary, Salmon-Alt and Francopoulo 2004), (Salmon-Alt, Akrout and Romary 2005) and some studies were conducted in syntax (Loukil, Haddar and Ben Hamadou 2008). However, these works were developed during the drafting of the LMF standard and were not updated according to the ISO validation.Obviously, the situation of the Arabic lexica...

show abstract

Section: Methodsmentioning

confidence: 99%

ISO standard modeling of a large Arabic dictionary

et al. 2015

Self Cite

View full text Add to dashboard Cite

show abstract

“…Some illustrations of LMF were already proposed (i.e. El-Madar Dictionary for the Arabic language [11] [12] and Morphalou [14] database for the French language). However, the LMF project is interesting only in lexical data representation.…”

Section: Introductionmentioning

confidence: 99%

SoLDES: Service-oriented Lexical Database Exploitation System

Abderrahmen¹,

Gargouri²,

Jmaïel³

2016

RCS

Self Cite

View full text Add to dashboard Cite

In this work, we focuses on the assisted exploitation of lexical databases designed according to the LMF standard (Lexical Markup Framework) ISO-24613. The proposed system is a service-oriented solution which relies on a requirement-based lexical web service generation approach that expedites the task of engineers when developing NLP (Natural Language Processing) systems. Using this approach, the developer will neither deal with the database content or its structure nor use any language query. Furthermore, this approach will promote a largescale reuse of LMF lexical databases by generating lexical web services for all languages. For evaluating this approach we have tested it on the Arabic language.

show abstract