Radu Ion scite author profile

Universal dependencies (UD) is a framework for morphosyntactic annotation of human language, which to date has been used to create treebanks for more than 100 languages. In this article, we outline the linguistic theory of the UD framework, which draws on a long tradition of typologically oriented grammatical theories. Grammatical relations between words are centrally used to explain how predicate–argument structures are encoded morphosyntactically in different languages while morphological features and part-of-speech classes give the properties of words. We argue that this theory is a good basis for cross-linguistically consistent annotation of typologically diverse languages in a way that supports computational natural language understanding as well as broader linguistic studies.

show abstract

Fine-grained word sense disambiguation based on parallel corpora, word alignment, word clustering and aligned wordnets

Tufiș¹,

Ion²,

Ide

2004

View full text Add to dashboard Cite

The paper presents a method for word sense disambiguation based on parallel corpora. The method exploits recent advances in word alignment and word clustering based on automatic extraction of translation equivalents and being supported by available aligned wordnets for the languages in the corpus. The wordnets are aligned to the Princeton Wordnet, according to the principles established by EuroWordNet. The evaluation of the WSD system, implementing the method described herein showed very encouraging results. The same system used in a validation mode, can be used to check and spot alignment errors in multilingually aligned wordnets as BalkaNet and EuroWordNet.

show abstract

The Romanian wordnet in a nutshell

Tufiș

Mititelu

Ştefănescu

et al. 2013

Lang Resources & Evaluation

View full text Add to dashboard Cite

Adapting the TTL Romanian POS Tagger to the Biomedical Domain

Mitrofan¹,

Ion²

2017

View full text Add to dashboard Cite

This paper presents the adaptation of the Hidden Markov Models-based TTL partof-speech tagger to the biomedical domain. TTL is a text processing platform that performs sentence splitting, tokenization, POS tagging, chunking and Named Entity Recognition (NER) for a number of languages, including Romanian. The POS tagging accuracy obtained by the TTL POS tagger exceeds 97% when TTL's baseline model is updated with training information from a Romanian biomedical corpus. This corpus is developed in the context of the CoRoLa (a reference corpus for the contemporary Romanian language) project. Informative description and statistics of the Romanian biomedical corpus are also provided.

show abstract

RACAI’s Question Answering System at QA@CLEF2007

Tufiș

Ştefănescu

Ion

et al. 2008

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Radu Ion

Universal Dependencies

Fine-grained word sense disambiguation based on parallel corpora, word alignment, word clustering and aligned wordnets

The Romanian wordnet in a nutshell

Adapting the TTL Romanian POS Tagger to the Biomedical Domain

RACAI’s Question Answering System at QA@CLEF2007

Contact Info

Product

Resources

About