2013
DOI: 10.1007/s10579-013-9230-7
|View full text |Cite
|
Sign up to set email alerts
|

The Romanian wordnet in a nutshell

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 17 publications
(11 citation statements)
references
References 4 publications
0
9
0
Order By: Relevance
“…First, to address the former issue, RoWordNet [3], [15] must be introduced. This is the Romanian version of the Princeton WordNet, and it is a lexical database which allows the modeling of linguistic knowledge with regards to aspects of the real world.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…First, to address the former issue, RoWordNet [3], [15] must be introduced. This is the Romanian version of the Princeton WordNet, and it is a lexical database which allows the modeling of linguistic knowledge with regards to aspects of the real world.…”
Section: Related Workmentioning
confidence: 99%
“…As far as the Romanian language is concerned, the existence of RoWordNet [15], a large semantic network that encompasses information in the form of synsets (i.e. groups of full synonyms) linked by lexical-semantic relations, marks the central point of Romanian language processing research due to its size, completeness, and information richness.…”
Section: Introductionmentioning
confidence: 99%
“…The textual resources available in the corpus were initially available in different formats such as .doc and unprotected .pdf and they had to be converted into a raw text format in order to be annotated by our processing tools (Tufiş et al, 2008). The conversion of the files involved a boilerplate removal step in which footers, headers, page numbers, figures, tables, footnotes, etc.…”
Section: Corpus Structurementioning
confidence: 99%
“…According to [17], word-based diacritic restorations are often knowledge intensive, relying on the existence of linguistic resources like dictionaries, part-of-speech taggers, and morphological analyzers [24]. According to [23], the word-based model is often more appropriate for languages "where the change of diacritics has a grammatical or semantic role." Nevertheless, its major challenge is in handling Out of Vocabulary (OOV) or unknown words due to data sparsity [8,9].…”
Section: Word-level Restorationmentioning
confidence: 99%