2002
DOI: 10.1007/3-540-45715-1_44
|View full text |Cite
|
Sign up to set email alerts
|

Cross-Lingual Document Similarity Calculation Using the Multilingual Thesaurus EUROVOC

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
52
0
2

Year Published

2006
2006
2021
2021

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 72 publications
(54 citation statements)
references
References 2 publications
0
52
0
2
Order By: Relevance
“…These systems map words or concepts, such as named entities, into a common representation space by means of a multilingual thesaurus (e.g. Eurovoc (Steinberger et al, 2002) or EuroWordnet (Vossen, 1998)). However, multilingual thesauri are not always available;…”
Section: Related Workmentioning
confidence: 99%
“…These systems map words or concepts, such as named entities, into a common representation space by means of a multilingual thesaurus (e.g. Eurovoc (Steinberger et al, 2002) or EuroWordnet (Vossen, 1998)). However, multilingual thesauri are not always available;…”
Section: Related Workmentioning
confidence: 99%
“…Leek et al (1999) use machine translation tools to help cross-language topic tracking. Steinberger et al (2002) represent European document contents using descriptor terms of a multilingual thesaurus EUROVOC and measure the semantic similarity based on the distance between the two documents' representations. Vu et al (2009) use Discrete Fourier Transform (DFT) score of a word's frequency chain to measure time distribution similarity as a feature to evaluate the bilingual news similarity.…”
Section: Related Workmentioning
confidence: 99%
“…Yang et al (1997) propose to use cross-lingual information retrieval techniques to identify the correspondence between multilingual texts. Steinberger et al (2002) calculate the semantic similarity of news in different languages using a multilingual thesaurus. Vu et al (2009) present a feature-based method to include useful information sources.…”
Section: Introductionmentioning
confidence: 99%
“…Eurovoc has previously been used for the identification of translated documents [9,10], in which, the Eurovoc concepts were enriched by a set of associative phrases extracted from a large manually (keywords) annotated corpus. The Eurovoc concepts are then assigned to the documents based on the similarity between the contents of the document and the enriched associative set.…”
Section: Related Workmentioning
confidence: 99%