Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management 2004
DOI: 10.1145/1031171.1031284
|View full text |Cite
|
Sign up to set email alerts
|

Distributional term representations

Abstract: A number of content management tasks, including term categorization, term clustering, and automated thesaurus generation, view natural language terms (e.g. words, noun phrases) as first-class objects, i.e. as objects endowed with an internal representation which makes them suitable for explicit manipulation by the corresponding algorithms. The information retrieval (IR) literature has traditionally used an extensional (aka distributional ) representation for terms according to which a term is represented by th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
30
0
4

Year Published

2008
2008
2020
2020

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 36 publications
(34 citation statements)
references
References 37 publications
0
30
0
4
Order By: Relevance
“…LIA experimented with TF-IDF combined with Gini purity criteria; they also used a set of tokens extracted from the authors and entities metadata in the Twitter website as feature vector. UAMCLYR investigated the role of Distributional Term Representation [14] to represent terms by means of contextual information given by the term co-occurrence statistics. They used SVM classifier, and their best result was achieved with bag-of-word representation and Boolean weighting.…”
Section: Discussionmentioning
confidence: 99%
“…LIA experimented with TF-IDF combined with Gini purity criteria; they also used a set of tokens extracted from the authors and entities metadata in the Twitter website as feature vector. UAMCLYR investigated the role of Distributional Term Representation [14] to represent terms by means of contextual information given by the term co-occurrence statistics. They used SVM classifier, and their best result was achieved with bag-of-word representation and Boolean weighting.…”
Section: Discussionmentioning
confidence: 99%
“…Distributional term representations (DTRs) are tools for term representation that rely on term occurrence and co-occurrence statistics (Lavelli et al 2005). The intuition behind DTRs is that the meaning of a term can be deduced by its context; where the context for a term is determined by the other terms it co-occurs with frequently or by the documents in which the term occurs more frequently.…”
Section: Distributional Term Representationsmentioning
confidence: 99%
“…Little work has been reported on DTRs for (unimodal) information retrieval (Carrillo et al 2009;Lavelli et al 2005). In the latter field, DTRs have been only used for processing unimodal information (e.g., text) where the representation of a term is determined by its context in unimodal information.…”
Section: Distributional Term Representationsmentioning
confidence: 99%
“…The context vectors used in BoC are generated using RI and 'Document Occurrence Representation' (DOR). DOR is based on the work of Lavelli et al [13] and considers the meaning of a term as the bag of documents in which it occurs. When RI is used together with DOR, the term t is represented as a context vector:…”
Section: Random Indexingmentioning
confidence: 99%