1999
DOI: 10.1016/s0306-4573(98)00040-5
|View full text |Cite
|
Sign up to set email alerts
|

A comparison of collocation-based similarity measures in query expansion

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
23
0

Year Published

2001
2001
2017
2017

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 62 publications
(24 citation statements)
references
References 15 publications
0
23
0
Order By: Relevance
“…Another popular statistical method for choosing expansion terms uses the concept of co-occurrence. For a certain term x in a user query, terms that frequently co-occur with x in the document base are shown to be excellent query expansion candidates (Kim and Choi, 1999). This technique is also evident in web search engines in the form of query suggestions provided either while eliciting the query or after an initial search has been conducted.…”
Section: Query Adaptationmentioning
confidence: 93%
“…Another popular statistical method for choosing expansion terms uses the concept of co-occurrence. For a certain term x in a user query, terms that frequently co-occur with x in the document base are shown to be excellent query expansion candidates (Kim and Choi, 1999). This technique is also evident in web search engines in the form of query suggestions provided either while eliciting the query or after an initial search has been conducted.…”
Section: Query Adaptationmentioning
confidence: 93%
“…One of them is relevance feedback using the returned results and adding new terms related to the original query and selected documents [44]. Other methods include adding relevant terms based on term frequency, document frequency from top ranked documents [45], [46], co-occurrence based techniques [47], thesaurus based techniques [48][49][50][51], desktop specific techniques [7], probability of terms over search logs [52]. Our approach uses a user intention based keyword addition to expand the original query to handle ambiguous query terms.…”
Section: Query Expansionmentioning
confidence: 99%
“…The set-based similarities [14,18] represent records as sets of tokens and estimates the similarity of records by estimating the similarity of their token sets. Given two token setss,r ⊂ T we can estimate their similarity by assigning values tos,r,s ∪r ands ∩r, and then combining these values into a final similarity score.…”
Section: Definitions Of Pairwise Record Operationsmentioning
confidence: 99%
“…The first class exploits the connection between set operations and the conjunction and disjunction functions in Proposition 2 to obtain consistent extensions of set-similarity measures, such as Jaccard or Dice, to the vector space model. We refer to, e.g., [18] or [14] for the set-based definitions of these measures.The second class uses the p-norm of a record to define distance-based similarity functions.…”
Section: Generalized Similarity Functionsmentioning
confidence: 99%