2011
DOI: 10.1007/978-3-642-23954-0_20
|View full text |Cite
|
Sign up to set email alerts
|

Clustering Web Search Results with Maximum Spanning Trees

Abstract: Abstract. We present a novel method for clustering Web search results based on Word Sense Induction. First, we acquire the meanings of a query by means of a graph-based clustering algorithm that calculates the maximum spanning tree of the co-occurrence graph of the query. Then we cluster the search results based on their semantic similarity to the induced word senses. We show that our approach improves classical search result clustering methods in terms of both clustering quality and degree of diversification.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 44 publications
(36 reference statements)
0
8
0
Order By: Relevance
“…-Within an application: a further way to evaluate the output of WSI is within an application. A paradigmatic example of this kind is the application of WSI to Web search result clustering [54,13], where WSI techniques have been shown to consistently surpass non-semantic state-of-the-art systems.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…-Within an application: a further way to evaluate the output of WSI is within an application. A paradigmatic example of this kind is the application of WSI to Web search result clustering [54,13], where WSI techniques have been shown to consistently surpass non-semantic state-of-the-art systems.…”
Section: Discussionmentioning
confidence: 99%
“…We can already see the first evidence that text understanding techniques improve the state of the art in fields such as search result clustering [54,13] and lexicography [20]. However, much work is still needed to prove that a proper injection of semantics into real-world applications is always beneficial.…”
Section: Discussionmentioning
confidence: 99%
“…Recently, Navigli and Crisafulli () and Di Marco and Navigli () present in their papers a novel approach to web search result clustering, based on the automatic discovery of word senses from raw text. WSI is performed in order to dynamically acquire an inventory of senses of the input query.…”
Section: Related Workmentioning
confidence: 99%
“…This article extends previous conference work (Navigli and Crisafulli 2010;Di Marco and Navigli 2011) by performing a novel, in-depth study of the interactions between different corpora and several different WSI algorithms, including novel ones, within the same framework, and, additionally, by providing a comparison with a stateof-the-art search result clustering engine.…”
Section: Introductionmentioning
confidence: 95%
“…r We present novel versions of previously proposed WSI graph-based algorithms, namely, SquaT++ and Balanced Maximum Spanning Tree (B-MST) (the former is an enhancement of the original SquaT algorithm [Navigli and Crisafulli 2010], and the latter is a variant of MST [Di Marco and Navigli 2011] that produces more balanced clusters).…”
Section: Introductionmentioning
confidence: 99%