Web Search Clustering and Labeling with Hidden Topics

Nguyen, Cam-Tu; Phan, Xuan-Hieu; Horiguchi, Susumu; Nguyen, Thu-Trang; Ha, Quang-Thuy

doi:10.1145/1568292.1568295

Cited by 19 publications

(10 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Tseng's approach [19] labels clusters by mapping category terms to generic terms. Before this research, rules are used in Nguyen's [13] work to find readable phrases. However, the rules used in these papers are mostly lexical rules that cannot cover syntactical features.…”

Section: Related Workmentioning

confidence: 99%

“…Keywords are used in some existing research but a single term rarely gives users enough information. Existing research has reported that phrases [10,13] are more informative than keywords for understanding. However, the readability of phrases is rarely studied in existing research because it is very difficult to formalize the measurement of readability for phrases.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Labeling clusters from both linguistic and statistical perspectives: A hybrid approach

Liao

et al. 2015

Knowledge-Based Systems

View full text Add to dashboard Cite

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Labeling clusters from both linguistic and statistical perspectives: A hybrid approach

Liao

et al. 2015

Knowledge-Based Systems

View full text Add to dashboard Cite

“…The basic idea is that if an n-word sequence tends to appear together, the sequence is more likely to be an n-word phrase. The phrases extracted from a text collection are considered as the label candidates of the text collection [37,38]. Mei et al [39] and Lau et al [40] reported that twoword phrases (bigrams) usually work better than other n-word phrases for label generation.…”

Section: Evaluation Of Facet Labeling Based On Degree Centrality and mentioning

confidence: 99%

DF-Miner: Domain-specific facet mining by leveraging the hyperlink structure of Wikipedia

Wei

Zheng

et al. 2015

Knowledge-Based Systems

View full text Add to dashboard Cite

“…Very little further work on this topic has been done: vector-based WSI was successfully shown to improve bag-of-words adhoc Information Retrieval [36] and experimental studies [10] have provided interesting, though preliminary, insights into the use of WSI for Web search result clustering. More recently the use of hidden topics has been proposed to identify query meanings [29]. However, topics -estimated from a universal dataset -are query-independent and thus their number needs to be found beforehand.…”

Section: Related Workmentioning

confidence: 99%

Clustering Web Search Results with Maximum Spanning Trees

Marco

Navigli

2011

AI*IA 2011: Artificial Intelligence Around Man and Beyond

View full text Add to dashboard Cite

Abstract. We present a novel method for clustering Web search results based on Word Sense Induction. First, we acquire the meanings of a query by means of a graph-based clustering algorithm that calculates the maximum spanning tree of the co-occurrence graph of the query. Then we cluster the search results based on their semantic similarity to the induced word senses. We show that our approach improves classical search result clustering methods in terms of both clustering quality and degree of diversification.

show abstract

Web Search Clustering and Labeling with Hidden Topics

Cited by 19 publications

References 30 publications

Labeling clusters from both linguistic and statistical perspectives: A hybrid approach

Labeling clusters from both linguistic and statistical perspectives: A hybrid approach

DF-Miner: Domain-specific facet mining by leveraging the hyperlink structure of Wikipedia

Clustering Web Search Results with Maximum Spanning Trees

Contact Info

Product

Resources

About