Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2005
DOI: 10.1145/1076034.1076138
|View full text |Cite
|
Sign up to set email alerts
|

Automatic web query classification using labeled and unlabeled training data

Abstract: Accurate topical categorization of user queries allows for increased effectiveness, efficiency, and revenue potential in general-purpose web search systems. Such categorization becomes critical if the system is to return results not just from a general web collection but from topic-specific databases as well. Maintaining sufficient categorization recall is very difficult as web queries are typically short, yielding few features per query. We examine three approaches to topical categorization of general web que… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
41
0

Year Published

2006
2006
2020
2020

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 64 publications
(43 citation statements)
references
References 4 publications
0
41
0
Order By: Relevance
“…Typical query understanding operations include refinements of the original query (Huang & Efthimiadis, 2009), such as spelling correction (Ahmad & Kondrak, 2005;Li et al, 2006), acronym expansion (Jain et al, 2007), stemming (Porter, 1980;Peng et al, 2007a) deletion (Kumaran & Allan, 2008;Kumaran & Carvalho, 2009), query segmentation (Risvik et al, 2003;Bergsma & Wang, 2007), and named entity recognition (Guo et al, 2009). Other common query understanding operations are query topic classification, aimed to restrict the scope of the retrieved documents (Beitzel et al, 2005;Shen et al, 2006), and query expansion, aimed to enhance the query representation with useful terms from the local corpus (Rocchio, 1971;Lavrenko & Croft, 2001;Carpineto & Romano, 2012), or from external resources, such as a query log (Cui et al, 2002) or a knowledge base such as Wikipedia (He & Ounis, 2007;Li et al, 2007;Xu et al, 2009).…”
Section: Query Processingmentioning
confidence: 99%
“…Typical query understanding operations include refinements of the original query (Huang & Efthimiadis, 2009), such as spelling correction (Ahmad & Kondrak, 2005;Li et al, 2006), acronym expansion (Jain et al, 2007), stemming (Porter, 1980;Peng et al, 2007a) deletion (Kumaran & Allan, 2008;Kumaran & Carvalho, 2009), query segmentation (Risvik et al, 2003;Bergsma & Wang, 2007), and named entity recognition (Guo et al, 2009). Other common query understanding operations are query topic classification, aimed to restrict the scope of the retrieved documents (Beitzel et al, 2005;Shen et al, 2006), and query expansion, aimed to enhance the query representation with useful terms from the local corpus (Rocchio, 1971;Lavrenko & Croft, 2001;Carpineto & Romano, 2012), or from external resources, such as a query log (Cui et al, 2002) or a knowledge base such as Wikipedia (He & Ounis, 2007;Li et al, 2007;Xu et al, 2009).…”
Section: Query Processingmentioning
confidence: 99%
“…Web user queries are classified into target taxonomy by Shen et al Some of similar wok has been done by Gravano et al [20] and Beitzel et al [21]. All their work is on classification of queries but the major difference between our approach and others is that the trained data is used for categorization of the result required.…”
Section: Related Workmentioning
confidence: 99%
“…The taxonomy based data marts will help to access the contents of data warehouse and use analysis tools. [21] Information processing is one of the applications of data warehouse which is based on queries. These queries are categorized with the use of Query classifier and easily answerable to queries that reflect information stored in database directly.…”
Section: Applicability Of Multilayered Architecturementioning
confidence: 99%