Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2004
DOI: 10.1145/1008992.1009048
|View full text |Cite
|
Sign up to set email alerts
|

Hourly analysis of a very large topically categorized web query log

Abstract: We review a query log of hundreds of millions of queries that constitute the total query traffic for an entire week of a generalpurpose commercial web search service. Previously, query logs have been studied from a single, cumulative view. In contrast, our analysis shows changes in popularity and uniqueness of topically categorized queries across the hours of the day. We examine query traffic on an hourly basis by matching it against lists of queries that have been topically pre-categorized by human editors. T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

5
166
1

Year Published

2006
2006
2018
2018

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 237 publications
(172 citation statements)
references
References 26 publications
(17 reference statements)
5
166
1
Order By: Relevance
“…However, when the workload includes items whose duration is on the scale of hours (such as parallel jobs), the daily cycle cannot be ignored [248]. Also, in some cases it is wrong to assume that the daytime workload is more important; for example, in web search workloads the activity of home users in the evening and at night is no less important than the activity of professionals during the day [63]. Over long ranges, a nonstationary workload can be the result of changing usage patterns as users get to know the system better.…”
Section: Correlation With Timementioning
confidence: 99%
See 4 more Smart Citations
“…However, when the workload includes items whose duration is on the scale of hours (such as parallel jobs), the daily cycle cannot be ignored [248]. Also, in some cases it is wrong to assume that the daytime workload is more important; for example, in web search workloads the activity of home users in the evening and at night is no less important than the activity of professionals during the day [63]. Over long ranges, a nonstationary workload can be the result of changing usage patterns as users get to know the system better.…”
Section: Correlation With Timementioning
confidence: 99%
“…The diurnal cycle is a well-known and widely recognized phenomenon, which appears in practically all workloads (e.g., [326,320,524,63,761,745,756,381,192,42]). An example from parallel supercomputers is shown in Figure 6.34.…”
Section: Periodicity and The Diurnal Cyclementioning
confidence: 99%
See 3 more Smart Citations