2003
DOI: 10.1016/s0306-4573(02)00084-5
|View full text |Cite
|
Sign up to set email alerts
|

Engineering a multi-purpose test collection for Web retrieval experiments

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
83
0
1

Year Published

2004
2004
2017
2017

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 119 publications
(86 citation statements)
references
References 10 publications
2
83
0
1
Order By: Relevance
“…9. The probability that the degree of a vertex exceeds x in (a) the largest five networks of Table I and (b) the smallest five networks in Table I. 042309-12 [12], (b) Baidu encyclopedia [12], (c) WordNet [13], (d) TREC-WT10g web graph [14], (e) Google web graph [11], (f) Internet on the autonomous systems level [11], (g) Catster and Dogster social networks [15], (h) Gowalla social network [11], and (i) Wikipedia communication network [11]. The different shadings indicate the theoretical boundaries of the regimes as in Fig.…”
Section: Appendix C: From Hidden Variables To Degreesmentioning
confidence: 99%
“…9. The probability that the degree of a vertex exceeds x in (a) the largest five networks of Table I and (b) the smallest five networks in Table I. 042309-12 [12], (b) Baidu encyclopedia [12], (c) WordNet [13], (d) TREC-WT10g web graph [14], (e) Google web graph [11], (f) Internet on the autonomous systems level [11], (g) Catster and Dogster social networks [15], (h) Gowalla social network [11], and (i) Wikipedia communication network [11]. The different shadings indicate the theoretical boundaries of the regimes as in Fig.…”
Section: Appendix C: From Hidden Variables To Degreesmentioning
confidence: 99%
“…The collection to be searched for this task is the WT10g collection [2], a 10 gigabytes subset of the VLC2 collection which in turn is a subset of a 1997 crawl of the WWW done by the Internet Archive 5 . WT10g is designed to have a relatively high density of inter-server hyperlinks.…”
Section: Worldnet Africa Hunt Memorial Library Haas Business School Umentioning
confidence: 99%
“…Tables III, IV, V and Tables VI, VII, VIII, respectively, we can see the effect of using the two different test collections. Both collections were designed to possess the basic properties of the Web (Bailey et al, 2003). However, it has been noted in TREC10 (Hawking and Craswell, 2001) and TREC11 (Craswell and Hawking, 2002) that using link analysis for the TREC10 topic relevance task and the TREC11 topic distillation task, hardly improves the retrieval effectiveness.…”
Section: Analysis Of Resultsmentioning
confidence: 99%