Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval 2010
DOI: 10.1145/1835449.1835468
|View full text |Cite
|
Sign up to set email alerts
|

A joint probabilistic classification model for resource selection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 25 publications
(22 citation statements)
references
References 25 publications
0
20
0
Order By: Relevance
“…A resource is considered relevant if has more than a threshold (τ ) number of documents among the top T documents from the full result. Hong et al [10] extend this work for cases where a full dataset search is infeasible. Instead of the full dataset result, they build the 'full result' using just the top-T documents from each resource.…”
Section: Related Workmentioning
confidence: 87%
“…A resource is considered relevant if has more than a threshold (τ ) number of documents among the top T documents from the full result. Hong et al [10] extend this work for cases where a full dataset search is infeasible. Instead of the full dataset result, they build the 'full result' using just the top-T documents from each resource.…”
Section: Related Workmentioning
confidence: 87%
“…While current shard-selection techniques do not combine multiple types of evidence to make predictions, prior work on text-based federated search used machine learning to combine a wide range of features for the task of resource selection [Arguello et al, 2009a;Hong et al, 2010]. In particular, because shards are topically focused, the query category features discussed later in Section 2.3 might contribute valuable evidence for shard selection.…”
Section: Selective Searchmentioning
confidence: 99%
“…In our IE scenario, to estimate the number of useful documents we should define f (d) = 1{d is useful}, that is, as the indicator function that returns 1 if d is useful and 0 otherwise. Various methods have been proposed to estimate properties of (queryable) document collections (e.g., collection size, number of documents relevant to a query, average document length) [4,19,34,35], and these methods can be classified in three broad classes: (i) surrogate-based methods, (ii) query pool-based methods, and (iii) query pool-free methods.…”
Section: Overview Of Estimation Approachesmentioning
confidence: 99%
“…19 Now, to normalize DCG@k-and obtain nDCG@k-, we need to calculate the DCG@k of an ideal ranking, namely, IDCG@k. Finally, nDCG@k = DCG@k IDCG@k .…”
Section: Experimental Settingsmentioning
confidence: 99%
See 1 more Smart Citation