2018
DOI: 10.48550/arxiv.1812.04412
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

MergeDTS: A Method for Effective Large-Scale Online Ranker Evaluation

Abstract: Online ranker evaluation is one of the key challenges in information retrieval. While the preferences of rankers can be inferred by interleaved comparison methods, how to effectively choose the pair of rankers to generate the result list without degrading the user experience too much can be formalized as a K-armed dueling bandit problem, which is an online partial-information learning framework, where feedback comes in the form of pair-wise preferences. A commercial search system may evaluate a large number of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 39 publications
0
2
0
Order By: Relevance
“…In future work, we intend to conduct experiments on live systems, where feedback is obtained from multiple users so as to test whether CascadeHybrid can learn across users. Another direction is to adapt Thompson sampling [34] to our hybrid model, since Thompson sampling generally outperforms UCB-based algorithms [21,38].…”
Section: Discussionmentioning
confidence: 99%
“…In future work, we intend to conduct experiments on live systems, where feedback is obtained from multiple users so as to test whether CascadeHybrid can learn across users. Another direction is to adapt Thompson sampling [34] to our hybrid model, since Thompson sampling generally outperforms UCB-based algorithms [21,38].…”
Section: Discussionmentioning
confidence: 99%
“…One popular method is 'grid search', in which a set of possible values is defined for each parameter, then grid search determines the best value for each parameter to maximize the system effectiveness on a query set [48]. Alternative methods include 'line search' [43], 'Bayesian optimization' [39], and 'transfer learning' [45].…”
Section: Introductionmentioning
confidence: 99%