2013
DOI: 10.1007/s10898-013-0050-5
|View full text |Cite
|
Sign up to set email alerts
|

Optimal learning for sequential sampling with non-parametric beliefs

Abstract: We propose a sequential learning policy for ranking and selection problems, where we use a non-parametric procedure for estimating the value of a policy. Our estimation approach aggregates over a set of kernel functions in order to achieve a more consistent estimator. Each element in the kernel estimation set uses a dierent bandwidth to achieve better aggregation.The nal estimate uses a weighting scheme with the inverse mean square errors of the kernel estimators as weights. This weighting scheme is shown to b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
6
3

Relationship

3
6

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 27 publications
0
5
0
Order By: Relevance
“…The knowledge gradient policy can handle the presence of a variety of belief models such as (generalized) linear [30,36] or nonparametric [29,5].…”
Section: Knowledge Gradientmentioning
confidence: 99%
“…The knowledge gradient policy can handle the presence of a variety of belief models such as (generalized) linear [30,36] or nonparametric [29,5].…”
Section: Knowledge Gradientmentioning
confidence: 99%
“…The knowledge gradient policy can handle a variety of belief models such as linear [22] or nonparametric [22,19,4].…”
Section: Definition 42 the Knowledge Gradient (Kg) Policy Is Definementioning
confidence: 99%
“…Negoescu et al [2011] introduces the use of a parametric belief model, making it possible to solve problems with thousands of alternatives. For nonparametric beliefs, Mes et al [2011] proposes a hierarchical aggregation technique using the common features shared by alternatives to learn about many alternatives from even a single measurement, while Barut and Powell [2013] estimates the belief function using kernel regression and aggregation of kernels. However, all the methods above assume low dimensional belief models, where the number of features is relatively small.…”
Section: Literaturementioning
confidence: 99%