“…The sampling procedures in the Bayesian branch aim to either maximize the PCS or minimize the expected opportunity cost subject to a given sampling budget (see, e.g., [15], [16], [17]). Chen et al [18], Zhang et al [19], and Gao and Chen [20] study sampling procedures to maximize the PCS for selecting an optimal subset; Xiao and Lee [21] derive the convergence rate of the false subset-selection probability, and offer an allocation rule achieving an asymptotically optimal convergence rate; and Gao and Chen [22] develop a sampling procedure based on the expected opportunity cost. In R&S, the alternatives are ranked by the expectations of their sample performance, which can be directly estimated by the sample average of each alternative, whereas in our problem, the nodes are ranked by the stationary probabilities of the Markov chain, which are estimated indirectly from the interaction samples between different nodes.…”