Mengbing Li scite author profile

Mengbing Li

1Publication

0Citation Statements Received

98Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Reinforcement Learning in Possibly Nonstationary Environments

Li¹,

Shi²,

Wu³

et al. 2022

Preprint

View full text Add to dashboard Cite

We consider reinforcement learning (RL) methods in offline nonstationary environments. Many existing RL algorithms in the literature rely on the stationarity assumption that requires the system transition and the reward function to be constant over time. However, the stationarity assumption is restrictive in practice and is likely to be violated in a number of applications, including traffic signal control, robotics and mobile health. In this paper, we develop a consistent procedure to test the nonstationarity of the optimal policy based on pre-collected historical data, without additional online data collection. Based on the proposed test, we further develop a sequential change point detection method that can be naturally coupled with existing state-of-the-art RL methods for policy optimisation in nonstationary environments. The usefulness of our method is illustrated by theoretical results, simulation studies, and a real data example from the 2018 Intern Health Study 1 . A Python implementation of the proposed procedure is available at https://github.com/limengbinggz/CUSUM-RL.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Mengbing Li

Reinforcement Learning in Possibly Nonstationary Environments

Contact Info

Product

Resources

About