2021 IEEE International Symposium on Information Theory (ISIT) 2021
DOI: 10.1109/isit45174.2021.9518203
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive Control of Differentially Private Linear Quadratic Systems

Abstract: In this paper we study the problem of regret minimization in reinforcement learning (RL) under differential privacy constraints. This work is motivated by the wide range of RL applications for providing personalized service, where privacy concerns are becoming paramount. In contrast to previous works, we take the first step towards non-tabular RL settings, while providing a rigorous privacy guarantee. In particular, we consider the adaptive control of differentially private linear quadratic (LQ) systems. We de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 17 publications
(5 reference statements)
0
2
0
Order By: Relevance
“…On the other hand, Garcelon et al (2020) design the first private RL algorithm -LDP-OBI -with regret and LDP guarantees. Recently, Chowdhury, Zhou, and Shroff (2021) study linear quadratic regulators under the JDP constraint. It is worth noting that all these prior work consider only value-based RL algorithms, and a study on policy-based private RL algorithms remains elusive.…”
Section: Introductionmentioning
confidence: 99%
“…On the other hand, Garcelon et al (2020) design the first private RL algorithm -LDP-OBI -with regret and LDP guarantees. Recently, Chowdhury, Zhou, and Shroff (2021) study linear quadratic regulators under the JDP constraint. It is worth noting that all these prior work consider only value-based RL algorithms, and a study on policy-based private RL algorithms remains elusive.…”
Section: Introductionmentioning
confidence: 99%
“…Noting that the information structure in the decentralized control systems mentioned above has the following feature, that is, all or part of historical control inputs of the controllers are shared with the other controllers. However, the case, where the controllers have its own private control inputs, has not been addressed in decentralized control system, which has applications in a personalized healthcare setting, in the states of a virtual keyboard user (e.g., Google GBoard users) and in the social robot for second language education of children [27]. It should be noted that the information structure where the control information are unavailable to the other decision makers will cause the estimation gain depends on the control gain and vice versa, which means the forward and backward Riccati equations are coupled, and make the calculation more complicated.…”
Section: Introductionmentioning
confidence: 99%