2019
DOI: 10.48550/arxiv.1907.04360
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Hybrid system identification using switching density networks

Abstract: Behaviour cloning is a commonly used strategy for imitation learning and can be extremely effective in constrained domains. However, in cases where the dynamics of an environment may be state dependent and varying, behaviour cloning places a burden on model capacity and the number of demonstrations required. This paper introduces switching density networks, which rely on a categorical reparametrisation for hybrid system identification. This results in a network comprising a classification layer that is followe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 25 publications
0
2
0
Order By: Relevance
“…Switching systems have also served as a powerful tool in various imitation learning approaches. Calinon et al ( 2010) combine traditional HMMs with Gaussian mixture regression to represent trajectory distributions, while Daniel et al (2016) use a hidden semi-Markov model to learn hierarchical policies and Burke et al (2019) introduced switching density networks for system identification and behavioral cloning. Finally, excellent work on hierarchical decomposition of policies in a fully Bayesian framework is introduced by Šošić et al (2017), albeit under known transition dynamics.…”
Section: Related Workmentioning
confidence: 99%
“…Switching systems have also served as a powerful tool in various imitation learning approaches. Calinon et al ( 2010) combine traditional HMMs with Gaussian mixture regression to represent trajectory distributions, while Daniel et al (2016) use a hidden semi-Markov model to learn hierarchical policies and Burke et al (2019) introduced switching density networks for system identification and behavioral cloning. Finally, excellent work on hierarchical decomposition of policies in a fully Bayesian framework is introduced by Šošić et al (2017), albeit under known transition dynamics.…”
Section: Related Workmentioning
confidence: 99%
“…The extraction of the policy implemented by a controller (being human or automatic) is often referred to as behavior cloning within the machine learning community, and we inherit here the same terminology. Specifically, the scope of behavior cloning is to learn a policy by imitation, i.e., the action to be performed in a given system state by extrapolating experience from a set of observationaction sequences [40]. With reference to system identification, the target is to imitate the behavior performed by an instructor (e.g., an automated control system or advanced logics on supervisory systems) by observing it operating in closed-loop over the controlled system (e.g., a device, a machine or a process).…”
Section: Behavior Cloning In Industrymentioning
confidence: 99%