2021
DOI: 10.1109/tase.2020.3043636
|View full text |Cite
|
Sign up to set email alerts
|

An Ergodic Measure for Active Learning From Equilibrium

Abstract: This paper develops KL-Ergodic Exploration from Equilibrium (KL-E 3 ), a method for robotic systems to integrate stability into actively generating informative measurements through ergodic exploration. Ergodic exploration enables robotic systems to indirectly sample from informative spatial distributions globally, avoiding local optima, and without the need to evaluate the derivatives of the distribution against the robot dynamics. Using hybrid systems theory, we derive a controller that allows a robot to expl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
19
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 21 publications
(19 citation statements)
references
References 39 publications
0
19
0
Order By: Relevance
“…( 3)) has linear time complexity, allowing us to avoid the computational complexity of IRL. As documented in prior work, ergodic control scales well to relatively high-dimensional spaces-it has been implemented on the half cheetah example with 26 observables and continuous actions [27].…”
Section: Related Workmentioning
confidence: 97%
“…( 3)) has linear time complexity, allowing us to avoid the computational complexity of IRL. As documented in prior work, ergodic control scales well to relatively high-dimensional spaces-it has been implemented on the half cheetah example with 26 observables and continuous actions [27].…”
Section: Related Workmentioning
confidence: 97%
“…This is due to the computational complexity and storage demanded in working with the ergodic metric and the control policy derived from it. Several authors deviated from the original ergodic control formulation to tackle this limitation, see e.g., [9], [10]. In [9] and [10], the authors relied on a different ergodic metric based on a Kullback-Leibler (KL) divergence measure for finite sensor footprint, where the control policy is obtained using sampling-based techniques.…”
Section: A Ergodic Controlmentioning
confidence: 99%
“…Several authors deviated from the original ergodic control formulation to tackle this limitation, see e.g., [9], [10]. In [9] and [10], the authors relied on a different ergodic metric based on a Kullback-Leibler (KL) divergence measure for finite sensor footprint, where the control policy is obtained using sampling-based techniques. Samplingbased methods avoid the curse of dimensionality but they are still computationally expensive to address the real-time computational requirements of robotics systems.…”
Section: A Ergodic Controlmentioning
confidence: 99%
See 2 more Smart Citations