2021
DOI: 10.48550/arxiv.2103.11881
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Introspective Visuomotor Control: Exploiting Uncertainty in Deep Visuomotor Control for Failure Recovery

Abstract: End-to-end visuomotor control is emerging as a compelling solution for robot manipulation tasks. However, imitation learning-based visuomotor control approaches tend to suffer from a common limitation, lacking the ability to recover from an out-of-distribution state caused by compounding errors. In this paper, instead of using tactile feedback or explicitly detecting the failure through vision, we investigate using the uncertainty of a policy neural network. We propose a novel uncertainty-based approach to det… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…One possible solution is to improve the accuracy of the operation by increasing the number of its learning patterns. Future tasks include real-time failure determination and estimation of recovery operations [11,12,25] and improvements to stability through hybridization with conventional control [36].…”
Section: Failure Casementioning
confidence: 99%
See 1 more Smart Citation
“…One possible solution is to improve the accuracy of the operation by increasing the number of its learning patterns. Future tasks include real-time failure determination and estimation of recovery operations [11,12,25] and improvements to stability through hybridization with conventional control [36].…”
Section: Failure Casementioning
confidence: 99%
“…However, the learning approach is a black box because it acquires behavioral rules on the basis of data, and there are still problems with reliability. Some methods have been proposed to use a model-based controller as prior knowledge for deep reinforcement learning [8,9] or to monitor the state of a system in real time during operation generation and execute a predetermined recovery action when an anomaly is detected [10][11][12]. However, none of these methods focus on the reliability of behavior.…”
Section: Introductionmentioning
confidence: 99%