Supervised learning and reinforcement learning of feedback models for reactive behaviors: Tactile feedback testbed

Sutanto, Giovanni; Rombach, Katharina; Chebotar, Yevgen; Su, Zhe; Schaal, Stefan; Sukhatme, Gaurav S.; Meier, Franziska

doi:10.1177/02783649221143399

Cited by 2 publications

(5 citation statements)

References 67 publications

(110 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To incorporate the movement phase dependency into the feedback model, Sutanto et al [33] proposed phase-modulated neural networks (PMNNs), which could learn phase-dependent feedback models. Building upon this, Sutanto et al [1] presented a full framework for learning feedback models for reactive motion planning and used a sample-efficient RL algorithm to fine-tune these feedback models for novel tasks through a limited number of interactions with the real system. It is worth noting that all these sensor feedback models are involved in the tuning of the skill model as one term of the DMPs.…”

Section: Related Workmentioning

confidence: 99%

“…Moreover, to address the challenges of gradient explosion and gradient disappearance, the gate recurrent unit (GRU) is used to learn time series information, as it offers computational efficiency compared to the long short-term memory (LSTM) [34]. Furthermore, the phases of actions are incorporated into the network construction to make the feedback model dependent on the evolution of phases [1], enabling improved scalability of the skill model in the time domain. Fig.…”

Section: Force Feedback Learning Model: Phase-modulated Diagonal Recu...mentioning

confidence: 99%

“…Learning from demonstration (LfD) is increasingly being used for robotic contact tasks, including tactile tasks [1][2] [3], assembly tasks [4] [5] [6], cutting [7] [8], writing [9] and polishing [10] [11]. Traditional kinematics-based skill learning often overlooks vital force and stiffness information, making it challenging to apply effectively to intricate tasks involving complex force interactions [12] [13].…”

Section: Introductionmentioning

confidence: 99%

“…Force signals measured by force sensors are the basis of this work. Mapping the contact force error to the adjustment amount of the skills model is one of the ways to address this problem [1], which can enhance the robustness of the contact task skills model. Initially, some handdesigned feedback models [14] [15] were used to map the sensor-measured error to the amount of movement primitive correction.…”

Section: Introductionmentioning

confidence: 99%

“…PMDRNN takes into account the interactive dynamics between the tool and the workpiece, as well as the effect of adjacent moments, and the effect of the phase function in the skill model DMP, which satisfies the learning of dynamical system features associated with the canonical system in DMPs. By combining the phase-modulated term [1] with the diagonal recurrent neural network (DRNN) [19] to improve the accuracy of the force sensor feedback learning model.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations