Uncertainty-Aware Data Aggregation for Deep Imitation Learning

Cui, Yuchen; Isele, David; Niekum, Scott; Fujimura, Kikuo

doi:10.1109/icra.2019.8794025

Cited by 20 publications

(24 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It is desired that the actual system output will achieve the response of a typical second-order system at a damping ratio 𝜉 = 0.707, the reward function is defined as follows: where 𝑠 (𝑘) denotes the 𝑘𝑡ℎ tracking error of the actual system, 𝑒(𝑘) denotes the 𝑘𝑡ℎ data of the ideal dataset 𝐸𝑟, and 𝜌 serves to adjust the convergence rate of the algorithm. From equation (10), it can be seen that when the actual position tracking error is closer to the ideal tracking error, the reward value 𝑟 will keep approaching 1, otherwise the reward value 𝑟 is close to 0. With the RL method, coach is trained and network weight is updated.…”

Section: Stage ⅱ Expert Model Evolution Through the Training Of The F...mentioning

confidence: 99%

“…Second, as traditional training methods for machine learning-based intelligent controllers are generally used separately, different methods need to be reselected for different usage scenarios. The effectiveness of imitation learning [9,10] with dataset limitations depends on the size and features of the dataset, and models trained based on imitation learning have a certain confidence level in prediction and may not make the best decisions. The complex relationship between the cost function or reward function and the optimal decision needs to be determined in reinforcement learning [11], and the ideal cost function is difficult to implement in practice.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Intelligent control of flexible joint based on cooperative learning theory

Shi

Jia

2022

J. Phys.: Conf. Ser.

View full text Add to dashboard Cite

It is complex and time-consuming for obtaining accurate dynamics of industrial robots. The flexibility of the joint not only increases the adverse effects of nonlinear factors, but also makes the controller design of robot significantly difficult. In order to improve the performance of robots containing joint flexibility, cooperative learning (COL) theory is proposed in this paper. Based on this theory, a model-free intelligent controller is trained and successfully applied to the flexible joints. Compared with the conventional PID controller and RBF controller, the cumulative tracking error produced by the cooperative learning intelligent controller is reduced by 38.25% and 31.08%, respectively, and the robustness is improved in the trajectory tracking experiment and the robustness experiment. The experimental results demonstrate the feasibility and effectiveness of the cooperative learning theory.

show abstract

Section: Stage ⅱ Expert Model Evolution Through the Training Of The F...mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Intelligent control of flexible joint based on cooperative learning theory

Shi

Jia

2022

J. Phys.: Conf. Ser.

View full text Add to dashboard Cite

show abstract

“…• Uncertainty-Aware Data Aggregation for Deep Imitation Learning (UAIL) [39]: UAIL gathers training data by estimating the output's uncertainty at a sub-optimal state. Monte Carlo Dropout is used for uncertainty estimation, in which the output distribution is computed using multiple dropout masks at each level, then the statistics of this distribution are used to calculate the uncertainty score, which is then compared to the uncertainty threshold.…”

Section: Uncertainity Detection and Data Aggregationmentioning

confidence: 99%

“…Effective for less complex DNN Fuzzy neural network Trajectory tracking under varied dynamics [39] The online learning of a pretrained deep fuzzy neural network-based controller improves the control of nonlinear system under diverse and varied operating conditions (i.e different payloads, height, and speed).…”

Section: Fast Adaptationmentioning

confidence: 99%

Continual Learning for Real-World Autonomous Systems: Algorithms, Challenges and Frameworks

Shaheen¹,

Hanif²,

Hasan³

et al. 2021

Preprint

View full text Add to dashboard Cite

Continual learning is essential for all real-world applications, as frozen pre-trained models cannot effectively deal with non-stationary data distributions. The purpose of this study is to review the state-of-the-art methods that allow continuous learning of computational models over time. We primarily focus on the learning algorithms that perform continuous learning in an online fashion from considerably large (or infinite) sequential data and require substantially low computational and memory resources. We critically analyze the key challenges associated with continual learning for autonomous real-world systems and compare current methods in terms of computations, memory, and network/model complexity. We also briefly describe the implementations of continuous learning algorithms under three main autonomous systems, i.e., self-driving vehicles, unmanned aerial vehicles, and robotics. The learning methods of these autonomous systems and their strengths and limitations are extensively explored in this article.

show abstract

“…In their approach, they gathered human demonstrations for grasping the sheet and failure detection, by utilizing pre-trained YOLO features in order to facilitate the learning of deep neural network policies. Other works on the execution of folding cloths can be found in [199]- [203]. Instead of improving the synthetic objects to be indistinguishable from real objects, Abolghasemi and Bölöni [204] have trained the vision system to accept synthetic objects as real.…”

Section: Robots Learning From Demonstrationmentioning

confidence: 99%

Review of Deep Reinforcement Learning-Based Object Grasping: Techniques, Open Challenges, and Recommendations

2020

View full text Add to dashboard Cite

The motivation behind our work is to review and analyze the most relevant studies on deep reinforcement learning-based object manipulation. Various studies are examined through a survey of existing literature and investigation of various aspects, namely, the intended applications, techniques applied, challenges faced by researchers and recommendations for minimizing obstacles. This review refers to all relevant articles on deep reinforcement learning-based object manipulation and solutions. The object grasping issue is a major manipulation challenge. Object grasping requires detection systems, methods and tools to facilitate efficient and fast agent training. Several studies have proposed that object grasping and its subtypes are the main elements in dealing with the environment and agent. Unlike other review articles, this review article provides different observations on deep reinforcement learning-based manipulation. The results of this comprehensive review of deep reinforcement learning in the manipulation field may be valuable for researchers and practitioners because they can expedite the establishment of important guidelines.

show abstract

Uncertainty-Aware Data Aggregation for Deep Imitation Learning

Cited by 20 publications

References 22 publications

Intelligent control of flexible joint based on cooperative learning theory

Intelligent control of flexible joint based on cooperative learning theory

Continual Learning for Real-World Autonomous Systems: Algorithms, Challenges and Frameworks

Review of Deep Reinforcement Learning-Based Object Grasping: Techniques, Open Challenges, and Recommendations

Contact Info

Product

Resources

About