Ching-An Cheng scite author profile

We present an end-to-end imitation learning system for agile, off-road autonomous driving using only low-cost on-board sensors. By imitating a model predictive controller equipped with advanced sensors, we train a deep neural network control policy to map raw, high-dimensional observations to continuous steering and throttle commands. Compared with recent approaches to similar tasks, our method requires neither state estimation nor on-the-fly planning to navigate the vehicle. Our approach relies on, and experimentally validates, recent imitation learning theory. Empirically, we show that policies trained with online imitation learning overcome well-known challenges related to covariate shift and generalize better than policies trained with batch imitation learning. Built on these insights, our autonomous driving system demonstrates successful high-speed off-road driving, matching the state-of-the-art performance.

show abstract

RMPflow: A Computational Graph for Automatic Motion Policy Generation

Cheng

Mukadam

Issac

et al. 2020

120

View full text Add to dashboard Cite

We develop a novel policy synthesis algorithm, RMPflow, based on geometrically consistent transformations of Riemannian Motion Policies (RMPs). RMPs are a class of reactive motion policies designed to parameterize non-Euclidean behaviors as dynamical systems in intrinsically nonlinear task spaces. Given a set of RMPs designed for individual tasks, RMPflow can consistently combine these local policies to generate an expressive global policy, while simultaneously exploiting sparse structure for computational efficiency. We study the geometric properties of RMPflow and provide sufficient conditions for stability. Finally, we experimentally demonstrate that accounting for the geometry of task policies can simplify classically difficult problems, such as planning through clutter on high-DOF manipulation systems.

show abstract

Nutritional factors and survival of patients with oral cancer

et al. 2006

View full text Add to dashboard Cite

show abstract

Imitation learning for agile autonomous driving

Pan

Cheng

Saigol

et al. 2019

The International Journal of Robotics Research

View full text Add to dashboard Cite

show abstract

A modified Newton method for radial distribution system power flow analysis

Zhang

Cheng²

1997

IEEE Trans. Power Syst.

176

View full text Add to dashboard Cite

Socio-demographic factors in the prognosis of oral cancer patients

et al. 2006

View full text Add to dashboard Cite

Bellman-consistent Pessimism for Offline Reinforcement Learning

Xie¹,

Cheng²,

Jiang³

et al. 2021

Preprint

View full text Add to dashboard Cite

The use of pessimism, when reasoning about datasets lacking exhaustive exploration, has recently gained prominence in offline reinforcement learning. Despite the robustness it adds to the algorithm, overly pessimistic reasoning can be equally damaging in precluding the discovery of good policies, which is an issue for the popular bonus-based pessimism. In this paper, we introduce the notion of Bellman-consistent pessimism for general function approximation: instead of calculating a point-wise lower bound for the value function, we implement pessimism at the initial state over the set of functions consistent with the Bellman equations. Our theoretical guarantees only require Bellman closedness as standard in the exploratory setting, in which case bonus-based pessimism fails to provide guarantees. Even in the special case of linear MDPs where stronger function-approximation assumptions hold, our result improves upon a recent bonus-based approach by O(d) in its sample complexity when the action space is finite. Remarkably, our algorithms automatically adapt to the best bias-variance tradeoff in the hindsight, whereas most prior approaches require tuning extra hyperparameters a priori.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ching-An Cheng

A three-phase power flow method for real-time distribution system analysis

Agile Autonomous Driving using End-to-End Deep Imitation Learning

RMPflow: A Computational Graph for Automatic Motion Policy Generation

Nutritional factors and survival of patients with oral cancer

Imitation learning for agile autonomous driving

A modified Newton method for radial distribution system power flow analysis

Socio-demographic factors in the prognosis of oral cancer patients

Bellman-consistent Pessimism for Offline Reinforcement Learning

Contact Info

Product

Resources

About