Yang Guan scite author profile

Decision making for self-driving cars is usually tackled by manually encoding rules from drivers' behaviors or imitating drivers' manipulation using supervised learning techniques. Both of them rely on mass driving data to cover all possible driving scenarios. This paper presents a hierarchical reinforcement learning method for decision making of self-driving cars, which does not depend on a large amount of labeled driving data. This method comprehensively considers both high-level maneuver selection and low-level motion control in both lateral and longitudinal directions. We firstly decompose the driving tasks into three maneuvers, including driving in lane, right lane change and left lane change, and learn the sub-policy for each maneuver. Then, a master policy is learned to choose the maneuver policy to be executed in the current state. All policies including master policy and maneuver policies are represented by fully-connected neural networks and trained by using asynchronous parallel reinforcement learners (APRL), which builds a mapping from the sensory outputs to driving decisions. Different state spaces and reward functions are designed for each maneuver. We apply this method to a highway driving scenario, which demonstrates that it can realize smooth and safe decision making for self-driving cars.

show abstract

MobiCacher: Mobility-aware content caching in small-cell networks

Guan¹,

Xiao

Feng

et al. 2014

View full text Add to dashboard Cite

Abstract-Small-cell networks have been proposed to meet the demand of ever growing mobile data traffic. One of the prominent challenges faced by small-cell networks is the lack of sufficient backhaul capacity to connect small-cell base stations (small-BSs) to the core network. We exploit the effective application layer semantics of both spatial and temporal locality to reduce the backhaul traffic. Specifically, small-BSs are equipped with storage facility to cache contents requested by users. As the cache hit ratio increases, most of the users' requests can be satisfied locally without incurring traffic over the backhaul. To make informed caching decisions, the mobility patterns of users must be carefully considered as users might frequently migrate from one small cell to another. We study the issue of mobility-aware content caching, which is formulated into an optimization problem with the objective to maximize the caching utility. As the problem is NP-complete, we develop a polynomial-time heuristic solution termed MobiCacher with bounded approximation ratio. We also conduct trace-based simulations to evaluate the performance of MobiCacher, which show that MobiCacher yields better caching utility than existing solutions.

show abstract

Centralized Cooperation for Connected and Automated Vehicles at Intersections by Proximal Policy Optimization

Guan¹,

Ren

et al. 2020

IEEE Trans. Veh. Technol.

104

View full text Add to dashboard Cite

Phosphorescence doping in a flexible ultramicroporous framework for high and tunable oxygen sensing efficiency

Liu

Lin

et al. 2013

Chem. Commun.

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yang Guan

Estimating the relationship between urban forms and energy consumption: A case study in the Pearl River Delta, 2005–2008

Hierarchical reinforcement learning for self‐driving decision‐making without reliance on labelled driving data

MobiCacher: Mobility-aware content caching in small-cell networks

Centralized Cooperation for Connected and Automated Vehicles at Intersections by Proximal Policy Optimization

Phosphorescence doping in a flexible ultramicroporous framework for high and tunable oxygen sensing efficiency

Contact Info

Product

Resources

About