2019
DOI: 10.1109/mie.2019.2913015
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning for Hybrid and Plug-In Hybrid Electric Vehicle Energy Management: Recent Advances and Prospects

Abstract: E nergy management is a critical technology in plug-in hybrid-electric vehicles (PHEVs) for maximizing efficiency, fuel economy, and range, as well as reducing pollutant emissions. At the same time, deep reinforcement learning (DRL) has become an effective and important methodology to formulate model-free and realtime energy-management strategies for HEVs and PHEVs. In this article, we describe the energy-management issues of HEVs/PHEVs and summarize a variety of potential DRL applications for onboard energy m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
60
0
1

Year Published

2020
2020
2022
2022

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 191 publications
(70 citation statements)
references
References 48 publications
0
60
0
1
Order By: Relevance
“…Therefore, the objectives (O1 and O2) of the study are to combat the drawbacks of conventional practices, i.e., calibration time and the inability to customize the control strategy to a specific driver. Furthermore, as suggested in literature [37], RL methods can improve fuel economy when compared to the rule based methods. However, these RL techniques come at the cost of learning time and this forms the third objective (O3), wherein the learning time must be minimized:…”
Section: Objectivesmentioning
confidence: 94%
See 1 more Smart Citation
“…Therefore, the objectives (O1 and O2) of the study are to combat the drawbacks of conventional practices, i.e., calibration time and the inability to customize the control strategy to a specific driver. Furthermore, as suggested in literature [37], RL methods can improve fuel economy when compared to the rule based methods. However, these RL techniques come at the cost of learning time and this forms the third objective (O3), wherein the learning time must be minimized:…”
Section: Objectivesmentioning
confidence: 94%
“…For continuous-spaces, the actor-critic method was used for the power management in a PHEV [36]. A qualitative study on RL techniques on HEVs and PHEVs shows potential for RL controllers to replace rule based controllers [37]. Similarly, learning based techniques have been used to train neural networks to predict the driving environment and generate an optimal torque split, achieving fuel savings [33].…”
Section: Torque Split Controlmentioning
confidence: 99%
“…The goal of the agent is to search an optimal sequence of control actions based on feedback from the environment. Owing to its characteristics of self-evaluation and self-promotion, RL is widely used in many research fields [28][29][30][31][32].…”
Section: A Rl Conceptmentioning
confidence: 99%
“…Rule-based strategies rely on predefined rules which require abundant human experiences and high calibration efforts and not adaptive to varied driving conditions and vehicle types [7][8][9]. Optimization-based strategies are considered as practical alternatives which can obtain the optimal control through the known or predicted driving conditions [10].…”
Section: Introductionmentioning
confidence: 99%