2020
DOI: 10.1016/j.apenergy.2020.114900
|View full text |Cite
|
Sign up to set email alerts
|

Heuristic action execution for energy efficient charge-sustaining control of connected hybrid vehicles with model-free double Q-learning

Abstract: Where a licence is displayed above, please note the terms and conditions of the licence govern your use of this document. When citing, please reference the published version. Take down policy While the University of Birmingham exercises care and attention in making items available there are rare occasions when an item has been uploaded in error or has been deemed to be commercially or otherwise sensitive.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
23
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
8

Relationship

3
5

Authors

Journals

citations
Cited by 50 publications
(29 citation statements)
references
References 43 publications
0
23
0
Order By: Relevance
“…where batt is the power supplied by the battery pack and egu is the power supplied by the engine generator; egu_max is the maximum power that can be provided by the engine-generator; and dem is the vehicle's power demand. Energy management strategies, including exponential functions [7], model-based predictive control [9], [10], and model-free control [13], [14], build a nonlinear relationship between the power distribution and the vehicle states (e.g. battery SoC).…”
Section: ) Energy Management Modulementioning
confidence: 99%
See 1 more Smart Citation
“…where batt is the power supplied by the battery pack and egu is the power supplied by the engine generator; egu_max is the maximum power that can be provided by the engine-generator; and dem is the vehicle's power demand. Energy management strategies, including exponential functions [7], model-based predictive control [9], [10], and model-free control [13], [14], build a nonlinear relationship between the power distribution and the vehicle states (e.g. battery SoC).…”
Section: ) Energy Management Modulementioning
confidence: 99%
“…( 13) and Eq. (14). Logistic map, which follows a principle of biological evidencing behavior, is proposed as a unified chaotic attraction strategy in this paper because it has only one tuning parameter and is easy to be implemented in engineering applications [47].…”
Section: Chaotic Attraction With Logistic Mapmentioning
confidence: 99%
“…The vehicle speed dataset is collected from the real driving conditions. To reduce the training time and improve the control accuracy, the vehicle velocity is divided into three-speed intervals that are [0-12] m/s, [12][13][14][15][16][17][18][19][20][21][22][23][24] m/s, [24][25][26][27][28][29][30][31][32][33][34][35][36] m/s, and they represent low, medium, and high speed, respectively. Then, the classified speed intervals are adopted to train the DDPG algorithm separately until the algorithm converges, the trained neural network is stored, as depicted in Fig.…”
Section: A a Bi-level Frameworkmentioning
confidence: 99%
“…In Ref. [30], B Shuai. et al employed the max-value-based policy and the random policy to reduce the overestimation for double Q-learning.…”
Section: Introductionmentioning
confidence: 99%
“…The effectiveness of RL has been demonstrated in various vehiclecontrol-related applications [34]. Remarkable improvements in vehicle energy efficiency have been achieved by RL methods, e.g., Q-learning [35], deep Q-learning [36], double Q-learning [37], and multiple-step Q-learning [38]. Most research on RL-based power management control focus on learning from scratch [39], [40].…”
mentioning
confidence: 99%