Enrico Anderlini scite author profile

This work presents the application of reinforcement learning for the optimal resistive control of a point absorber. The model-free Q-learning algorithm is selected in order to maximise energy absorption in each sea state. Step changes are made to the controller damping, observing the associated penalty, for excessive motions, or reward, i.e. gain in associated power. Due to the general periodicity of gravity waves, the absorbed power is averaged over a time horizon lasting several wave periods. The performance of the algorithm is assessed through the numerical simulation of a point absorber subject to motions in heave in both regular and irregular waves. The algorithm is found to converge towards the optimal controller damping in each sea state. Additionally, the model-free approach ensures the algorithm can adapt to changes to the device hydrodynamics over time and is unbiased by modelling errors.

show abstract

Towards Real-Time Reinforcement Learning Control of a Wave Energy Converter

Anderlini

Husain

Parker

et al. 2020

JMSE

View full text Add to dashboard Cite

The levellised cost of energy of wave energy converters (WECs) is not competitive with fossil fuel-powered stations yet. To improve the feasibility of wave energy, it is necessary to develop effective control strategies that maximise energy absorption in mild sea states, whilst limiting motions in high waves. Due to their model-based nature, state-of-the-art control schemes struggle to deal with model uncertainties, adapt to changes in the system dynamics with time, and provide real-time centralised control for large arrays of WECs. Here, an alternative solution is introduced to address these challenges, applying deep reinforcement learning (DRL) to the control of WECs for the first time. A DRL agent is initialised from data collected in multiple sea states under linear model predictive control in a linear simulation environment. The agent outperforms model predictive control for high wave heights and periods, but suffers close to the resonant period of the WEC. The computational cost at deployment time of DRL is also much lower by diverting the computational effort from deployment time to training. This provides confidence in the application of DRL to large arrays of WECs, enabling economies of scale. Additionally, model-free reinforcement learning can autonomously adapt to changes in the system dynamics, enabling fault-tolerant control.

show abstract

Control of a Realistic Wave Energy Converter Model Using Least-Squares Policy Iteration

Anderlini

Forehand

Bannon³

et al. 2017

IEEE Trans. Sustain. Energy

View full text Add to dashboard Cite

Reactive control of a wave energy converter using artificial neural networks

Anderlini

Forehand

Bannon³

et al. 2017

International Journal of Marine Energy

View full text Add to dashboard Cite

Reactive control of a two-body point absorber using reinforcement learning

Anderlini

Forehand

Bannon³

et al. 2018

Ocean Engineering

View full text Add to dashboard Cite

Unsupervised anomaly detection for underwater gliders using generative adversarial networks

Harris

Salavasidis

et al. 2021

Engineering Applications of Artificial Intelligence

View full text Add to dashboard Cite

Control of a ROV carrying an object

2018

View full text Add to dashboard Cite

Unoccupied Underwater Vehicles (UUVs) are growing in importance and capabilities. Here, the trajectory control of an UUV carrying an object is investigated, with the consequent changes in system dynamics. For the first time, an Adaptive Model Predictive Control (AMPC) scheme for UUVs is developed, which selects optimal actions at the start of every time step to minimise the trajectory tracking error and prevent excessive changes in the control action over a receding time horizon. Prediction error minimisation is used to identify the linear model of the UUV in real time. The performance of AMPC is compared with existing PID and sliding-mode control (SMC) strategies through simulations. The latter is improved to prevent integral wind-up. While SMC results in best tracking performance, it imposes a strong burden on the motors due to its bang-bang action selection. AMPC presents smoother changes in applied thrust, but higher tracking errors due to non-linear effects and inaccuracies in the on-line system identification process. PID presents best overall performance, but its behaviour is expected to degrade on an actual ROV application due to sensor noise. This study will contribute to the selection of a suitable control scheme for future UUVs performing maintenance tasks autonomously.

show abstract

Hydrodynamic Modelling of An Oscillating Wave Surge Converter Including Power Take-Off

Benites-Munoz

Huang

Anderlini

et al. 2020

JMSE

View full text Add to dashboard Cite

To estimate the response of wave energy converters to different sea environments accurately is an ongoing challenge for researchers and industry, considering that there has to be a balance between guaranteeing their integrity whilst extracting the wave energy efficiently. For oscillating wave surge converters, the incident wave field is changed due to the pitching motion of the flap structure. A key component influencing this motion response is the Power Take-Off system used. Based on OpenFOAM, this paper includes the Power Take-off to establish a realistic model to simulate the operation of a three-dimensional oscillating wave surge converter by solving Reynolds Averaged Navier-Stokes equations. It examines the relationship between incident waves and the perturbed fluid field near the flap, which is of great importance when performing in arrays as neighbouring devices may influence each other. Furthermore, it investigates the influence of different control strategy systems (active and passive) in the energy extracted from regular waves related to the performance of the device. This system is estimated for each wave frequency considered and the results show the efficiency of the energy extracted from the waves is related to high amplitude pitching motions of the device in short periods of time.

show abstract

12 3 4 5

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.