A reinforcement learning approach to airfoil shape optimization

Dussauge, Thomas P.; Sung, Woong Je; Pinon Fischer, Olivia J.; Mavris, Dimitri N.

doi:10.1038/s41598-023-36560-z

Cited by 15 publications

(3 citation statements)

References 45 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Another source of interest is the optimization of airfoils to improve their aerodynamic properties, with all kinds of applications in aeronautics. While traditionally these kinds of problems are tackled with optimization methods such as gradient-based optimization, the authors in [23] argue that these methods, even though computationally efficient in large spaces, are susceptible to poor local minima, and do not work well with non-linear cost functions. While machine learning techniques are less susceptible to these kinds of errors, the authors in [10] point out that using high-fidelity data for training can become prohibitively expensive.…”

Section: Rl For Optimizationmentioning

confidence: 99%

Actively learning costly reward functions for reinforcement learning

Eberhard,

Metni,

Fahland

et al. 2024

Mach. Learn.: Sci. Technol.

View full text Add to dashboard Cite

Transfer of recent advances in deep reinforcement learning to real-worldapplications is hindered by high data demands and thus low efficiency andscalability.Through independent improvements of components such as replay buffers ormore stable learning algorithms, and through massively distributed systems,training time could be reduced from several days to several hours for standardbenchmark tasks.However, while rewards in simulated environments are well-defined and easyto compute, reward evaluation becomes the bottleneck in many real-worldenvironments, e.g., in molecular optimization tasks, where computationallydemanding simulations or even experiments are required to evaluatestates and to quantify rewards.When ground-truth evaluations become orders of magnitude more expensive thanin research scenarios, direct transfer of recent advances would require massive amountsof scale, just for evaluating rewards rather than training the models.We propose to alleviate this problem by replacing costly ground-truthrewards with rewards modeled by neural networks, counteractingnon-stationarity of state and reward distributions during training with anactive learning component.We demonstrate that using our proposed method, it is possible to train agentsin complex real-world environments orders of magnitudes faster than would bepossible when using ground-truth rewards.By enabling the application of reinforcement learning methods to newdomains, we show that we can find interesting and non-trivial solutions toreal-world optimization problems in chemistry, materials science andengineering.We demonstrate speed-up factors of 50 to 3000 when applying our approach tochallenges of molecular design and airfoil optimization.

show abstract

Section: Rl For Optimizationmentioning

confidence: 99%

Actively learning costly reward functions for reinforcement learning

Eberhard,

Metni,

Fahland

et al. 2024

Mach. Learn.: Sci. Technol.

View full text Add to dashboard Cite

show abstract

“…Shape Optimization Using Reinforcement Learning: In recent years, reinforcement learning (RL) has been extensively used to address the shape optimization problem. For example, in [42], the authors use deep RL for airfoil shape optimization. The RL agent modifies the shape of the 2D airfoil within geometric constraints, and it is run on a lowfidelity external solver to calculate the reward.…”

Section: Literature Surveymentioning

confidence: 99%

A Deep Reinforcement Learning Approach to Optimal Morphologies Generation in Reconfigurable Tiling Robots

Kalimuthu,

Hayat,

Pathmakumar

et al. 2023

Mathematics

View full text Add to dashboard Cite

Reconfigurable robots have the potential to perform complex tasks by adapting their morphology to different environments. However, designing optimal morphologies for these robots is challenging due to the large design space and the complex interactions between the robot and the environment. An in-house robot named Smorphi, having four holonomic mobile units connected with three hinge joints, is designed to maximize area coverage with its shape-changing features using transformation design principles (TDP). The reinforcement learning (RL) approach is used to identify the optimal morphologies out of a vast combination of hinge angles for a given task by maximizing a reward signal that reflects the robot’s performance. The proposed approach involves three steps: (i) Modeling the Smorphi design space with a Markov decision process (MDP) for sequential decision-making; (ii) a footprint-based complete coverage path planner to compute coverage and path length metrics for various Smorphi morphologies; and (iii) pptimizing policies through proximal policy optimization (PPO) and asynchronous advantage actor–critic (A3C) reinforcement learning techniques, resulting in the generation of energy-efficient, optimal Smorphi robot configurations by maximizing rewards. The proposed approach is applied and validated using two different environment maps, and the results are also compared with the suboptimal random shapes along with the Pareto front solutions using NSGA-II. The study contributes to the field of reconfigurable robots by providing a systematic approach for generating optimal morphologies that can improve the performance of reconfigurable robots in a variety of tasks.

show abstract

“…Wavy geometry is a biomimetic technique inspired from the unique structure of the fins of humpback whales. This geometry has been found to offer significant advantages in fluid mechanics and flow control, particularly in airfoil design [1][2][3][4][5][6][7][8][9][10][11][12][13]. By incorporating the wavy shape into airfoils, researchers have been able to achieve improvements in lift, drag, and overall aerodynamic performance.…”

Section: Introductionmentioning

confidence: 99%

Deep Learning Models for the Evaluation of the Aerodynamic and Thermal Performance of Three-Dimensional Symmetric Wavy Wings

Kim,

Yoon,

Seo

2023

Symmetry

View full text Add to dashboard Cite

The present study initially evaluates the feasibility of deep learning models to predict the flow and thermal fields of a wing with a symmetric wavy disturbance as the passive flow control. The present study developed the encoder–decoder (ED) and convolutional neural network (CNN) models to predict the characteristics of flow and heat transfer on the surface of three-dimensional wavy wings in a wide range of parameters, such as the aspect ratio, wave amplitude, wave number, and the angle of attack. Computational fluid dynamics (CFD) is used to generate the dataset of the deep learning models. Various tests are carried out to examine the predictive performance of the architectures for two deep learning models. The CNN and ED models demonstrated a quantitatively predictive performance for aerodynamic coefficients and Nusselt numbers, as well as a qualitative prediction for pressure contours, limiting streamlines, and Nusselt contours. The predicted results well reconstructed the spiral vortical formation and the separation delay by the limiting streamlines. It is expected that the present established deep learning methods are useful to perform the parametric study to find the conditions to provide efficient aerodynamic and thermal performances.

show abstract

A reinforcement learning approach to airfoil shape optimization

Cited by 15 publications

References 45 publications

Actively learning costly reward functions for reinforcement learning

Actively learning costly reward functions for reinforcement learning

A Deep Reinforcement Learning Approach to Optimal Morphologies Generation in Reconfigurable Tiling Robots

Deep Learning Models for the Evaluation of the Aerodynamic and Thermal Performance of Three-Dimensional Symmetric Wavy Wings

Contact Info

Product

Resources

About