A Response Surface Model Approach to Parameter Estimation of Reinforcement Learning for the Travelling Salesman Problem

Ottoni, André Luiz Carvalho; Nepomuceno, Erivelton G.; Oliveira, Marcos Santos de

doi:10.1007/s40313-018-0374-y

Cited by 19 publications

(84 citation statements)

References 33 publications

(41 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…O modelo de AR definido para a resolução do problema de planejamento de rotas com reabastecimentoé composto por um conjunto de estados (S), ações (A) e reforços (R). A formulação adotadaé baseada em trabalhos anteriores: (Bianchi et al, 2009;Lima Júnior et al, 2010;Ottoni et al, 2018). Após analisar a lógica do problema abordado,é proposta a seguinte estrutura:…”

Section: Sistema De Aprendizado Por Reforçounclassified

“…A metodologia experimental foi baseada em trabalhos recentes: (Ottoni et al, 2018(Ottoni et al, , 2019. As simulações foram realizadas no software M AT LAB R e compreenderam 16 grupos de experimentos (2 algoritmos × 4 instâncias × 2 tipos de problemas):…”

Section: Experimentos Realizadosunclassified

“…Uma técnica de Inteligência Artificial com relevantes aplicações no planejamento de rotas e no PCVé o Aprendizado por Reforço (AR) (Gambardella and Dorigo, 1995;Konar et al, 2013;Rakshit et al, 2013;Li et al, 2015;Ottoni et al, 2018). No AR, um agente aprende a partir de sucessos e fracassos interagindo em um ambiente (Sutton and Barto, 2018).…”

Section: Introductionunclassified

“…Um dos principais aspectos do ARé a estimação de parâmetros que otimizem o aprendizado, como taxa de aprendizado (α) e o fator de desconto (γ) (Even-Dar and Mansour, 2003;Schweighofer and Doya, 2003;Ottoni et al, 2019). A definição dos parâmetros podem influenciar diretamente no aprendizado de uma boa rota (Ottoni et al, 2018). Nesse sentido, Ottoni et al (2018) apresentam uma metodologia para a estimação de parâmetros do AR utilizando modelos de Superfície de Resposta (RSM) (Myers et al, 2009).…”

Section: Introductionunclassified

“…A definição dos parâmetros podem influenciar diretamente no aprendizado de uma boa rota (Ottoni et al, 2018). Nesse sentido, Ottoni et al (2018) apresentam uma metodologia para a estimação de parâmetros do AR utilizando modelos de Superfície de Resposta (RSM) (Myers et al, 2009).…”

Section: Introductionunclassified

See 4 more Smart Citations

Estimação de Parâmetros do Aprendizado por Reforço para o Problema de Planejamento de Rotas com Reabastecimento

Ottoni

Nepomuceno

Oliveira

2019

Anais Do 14º Simpósio Brasileiro De Automação Inteligente

View full text Add to dashboard Cite

Path planning is a important problem in mobile robotics. One of the aspects of this type of autonomous vehicles planning refers to observe the fuel-constraints. In this sense, the objective of this work is to estimate the Reinforcement Learning parameters for the path planning problem with refueling. The results indicate that the parameters estimated with the Response Surface Methodology reached the best solutions in most of the experiments. Resumo: O planejamento de rotasé um importante problema na robótica móvel. Uma das vertentes desse tipo de planejamento para veículos autônomos, refere-se a observar as restrições operacionais com combustível. Nesse sentido, o objetivo deste trabalhoé estimar os parâmetros do Aprendizado por Reforço para o problema planejamento de rotas com reabastecimento. Os resultados apontam que os parâmetros estimados com a Metodologia de Superfície de Resposta alcançaram as melhores soluções na maioria dos experimentos.

show abstract

Section: Sistema De Aprendizado Por Reforçounclassified

Section: Experimentos Realizadosunclassified

Section: Introductionunclassified

See 3 more Smart Citations

Estimação de Parâmetros do Aprendizado por Reforço para o Problema de Planejamento de Rotas com Reabastecimento

Ottoni

Nepomuceno

Oliveira

2019

Anais Do 14º Simpósio Brasileiro De Automação Inteligente

View full text Add to dashboard Cite

show abstract

Hyperparameter tuning of convolutional neural networks for building construction image classification

2022

View full text Add to dashboard Cite

Tuning of reinforcement learning parameters applied to SOP using the Scott–Knott method

et al. 2019

Self Cite

View full text Add to dashboard Cite

In this paper, we present a technique to tune the reinforcement learning (RL) parameters applied to the sequential ordering problem (SOP) using the Scott-Knott method. The RL has been widely recognized as a powerful tool for combinatorial optimization problems, such as travelling salesman and multidimensional knapsack problems. It seems, however, that less attention has been paid to solve the SOP. Here, we have developed a RL structure to solve the SOP that can partially fill that gap. Two traditional RL algorithms, Q-learning and SARSA, have been employed. Three learning specifications have been adopted to analyze the performance of the RL: algorithm type, reinforcement learning function, and parameter. A complete factorial experiment and the Scott-Knott method are used to find the best combination of factor levels, when the source of variation is statistically different in analysis of variance. The performance of the proposed RL has been tested using benchmarks from the TSPLIB library. In general, the selected parameters indicate that SARSA overwhelms the performance of Q-learning.

show abstract

A Response Surface Model Approach to Parameter Estimation of Reinforcement Learning for the Travelling Salesman Problem

Cited by 19 publications

References 33 publications

Estimação de Parâmetros do Aprendizado por Reforço para o Problema de Planejamento de Rotas com Reabastecimento

Estimação de Parâmetros do Aprendizado por Reforço para o Problema de Planejamento de Rotas com Reabastecimento

Hyperparameter tuning of convolutional neural networks for building construction image classification

Tuning of reinforcement learning parameters applied to SOP using the Scott–Knott method

Contact Info

Product

Resources

About