A Novel Approach Based on Reinforcement Learning for Finding Global Optimum

Ozan, Cenk; Başkan, Özgür; Haldenbilen, Soner

doi:10.4236/ojop.2017.62006

Cited by 5 publications

(6 citation statements)

References 22 publications

(28 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The RL approach exhibits a higher probability of finding a global optimum than existing heuristic optimization algorithms owing to its search and reward characteristics. The MORELA [ 28 ] is a global optimum-finding algorithm, which is based on the model-free Q -learning-based RL approach. One advantage of the MORELA is the use of a sub-environment that is generated around the best solution determined in the previous learning step, and it plays an important role in the prevention of falling into a local optima by searching around the best solution.…”

Section: Optimization Algorithm Based On Reinforcement Learningmentioning

confidence: 99%

“…To the best of our knowledge, this is the first wideband NUSLA optimization approach using RL. A global minimum finding algorithm based on RL, known as the modified reinforcement learning algorithm (MORELA) [ 28 ], presents a significant advantage over existing heuristic algorithms. The algorithm is less insensitive to the hyper-parameter setting, demonstrates a higher probability of finding the global optimum, and is more efficient for high-dimensional cost functions.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Design Method for a Wideband Non-Uniformly Spaced Linear Array Using the Modified Reinforcement Learning Algorithm

Kang

Kim

Park

et al. 2022

Sensors

View full text Add to dashboard Cite

In this paper, we present a design method for a wideband non-uniformly spaced linear array (NUSLA), with both symmetric and asymmetric geometries, using the modified reinforcement learning algorithm (MORELA). We designed a cost function that provided freedom to the beam pattern by setting limits only on the beam width (BW) and side-lobe level (SLL) in order to satisfy the desired BW and SLL in the wide band. We added the scan angle condition to the cost function to design the scanned beam pattern, as the ability to scan a beam in the desired direction is important in various applications. In order to prevent possible pointing angle errors for asymmetric NUSLA, we employed a penalty function to ensure the peak at the desired direction. Modified reinforcement learning algorithm (MORELA), which is a reinforcement learning-based algorithm used to determine a global optimum of the cost function, is applied to optimize the spacing and weights of the NUSLA by minimizing the proposed cost function. The performance of the proposed scheme was verified by comparing it with that of existing heuristic optimization algorithms via computer simulations.

show abstract

Section: Optimization Algorithm Based On Reinforcement Learningmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Design Method for a Wideband Non-Uniformly Spaced Linear Array Using the Modified Reinforcement Learning Algorithm

Kang

Kim

Park

et al. 2022

Sensors

View full text Add to dashboard Cite

show abstract

“…여기서 는 페널티 계수(penalty coefficient)이다. 본 논문에서 제안한 비용 함수를 최소화하는 광대역 NUSLA의 안테나 배열과 가중치를 구하기 위해서 강화 학습 기반의 휴리스틱 최적화 알고리즘인 MORELA [17] 를 사용하였다.…”

Section: 본 논문에서는 그림unclassified

“…강화학습 기반의 휴리스틱 최적화 알고리즘으로, 강화학 습의 특성인 탐색(search)과 보상(reward)으로 기존 휴리스 틱 최적화 알고리즘보다 global optimum을 찾을 확률이 높은 장점이 있다. MORELA는 model-free Q-learning 기반 의 강화학습 알고리즘으로, 이전 단계의 best solution을 중심으로 특정 범위를 갖는 하위 환경(sub-environment)을 탐색하여 local optimum에 빠질 확률이 낮은 장점이 있어 기존의 휴리스틱 최적화 알고리즘을 뛰어넘는 성능을 갖 는다[17] . 특정 범위 내의 모든 주파수에서 널을 생성하는 광대역 NUSLA를 설계하기 위해, 본 논문에서 제안한 비 용 함수를 최소화하는 최적의 안테나 배열과 가중치를 MORELA를 사용하여 찾았다.…”

unclassified

Design Approach of a Wideband Non-Uniformly Spaced Linear Array Based on Reinforcement Learning to Suppress Multiple Interference Signals

Kang¹,

Kim²,

Park³

et al. 2022

J. Korean Inst. Electromagn. Eng. Sci.

View full text Add to dashboard Cite

In this paper, we present a novel design approach for a wideband non-uniformly spaced linear array (NUSLA) to suppress the effect of interference signals. Notably, a uniform linear array (ULA), which is easy to handle, is widely utilized to generate nulls over a wide band; however, its beamforming performance is limited owing to the ULA structure. Although a NUSLA with nonlinear spacing addresses nonlinear problems that are difficult to handle mathematically, it can exhibit performance improvements surpassing those of ULA structures. However, an additional constraint is required to generate nulls at a specific position, and this has not yet been studied for null generation using wideband NUSLA. In this paper, we propose a novel cost function for designing a wideband NUSLA, which generates a null at the desired position, and we utilize the modified reinforcement learning algorithm (MORELA), which is a heuristic optimization algorithm based on reinforcement learning (RL), to minimize the proposed cost function and analyze the optimized antenna array and weights. Further, we compare the performance of the proposed MORELA based on RL with that of existing heuristic optimization algorithms via computer simulations.

show abstract

“…Moreover increasing data flow rate, avoiding accidents at domestic area, imposing node to move with modest velocity for minimal fuel ingesting were done [7]. Modified reinforcement algorithm is introduced with mathematical function to improve the performance of reinforcement learning in the parameters like average objective function and average no of learning episodes with various dimensions [8]. For large distance travel, time prediction is an effective method for managing the traffic, in [9] Gradient Boosting [GB] method is introduced in this work for time prediction.…”

Section: Introductionmentioning

confidence: 99%

Effective Routing in Vehicular Adhoc Network (VANET) using an Bio-inspired Algorithm: Enhanced Deep Reinforcement Learning (EDRL) for Secure Wireless Communication

Ravikumar¹,

Thiyagarajan²

2021

Preprint

View full text Add to dashboard Cite

For improving the performance of city wide-ranging lane networks through the optimized control signal, we proposed a framework in Vehicular Adhoc Network (VANET). Node which reduces the traffic efficiency drastically is identified as critical node, with the help of defined framework. Tripartite graph is used for identifying critical node through vehicle trajectory in the over-all viewpoint. Enhanced Deep Reinforcement Learning (EDRL) method is introduced to control the traffic signal and gives appropriate decision for routing the data from Road Side Unit (RSU) to intermediate or destination node. Various experiments were done with proposed model and the result shows considerable efficiency in delay and travelling time of the node in VANET.

show abstract

A Novel Approach Based on Reinforcement Learning for Finding Global Optimum

Cited by 5 publications

References 22 publications

Design Method for a Wideband Non-Uniformly Spaced Linear Array Using the Modified Reinforcement Learning Algorithm

Design Method for a Wideband Non-Uniformly Spaced Linear Array Using the Modified Reinforcement Learning Algorithm

Design Approach of a Wideband Non-Uniformly Spaced Linear Array Based on Reinforcement Learning to Suppress Multiple Interference Signals

Effective Routing in Vehicular Adhoc Network (VANET) using an Bio-inspired Algorithm: Enhanced Deep Reinforcement Learning (EDRL) for Secure Wireless Communication

Contact Info

Product

Resources

About