2022
DOI: 10.1007/s11633-022-1314-7
|View full text |Cite
|
Sign up to set email alerts
|

A Dynamic Resource Allocation Strategy with Reinforcement Learning for Multimodal Multi-objective Optimization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 21 publications
(5 citation statements)
references
References 57 publications
0
3
0
Order By: Relevance
“…When transforming the encoding matrix into the positions of the particles, every priority within the path between every node is the coordinate of one axis of the particle, i.e., the dimension of the particle is 4a. Also, when initializing a population particle, its priority takes a value in the range of (0, 1], and if a transport mode does not exist within the path, the priority of that transport mode is set as 0. where 𝑣 (𝑑 + 1) and π‘₯ (𝑑 + 1) are the velocity and position of particle 𝑖 after one iteration, 𝑣 (𝑑) and π‘₯ (𝑑) are the velocity and position of particle 𝑖 before the iteration, πœ› is the inertia weight of the particle, 𝑐 and 𝑐 are the learning factors, π‘Ÿ and π‘Ÿ are random numbers between 0 and 1 (Dang et al, 2022), 𝑃 (𝑑) is the optimal position experienced by particle 𝑖 (excluding particles that exceed the limit), and 𝐺 (𝑑) is the best position experienced by the particle population after excluding particles that exceed the limit.…”
Section: Multimodal Transport Path Optimization Model Under Transport...mentioning
confidence: 99%
“…When transforming the encoding matrix into the positions of the particles, every priority within the path between every node is the coordinate of one axis of the particle, i.e., the dimension of the particle is 4a. Also, when initializing a population particle, its priority takes a value in the range of (0, 1], and if a transport mode does not exist within the path, the priority of that transport mode is set as 0. where 𝑣 (𝑑 + 1) and π‘₯ (𝑑 + 1) are the velocity and position of particle 𝑖 after one iteration, 𝑣 (𝑑) and π‘₯ (𝑑) are the velocity and position of particle 𝑖 before the iteration, πœ› is the inertia weight of the particle, 𝑐 and 𝑐 are the learning factors, π‘Ÿ and π‘Ÿ are random numbers between 0 and 1 (Dang et al, 2022), 𝑃 (𝑑) is the optimal position experienced by particle 𝑖 (excluding particles that exceed the limit), and 𝐺 (𝑑) is the best position experienced by the particle population after excluding particles that exceed the limit.…”
Section: Multimodal Transport Path Optimization Model Under Transport...mentioning
confidence: 99%
“…In Ref. [35], in order to distinguish the potentials of each subspace, the reinforcement learning method is used to dynamically allocate computing resources to each subspace. From the results, it is an effective strategy to assist the ZS in solving the MMOPs.…”
Section: Related Workmentioning
confidence: 99%
“…Intelligent algorithms improve the problem solving capability and optimize the resource allocation in solving multi-objective problems. Researchers combine multimodal multi-objective optimization with deep learning techniques to rationalize the space allocation to improve this model's area search capability and find optimal solution better [17]. In the field information analysis of drilling operations, some scholars have used support vector machines combined with simulated annealing algorithms for multi-objective optimization model building.…”
Section: Introductionmentioning
confidence: 99%