2021
DOI: 10.1016/j.rcim.2021.102202
|View full text |Cite
|
Sign up to set email alerts
|

Multi-agent reinforcement learning for online scheduling in smart factories

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 59 publications
(23 citation statements)
references
References 29 publications
0
15
0
Order By: Relevance
“…First, one of the great advantages of multi-agent systems is the distribution of a complex task into multiple simpler tasks among different agents to avoid the exponential growth of the joint action space [92,93]. However, multi-agent RL systems present different challenges such as agent heterogeneity, communication, the definition of collective goals (cooperation), scalability, design of compact representations of the true state of the environment, and the main problem of the non-stationarity [86,87].…”
Section: Discussionmentioning
confidence: 99%
“…First, one of the great advantages of multi-agent systems is the distribution of a complex task into multiple simpler tasks among different agents to avoid the exponential growth of the joint action space [92,93]. However, multi-agent RL systems present different challenges such as agent heterogeneity, communication, the definition of collective goals (cooperation), scalability, design of compact representations of the true state of the environment, and the main problem of the non-stationarity [86,87].…”
Section: Discussionmentioning
confidence: 99%
“… Zhou et al (2021) proposed a smart factory comprising varieties of components and developed a multi-agent actor–critic approach for decentralized job scheduling. The machines each correspond with an actor–critic agent (enhanced also with target policy and target critic networks), which can all observe all the states of the other agents ( via communication) in order to deal with the dynamic environment and also deal with non-stationarity.…”
Section: Applicationsmentioning
confidence: 99%
“…However, the corresponding information sharing costs reduce the scalability and efficiency of this approach. A close look at the most comprehensive approaches that incorporate a larger number of agents, such as Kim et al (2020) and Zhou et al (2021) , demonstrates the fact that scalable approaches tend to use more sparse communication strategies where agents are selectively chosen to exchange information to save time and cost. A more advanced approach, however, is the use of negotiation-based strategies incorporated in Wang (2020) and Kim et al (2020) , which can even be enhanced by taking advantage of negotiation learning.…”
Section: Applicationsmentioning
confidence: 99%
“…Message scheduling is a process in which action is carried out by a distributed system's broker (scheduler) mechanism [13], it makes use of message contexts or any other type of information that may be considered according to their priorities. Messages are categorized in the proposed system based on their features and quality of service (QoS) specifications, and applications are divided into two categories: critical and non-critical.…”
Section: Introductionmentioning
confidence: 99%