With the rapid development of modern industrialization in our country and the continuous improvement of people’s living standards, the changing market has put forward new requirements for traditional manufacturing enterprises. The order product demand of many manufacturing enterprises is changing from a single variety, large batch to multiple varieties, small batch. In view of this change, the traditional job-shop scheduling method is far from enough, which greatly affects the efficiency of the production job-shop. In order to solve the above problems, this paper proposes a real-time scheduling method based on reinforcement learning applied in the dynamic job-shop and a new type of neural network is designed at the same time. The neural network is designed with the high-dimensional data in the above problem as input, and a policy-based reinforcement learning algorithm is proposed based on this network. In the process of research, it was found that the reinforcement learning method not only enables the agent to use historical data for learning, but also enables it to explore and learn other possible high-reward actions within a certain range, so as to realize the optimization of production goals under real-time scheduling. The effectiveness of the proposed real-time scheduling method is verified by comparing with other common rule-based scheduling methods in the manufacturing environment.
At present, manufacturing models are characterized by multi-variety, small batch, and diversification. It is insufficient to use traditional scheduling methods for production management with high performance. A real-time production scheduling system based on reinforcement learning (RL) is suggested in an effort to address the aforementioned issues. A brand-new manufacturing neural network is created to learn the state-action values for production scheduling in real time using high-dimensional data as the input. The detailed setup of network inputs, neural network, action, and reward are also designed. Then, a policy-based reinforcement learning algorithm is proposed to achieve the optimum objective. Finally, By contrasting the proposed scheduling strategy with rule-based approaches in a smart manufacturing environment, its efficacy is demonstrated. according to experimental data, the suggested algorithm can successfully improve performance in the dynamic job-shop environment.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.