Deep Reinforcement Learning (DRL) has been included into the production system for multiple objectives, including control, scheduling, and maintenance planning. Maintenance must be planned sensibly and economically in order to preserve the usable life of the production systems while not sacrificing productivity and so minimising costs and losses. In this work a hybrid simulation-based and DRL approach is employed to develop an agent that can autonomously determine when to do preventative maintenance by considering the failure probability at a particular instant and the length of time since the last maintenance operation has been performed. The novelty of this approach is the configuration of the DRL setting, in particular the reward function. Results are promising comparing the approach with a heuristic from the literature, as they show that the frequency of machine failures is dramatically reduced.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.