Overlap has been identified in previous works as a significant obstacle to automated diagnosis using data mining algorithms, since it makes it impossible to discern how each machine influences product quality. Several solutions that handle overlap have been proposed, but the final result is a list of potential overlapped root causes. The goal of this paper is to develop a solution resilient to overlap that can determine the true root cause from a list of possible root causes, when possible, and determine the conditions in which it is possible to identify the root causes. This allows for a better understanding of overlap, and enables the development of a fully automatic root cause analysis for manufacturing. To do so, we propose an automatic root cause analysis approach that uses causal inference and do calculus to determine the true root cause. The proposed approach was validated on simulated and real case-study data, and allowed for an estimation of the effect of a product passing through a certain machine while disregarding the effect of overlap, in certain conditions. The results were on par with the state-of-the-art solutions capable of handling overlap. The contributions of this paper are a graphical definition of overlap, the identification of the conditions in which is possible to overcome the effect of overlap, and a solution that can present a single true root cause when such conditions are met.
Automatic Root Cause Analysis solutions aid analysts in finding problems’ root causes by using automatic data analysis. When trying to locate the root cause of a problem in a manufacturing process, an issue-denominated overlap can occur. Overlap can impede automated diagnosis using algorithms, as the data make it impossible to discern the influence of each machine on the quality of products. This paper proposes a new measure of overlap based on an information theory concept called Positive Mutual Information. This new measure allows for a more detailed analysis. A new approach is developed for automatically finding the root causes of problems when overlap occurs. A visualization that depicts overlapped locations is also proposed to ease practitioners’ analysis. The proposed solution is validated in simulated and real case-study data. Compared to previous solutions, the proposed approach improves the capacity to pinpoint a problem’s root causes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.