2016 IEEE International Conference on Big Data (Big Data) 2016
DOI: 10.1109/bigdata.2016.7840825
|View full text |Cite
|
Sign up to set email alerts
|

Predicting rare failure events using classification trees on large scale manufacturing data with complex interactions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 13 publications
(15 citation statements)
references
References 2 publications
0
15
0
Order By: Relevance
“…Hebert [72] Experimental results show that certain tree-based classifiers, such as RF and XGBoost, are better learners than logistic regression Rio et al [46] The results of this study indicate that the classification performance of sampling methods, such as ROS, RUS, and SMOTE, is strongly related to the number of mappers used within a MapReduce environment (Apache Hadoop). The type of Data-Level sampling technique does not appear to matter Baughman et al [74] The research work discusses the gamification of the DeepQA system for real-world use.…”
Section: Algorithm-level Approachesmentioning
confidence: 93%
See 2 more Smart Citations
“…Hebert [72] Experimental results show that certain tree-based classifiers, such as RF and XGBoost, are better learners than logistic regression Rio et al [46] The results of this study indicate that the classification performance of sampling methods, such as ROS, RUS, and SMOTE, is strongly related to the number of mappers used within a MapReduce environment (Apache Hadoop). The type of Data-Level sampling technique does not appear to matter Baughman et al [74] The research work discusses the gamification of the DeepQA system for real-world use.…”
Section: Algorithm-level Approachesmentioning
confidence: 93%
“…An imbalanced class environment can also be created by rare failure events within large-scale manufacturing operations. Hebert [72] separately compared RF and XGBoost (two tree-based classification methods) with logistic regression to research this issue.…”
Section: Algorithm-level Methods For Class Imbalance In Big Datamentioning
confidence: 99%
See 1 more Smart Citation
“…Three different types of data are available: numeric, categorical and time based data which are described as being both scaled and anonymised. The categorical data is extremely sparse and is excluded for this research as it was for the research in [10], [14]. Initially, this research focuses only on the numerical data as preliminary data exploration found it to be the most influential, therefore the time/date variables were not within the scope of our research for this paper.…”
Section: Manufacturing Case Studymentioning
confidence: 99%
“…Rare event predictions and creation of fault prognostic systems using alternative methods have been covered in [9], [10]. Wang et al has demonstrated the usage of BNs for modelling fault detection in a semiconductor manufacturing process [9], where both discrete and continuous variables representing relationships within a manufacturing dataset were deemed to be highly correlated.…”
Section: Introductionmentioning
confidence: 99%