2021
DOI: 10.1109/access.2021.3059221
|View full text |Cite
|
Sign up to set email alerts
|

A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm

Abstract: K-Means clustering algorithm is a typical unsupervised learning method, and it has been widely used in the field of fault diagnosis. However, the traditional serial K-Means clustering algorithm is difficult to efficiently and accurately perform clustering analysis on the massive running-state monitoring data of rolling bearing. Therefore, a novel fault diagnosis method of rolling bearing using Spark-based parallel ant colony optimization (ACO)-K-Means clustering algorithm is proposed. Firstly, a Spark-based th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 21 publications
(12 citation statements)
references
References 36 publications
(52 reference statements)
0
12
0
Order By: Relevance
“…Therefore, after the initial cleaning of the original data extracted by the SCADA system, the Z-score method is needed to scale the data proportionally, and the data of the same parameter is processed into the interval (−1, 1) with the mean u = 0 and the variance σ = 1. The Equation (19) is the mathematical expression of the Z-score, and the variance calculation is shown in Equation (20). During the optimization of the Extreme Random Forest parameters using the improved grey wolf optimization, the above two parameters (Table 1) are set into a twodimensional vector X(t + 1) (n_estimators, min_samples_leaf).…”
Section: Data Preprocessingmentioning
confidence: 99%
See 1 more Smart Citation
“…Therefore, after the initial cleaning of the original data extracted by the SCADA system, the Z-score method is needed to scale the data proportionally, and the data of the same parameter is processed into the interval (−1, 1) with the mean u = 0 and the variance σ = 1. The Equation (19) is the mathematical expression of the Z-score, and the variance calculation is shown in Equation (20). During the optimization of the Extreme Random Forest parameters using the improved grey wolf optimization, the above two parameters (Table 1) are set into a twodimensional vector X(t + 1) (n_estimators, min_samples_leaf).…”
Section: Data Preprocessingmentioning
confidence: 99%
“…L.J. Wan [19] proposed a high-efficiency rolling bearing FD method based on Spark and improved random forest, which can increase the detection speed and obtain a higher accuracy. Ma, S.L.…”
Section: Introductionmentioning
confidence: 99%
“…proposed a fault feature extraction method utilizing time-varying envelope filtering, effectively extracting bearing cage fault signals that are often submerged in high levels of noise. Following feature extraction, the extracted features are utilized for fault classification using various methods such as support vector machines [16], Random Forest [17], and k-means [18].…”
Section: Introductionmentioning
confidence: 99%
“…Monitoring and information collection improve real-time analysis and intelligent diagnosis capabilities [ 19 , 39 ]. With the rapid development of cloud computing technology, various high-reliability and high-scalability, big data processing systems such as Hadoop, Spark, and Storm have emerged, providing favorable tools for the centralized processing of large-scale power equipment monitoring data [ 40 , 41 , 42 ]. Although these emerging computing models all offer a unified programming interface and shield more low-level details than traditional parallel computing programming models, how can they be introduced into the data processing of the power equipment monitoring center?…”
Section: Introductionmentioning
confidence: 99%