2021
DOI: 10.1002/cpe.6316
|View full text |Cite
|
Sign up to set email alerts
|

Big data analytics for identifying electricity theft using machine learning approaches in microgrids for smart communities

Abstract: Electricity theft (ET) causes major revenue loss in power utilities. It reduces the quality of supply, raises production cost, causes legal consumers to pay the higher cost, and impacts the economy as a whole. In this article, we use the State Grid Corporation of China (SGCC) dataset, which contains electricity consumption data of 1035 days for two classes: normal and fraudulent. In this work, ET detection model is proposed that consists of four steps: interpolation, data balancing, feature extraction, and cla… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
3

Relationship

1
8

Authors

Journals

citations
Cited by 25 publications
(12 citation statements)
references
References 44 publications
0
8
0
Order By: Relevance
“…The results achieved 61% accuracy, 96% recall, and 64% f-measure. Arif et al [21] proposed a detection model on a State Grid Corporation of China (SGCC) dataset [22]. They solve the imbalanced data using a hybrid resampling technique named synthetic minority oversampling technique with a near miss.…”
Section: Machine Learning Solutionsmentioning
confidence: 99%
“…The results achieved 61% accuracy, 96% recall, and 64% f-measure. Arif et al [21] proposed a detection model on a State Grid Corporation of China (SGCC) dataset [22]. They solve the imbalanced data using a hybrid resampling technique named synthetic minority oversampling technique with a near miss.…”
Section: Machine Learning Solutionsmentioning
confidence: 99%
“…In [31], the authors considered the imbalance data problem, high dimensionality issue, and classification. SMOTE and near miss (NM) are used in combination to resolve the data imbalance issue.…”
Section: Related Workmentioning
confidence: 99%
“…In our scenario, 0 means that the consumer is honest while 1 means that the consumer is dishonest or theft. This technique is also used by many researchers in the literature for ETD, which is a binary classification problem [12], [31], [43], etc.…”
Section: ) Logistic Regressionmentioning
confidence: 99%
“…Because minority class representation in the test sample is restricted or sometimes it doesn't appear, the overall accuracy may still be high, but the memory of the minority class is low. This shows that a model fails to recognize the class of minorities [10]. Because many machine learning problems are inherently unbalanced, this problem may happen most of the time [11].…”
Section: Introductionmentioning
confidence: 99%