2015
DOI: 10.1177/0143624415573215
|View full text |Cite
|
Sign up to set email alerts
|

A case study to examine the imputation of missing data to improve clustering analysis of building electrical demand

Abstract: Building performance data are widely used for daily operation, improving building efficiency, identifying and diagnosing performance problems, and commissioning. In this study, the authors explore the use of missing data imputation and clustering on an electrical demand dataset. The objective was to compare four approaches of data imputation and clustering analysis. Results of this study suggest that using multiple imputation to fill in missing data prior to performing clustering analysis results in more infor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(3 citation statements)
references
References 10 publications
0
3
0
Order By: Relevance
“…Different machine learning approaches have been employed for forecasting and backcasting missing demand data due to improper database management systems [36]. The application of clustering and missing data imputation were examined for a large electricitydemand dataset [37]. The mechanisms of missing data are often categorized into the following three types and considered accordingly for imputation: missing completely at random (MCAR), missing at random (MAR), and not missing at random (NMAR) [38].…”
Section: Review Of the State-of-the-art Researchmentioning
confidence: 99%
“…Different machine learning approaches have been employed for forecasting and backcasting missing demand data due to improper database management systems [36]. The application of clustering and missing data imputation were examined for a large electricitydemand dataset [37]. The mechanisms of missing data are often categorized into the following three types and considered accordingly for imputation: missing completely at random (MCAR), missing at random (MAR), and not missing at random (NMAR) [38].…”
Section: Review Of the State-of-the-art Researchmentioning
confidence: 99%
“…Inman et al [ 14 ] explored the use of missing data imputation and clustering on building electricity consumption data. The objective was to compare two data imputation methods: Amelia multiple imputation and cubic spline imputation.…”
Section: Introductionmentioning
confidence: 99%
“…Kornelsen and Coulibaly [34] demonstrated data-driven approaches against conventional infilling techniques (i.e., the statistical and interpolation infilling approaches) for the imputation of missing values in a distributed soil moisture dataset. Inman et al [35] compared two imputation approaches (the cubic spline and multiple imputations) and two clustering techniques (autocorrelation-based fuzzy clustering and wavelet-based clustering) on the electrical demand data of a commercial building. Nelwamondo et al [36] developed the expectation maximization (EM) algorithm, which was combined with the auto-associative neural network and genetic algorithm (GA), to solve the problem of missing data imputation.…”
Section: Introductionmentioning
confidence: 99%