2021
DOI: 10.3390/app11178065
|View full text |Cite
|
Sign up to set email alerts
|

Quantification of the Information Loss Resulting from Temporal Aggregation of Wind Turbine Operating Data

Abstract: SCADA operating data are more and more used across the wind energy domain, both as a basis for power output prediction and turbine health status monitoring. Current industry practice to work with this data is by aggregating the signals at coarse resolution of typically 10-min averages, in order to reduce data transmission and storage costs. However, aggregation, i.e., downsampling, induces an inevitable loss of information and is one of the main causes of skepticism towards the use of SCADA operating data to m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 35 publications
(42 reference statements)
0
3
0
Order By: Relevance
“…The cross-correlation coefficient between each pair among these three measures is one; thus, only T BAX−BOX1 was retained to describe the battery box temperature. Second, the 10-min average erases any significant information from measures varying at a very high frequency, such as the electrical frequency and voltage (both neglected in the model) [37]. Thirdly, some variables are too scarce, i.e., their time series present many missing or nonnumerical entries [38].…”
Section: Scada Databasementioning
confidence: 99%
“…The cross-correlation coefficient between each pair among these three measures is one; thus, only T BAX−BOX1 was retained to describe the battery box temperature. Second, the 10-min average erases any significant information from measures varying at a very high frequency, such as the electrical frequency and voltage (both neglected in the model) [37]. Thirdly, some variables are too scarce, i.e., their time series present many missing or nonnumerical entries [38].…”
Section: Scada Databasementioning
confidence: 99%
“…Many engineering tools and financial models rely on 10 minutes (IEC, 2017) or hourly data. However, such an industry-standard practice comes with significant information loss (Beretta et al, 2021). The 10-minute resource data is inadequate for sizing and dispatch of integrated storage (Poudel et al, 2021), and the power system simulation study requires data at a resolution of 1 second or better.…”
Section: Introductionmentioning
confidence: 99%
“…Second, Ulmer et al, 10 who apply convolutional neural networks for failure detection, mention that the 10-min averaging process naturally leads to a loss of information. This effect is specifically studied in Beretta et al 27 Some researchers have tried to avoid these problems by using simulated high frequency data 24,28 while the industry uses additionally installed vibration sensors to increase monitoring quality. Stetco et al 29 provide a review on approaches using such Condition Monitoring Systems (CMS).…”
mentioning
confidence: 99%