Advances in Data Mining Knowledge Discovery and Applications 2012
DOI: 10.5772/50787
|View full text |Cite
|
Sign up to set email alerts
|

Selecting Representative Data Sets

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
52
0
1

Year Published

2015
2015
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 65 publications
(53 citation statements)
references
References 103 publications
0
52
0
1
Order By: Relevance
“…In [13], the authors evaluated a pool of potential input variables to predict tower top acceleration signal with a wrapper algorithm [22], which includes the predictor model to search for the variables that reduce prediction error (a posteriori approach). They found wind speed, tower acceleration and wind direction relevant.…”
Section: Methods and Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…In [13], the authors evaluated a pool of potential input variables to predict tower top acceleration signal with a wrapper algorithm [22], which includes the predictor model to search for the variables that reduce prediction error (a posteriori approach). They found wind speed, tower acceleration and wind direction relevant.…”
Section: Methods and Resultsmentioning
confidence: 99%
“…For example, were we in posession of abnormalities in our data set, we could have also used a classification algorithm to mark records into either one of two classes (normal, abnormal). In such a case, care must been taken to account for the imbalance of instances in one of the classes [22], and as a consequence, we would have had to select other metrics, such as the F-measure as recommended by the investigation of fault diagnosis in gearboxes [33].…”
Section: Comparing Both Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…The development sample is used to develop the model (learning and estimating parameters of the model), while the validation sample is used to evaluate the model and for fi nal model selection. In a later phase of model development, a third type of sample -the testing sample(s) -can be used for assessing the predictive performance of the model [Borovicka et al, 2012]. If the same dataset would be used for the development, validation and calibration, the estimation of the predictive ability of the model would be overly optimistic.…”
Section: Introductionmentioning
confidence: 99%
“…Here the complete data set is split into a learning (70 %) and a test (30 %) set (Borovicka et al 2012). In this case there is no need for an evaluation set which is needed for some algorithms and in general accounts for around 10 % of the whole data set.…”
Section: By Creating Accumulated State Vectors Combining Individual mentioning
confidence: 99%