2020
DOI: 10.3390/app10238481
|View full text |Cite
|
Sign up to set email alerts
|

Decomposition Methods for Machine Learning with Small, Incomplete or Noisy Datasets

Abstract: In many machine learning applications, measurements are sometimes incomplete or noisy resulting in missing features. In other cases, and for different reasons, the datasets are originally small, and therefore, more data samples are required to derive useful supervised or unsupervised classification methods. Correct handling of incomplete, noisy or small datasets in machine learning is a fundamental and classic challenge. In this article, we provide a unified review of recently proposed methods based on signal … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8

Relationship

2
6

Authors

Journals

citations
Cited by 14 publications
(14 citation statements)
references
References 56 publications
0
10
0
Order By: Relevance
“…These interpolated values were tracked, and never exceeded 3% of the full data. For future studies having data sets with a larger percentage of missing data, it is advisable to apply a data decomposition method to fill missing values [43]. The meteorological data were obtained from the Visual Crossing Weather application program interface [36], using weather station GK2 with coordinates (28°31 12.0 N 77°15 00.0 E).…”
Section: Methods and Experimental Designmentioning
confidence: 99%
“…These interpolated values were tracked, and never exceeded 3% of the full data. For future studies having data sets with a larger percentage of missing data, it is advisable to apply a data decomposition method to fill missing values [43]. The meteorological data were obtained from the Visual Crossing Weather application program interface [36], using weather station GK2 with coordinates (28°31 12.0 N 77°15 00.0 E).…”
Section: Methods and Experimental Designmentioning
confidence: 99%
“…Four contributions proposed general methods for machine learning with low-quality datasets. In [1], the authors provided a unified review of decomposition methods, which includes linear decomposition, low-rank matrix/tensor factorization, sparse matrix/tensor decomposition and empirical mode decomposition (EMD) models. This paper illustrates the ability of these decomposition models to impute missing features, denoising and to artificially generate additional data samples (data augmentation) with examples to the brain-computer interface (BCI) and epileptic EEG analysis, among others.…”
Section: Methodological Articlesmentioning
confidence: 99%
“…Three papers addressed different problems or diseases in Neuroscience. For example, in [1], Caiafa et al (Argentina-Spain-Japan) reviewed recent approaches to deal with incomplete or noisy measurements by applying signal decomposition methods and showed their usefulness in epileptic intracranial electroencephalogram (iEEG) signals classification, among other applications. Finding epileptic focus with iEEG is usually difficult mainly because available datasets labeled by expert medical doctors are scarce.…”
Section: Medical Applicationsmentioning
confidence: 99%
See 1 more Smart Citation
“…The deep learning technology is notable for its impressive performance and generalization capability, but the number of effective samples in the medical imaging dataset is usually small, leading to performance degradation. The training model needs large amount of data to avoid overfitting (Caiafa et al, 2020 ). However, obtaining enough MRI data is not easy.…”
Section: Introductionmentioning
confidence: 99%