2021
DOI: 10.1007/s10618-021-00791-3
|View full text |Cite
|
Sign up to set email alerts
|

Matrix sketching for supervised classification with imbalanced classes

Abstract: The presence of imbalanced classes is more and more common in practical applications and it is known to heavily compromise the learning process. In this paper we propose a new method aimed at addressing this issue in binary supervised classification. Re-balancing the class sizes has turned out to be a fruitful strategy to overcome this problem. Our proposal performs re-balancing through matrix sketching. Matrix sketching is a recently developed data compression technique that is characterized by the property o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 38 publications
(35 reference statements)
0
4
0
Order By: Relevance
“…Numerous research endeavors have been undertaken to derive a concise and updatable representation from observed data, tailored for streaming analysis (Coleman and Shrivastava 2020;Kowshik et al 2021;Jin et al 2021;Chen et al 2022;Diakonikolas et al 2022;Makarychev, Manoj, and Ovsiankin 2022). Among these approaches, matrix sketching has emerged as a viable solution, finding ap-plication in diverse streaming scenarios such as anomaly detection (Huang and Kasiviswanathan 2015), clustering (Yoo, Huang, and Kasiviswanathan 2016), visualization (Fujiwara et al 2019) and non-time series classifications (Falcone, Anderlucci, and Montanari 2022). A matrix sketch is a reducedsize representation of an input matrix that effectively retains its essential characteristics.…”
Section: Motivations Related Work and Notationsmentioning
confidence: 99%
“…Numerous research endeavors have been undertaken to derive a concise and updatable representation from observed data, tailored for streaming analysis (Coleman and Shrivastava 2020;Kowshik et al 2021;Jin et al 2021;Chen et al 2022;Diakonikolas et al 2022;Makarychev, Manoj, and Ovsiankin 2022). Among these approaches, matrix sketching has emerged as a viable solution, finding ap-plication in diverse streaming scenarios such as anomaly detection (Huang and Kasiviswanathan 2015), clustering (Yoo, Huang, and Kasiviswanathan 2016), visualization (Fujiwara et al 2019) and non-time series classifications (Falcone, Anderlucci, and Montanari 2022). A matrix sketch is a reducedsize representation of an input matrix that effectively retains its essential characteristics.…”
Section: Motivations Related Work and Notationsmentioning
confidence: 99%
“…Sketching algorithms have been proposed for key linear statistical methods such as low rank matrix approximation, principal components analysis, linear discriminant analysis and ordinary least squares regression (Mahoney 2011;Woodruff 2014;Erichson et al 2016;Falcone et al 2021). Sketching has also been investigated for Bayesian posterior approximation (Bardenet and Maillard 2015;Geppert et al 2017).…”
Section: Sketching Algorithmsmentioning
confidence: 99%
“…The bounds in Table 1 only give qualitative guidance about the embedding probability. Users will benefit from more prescriptive results in order to choose the sketch size k, and the type of sketch for applications (Grellmann et al 2016;Geppert et al 2017;Ahfock et al 2020;Falcone et al 2021).…”
Section: Sketching Algorithmsmentioning
confidence: 99%
“…The third column refers to the necessary sketch size k to obtain an -subspace embedding for an arbitrary n × d source dataset with at least probability (1 − δ). (Mahoney, 2011;Woodruff, 2014;Erichson et al, 2016;Falcone et al, 2021). Sketching has also been investigated for Bayesian posterior approximation (Bardenet and Maillard, 2015;Geppert et al, 2017).…”
Section: Sketching Algorithmsmentioning
confidence: 99%