New Advances in Machine Learning 2010
DOI: 10.5772/9382
|View full text |Cite
|
Sign up to set email alerts
|

Data Mining with Skewed Data

Abstract: In this chapter, we explore difficulties one often encounters when applying machine learning techniques to real-world data, which frequently show skewness properties. A typical example from industry where skewed data is an intrinsic problem is fraud detection in finance data. In the following we provide examples, where appropriate, to facilitate the understanding of data mining of skewed data. The topics explored include but are not limited to:

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 10 publications
0
1
0
Order By: Relevance
“…This algorithm identifies different sets of predictors and different interactions between them for different subgroups in a dataset [54]. Being a nonparametric algorithm [55], it spares researchers from making distributional and metric assumptions about the data, especially since collected data are often correlated and skewed [56]. Its ease of representation and interpretation allows the researcher to represent results in a non-statistically intensive and comprehensive framework for novices in data science [57].…”
Section: Yearmentioning
confidence: 99%
“…This algorithm identifies different sets of predictors and different interactions between them for different subgroups in a dataset [54]. Being a nonparametric algorithm [55], it spares researchers from making distributional and metric assumptions about the data, especially since collected data are often correlated and skewed [56]. Its ease of representation and interpretation allows the researcher to represent results in a non-statistically intensive and comprehensive framework for novices in data science [57].…”
Section: Yearmentioning
confidence: 99%