2022
DOI: 10.1007/s10489-022-03395-6
|View full text |Cite
|
Sign up to set email alerts
|

A novel intrinsic measure of data separability

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 30 publications
0
5
0
Order By: Relevance
“…The DSI 71 for each class was computed as: where C i and C represent class i and the number of classes, respectively, and s i is the Kolmogorov-Smirnov (KS) distance for class i defined as: Here, and represent the intra-class (ICD) and between-class (BCD) distance sets for class i , respectively. The DSI serves as a normalized indicator of data separability (i.e., DSI ∈ (0,1)).…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The DSI 71 for each class was computed as: where C i and C represent class i and the number of classes, respectively, and s i is the Kolmogorov-Smirnov (KS) distance for class i defined as: Here, and represent the intra-class (ICD) and between-class (BCD) distance sets for class i , respectively. The DSI serves as a normalized indicator of data separability (i.e., DSI ∈ (0,1)).…”
Section: Methodsmentioning
confidence: 99%
“…To evaluate the distinguishability of CVs, we employed two metrics: the Gini Impurity Score (GIS), which relies on local information (i.e., neighbors of each data point) to examine overlaps between data distributions, and the Distance-based Separability Index (DSI) 71 , which provides global information by assessing both the overall data distributions and the distributions of pairwise distances between data points. To implement the GIS, we overlayed a rectangular grid onto a 2D space of CVs (obtained from one of the above DR analyses).…”
Section: Measures Of Class Separationmentioning
confidence: 99%
See 1 more Smart Citation
“…(Wasserman, 2018). In contrast, measures of data separability such as the distance-based separability index (Guan and Loew, 2021) quantify the separability of datasets in a single scalar value. However, both approaches only contribute to the first aspect of inferring the topological structure, i.e.…”
Section: Scope Of the Studymentioning
confidence: 99%
“…The provided degree of formality is exhausted in the definition of distributional and structural outliers in precise mathematical terms and thus serves the need for precise terminology but does not overload the work with more formalism than we think necessary to contribute to the overall scope of the study. Readers interested in more rigorous mathematical approaches to infer structures in data may, for example, consult Mordohai and Medioni (2010) for dimensionality estimation and manifold learning based on tensor voting, Niyogi et al (2011) for a topological perspective on unsupervised learning, Guan and Loew (2021) of a distance‐based measure of class separability, and Kandanaarachchi and Hyndman (2020) for outlier detection in tabular data based on dimensionality reduction.…”
Section: Introductionmentioning
confidence: 99%