2022
DOI: 10.1016/j.patter.2022.100589
|View full text |Cite
|
Sign up to set email alerts
|

DADApy: Distance-based analysis of data-manifolds in Python

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(7 citation statements)
references
References 43 publications
0
6
0
Order By: Relevance
“…We first employed standard information imbalance 15 to find pairwise informative relationships between features. By this approach we were able to identify features with plain correlation, such as hematocrit (EHCT) and hemoglobin (EHB), as well as asymmetric correlations in which one feature holds more information about the other than v.v.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…We first employed standard information imbalance 15 to find pairwise informative relationships between features. By this approach we were able to identify features with plain correlation, such as hematocrit (EHCT) and hemoglobin (EHB), as well as asymmetric correlations in which one feature holds more information about the other than v.v.…”
Section: Discussionmentioning
confidence: 99%
“…12 . We computed the information imbalance ∆ between each pair of features using the implementation in the Python package DADApy 15 . ∆(A → B) is close to zero if feature A predicts feature B well.…”
Section: Information Imbalance Between Input Featuresmentioning
confidence: 99%
“…The nonlinear approaches are geometrical methods based on nearest neighbor (NN) statistics (Facco et al, 2017; Denti et al, 2022), implemented in Glielmo et al (2022). These methods are meant to be applied in a range of scales: at each scale, determined by the neighbors’ rank and/or by the number of data points that enter the calculation, they return the estimated number of “soft” directions, that is, the number of directions in which the features of the dataset change remarkably, as opposed to “noise” directions characterized by small variations.…”
Section: Analyses Of the Relationships Between Measures And Between T...mentioning
confidence: 99%
“…However, one can only find it on GitHub repositories, coded in Python and C++. Note that Python versions of the TWO-NN estimator have also been implemented in the recent scikit-dimension and DADApy packages (Bac et al 2021;Glielmo et al 2022). Moreover, DADApy contains routines dedicated to GRIDE.…”
Section: B Intrinsic and Other Packagesmentioning
confidence: 99%
“…Among the various options, the package Rdimtools (You 2022a) stands out, implementing 150 different algorithms, 17 of which are exclusively dedicated to ID estimation (You 2022b). Finally, it is worth mentioning that there are also Python (Van Rossum et al 2011) packages implementing different methods for ID estimation: two prominent examples are scikit-learn (Bac et al 2021) and DADApy (Glielmo et al 2022). See Appendix B for more details.…”
Section: Introductionmentioning
confidence: 99%