2021
DOI: 10.1186/s40537-021-00428-8
|View full text |Cite
|
Sign up to set email alerts
|

An alternative approach to dimension reduction for pareto distributed data: a case study

Abstract: Deep learning models are tools for data analysis suitable for approximating (non-linear) relationships among variables for the best prediction of an outcome. While these models can be used to answer many important questions, their utility is still harshly criticized, being extremely challenging to identify which data descriptors are the most adequate to represent a given specific phenomenon of interest. With a recent experience in the development of a deep learning model designed to detect failures in mechanic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
22
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 41 publications
(22 citation statements)
references
References 35 publications
0
22
0
Order By: Relevance
“…Data quality affects algorithm performance. The imbalance and bias in the data distribution could weaken the capability of algorithms (Roccetti et al, 2019;Roccetti et al, 2021;Strickland 2022). Effective image preprocessing methods should be developed to improve image quality and ensure the reliability of the dataset.…”
Section: Image Preprocessingmentioning
confidence: 99%
“…Data quality affects algorithm performance. The imbalance and bias in the data distribution could weaken the capability of algorithms (Roccetti et al, 2019;Roccetti et al, 2021;Strickland 2022). Effective image preprocessing methods should be developed to improve image quality and ensure the reliability of the dataset.…”
Section: Image Preprocessingmentioning
confidence: 99%
“…Currently, several machine learning approaches and problem solutions in numerous fields have been studied [ 23 , 35 , 42 , 43 , 50 ]. Several real-time applications have been proposed using transfer learning approaches such as Natural Language Processing (NLP), Automatic Speech Recognition (ASR), and to solve computer vision problems.…”
Section: Discussionmentioning
confidence: 99%
“…The intuition behind FAD curves is inspired by counterfactual explanations -which describes how the prediction of a model changes when the input is perturbed (Wachter et al, 2018) -and the Pareto principle -which states that for many situations, approximately 80% of the outcome is due to 20% of causes (the "vital few") (Pareto, 1964;Roccetti et al, 2021). If a feature attribution method accurately ranks the most important features for a certain prediction and the Pareto principle holds true, then cumulatively dropping the most important features in descending order should yield a smaller and smaller decrease in model performance for that prediction.…”
Section: Fad Curvementioning
confidence: 99%