2021
DOI: 10.3390/app112411854
|View full text |Cite
|
Sign up to set email alerts
|

Towards a More Reliable Interpretation of Machine Learning Outputs for Safety-Critical Systems Using Feature Importance Fusion

Abstract: When machine learning supports decision-making in safety-critical systems, it is important to verify and understand the reasons why a particular output is produced. Although feature importance calculation approaches assist in interpretation, there is a lack of consensus regarding how features’ importance is quantified, which makes the explanations offered for the outcomes mostly unreliable. A possible solution to address the lack of agreement is to combine the results from multiple feature importance quantifie… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(3 citation statements)
references
References 39 publications
0
2
0
1
Order By: Relevance
“…If it is set too high, anomalies will be missed, and if it is set too low, the rate of false positives will become high. Typically used methodologies for thresholding are Area Under Curve Percentage (AUCP) [16], Median Absolute Deviation (MAD) [17], Modified Thompson Tau Test (MTT) [18], Variational Autoencoders (VAE) [19], Z-Score [20] or Clustering-based techniques [21].…”
Section: Thresholdingmentioning
confidence: 99%
“…If it is set too high, anomalies will be missed, and if it is set too low, the rate of false positives will become high. Typically used methodologies for thresholding are Area Under Curve Percentage (AUCP) [16], Median Absolute Deviation (MAD) [17], Modified Thompson Tau Test (MTT) [18], Variational Autoencoders (VAE) [19], Z-Score [20] or Clustering-based techniques [21].…”
Section: Thresholdingmentioning
confidence: 99%
“…The purpose of the PFI is to calculate how much the performance measure of the model has decreased by randomly extracting the features from the data set. The amount of increase in the RMSE (Ibrahim and Jafari 2019) or MAE (Rengasamy, Rothwell, and Figueredo 2021) values can be determined by the effects of the used features in the model on the classification. The bigger the change, the more important that feature is.…”
Section: Permutation Feature Importance (Pfi)mentioning
confidence: 99%
“…Considerada uma das mais importantes fases na construção de um modelo de aprendizado de máquina, técnicas de seleção de variáveis vem sendo destaques em várias literaturas (Chen et al, 2020;Rengasamy et al, 2021), pois ao selecionar as variáveis que realmente são relevantes para serem aplicados ao modelo, ocorrerá uma interpretação melhor de como cada uma delas afetam as predições. Embora a identificação e seleção de variáveis possa ser feita de forma empírica (por meio do conhecimento de especialistas, popularidade na literatura e sucesso preditivo em pesquisas anteriores), tendo a oportunidade de melhorar ainda mais o presente modelo com recursos correlacionados e não redundantes, além de diminuir a complexidade, facilitar a compreensão e ajuda a melhorar o desempenho das métricas em exatidão, precisão e recuperação.…”
Section: Seleção De Variáveisunclassified