2021
DOI: 10.1371/journal.pone.0258658
|View full text |Cite
|
Sign up to set email alerts
|

Hollow-tree super: A directional and scalable approach for feature importance in boosted tree models

Abstract: Purpose Current limitations in methodologies used throughout machine-learning to investigate feature importance in boosted tree modelling prevent the effective scaling to datasets with a large number of features, particularly when one is investigating both the magnitude and directionality of various features on the classification into a positive or negative class. This manuscript presents a novel methodology, “Hollow-tree Super” (HOTS), designed to resolve and visualize feature importance in boosted tree model… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 11 publications
(10 citation statements)
references
References 19 publications
0
10
0
Order By: Relevance
“…Ultimately, ML tools are capable of taking highly dimensional data and quickly making accurate decisions in highly time-critical medical scenarios, a feat that humans may never physically nor cognitively be capable of performing ( 54 ). However, if we could explain the various decisions being executed by a certain model and the specific features being analyzed to produce a certain outcome ( 24 ), physicians can better interpret these results based on logic and previous knowledge. Then, healthcare providers may not only be able to better trust these algorithms, but providers may also continually improve the model's performance when the system presents an error that is likely based on a specific wrong answer possibly being executed in a portion of a decision tree.…”
Section: Method: How Does the Tech Approach Play Out?mentioning
confidence: 99%
See 2 more Smart Citations
“…Ultimately, ML tools are capable of taking highly dimensional data and quickly making accurate decisions in highly time-critical medical scenarios, a feat that humans may never physically nor cognitively be capable of performing ( 54 ). However, if we could explain the various decisions being executed by a certain model and the specific features being analyzed to produce a certain outcome ( 24 ), physicians can better interpret these results based on logic and previous knowledge. Then, healthcare providers may not only be able to better trust these algorithms, but providers may also continually improve the model's performance when the system presents an error that is likely based on a specific wrong answer possibly being executed in a portion of a decision tree.…”
Section: Method: How Does the Tech Approach Play Out?mentioning
confidence: 99%
“…Movement toward white-box, also called glass-box, models provides a solution to address concerns of explainability. These models can often be seen with linear ( 58 ) and decision-tree based models ( 24 ), although a number of other applications are increasingly being developed ( 53 ). In fact, DL based networks make up the majority of the highly sought after radiological-AI applications for the medical field ( 1 ), such as the systems that can diagnose brain cancer during surgery.…”
Section: Method: How Does the Tech Approach Play Out?mentioning
confidence: 99%
See 1 more Smart Citation
“…The black box problem in machine learning generally limits the ability to utilize machine learning techniques in clinical practice, as there is generally a need-to-know which parts of the brain contribute to a given pathology. In order to address this, we used a boosted trees approach, called Hollow-tree Super (HoTS) ( Doyen et al, 2021 ), to determine which features of each machine learning model, in this case the functional connectivity among brain regions, were contributing most to the model’s prediction of performance on each neuropsychological test. Performance in each test was classified by a tertile split, with the upper and lower tertiles classed as poor and good performance, respectively.…”
Section: Methodsmentioning
confidence: 99%
“…In this study, we performed functional connectivity-based analysis and utilized a recently described machine learning approach ( Doyen et al, 2021 ) to identify commonalities and differences in brain regions across neuropsychological domains in a cohort of MCI and AD patients, and age-matched cognitively unimpaired subjects. We sought to explore patterns among these regions to identify potential markers which may be used in future studies to develop better disease classifiers.…”
Section: Introductionmentioning
confidence: 99%