2023
DOI: 10.1016/j.ecolmodel.2023.110353
|View full text |Cite
|
Sign up to set email alerts
|

A comparison of machine learning and statistical species distribution models: Quantifying overfitting supports model interpretation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(5 citation statements)
references
References 42 publications
0
2
0
Order By: Relevance
“…Generalized additive models (GAM) is a regression technique that applies smooth functions to assess the effect of predictors on response variables. Compared to other species distribution modelling (SDM) algorithms, machine learning algorithms are suitable for predicting complex relationships (Ramampiandra et al., 2023) with RF and BRT known to provide high prediction performances, while GAM is a highly flexible regression approach suitable for modelling data that do not require interactive terms (Elith et al., 2006). RF was applied using the ‘randomforest’ r package (Liaw & Wiener, 2002), BRT using the ‘gbm’ r package (Greenwell et al., 2019) and GAM using the ‘mgcv’ r package (Wood, 2017).…”
Section: Methodsmentioning
confidence: 99%
“…Generalized additive models (GAM) is a regression technique that applies smooth functions to assess the effect of predictors on response variables. Compared to other species distribution modelling (SDM) algorithms, machine learning algorithms are suitable for predicting complex relationships (Ramampiandra et al., 2023) with RF and BRT known to provide high prediction performances, while GAM is a highly flexible regression approach suitable for modelling data that do not require interactive terms (Elith et al., 2006). RF was applied using the ‘randomforest’ r package (Liaw & Wiener, 2002), BRT using the ‘gbm’ r package (Greenwell et al., 2019) and GAM using the ‘mgcv’ r package (Wood, 2017).…”
Section: Methodsmentioning
confidence: 99%
“…The application of TAVAC will allow for robust model interpretation in deep learning models leveraging commonly available H&E-stained images-especially those digitized through modern digital pathology techniquesfor high-resolution tumor status or subtype prediction. Overfitting affects model interpretability (Ramampiandra & Chollet, 2023). TAVAC is the first algorithm focused on the generalization of deep learning model interpretation.…”
Section: Application To Two Independent Breast Cancer Pathological Im...mentioning
confidence: 99%
“…At the same time, the generalization of deep learning model interpretation through attention visualization has been understudied. Ramampiandra & Chollet previously commented that overfitted models impede the interpretation of machine learning models (Ramampiandra & Chollet, 2023), yet no solution is proposed to solve the problem. Model interpretation for ViT models is carried out through analysis of attention layers, which are the weight on the input representing the focus of the model (Dosovitskiy, et al, 2020;Abnar & Zuidema, 2020).…”
mentioning
confidence: 99%
“…When a statistical model is overly intricate and catches noise in the data rather than underlying patterns, overfitting occurs. As a result, new data may not generalize well, and model interpretability may suffer [9]. Predictive models are created using machine learning methods, including decision trees [10], random forests [11], neural networks [12], and support vector machines [13].…”
Section: Introductionmentioning
confidence: 99%