2022
DOI: 10.31234/osf.io/89snd
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Best Practices in Supervised Machine Learning: A Tutorial for Psychologists

Abstract: Supervised machine learning (ML) is becoming an influential research method in psychology and other social sciences. However, theoretical ML concepts and predictive modeling techniques are not yet widely taught in psychology programs. This tutorial is intended to provide a low-barrier, non-technical entrance to supervised ML for psychologists in four consecutive modules. After introducing the basic idea of supervised ML, Module I covers performance evaluation of ML models with resampling methods (performance m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 10 publications
(11 citation statements)
references
References 58 publications
0
10
0
Order By: Relevance
“…Individual decision trees recursively split the feature space (rules to distinguish classes) with the goal to separate the different classes of the criterion (drop out vs. remain in our case). For a detailed description of how individual decision trees operate and translate to a random forest see Pargent, Schoedel & Stachl 80 .…”
Section: Methodsmentioning
confidence: 99%
“…Individual decision trees recursively split the feature space (rules to distinguish classes) with the goal to separate the different classes of the criterion (drop out vs. remain in our case). For a detailed description of how individual decision trees operate and translate to a random forest see Pargent, Schoedel & Stachl 80 .…”
Section: Methodsmentioning
confidence: 99%
“…The development of such models requires diagnostic, meaning sensitive and privacy protected, information about individuals. Hence there are many challenges and professional requirements that need to be met for safe and ethical handling and development of such models (Pargent, Schoedel, & Stachl, 2022).…”
Section: Intentionally Modelling Vulnerabilitymentioning
confidence: 99%
“…The basic approach to detect overfitting in ML is to split the sample data into two parts: a training set whose observations are used to train the algorithm and a test set whose observations are predicted to estimate the performance of the trained algorithm on new, unseen data (Yarkoni & Westfall, 2017). Since going into detail about the terminology and foundations of ML would be beyond the scope of this paper, we refer newcomers to Pargent et al (2022) or Yarkoni and Westfall (2017) for introductions more tailored to psychologists; for a special focus on personality research and assessment, see Stachl et al (2020); for a special focus on clinical psychology, see Dwyer et al (2018).…”
Section: In Psychological Sciencementioning
confidence: 99%