“…Many statistical learning algorithms are now available in statistical software like R and Python, and it is not possible to give a complete overview here (see e.g., Hao and Ho, 2019 , for a Python overview). However, we do want to point to some of the most popular choices that have been applied to classifying answers to open-ended questions: these include tree-based methods like random forests and boosting (Schonlau and Couper, 2016 ; Kern et al, 2019 ; Schierholz and Schonlau, 2021 ), support vector machines (SVM) (Joachims, 2001 ; Bullington et al, 2007 ; He and Schonlau, 2020 , 2021 ; Khanday et al, 2021 ), multinomial regression (Schierholz and Schonlau, 2021 ) and naïve Bayes classifiers (Severin et al, 2017 ; Paudel et al, 2018 ).…”