Random Forest Analysis of Factors Predicting Science Achievement Groups: Focusing on Science Activities and Learning in School

Hong, Jeehye; Kim, Hyunjung; Hong, Hun-Gi

doi:10.1163/23641177-bja10055

Cited by 2 publications

(2 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In recent years, machine learning methods have gained traction for analyzing extensive educational assessment data, estimating the influence of individual factors, and constructing predictive models [40]. Among these techniques, the RF approach has been proposed to assess the significance of individual factors in predicting outcomes [18,19,21].…”

Section: Research Aimsmentioning

confidence: 99%

“…Within the present study, 20% of the samples were designated for validation data; tree numbers were fixed to 500, 1000, and 2000 trees; each tree drew on 50% of the training data. Two variables (the square root of the total number of variables, i.e., √ 5 ≈ 2) were randomly selected for node splitting in each decision tree generated from bootstrapped datasets [40,53]. Therefore, n(Train) = 17,609; n(Test) = 4402.…”

Section: Random Forestmentioning

confidence: 99%

See 1 more Smart Citation

Random Forest Regression in Predicting Students’ Achievements and Fuzzy Grades

Doz,

Cotič,

Felda

2023

Mathematics

View full text Add to dashboard Cite

The use of fuzzy logic to assess students’ knowledge is not a completely new concept. However, despite dealing with a large quantity of data, traditional statistical methods have typically been the preferred approach. Many studies have argued that machine learning methods could offer a viable alternative for analyzing big data. Therefore, this study presents findings from a Random Forest (RF) regression analysis to understand the influence of demographic factors on students’ achievements, i.e., teacher-given grades, students’ outcomes on the national assessment, and fuzzy grades, which were obtained as a combination of the two. RF analysis showed that demographic factors have limited predictive power for teacher-assigned grades, unlike INVALSI scores and fuzzy grades. School type, macroregion, and ESCS are influential predictors, whereas gender and origin have a lesser impact. The study highlights regional and socio-economic disparities, influencing both student outcomes and fuzzy grades, underscoring the need for equitable education. Unexpectedly, gender’s impact on achievements is minor, possibly due to gender-focused policies. Although the study acknowledges limitations, its integration of fuzzy logic and machine learning sets the foundation for future research and policy recommendations, advocating for diversified assessment approaches and data-driven policymaking.

show abstract