A comparison of univariate probit and logit models using simulation

Alsoruji, Abeer H.; Binhimd, Sulafah; Elaal, Mervat K. Abd

doi:10.12988/ams.2018.818

Cited by 2 publications

(2 citation statements)

References 6 publications

(2 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Given that the dependent variables (late-life career participation and health-related working capacity) were binary indicators, the discrete choice model was applied to examine the relationships of variables. The logit model with n regressors ( x 1 , x 2 ,---, x n ) performs better than the probit model with n regressors in the case of larger sample size, because when the sample size increases, the probability of observes in tail increases too (Alsoruji et al 2018 ). Thus, the logit model is used for the second regression (health-related working capacity) with the larger valid sample size.…”

Section: Methodsmentioning

confidence: 99%

Chronic patients as retirement-aged workers: the impact of employment-based health insurance and chronic conditions on health-related working capacity and late-life career participation

Yuan

Fang²,

Li³

et al. 2022

Eur J Ageing

View full text Add to dashboard Cite

Retirement-aged workers with chronic conditions are increasingly engaged in late-life careers in the policy context of delayed retirement initiative. However, it remains uncertain as to how chronic conditions and employment-based social health insurance interact to affect health-related working capacity and late career participation in this group of people. Using data from the China Health and Retirement Longitudinal Study (CHARLS) and the discrete choice model, this study finds that chronic conditions are negatively associated with health-related working capacity (– 0.400, p < 0.01) and late-life career participation (– 0.170, p < 0.01). Employment-based health insurance is positively associated with health-related working capacity of retirement-aged workers (0.432, p < 0.01), but is negatively associated with their late-life career participation (– 1.027, p < 0.01). Moreover, employment-based health insurance could weaken the negative associations between chronic conditions and health-related working capacity (interaction = 0.285, p < 0.05) and late-life career participation (interaction = 0.251, p < 0.05). More fine-grained policies for delayed retirement are needed to focus on the long-neglected health of retirement-aged workers with chronic conditions.

show abstract

Section: Methodsmentioning

confidence: 99%

Chronic patients as retirement-aged workers: the impact of employment-based health insurance and chronic conditions on health-related working capacity and late-life career participation

Yuan

Fang²,

Li³

et al. 2022

Eur J Ageing

View full text Add to dashboard Cite

show abstract

“…In a similar but more recent study by Cakmakyapan and Goktas (2013) is that executed by Alsoruji et al (2018), the researchers conducted a simulation to compare the probit and logit models under various sample sizes, dependent-independent variables' correlation coefficients, and latent response In variable cut points. In the simulation, the regressand is influenced by three covariates from the standard multivariate normal distribution.…”

Section: Statistics-based Studiesmentioning

confidence: 99%

Simulation-Based Assessment of Classification Methods: Statistical Models vs. Machine Learning Algorithms

Beram,

El-Kotory

2024

The Egyptian Statistical Journal

View full text Add to dashboard Cite

Current studies evaluated the effectiveness of categorization techniques primarily using real datasets with unreported or unknown statistical features. This simulation-based study aims to compare the performance of statistical models (logistic regression, probit regression, and discriminant analysis) with machine learning algorithms (support vector machines, classification and regression trees, and k-nearest neighbors) to comprehensively understand their suitability for classification tasks. Although simulated datasets are used to control their statistical characteristics, the Pima Indian Diabetes real dataset is used to verify the study findings. The outcomes of this study have the potential to guide practitioners and researchers in selecting the most appropriate modeling technique for their specific needs, ultimately enhancing the accuracy and reliability of classification outcomes across various domains. The results revealed that the two statistical models -probit and logit-outperformed in most simulation scenarios. Markedly, the well-grounded, theory-based models of the logit regression and the probit regression models yielded the most accurate predictions in 78.5% and 83.6% of the simulated scenarios, respectively. Interestingly, the performance of the probit model was the best when the binary response variable was balanced (τ=0.50) and when it was too imbalanced (τ=0.90). Notably, the resulting performance metrics of the real dataset refer to the logit, followed by the probit, being the best-predicting models, which resembles the outcome of the simulation study.in the case of binary response variables. Although different categorical response models exist, the most commonly applied are the logistic regression, the probit regression, and the Discriminant Analysis (DA).The term ML was first introduced by Arthur Samuel in 1959 (Arthur, 1959). ML is the field of study that trains computers/systems to operate independently and improve with experience. Accordingly, ML algorithms construct a model based on sample data-training data-to make predictions or decisions. Furthermore, ML utilizes notions from various disciplines: statistics, mathematics, philosophy, computational complexity, and artificial intelligence. Markedly, interest in applying contemporary ML techniques as alternatives to statistical methods is widely increasing (Lynam et al., 2020). For that, colossal improvement has been achieved by ML methods concerning the simple binary discrimination problem that qualitative response models can target.Further, it was claimed that the successful use of ML in several fields indicates promising applications in other fields. However, the advantages and superiority of ML-based classification methods compared with more traditional statistical ones need to be assessed, validated, and verified in all fields of application (Côté et al., 2022). With that in mind, such alternative ML algorithms include Decision Trees (DTs), Support Vector Machines (SVMs), K-Nearest Neighbors (KNN), Random Forest (RF), Gaussian Process (G...

show abstract

A comparison of univariate probit and logit models using simulation

Cited by 2 publications

References 6 publications

Chronic patients as retirement-aged workers: the impact of employment-based health insurance and chronic conditions on health-related working capacity and late-life career participation

Chronic patients as retirement-aged workers: the impact of employment-based health insurance and chronic conditions on health-related working capacity and late-life career participation

Simulation-Based Assessment of Classification Methods: Statistical Models vs. Machine Learning Algorithms

Contact Info

Product

Resources

About