2020
DOI: 10.21203/rs.3.rs-33216/v3
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Predictors of Outpatients’ No-Show: Big Data Analytics using Apache Spark

Abstract: Outpatients who fail to attend their appointments have a negative impact on the healthcare outcome. Thus, healthcare organizations facing new opportunities, one of them is to improve the quality of healthcare. The main challenges is predictive analysis using techniques capable of handle the huge data generated. We propose a big data framework for identifying subject outpatients’ no-show via feature engineering and machine learning (MLlib) in the Spark platform. This study evaluates the performance of five mach… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
8
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(9 citation statements)
references
References 4 publications
1
8
0
Order By: Relevance
“…14 This finding provides an additional important feature to consider in the development of prediction models for missed appointments, as it represents a previously unassessed predictor in such models. [43][44][45] Because of the cross-sectional nature of our data, we are unable to draw temporal relationships between being a portal user and no-shows. An area of future research could be evaluating differences in no shows between appointments made through the portal compared to other means.…”
Section: Discussionmentioning
confidence: 99%
“…14 This finding provides an additional important feature to consider in the development of prediction models for missed appointments, as it represents a previously unassessed predictor in such models. [43][44][45] Because of the cross-sectional nature of our data, we are unable to draw temporal relationships between being a portal user and no-shows. An area of future research could be evaluating differences in no shows between appointments made through the portal compared to other means.…”
Section: Discussionmentioning
confidence: 99%
“…They build predictive models with machine learning algorithm JRip [32] and Hoeffding tree algorithm [33]. In [31], data used is provided by their national health center of authors' country. Five algorithms, random forest, gradient boosting, logistic regression, SVM, and multilayer perceptron, were used [34].…”
Section: Related Workmentioning
confidence: 99%
“…It was observed that 58% of the reviewed papers [15, were based on BD predictive analytics, 18% BD prescriptive analytics [103][104][105] , 11% BD descriptive analytics [15, , whiles 9% (A+B) [100][101][102][103][104][105] and 5% (B+C) [106][107][108] . Few studies in BDA used prescriptive analytics (see Table A1 in Appendix); this can be attributed to fact that big data prescriptive analytics is in its early stage.…”
Section: Methodsmentioning
confidence: 99%
“…Additionally, it was observed that some studies [15,69,73,87,89,93,95,98,101,122] adopted ensemble learning methods, like the random forest, boosting, and bagging, to enhance the power of EL techniques in BDA. ML algorithm's hybridisation is an excellent technique to compensate for the weakness in the individual algorithm [9] .…”
Section: Big Data Platforms Tool In Bdamentioning
confidence: 99%
See 1 more Smart Citation