One of the major causes of death in the world is cardiac arrhythmias. In the field of healthcare, physicians use the patient’s electrocardiogram (ECG) records to detect arrhythmias, which indicate the electrical activity of the patient’s heart. The problem is that the symptoms do not always appear and the physician may be mistaken in the diagnosis. Therefore, patients need continuous monitoring through real-time ECG analysis to detect arrhythmias in a timely manner and prevent an eventual incident that threatens the patient’s life. In this research, we used the Structured Streaming module built top on the open-source Apache Spark platform for the first time to implement a machine learning pipeline for real-time cardiac arrhythmias detection and evaluate the impact of using this new module on classification performance metrics and the rate of delay in arrhythmia detection. The ECG data collected from the MIT/BIH database for the detection of three class labels: normal beats, RBBB, and atrial fibrillation arrhythmias. We also developed three decision trees, random forest, and logistic regression multiclass classifiers for data classification where the random forest classifier showed better performance in classification than the other two classifiers. The results show previous results in performance metrics of the classification model and a significant decrease in pipeline runtime by using more class labels compared to previous studies.
The world today faces a new challenge that is unprecedented in the last 100 years. The emergence of a new coronavirus has led to a human catastrophe. The new coronavirus is the cause of the Covid-19 disease, which kills many people in the world every day. Scientists in various sciences have been looking for solutions to this problem so far. In addition to general vaccination, maintaining social distance and hygienic principles are the most well-known strategies to prevent Covid-19 infection. In this research, we have tried to examine the symptoms of Covid-19 cases through different supervised machine learning methods. We solved the class imbalance problem using the SMOTE up-sampling method and then developed some classification models to predict the recovery or death of patients. Besides, we implemented a rule-based technique to identify important symptoms that affect patients' fate and calculate the range of values in these features that lead to recovery or death of patients. Our results showed that the random forest model with 94% accuracy, 95.2% sensitivity, 92.7% specification, 93.2% precision, and 94.2% F-score outperforms state-of-the-art classification models. Finally, we identified the ten most significant rules in the data set. The rules state that different combinations of 6 features in certain ranges of their values lead to patients' recovery with 90% confidence. In conclusion, the classification results in this study show better performance than recent researches. Besides, help physicians consider other important factors in improving health services to different groups of Covid-19 patients.
The emergence of the new coronavirus in late 2019 further highlighted the human need for solutions to explore various aspects of deadly pandemics. Providing these solutions will enable humans to be more prepared for dealing with possible future pandemics. In addition, it helps governments implement strategies to tackle and control infectious diseases similar to COVID-19 faster than ever before. In this article, we used the social network analysis (SNA) method to identify high-risk areas of the new coronavirus in Iran. First, we developed the mobility network through the transfer of passengers (edges) between the provinces (nodes) of Iran and then evaluated the in-degree and page rank centralities of the network. Next, we developed 2 Poisson regression (PR) models to predict high-risk areas of the disease in different populations (moderator) using the mobility network centralities (independent variables) and the number of patients (dependent variable). The P-value of .001 for both prediction models confirmed a meaningful interaction between our variables. Besides, the PR models revealed that in higher populations, with the increase of network centralities, the number of patients increases at a higher rate than in lower populations, and vice versa. In conclusion, our method helps governments impose more restrictions on high-risk areas to handle the COVID-19 outbreak and provides a viable solution for accelerating operations against future pandemics similar to the coronavirus.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.