In recent years, heart disease is one of the leading cause of death for both women and men. So, heart disease prediction is considered as a significant part in the clinical data analysis. Standard data mining techniques like Support Vector Machine (SVM), Naïve Bayes and other machine learning techniques used in the earlier research for heart disease prediction. These methods are not sufficient for effective heart disease prediction due to insufficient test data. In this research, Bi-directional Long Short Term Memory with Conditional Random Field (BiLSTM-CRF) has been proposed to increase the efficiency of heart disease prediction. The input medical data were analyzed in a bidirectional manner for effective analysis, and CRF provided the linear relationship between the features. The BiLSTM-CRF method has been tested on the Cleveland dataset to analyze the performance and compared with existing methods. The results showed that the proposed BiLSTM-CRF outperformed the existing methods in heart disease prediction. The average accuracy of the proposed BiLSTM-CRF is 90.04%, which is higher than the existing methods.
Heart disease is a serious disease that causes sudden death among 80% of the people around the world. The traditional models performed predictive analytics using machine learning techniques to make a better decision. For better decision making in heart disease prediction, the big data analysis shows the great opportunities to predict the future health status from health parameters and provide best outcomes. However, the traditional decision making models had traffic data or contained noise and uncertainty was unpredictable as the data ambiguity emerged. In order to overcome such an issue, the big data is used to ensure the medical service which is mostly needed in a timely manner and for accurate diagnosis. The pre-processing of the medical data acquired from Cleveland heart disease UCI datasets has a vast number of attributes which helps to predict the heart disease. The data are contaminated with the noise and some of the data are missing, so the pre-processing using Min max Normalization is performed to remove contaminated noise acquired in the data which is taken from the UCI repository dataset. The proposed Fuzzy Deep Convolution Network (FDCN) permits the input features for fuzzification process that uses transformed features. The fuzzification process eliminates the redundant or irrelevant fuzzified features and overcomes the system complexity problems. The proposed FDCN obtains accuracy of 95.56 % and 92 % of F-score shows better results when compared with the existing KNN-DT, Naive Bayes, and Random Forest algorithms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.