Xinglong Yuan scite author profile

The outcomes of hypertension refer to the death or serious complications (such as myocardial infarction or stroke) that may occur in patients with hypertension. The outcomes of hypertension are very concerning for patients and doctors, and are ideally avoided. However, there is no satisfactory method for predicting the outcomes of hypertension. Therefore, this paper proposes a prediction method for outcomes based on physical examination indicators of hypertension patients. In this work, we divide the patients’ outcome prediction into two steps. The first step is to extract the key features from the patients’ many physical examination indicators. The second step is to use the key features extracted from the first step to predict the patients’ outcomes. To this end, we propose a model combining recursive feature elimination with a cross-validation method and classification algorithm. In the first step, we use the recursive feature elimination algorithm to rank the importance of all features, and then extract the optimal features subset using cross-validation. In the second step, we use four classification algorithms (support vector machine (SVM), C4.5 decision tree, random forest (RF), and extreme gradient boosting (XGBoost)) to accurately predict patient outcomes by using their optimal features subset. The selected model prediction performance evaluation metrics are accuracy, F1 measure, and area under receiver operating characteristic curve. The 10-fold cross-validation shows that C4.5, RF, and XGBoost can achieve very good prediction results with a small number of features, and the classifier after recursive feature elimination with cross-validation feature selection has better prediction performance. Among the four classifiers, XGBoost has the best prediction performance, and its accuracy, F1, and area under receiver operating characteristic curve (AUC) values are 94.36%, 0.875, and 0.927, respectively, using the optimal features subset. This article’s prediction of hypertension outcomes contributes to the in-depth study of hypertension complications and has strong practical significance.

show abstract

Value of a Machine Learning Approach for Predicting Clinical Outcomes in Young Patients With Hypertension

Yuan

Wang

et al. 2020

Hypertension

View full text Add to dashboard Cite

Risk stratification of young patients with hypertension remains challenging. Generally, machine learning (ML) is considered a promising alternative to traditional methods for clinical predictions because it is capable of processing large amounts of complex data. We, therefore, explored the feasibility of an ML approach for predicting outcomes in young patients with hypertension and compared its performance with that of approaches now commonly used in clinical practice. Baseline clinical data and a composite end point—comprising all-cause death, acute myocardial infarction, coronary artery revascularization, new-onset heart failure, new-onset atrial fibrillation/atrial flutter, sustained ventricular tachycardia/ventricular fibrillation, peripheral artery revascularization, new-onset stroke, end-stage renal disease—were evaluated in 508 young patients with hypertension (30.83±6.17 years) who had been treated at a tertiary hospital. Construction of the ML model, which consisted of recursive feature elimination, extreme gradient boosting, and 10-fold cross-validation, was performed at the 33-month follow-up evaluation, and the model’s performance was compared with that of the Cox regression and recalibrated Framingham Risk Score models. An 11-variable combination was considered most valuable for predicting outcomes using the ML approach. The C statistic for identifying patients with composite end points was 0.757 (95% CI, 0.660–0.854) for the ML model, whereas for Cox regression model and the recalibrated Framingham Risk Score model it was 0.723 (95% CI, 0.636–0.810) and 0.529 (95% CI, 0.403–0.655). The ML approach was comparable with Cox regression for determining the clinical prognosis of young patients with hypertension and was better than that of the recalibrated Framingham Risk Score model.

show abstract

Sequential Pattern Mining Algorithm Based on Text Data: Taking the Fault Text Records as an Example

et al. 2018

View full text Add to dashboard Cite

Sequential pattern mining (SPM) is an effective and important method for analyzing time series. This paper proposed a SPM algorithm to mine fault sequential patterns in text data. Because the structure of text data is poor and there are many different forms of text expression for the same concept, the traditional SPM algorithm cannot be directly applied to text data. The proposed algorithm is designed to solve this problem. First, this study measured the similarity of fault text data and classified similar faults into one class. Next, this paper proposed a new text similarity measurement model based on the word embedding distance. Compared with the classic text similarity measurement method, this model can achieve good results in short text classification. Then, on the basis of fault classification, this paper proposed the SPM algorithm with an event window, which is a time soft constraint for obtaining a certain number of sequential patterns according to needs. Finally, this study used the fault text records of a certain aircraft as experimental data for mining fault sequential patterns. Experiment showed that this algorithm can effectively mine sequential patterns in text data. The proposed algorithm can be widely applied to text time series data in many fields such as industry, business, finance and so on.

show abstract

Decision-Making Method based on Mixed Integer Linear Programming and Rough Set: A Case Study of Diesel Engine Quality and Assembly Clearance Data

Chang

Yuan

et al. 2019

Sustainability

View full text Add to dashboard Cite

The purpose of this paper is to establish a decision-making system for assembly clearance parameters and machine quality level by analyzing the data of assembly clearance parameters of diesel engine. Accordingly, we present an extension of the rough set theory based on mixed-integer linear programming (MILP) for rough set-based classification (MILP-FRST). Traditional rough set theory has two shortcomings. First, it is sensitive to noise data, resulting in a low accuracy of decision systems based on rough sets. Second, in the classification problem based on rough sets, the attributes cannot be automatically determined. MILP-FRST has the advantages of MILP in resisting noisy data and has the ability to select attributes flexibly and automatically. In order to prove the validity and advantages of the proposed model, we used the machine quality data and assembly clearance data of 29 diesel engines of a certain type to validate the proposed model. Experiments show that the proposed decision-making method based on MILP-FRST model can accurately determine the quality level of the whole machine according to the assembly clearance parameters.

show abstract

Improvement of K-means clustering algorithm based on MIP optimization

Chang

Yuan

Zhou

2018

J. Phys.: Conf. Ser.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xinglong Yuan

A Machine-Learning-Based Prediction Method for Hypertension Outcomes Based on Medical Data

Value of a Machine Learning Approach for Predicting Clinical Outcomes in Young Patients With Hypertension

Sequential Pattern Mining Algorithm Based on Text Data: Taking the Fault Text Records as an Example

Decision-Making Method based on Mixed Integer Linear Programming and Rough Set: A Case Study of Diesel Engine Quality and Assembly Clearance Data

Improvement of K-means clustering algorithm based on MIP optimization

Contact Info

Product

Resources

About