Wei Ying Tan scite author profile

This study looked at novel data sources for cardiovascular risk prediction including detailed lifestyle questionnaire and continuous blood pressure monitoring, using ensemble machine learning algorithms (MLAs). The reference conventional risk score compared against was the Framingham Risk Score (FRS). The outcome variables were low or high risk based on calcium score 0 or calcium score 100 and above. Ensemble MLAs were built based on naive bayes, random forest and support vector classifier for low risk and generalized linear regression, support vector regressor and stochastic gradient descent regressor for high risk categories. MLAs were trained on 600 Southeast Asians aged 21 to 69 years free of cardiovascular disease. All MLAs outperformed the FRS for low and high-risk categories. MLA based on lifestyle questionnaire only achieved AUC of 0.715 (95% CI 0.681, 0.750) and 0.710 (95% CI 0.653, 0.766) for low and high risk respectively. Combining all groups of risk factors (lifestyle survey questionnaires, clinical blood tests, 24-h ambulatory blood pressure and heart rate monitoring) along with feature selection, prediction of low and high CVD risk groups were further enhanced to 0.791 (95% CI 0.759, 0.822) and 0.790 (95% CI 0.745, 0.836). Besides conventional predictors, self-reported physical activity, average daily heart rate, awake blood pressure variability and percentage time in diastolic hypertension were important contributors to CVD risk classification.

show abstract

Patient similarity analytics for explainable clinical risk prediction

Fang

Tan

et al. 2021

BMC Med Inform Decis Mak

View full text Add to dashboard Cite

Background Clinical risk prediction models (CRPMs) use patient characteristics to estimate the probability of having or developing a particular disease and/or outcome. While CRPMs are gaining in popularity, they have yet to be widely adopted in clinical practice. The lack of explainability and interpretability has limited their utility. Explainability is the extent of which a model’s prediction process can be described. Interpretability is the degree to which a user can understand the predictions made by a model. Methods The study aimed to demonstrate utility of patient similarity analytics in developing an explainable and interpretable CRPM. Data was extracted from the electronic medical records of patients with type-2 diabetes mellitus, hypertension and dyslipidaemia in a Singapore public primary care clinic. We used modified K-nearest neighbour which incorporated expert input, to develop a patient similarity model on this real-world training dataset (n = 7,041) and validated it on a testing dataset (n = 3,018). The results were compared using logistic regression, random forest (RF) and support vector machine (SVM) models from the same dataset. The patient similarity model was then implemented in a prototype system to demonstrate the identification, explainability and interpretability of similar patients and the prediction process. Results The patient similarity model (AUROC = 0.718) was comparable to the logistic regression (AUROC = 0.695), RF (AUROC = 0.764) and SVM models (AUROC = 0.766). We packaged the patient similarity model in a prototype web application. A proof of concept demonstrated how the application provided both quantitative and qualitative information, in the form of patient narratives. This information was used to better inform and influence clinical decision-making, such as getting a patient to agree to start insulin therapy. Conclusions Patient similarity analytics is a feasible approach to develop an explainable and interpretable CRPM. While the approach is generalizable, it can be used to develop locally relevant information, based on the database it searches. Ultimately, such an approach can generate a more informative CRPMs which can be deployed as part of clinical decision support tools to better facilitate shared decision-making in clinical practice.

show abstract

A Machine Learning Approach for Early Diagnosis of Cognitive Impairment Using Population-Based Data

Tan

Hargreaves

Chen

et al. 2023

JAD

View full text Add to dashboard Cite

Background: The major mechanisms of dementia and cognitive impairment are vascular and neurodegenerative processes. Early diagnosis of cognitive impairment can facilitate timely interventions to mitigate progression. Objective: This study aims to develop a reliable machine learning (ML) model using socio-demographics, vascular risk factors, and structural neuroimaging markers for early diagnosis of cognitive impairment in a multi-ethnic Asian population. Methods: The study consisted of 911 participants from the Epidemiology of Dementia in Singapore study (aged 60– 88 years, 49.6% male). Three ML classifiers, logistic regression, support vector machine, and gradient boosting machine, were developed. Prediction results of independent classifiers were combined in a final ensemble model. Model performances were evaluated on test data using F1 score and area under the receiver operating curve (AUC) methods. Post modelling, SHapely Additive exPlanation (SHAP) was applied on the prediction results to identify the predictors that contribute most to the cognitive impairment prediction. Findings: The final ensemble model achieved a F1 score and AUC of 0.87 and 0.80 respectively. Accuracy (0.83), sensitivity (0.86), specificity (0.74) and predictive values (positive 0.88 negative 0.72) of the ensemble model were higher compared to the independent classifiers. Age, ethnicity, highest education attainment and neuroimaging markers were identified as important predictors of cognitive impairment. Conclusion: This study demonstrates the feasibility of using ML tools to integrate multiple domains of data for reliable diagnosis of early cognitive impairment. The ML model uses easy-to-obtain variables and is scalable for screening individuals with a high risk of developing dementia in a population-based setting.

show abstract

The effect of oral diabetes medications on glycated haemoglobin (HbA1c) in Asians in primary care: a retrospective cohort real-world data study

Fang

Gao

Tan

et al. 2022

BMC Med

View full text Add to dashboard Cite

Background Clinical trials have demonstrated that initiating oral anti-diabetic drugs (OADs) significantly reduce glycated hemoglobin (HbA1c) levels. However, variability in lifestyle modifications and OAD adherence impact on their actual effect on glycemic control. Furthermore, evidence on dose adjustments and discontinuation of OAD on HbA1c is lacking. This study aims to use real-world data to determine the effect of OAD initiation, up-titration, down-titration, and discontinuation on HbA1c levels, among Asian patients managed in primary care. Methods A retrospective cohort study over a 5-year period, from Jan 2015 to Dec 2019 was conducted on a cohort of multi-ethnic adult Asian patients with clinical diagnosis of type 2 diabetes mellitus (T2DM) managed by a network of primary care clinics in Singapore. Nine OADs from five different classes (biguanides, sulphonyurea, dipeptidyl peptidase-4 [DPP-4] inhibitors, sodium-glucose cotransporter-2 [SGLT-2] inhibitors, and alpha-glucosidase inhibitors) were evaluated. Patients were grouped into “No OAD”, “Non-titrators,” and “Titrators” cohorts based on prescribing patterns. For the “Titrators” cohort, the various OAD titrations were identified. Subsequently, a descriptive analysis of HbA1c values before and after each titration was performed to compute a mean difference for each unique titration identified. Results Among the cohort of 57,910 patients, 43,338 of them had at least one OAD titration, with a total of 76,990 pairs of HbA1c values associated with an OAD titration. There were a total of 206 unique OAD titrations. Overall, initiation of OADs resulted in a reduction of HbA1c by 3 to 12 mmol/mol (0.3 to 1.1%), respectively. These results were slightly lower than those reported in clinical trials of 6 to 14 mmol/mol (0.5 to 1.25%). The change of HbA1c levels due to up-titration, down-titration, and discontinuation were −1 to −8 mmol/mol (−0.1 to −0.7%), +1 to 7 mmol/mol (+0.1 to +0.6%), and +2 to 11 mmol/mol (+0.2 to +1.0%), respectively. The HbA1c lowering effect of initiating newer OADs, namely DPP-4 inhibitors and SGLT-2 inhibitors was 8 to 11 mmol/mol (0.7 to 0.9%) and 7 to 11 mmol/mol (0.6 to 1.0%), respectively. Conclusion The real-world data on Asians with T2DM in this study show that the magnitudes of OAD initiation and dose titration are marginally lower than the results from clinical trials. During shared decision-making in selecting treatment options, the results enable physicians to communicate realistic expectation of the effect of oral medications on the glycemic control of their patients in primary care.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Wei Ying Tan

Application of ensemble machine learning algorithms on lifestyle factors and wearables for cardiovascular risk prediction

Patient similarity analytics for explainable clinical risk prediction

A Machine Learning Approach for Early Diagnosis of Cognitive Impairment Using Population-Based Data

The effect of oral diabetes medications on glycated haemoglobin (HbA1c) in Asians in primary care: a retrospective cohort real-world data study

Contact Info

Product

Resources

About