James Feng scite author profile

Supplementary key words bioinformatic analysis • diagnostic tools • dyslipidemias • genetic testing • human genetics • next-generation sequencing • triglycerides • lipoprotein lipase Elevations in fasting plasma triglyceride (TG) levels are diagnosed as hypertriglyceridemia (HTG). TG levels 10 mmol/l (885 mg/dl) are classified as severe HTG (1) and are seen in 1 in 600 individuals (2). As a relatively common form of dyslipidemia with serious health complications that include pancreatitis (2, 3), there is a focus on identifying and understanding factors that can increase susceptibility to or cause severe HTG.A combination of rare single-nucleotide variants and common SNPs can contribute permissively or causally toward the presentation of this complex disease (3). The monogenic form of severe HTG, also referred to as familial chylomicronemia syndrome (FCS), is caused by bi-allelic variants disrupting canonical genes involved in TG metabolism, such as LPL, APOC2, APOA5, glycosylphosphatidylinositolanchored HDL-binding protein 1 (GPIHBP1), or lipase Abstract Severe hypertriglyceridemia (HTG) is a relatively common form of dyslipidemia with a complex pathophysiology and serious health complications. HTG can develop in the presence of rare genetic factors disrupting genes involved in the triglyceride (TG) metabolic pathway, including large-scale copy-number variants (CNVs). Improvements in next-generation sequencing technologies and bioinformatic analyses have better allowed assessment of CNVs as possible causes of or contributors to severe HTG. We screened targeted sequencing data of 632 patients with severe HTG and identified partial deletions of the LPL gene, encoding the central enzyme involved in the metabolism of TG-rich lipoproteins, in four individuals (0.63%). We confirmed the genomic breakpoints in each patient with Sanger sequencing. Three patients carried an identical heterozygous deletion spanning the 5′ untranslated region (UTR) to LPL exon 2, and one patient carried a heterozygous deletion spanning the 5′UTR to LPL exon 1. All four heterozygous CNV carriers were determined to have multifactorial severe HTG. The predicted null nature of our identified LPL deletions may contribute to relatively higher TG levels and a more severe clinical phenotype than other forms of genetic variation associated with the disease, particularly in the polygenic state. The identification of novel CNVs in patients with severe HTG suggests that methods for CNV detection should be included in the diagnostic workup and molecular genetic

show abstract

Clinical and bacteriological characteristics associated with clustering of multidrug-resistant tuberculosis

Feng

Jarlsberg

Salcedo³

et al. 2017

int j tuberc lung dis

View full text Add to dashboard Cite

show abstract

Machine learning and statistical approaches for classification of risk of coronary artery disease using plasma cytokines

et al. 2021

View full text Add to dashboard Cite

Background As per the 2017 WHO fact sheet, Coronary Artery Disease (CAD) is the primary cause of death in the world, and accounts for 31% of total fatalities. The unprecedented 17.6 million deaths caused by CAD in 2016 underscores the urgent need to facilitate proactive and accelerated pre-emptive diagnosis. The innovative and emerging Machine Learning (ML) techniques can be leveraged to facilitate early detection of CAD which is a crucial factor in saving lives. The standard techniques like angiography, that provide reliable evidence are invasive and typically expensive and risky. In contrast, ML model generated diagnosis is non-invasive, fast, accurate and affordable. Therefore, ML algorithms can be used as a supplement or precursor to the conventional methods. This research demonstrates the implementation and comparative analysis of K Nearest Neighbor (k-NN) and Random Forest ML algorithms to achieve a targeted “At Risk” CAD classification using an emerging set of 35 cytokine biomarkers that are strongly indicative predictive variables that can be potential targets for therapy. To ensure better generalizability, mechanisms such as data balancing, repeated k-fold cross validation for hyperparameter tuning, were integrated within the models. To determine the separability efficacy of “At Risk” CAD versus Control achieved by the models, Area under Receiver Operating Characteristic (AUROC) metric is used which discriminates the classes by exhibiting tradeoff between the false positive and true positive rates. Results A total of 2 classifiers were developed, both built using 35 cytokine predictive features. The best AUROC score of .99 with a 95% Confidence Interval (CI) (.982,.999) was achieved by the Random Forest classifier using 35 cytokine biomarkers. The second-best AUROC score of .954 with a 95% Confidence Interval (.929,.979) was achieved by the k-NN model using 35 cytokines. A p-value of less than 7.481e-10 obtained by an independent t-test validated that Random Forest classifier was significantly better than the k-NN classifier with regards to the AUROC score. Presently, as large-scale efforts are gaining momentum to enable early, fast, reliable, affordable, and accessible detection of individuals at risk for CAD, the application of powerful ML algorithms can be leveraged as a supplement to conventional methods such as angiography. Early detection can be further improved by incorporating 65 novel and sensitive cytokine biomarkers. Investigation of the emerging role of cytokines in CAD can materially enhance the detection of risk and the discovery of mechanisms of disease that can lead to new therapeutic modalities.

show abstract

Machine Learning and Statistical Approaches for Classification of Risk of Coronary Artery Disease using Plasma Cytokines.

Saharan

Nagar

Creasy

et al. 2020

Preprint

View full text Add to dashboard Cite

BackgroundAs per the 2017 WHO fact sheet, Coronary Artery Disease (CAD) is the primary cause of death in the world, and accounts for 31% of total fatalities. The unprecedented 17.6 million deaths caused by CAD in 2016 underscores the urgent need to facilitate proactive and accelerated pre-emptive diagnosis. The innovative and emerging Machine Learning (ML) techniques can be leveraged to facilitate early detection of CAD which is a crucial factor in saving lives. The standard techniques like angiography, that provide reliable evidence are invasive and typically very expensive and risky. In contrast, ML model generated diagnosis is non-invasive, fast, accurate and affordable. Therefore, it can be used as a supplement or precursor to the conventional methods. This research demonstrates the implementation of K Nearest Neighbor (k-NN) and Random Forest ML algorithms to achieve a targeted “At Risk” CAD classification using an emerging set of 35 cytokine biomarkers that are strongly indicative predictive variables that can be potential targets for therapy. To ensure better generalizability, mechanisms such as data balancing, k-fold cross validation for hyperparameter tuning, feature selection via feature importance identification were integrated within the models.ResultsA total of 5 classifiers were developed, with two built using 35 cytokine predictive features and three built using a subset of cytokines, selected by variable importance techniques namely Random Forest, ReliefF and Boruta. The best Area under Receiver Operating Characteristic (AUROC) based accuracy of .99 was achieved by the Random Forest classifier with 35 cytokine biomarkers. The second-best AUROC accuracy was achieved by the k-NN model using cytokines selected by the Random Forest variable importance selection mechanism.ConclusionsPresently, as large-scale efforts are gaining momentum to enable early, fast, reliable, affordable, and accessible detection of individuals at risk for CAD, the application of powerful ML algorithms can be leveraged as a supplement to the conventional treatments such as angiography. The early detection can be further improved by incorporating 65 novel and sensitive cytokines biomarkers. Investigation of the emerging role of cytokines in CAD can materially enhance the detection of risk and the discovery of mechanisms of disease that can lead to new therapeutic modalities.

show abstract

Machine Learning and Statistical Approaches for Classification of Risk of Coronary Artery Disease using Plasma Cytokines.

Saharan

Nagar

Creasy

et al. 2021

Preprint

View full text Add to dashboard Cite

Background As per the 2017 WHO fact sheet, Coronary Artery Disease (CAD) is the primary cause of death in the world, and accounts for 31% of total fatalities. The unprecedented 17.6 million deaths caused by CAD in 2016 underscores the urgent need to facilitate proactive and accelerated pre-emptive diagnosis. The innovative and emerging Machine Learning (ML) techniques can be leveraged to facilitate early detection of CAD which is a crucial factor in saving lives. The standard techniques like angiography, that provide reliable evidence are invasive and typically expensive and risky. In contrast, ML model generated diagnosis is non-invasive, fast, accurate and affordable. Therefore, ML algorithms can be used as a supplement or precursor to the conventional methods. This research demonstrates the implementation and comparative analysis of K Nearest Neighbor (k-NN) and Random Forest ML algorithms to achieve a targeted “At Risk” CAD classification using an emerging set of 35 cytokine biomarkers that are strongly indicative predictive variables that can be potential targets for therapy. To ensure better generalizability, mechanisms such as data balancing, repeated k-fold cross validation for hyperparameter tuning, were integrated within the models. To determine the separability efficacy of “At Risk” CAD versus Control achieved by the models, Area under Receiver Operating Characteristic (AUROC) metric is used which discriminates the classes by exhibiting tradeoff between the false positive and true positive rates.Results A total of 2 classifiers were developed, both built using 35 cytokine predictive features. The best AUROC score of .99 with a 95% Confidence Interval(CI) (.982,.999) was achieved by the Random Forest classifier using 35 cytokine biomarkers. The second-best AUROC score of .954 with a 95% Confidence Interval (.929,.979) was achieved by the k-NN model using 35 cytokines. A p-value of less than 7.481e-10 obtained by an independent t-test validated that Random Forest classifier was significantly better than the k-NN classifier with regards to the AUROC score.Presently, as large-scale efforts are gaining momentum to enable early, fast, reliable, affordable, and accessible detection of individuals at risk for CAD, the application of powerful ML algorithms can be leveraged as a supplement to conventional methods such as angiography. Early detection can be further improved by incorporating 65 novel and sensitive cytokine biomarkers. Investigation of the emerging role of cytokines in CAD can materially enhance the detection of risk and the discovery of mechanisms of disease that can lead to new therapeutic modalities.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

James Feng

Partial LPL deletions: rare copy-number variants contributing towards severe hypertriglyceridemia

Clinical and bacteriological characteristics associated with clustering of multidrug-resistant tuberculosis

Machine learning and statistical approaches for classification of risk of coronary artery disease using plasma cytokines

Machine Learning and Statistical Approaches for Classification of Risk of Coronary Artery Disease using Plasma Cytokines.

Machine Learning and Statistical Approaches for Classification of Risk of Coronary Artery Disease using Plasma Cytokines.

Contact Info

Product

Resources

About