Dacosta Yeboah scite author profile

We present a framework for an explainable and statistically validated ensemble clustering model applied to Traumatic Brain Injury (TBI). The objective of our analysis is to identify patient injury severity subgroups and key phenotypes that delineate these subgroups using varied clinical and computed tomography data. Explainable and statistically-validated models are essential because a datadriven identification of subgroups is an inherently multidisciplinary undertaking. In our case, this procedure yielded six distinct patient subgroups with respect to mechanism of injury, severity of presentation, anatomy, psychometric, and functional outcome. This framework for ensemble cluster analysis fully integrates statistical methods at several stages of analysis to enhance the quality and the explainability of results. This methodology is applicable to other clinical data sets that exhibit significant heterogeneity as well as other diverse data science applications in biomedicine and elsewhere.

show abstract

A PheWAS Model of Autism Spectrum Disorder

Matta

Dobrino

Howard

et al. 2021

View full text Add to dashboard Cite

Heterogeneity in Blood Biomarker Trajectories After Mild TBI Revealed by Unsupervised Learning

Bui

Yeboah

Steinmeister

et al. 2022

IEEE/ACM Trans. Comput. Biol. and Bioinf.

View full text Add to dashboard Cite

Statistical Comparative Analysis and Evaluation of Validation Indices for Clustering Optimization

Nguyen

Viehman

Yeboah

et al. 2020

View full text Add to dashboard Cite

Connecting phenotype to genotype: PheWAS-inspired analysis of autism spectrum disorder

Matta

Dobrino

Yeboah

et al. 2022

Front. Hum. Neurosci.

View full text Add to dashboard Cite

Autism Spectrum Disorder (ASD) is extremely heterogeneous clinically and genetically. There is a pressing need for a better understanding of the heterogeneity of ASD based on scientifically rigorous approaches centered on systematic evaluation of the clinical and research utility of both phenotype and genotype markers. This paper presents a holistic PheWAS-inspired method to identify meaningful associations between ASD phenotypes and genotypes. We generate two types of phenotype-phenotype (p-p) graphs: a direct graph that utilizes only phenotype data, and an indirect graph that incorporates genotype as well as phenotype data. We introduce a novel methodology for fusing the direct and indirect p-p networks in which the genotype data is incorporated into the phenotype data in varying degrees. The hypothesis is that the heterogeneity of ASD can be distinguished by clustering the p-p graph. The obtained graphs are clustered using network-oriented clustering techniques, and results are evaluated. The most promising clusterings are subsequently analyzed for biological and domain-based relevance. Clusters obtained delineated different aspects of ASD, including differentiating ASD-specific symptoms, cognitive, adaptive, language and communication functions, and behavioral problems. Some of the important genes associated with the clusters have previous known associations to ASD. We found that clusters based on integrated genetic and phenotype data were more effective at identifying relevant genes than clusters constructed from phenotype information alone. These genes included five with suggestive evidence of ASD association and one known to be a strong candidate.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Dacosta Yeboah

An Explainable and Statistically Validated Ensemble Clustering Model Applied to the Identification of Traumatic Brain Injury Subgroups

A PheWAS Model of Autism Spectrum Disorder

Heterogeneity in Blood Biomarker Trajectories After Mild TBI Revealed by Unsupervised Learning

Statistical Comparative Analysis and Evaluation of Validation Indices for Clustering Optimization

Connecting phenotype to genotype: PheWAS-inspired analysis of autism spectrum disorder

Contact Info

Product

Resources

About