A performance metric is a tool to measure the correctness of a trained Machine Learning (ML) model. Numerous performance metrics have been developed for classification problems making it overwhelming to select the appropriate one since each of them represents a particular aspect of the model. Furthermore, selection of a performance metric becomes harder for problems with imbalanced and/or small datasets. Therefore, in clinical studies where datasets are frequently imbalanced and, in situations when the prevalence of a disease is low or the collection of patient samples is difficult, deciding on a suitable metric for performance evaluation of an ML model becomes quite challenging. The most common approach to address this problem is measuring multiple metrics and compare them to identify the best-performing ML model. However, comparison of multiple metrics is laborious and prone to user preference bias. Furthermore, evaluation metrics are also required by ML model optimization techniques such as hyperparameter tuning, where we train many models, each with different parameters, and compare their performances to identify the best-performing parameters. In such situations, it becomes almost impossible to assess different models by comparing multiple metrics. Here, we propose a new metric called Machine Learning Cumulative Performance Score (MLcps) as a Python package for classification problems. MLcps combines multiple pre-computed performance metrics into one metric that conserves the essence of all pre-computed metrics for a particular model. We tested MLcps on 4 different publicly available biological datasets and the results reveal that it provides a comprehensive picture of overall model robustness. MLcps is available at https://pypi.org/project/MLcps/ and cases of use are available at https://mybinder.org/v2/gh/FunctionalUrology/MLcps.git/main.
Background: Machine learning (ML) has emerged as a vital asset for researchers to analyze and extract valuable information from complex datasets. However, developing an effective and robust ML pipeline can present a real challenge, demanding considerable time and effort, thereby impeding research progress. Existing tools in this landscape require a profound understanding of ML principles and programming skills. Furthermore, users are required to engage in the comprehensive configuration of their ML pipeline to obtain optimal performance. Results: To address these challenges, we have developed a novel tool called Machine Learning Made Easy (MLme) that streamlines the use of ML in research, specifically focusing on classification problems at present. By integrating four essential functionalities, namely Data Exploration, AutoML, CustomML, and Visualization, MLme fulfills the diverse requirements of researchers while eliminating the need for extensive coding efforts. To demonstrate the applicability of MLme, we conducted rigorous testing on six distinct datasets, each presenting unique characteristics and challenges. Our results consistently showed promising performance across different datasets, reaffirming the versatility and effectiveness of the tool. Additionally, by utilizing MLme's feature selection functionality, we successfully identified significant markers for CD8+ naive (BACH2), CD16+ (CD16), and CD14+ (VCAN) cell populations. Conclusion: MLme serves as a valuable resource for leveraging machine learning (ML) to facilitate insightful data analysis and enhance research outcomes, while alleviating concerns related to complex coding scripts. The source code and a detailed tutorial for MLme are available at https://github.com/FunctionalUrology/MLme.
Focal Cortical Dysplasia (FCD) is a frequent cause of drug-resistant focal epilepsy in children and young adults. The international FCD classifications of 2011 and 2022 have identified several clinico-pathological subtypes, either occurring isolated, i.e., FCD ILAE Type 1 or 2, or in association with a principal cortical lesion, i.e., FCD Type 3. Here, we addressed the DNA methylation signature of a previously described new subtype of FCD 3D occurring in the occipital lobe of very young children and microscopically defined by neuronal cell loss in cortical layer 4. We studied the DNA methylation profile using 850 K BeadChip arrays in a retrospective cohort of 104 patients with FCD 1 A, 2 A, 2B, 3D, TLE without FCD, and 16 postmortem specimens without neurological disorders as controls, operated in China or Germany. DNA was extracted from formalin-fixed paraffin-embedded tissue blocks with microscopically confirmed lesions, and DNA methylation profiles were bioinformatically analyzed with a recently developed deep learning algorithm. Our results revealed a distinct position of FCD 3D in the DNA methylation map of common FCD subtypes, also different from non-FCD epilepsy surgery controls or non-epileptic postmortem controls. Within the FCD 3D cohort, the DNA methylation signature separated three histopathology subtypes, i.e., glial scarring around porencephalic cysts, loss of layer 4, and Rasmussen encephalitis. Differential methylation in FCD 3D with loss of layer 4 mapped explicitly to biological pathways related to neurodegeneration, biogenesis of the extracellular matrix (ECM) components, axon guidance, and regulation of the actin cytoskeleton. Our data suggest that DNA methylation signatures in cortical malformations are not only of diagnostic value but also phenotypically relevant, providing the molecular underpinnings of structural and histopathological features associated with epilepsy. Further studies will be necessary to confirm these results and clarify their functional relevance and epileptogenic potential in these difficult-to-treat children.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.