Daniel Walke scite author profile

Taxonomic and functional characterization of microbial communities from diverse environments such as the human gut or biogas plants by multi-omics methods plays an ever more important role. Researchers assign all identified genes, transcripts, or proteins to biological pathways to better understand the function of single species and microbial communities. However, due to the versality of microbial metabolism and a still-increasing number of newly biological pathways, linkage to standard pathway maps such as the KEGG central carbon metabolism is often problematic. We successfully implemented and validated a new user-friendly, stand-alone web application, the MPA_Pathway_Tool. It consists of two parts, called ‘Pathway-Creator’ and ‘Pathway-Calculator’. The ‘Pathway-Creator’ enables an easy set-up of user-defined pathways with specific taxonomic constraints. The ‘Pathway-Calculator’ automatically maps microbial community data from multiple measurements on selected pathways and visualizes the results. The MPA_Pathway_Tool is implemented in Java and ReactJS.

show abstract

Decision tree learning in Neo4j on homogeneous and unconnected graph nodes from biological and clinical datasets

Mondal

Ahmed

et al. 2023

BMC Med Inform Decis Mak

View full text Add to dashboard Cite

Background Graph databases enable efficient storage of heterogeneous, highly-interlinked data, such as clinical data. Subsequently, researchers can extract relevant features from these datasets and apply machine learning for diagnosis, biomarker discovery, or understanding pathogenesis. Methods To facilitate machine learning and save time for extracting data from the graph database, we developed and optimized Decision Tree Plug-in (DTP) containing 24 procedures to generate and evaluate decision trees directly in the graph database Neo4j on homogeneous and unconnected nodes. Results Creation of the decision tree for three clinical datasets directly in the graph database from the nodes required between 0.059 and 0.099 s, while calculating the decision tree with the same algorithm in Java from CSV files took 0.085–0.112 s. Furthermore, our approach was faster than the standard decision tree implementations in R (0.62 s) and equal to Python (0.08 s), also using CSV files as input for small datasets. In addition, we have explored the strengths of DTP by evaluating a large dataset (approx. 250,000 instances) to predict patients with diabetes and compared the performance against algorithms generated by state-of-the-art packages in R and Python. By doing so, we have been able to show competitive results on the performance of Neo4j, in terms of quality of predictions as well as time efficiency. Furthermore, we could show that high body-mass index and high blood pressure are the main risk factors for diabetes. Conclusion Overall, our work shows that integrating machine learning into graph databases saves time for additional processes as well as external memory, and could be applied to a variety of use cases, including clinical applications. This provides user with the advantages of high scalability, visualization and complex querying.

show abstract

MPA_Pathway_Tool: User-friendly, automatic assignment of microbial community data on metabolic pathways

Walke

Schallert

Ramesh

et al. 2021

Preprint

View full text Add to dashboard Cite

MotivationTaxonomic and functional characterization of microbial communities from diverse environments such as the human gut or biogas plants by multi-omics methods plays an ever more important role. Researchers assign all identified genes, transcripts, or proteins to biological pathways to better understand the function of single species and microbial communities. However, due to the versatility of microbial metabolism and a still increasing number of new biological pathways, linkage to standard pathway maps such as the KEGG (Kyoto Encyclopedia of Genes and Genomes) central carbon metabolism is often problematic.ResultsWe successfully implemented and validated a new user-friendly, stand-alone web application, the MPA_Pathway_Tool. It consists of two parts, called ‘Pathway-Creator’ and ‘Pathway-Calculator’. The ‘Pathway-Creator’ enables an easy setup of user-defined pathways with specific taxonomic constraints. The ‘Pathway-Calculator’ automatically maps microbial community data from multiple measurements on selected pathways and visualizes the results.Availability and ImplementationThe MPA_Pathway_Tool is implemented in Java and ReactJS. It is freely available on http://mpa-pathwaymapper.ovgu.de/. Further documentation and the complete source code are available on GitHub (https://github.com/danielwalke/MPA_Pathway_Tool).Contactdaniel.walke@ovgu.de, mailto:heyer@mpi-magdeburg.mpg.deheyer@mpi-magdeburg.mpg.deSupplementary InformationAdditional files and images are available at MDPI online.Highlightsuser-friendly generation of pathways, re-using of existent metabolic pathways, automated mapping of data

show abstract

The importance of graph databases and graph learning for clinical applications

Walke

Micheel

Schallert

et al. 2023

View full text Add to dashboard Cite

The increasing amount and complexity of clinical data require an appropriate way of storing and analyzing those data. Traditional approaches use a tabular structure (relational databases) for storing data and thereby complicate storing and retrieving interlinked data from the clinical domain. Graph databases provide a great solution for this by storing data in a graph as nodes (vertices) that are connected by edges (links). The underlying graph structure can be used for the subsequent data analysis (graph learning). Graph learning consists of two parts: graph representation learning and graph analytics. Graph representation learning aims to reduce high-dimensional input graphs to low-dimensional representations. Then, graph analytics uses the obtained representations for analytical tasks like visualization, classification, link prediction and clustering which can be used to solve domain-specific problems. In this survey, we review current state-of-the-art graph database management systems, graph learning algorithms and a variety of graph applications in the clinical domain. Furthermore, we provide a comprehensive use case for a clearer understanding of complex graph learning algorithms. Graphical abstract

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Daniel Walke

MPA_Pathway_Tool: User-Friendly, Automatic Assignment of Microbial Community Data on Metabolic Pathways

Decision tree learning in Neo4j on homogeneous and unconnected graph nodes from biological and clinical datasets

MPA_Pathway_Tool: User-friendly, automatic assignment of microbial community data on metabolic pathways

The importance of graph databases and graph learning for clinical applications

Contact Info

Product

Resources

About