The advance of high-throughput RNA-Sequencing techniques enables researchers to analyze the complete gene activity in particular cells. From the insights of such analyses, researchers can identify disease-specific expression profiles, thus understand complex diseases like cancer, and eventually develop effective measures for diagnosis and treatment. The high dimensionality of gene expression data poses challenges to its computational analysis, which is addressed with measures of gene selection. Traditional gene selection approaches base their findings on statistical analyses of the actual expression levels, which implies several drawbacks when it comes to accurately identifying the underlying biological processes. In turn, integrative approaches include curated information on biological processes from external knowledge bases during gene selection, which promises to lead to better interpretability and improved predictive performance. Our work compares the performance of traditional and integrative gene selection approaches. Moreover, we propose a straightforward approach to integrate external knowledge with traditional gene selection approaches. We introduce a framework enabling the automatic external knowledge integration, gene selection, and evaluation. Evaluation results prove our framework to be a useful tool for evaluation and show that integration of external knowledge improves overall analysis results.
Gene expression data provide the expression levels of tens of thousands of genes from several hundred samples. These data are analyzed to detect biomarkers that can be of prognostic or diagnostic use. Traditionally, biomarker detection for gene expression data is the task of gene selection. The vast number of genes is reduced to a few relevant ones that achieve the best performance for the respective use case. Traditional approaches select genes based on their statistical significance in the data set. This results in issues of robustness, redundancy and true biological relevance of the selected genes. Integrative analyses typically address these shortcomings by integrating multiple data artifacts from the same objects, e.g. gene expression and methylation data. When only gene expression data are available, integrative analyses instead use curated information on biological processes from public knowledge bases. With knowledge bases providing an ever-increasing amount of curated biological knowledge, such prior knowledge approaches become more powerful. This paper provides a thorough overview on the status quo of biomarker detection on gene expression data with prior biological knowledge. We discuss current shortcomings of traditional approaches, review recent external knowledge bases, provide a classification and qualitative comparison of existing prior knowledge approaches and discuss open challenges for this kind of gene selection.
Background: Since the beginning of the Ebola outbreak in West Africa in 2014, more than 11,000 people died. For outbreaks of infectious diseases like this, the rapid implementation of control measures is a crucial factor for containment. In West African countries, outbreak surveillance is a paper-based process with significant delays in forwarding outbreak information, which affects the ability to react adequately to situational changes. Our objective therefore was to develop a tool that improves data collection, situation assessment, and coordination of response measures in outbreak surveillance processes for a better containment. Methods:We have developed the Surveillance and Outbreak Response Management System (SORMAS) based on findings from Nigeria's 2014 Ebola outbreak. We conducted a thorough requirements engineering and defined personas and processes. We also defined a data schema with specific variables to measure in outbreak situations. We designed our system to be a cloud application that consists of interfaces for both mobile devices and desktop computers to support all stakeholders in the process. In the field, health workers collect data on the outbreak situation via mobile applications and directly transmit it to control centers. At the control centers, health workers access SORMAS via desktop computers, receive instant updates on critical situations, react immediately on emergencies, and coordinate the implementation of control measures with SORMAS. Results:We have tested SORMAS in multiple workshops and a field study in July 2015.Results from workshops confirmed derived requirements and implemented features, but also led to further iterations on the systems regarding usability. Results from the field study are currently under assessment. General feedback showed high enthusiasm about the system and stressed its benefits for an effective outbreak containment of infectious diseases.Conclusions: SORMAS is a software tool to support health workers in efficiently handling outbreak situations of infectious diseases, such as Ebola. Our tool enables a bi-directional exchange of situational data between individual stakeholders in outbreak Perscheid et al. Ebola Outbreak Containment With SORMAS containment. This allows instant and seamless collection of data from the field and its instantaneous analysis in operational centers. By that, SORMAS accelerates the implementation of control measures, which is crucial for a successful outbreak containment.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.