This paper addresses the development of predictive models for distinguishing pre-symptomatic infections from uninfected individuals. Our machine learning experiments are conducted on publicly available challenge studies that collected whole-blood transcriptomics data from individuals infected with HRV, RSV, H1N1, and H3N2. We address the problem of identifying discriminatory biomarkers between controls and eventual shedders in the first 32 h post-infection. Our exploratory analysis shows that the most discriminatory biomarkers exhibit a strong dependence on time over the course of the human response to infection. We visualize the feature sets to provide evidence of the rapid evolution of the gene expression profiles. To quantify this observation, we partition the data in the first 32 h into four equal time windows of 8 h each and identify all discriminatory biomarkers using sparsity-promoting classifiers and Iterated Feature Removal. We then perform a comparative machine learning classification analysis using linear support vector machines, artificial neural networks and Centroid-Encoder. We present a range of experiments on different groupings of the diseases to demonstrate the robustness of the resulting models.
Given an infected host, estimating the time that has elapsed since initial exposure to the pathogen is an important problem in public health. In this paper we use longitudinal gene expression data from human challenge studies of viral respiratory illnesses for building predictive models to estimate the time elapsed since onset of respiratory infection. We apply sparsity driven machine learning to this time-stamped gene expression data to model the time of exposure by a pathogen and subsequent infection accompanied by the onset of the host immune response. These predictive models exploit the fact that the host gene expression profile evolves in time and its characteristic temporal signature can be effectively modeled using a small number of features. Predicting the time of exposure to infection to be in first 48 h after exposure produces BSR in the range of 80–90% on sequestered test data. A variety of machine learning experiments provide evidence that models developed on one virus can be used to predict exposure time for other viruses, e.g., H1N1, H3N2, and HRV. The interferon $$\alpha /\beta $$ α / β signaling pathway appears to play a central role in keeping time from onset of infection. Successful prediction of the time of exposure to a pathogen has potential ramifications for patient treatment and contact tracing.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.