Gene signatures derived from transcriptomic data using machine Background: learning methods have shown promise for biodosimetry testing. These signatures may not be sufficiently robust for large scale testing, as their performance has not been adequately validated on external, independent datasets. The present study develops human and murine signatures with biochemically-inspired machine learning that are strictly validated using k-fold and traditional approaches.Gene Expression Omnibus (GEO) datasets of exposed human and Methods: murine lymphocytes were preprocessed via nearest neighbor imputation and expression of genes implicated in the literature to be responsive to radiation exposure (n=998) were then ranked by Minimum Redundancy Maximum Relevance (mRMR). Optimal signatures were derived by backward, complete, and forward sequential feature selection using Support Vector Machines (SVM), and validated using k-fold or traditional validation on independent datasets.The best human signatures we derived exhibit k-fold , and ) when validated over 85 samples. Some human ENO1 PPM1D signatures are specific enough to differentiate between chemotherapy and radiotherapy. Certain multi-class murine signatures have sufficient granularity in dose estimation to inform eligibility for cytokine therapy (assuming these signatures could be translated to humans). We compiled a list of the most frequently appearing genes in the top 20 human and mouse signatures. More frequently appearing genes among an ensemble of signatures may indicate greater impact of these genes on the performance of individual signatures. Several genes in the signatures we derived are present in previously proposed signatures.Gene signatures for ionizing radiation exposure derived by Conclusions: 2018, 7:233 Last updated: 20 MAR 2019 Gene signatures for ionizing radiation exposure derived by Conclusions: machine learning have low error rates in externally validated, independent datasets, and exhibit high specificity and granularity for dose estimation.