BACKGROUND
Although interventions exist to reduce violent crime, optimal implementation requires accurate targeting. We report the results of an attempt to develop an actuarial model using machine learning methods to predict future violent crimes among U.S. Army soldiers.
METHODS
A consolidated administrative database for all 975,057 soldiers in the U.S. Army in 2004-2009 was created in the Army Study to Assess Risk and Resilience in Servicemembers (Army STARRS). 5,771 of these soldiers committed a first founded major physical violent crime (murder-manslaughter, kidnapping, aggravated arson, aggravated assault, robbery) over that time period. Temporally prior administrative records measuring socio-demographic, Army career, criminal justice, medical/pharmacy, and contextual variables were used to build an actuarial model for these crimes separately among men and women using machine learning methods (cross-validated stepwise regression; random forests; penalized regressions). The model was then validated in an independent 2011-2013 sample.
RESULTS
Key predictors were indicators of disadvantaged social/socio-economic status, early career stage, prior crime, and mental disorder treatment. Area under the receiver operating characteristic curve was .80-.82 in 2004-2009 and .77 in a 2011-2013 validation sample. 36.2-33.1% (male-female) of all administratively-recorded crimes were committed by the 5% of soldiers having highest predicted risk in 2004-2009 and an even higher proportion (50.5%) in the 2011-2013 validation sample.
CONCLUSIONS
Although these results suggest that the models could be used to target soldiers at high risk of violent crime perpetration for preventive interventions, final implementation decisions would require further validation and weighing of predicted effectiveness against intervention costs and competing risks.