Repository logo

Using machine learning to forecast domestic homicide via police data and super learning

Published version

Repository DOI

Change log


Verrey, Jacob 
Ariel, Barak 
Harinam, Vincent 
Dillon, Luke 


jats:titleAbstract</jats:title>jats:pWe explore the feasibility of using machine learning on a police dataset to forecast domestic homicides. Existing forecasting instruments based on ordinary statistical instruments focus on non-fatal revictimization, produce outputs with limited predictive validity, or both. We implement a “super learner,” a machine learning paradigm that incorporates roughly a dozen machine learning models to increase the recall and AUC of forecasting using any one model. We purposely incorporate police records only, rather than multiple data sources, to illustrate the practice utility of the super learner, as additional datasets are often unavailable due to confidentiality considerations. Using London Metropolitan Police Service data, our model outperforms all extant domestic homicide forecasting tools: the super learner detects 77.64% of homicides, with a precision score of 18.61% and a 71.04% Area Under the Curve (AUC), which, collectively and severely, are assessed as “excellent.” Implications for theory, research, and practice are discussed.</jats:p>



Journal Title

Scientific Reports

Conference Name

Journal ISSN


Volume Title



Springer Science and Business Media LLC