2019
DOI: 10.2139/ssrn.3330651
|View full text |Cite
|
Sign up to set email alerts
|

What Predicts Corruption?

Abstract: Using rich micro-data from Brazil, we show that multiple machine learning models display high levels of performance in predicting municipality-level corruption in public spending. Measures of private sector activity, financial development, and human capital are the strongest predictors of corruption, while public sector and political features play a secondary role. Our findings have implications for the design and cost-effectiveness of various anti-corruption policies.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
17
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 14 publications
(18 citation statements)
references
References 55 publications
1
17
0
Order By: Relevance
“…The possibility of establishing models that can be used to quantify corruption provides an opportunity not only to describe the phenomenon more precisely, but also to predict it. However, the problem of prediction is not rigorously addressed within the vast academic literature about corruption, and we know little of the aspects that may help predict these acts in a precise manner (Colonnelli, Gallego & Prem, 2019). For example, the statistical models used in diverse studies (Olken & Pande, 2012) lack the capacity of prediction due to the scope of the data used, the spatial and temporal scales of analysis, the fields of operation, and the high dependency on correlations, making nearly impossible to establish causalities for different events (Riccardi & Sarno, 2014).…”
Section: Predicting Corruptionmentioning
confidence: 99%
See 2 more Smart Citations
“…The possibility of establishing models that can be used to quantify corruption provides an opportunity not only to describe the phenomenon more precisely, but also to predict it. However, the problem of prediction is not rigorously addressed within the vast academic literature about corruption, and we know little of the aspects that may help predict these acts in a precise manner (Colonnelli, Gallego & Prem, 2019). For example, the statistical models used in diverse studies (Olken & Pande, 2012) lack the capacity of prediction due to the scope of the data used, the spatial and temporal scales of analysis, the fields of operation, and the high dependency on correlations, making nearly impossible to establish causalities for different events (Riccardi & Sarno, 2014).…”
Section: Predicting Corruptionmentioning
confidence: 99%
“…As it is, conclusions and proposals derived from such models should be considered reservedly within their respective contexts. Nevertheless, with current computing capabilities and trends towards the digitalization of information in many public sectors, the ideal and promising approaches to predict corruption risk seem to be those supported by livemonitoring systems powered by artificial intelligence (Fazekas & Kocsis, 2017;López-Iturriaga & Sanz, 2018;Colonnelli et al, 2019). Such systems may provide a wide range of real-time preventive capabilities (Byers, 2017;Altshuler & Pentland, 2018).…”
Section: Predicting Corruptionmentioning
confidence: 99%
See 1 more Smart Citation
“…In line with previous research (Gallego et al, 2019;Colonnelli et al, 2020), we construct an objective measure based on observable characteristics and a machine learning approach. This measure allows us to the disentangle the effects of corruption on public procurement in the midst of an emergency.…”
Section: Introductionmentioning
confidence: 99%
“…Since the CGU construct these measures only starting in 2006, we compute the number of irregularities in the 2003-2005 audits using the manually constructed data inColonnelli et al (2020b). 41 Specifically, following the approach ofColonnelli et al (2020a), we estimate a LASSO model using a set of 147 municipality characteristics and a ten-fold cross-validation to find the parameter that best fits the data. We then standardize the predicted probability by subtracting its mean and dividing by its standard deviation.42 Using numbers from the empirical distribution of the standardized share in the data, we can compute that municipalities with predicted levels of detected corruption in the bottom 10th percentile of the distribution experience a negative effect on total firms of 4.3%.…”
mentioning
confidence: 99%