Earlier on twitter I shared some scatter plots comparing homicide rates by percentage of each state black.

gun_hom_by_pct_state_black

homicides_by_pct_black

Since some people believe this might somehow only be a state-level issue, I thought I’d just quickly plot the data at a county level.


Including even the smallest counties, a number with zero homicides, the correlation isn’t quite as strong (r=0.58):

county_hom_by_pct_black_all_counties

However, if I filter to only include counties with 100K plus people, it tightens up (r=0.69).

county_hom_by_pct_black_100k

@500k+ (r=0.73)county_hom_by_pct_black_500k

@ 1M+ (r=0.76)county_hom_by_pct_black_1M

Even in relatively modestly sized counties (50-200K people), the correlation coefficient is 0.58.county_hom_by_pct_black_50_to_200k

I am, of course, not the first person to take this sort of approach.  Ron Unz analyzed this in 2013 at a city level, on an annual basis, and for a variety of other crimes, but my approach is somewhat different (CDC stats vs FBI UCR; multi-year average; and scatterplots vs reporting correlation coefficients).  My results are generally consistent with his with respect to blacks and homicide rates (somewhat weaker correlation, but that’s not surprising given counties vs city differences).

However, he is wrong about hispanics though.  There is a weak correlation (r=0.05) without any controls, but their demographic footprint and relatively low homicide rate (as compared to blacks) masks the signal a bit.  Similar confounds also exist for asians.

hom_splom

I quickly modeled the percentage black, hispanic, asian, and white to suss out the relative impacts of various groups at once.

The coefficients are approximately:

Intercept: 18.8

Black: 7.4

Hispanic: -25.2

White: -34.3

Asian: -62.0

…which implies that larger hispanic populations are correlated with higher homicide rates and larger asian populations are correlated with smaller homicide rates.  This is generally consistent with criminal justice statistics in broad strokes (e.g., victims, perpetrators, imprisonment rates, etc).

If I plot the predicted values (r=0.73):

actual_vs_pred_hom_100K_black_hispanic_asian_white

No doubt I could improve this and take this analysis further, but that’s enough for now. 🙂