Research papers from policy school

Today I went digging through my hard drive for a research paper I wrote in policy school that used structural equations modeling to analyze the 2004 United States presidential election.  Sadly, it seems I lost the final version and only have a rough draft.  I did, however, find another research paper on population changes in Wayne and Oakland counties (roughly, Detroit and its wealthier suburbs.)

WayneAndOakland.doc

You might find it interesting if you like Detroit or pretty choropleths:choro

Some mixed, sort-of tooting my own horn: I independently discovered one of the most important urban trends in the United States – the dispersal of poor, urban blacks to inner ring suburbs, which in many ways laid the ground for the recent conflict in Ferguson.  Of course, by that time, the professionals had already discovered it.

I also found, in retrospect, absolutely no evidence for gentrification in Detroit from 1990-2000; at the time I worded my conclusion a bit more weakly, probably because no one I talked to wanted to hear this conclusion.  Fortunately, I’m coming to care less what others think in my old age.  Also in retrospect, the most likely cause of the changes I observed is the Third Great Migration.

I guess I’ll take a stab at explaining the lost structural equations modeling paper as well.

Continue reading

Sanders versus Clinton supporters in the American National Election Studies data

Were it not for Trump, the great drama of the 2016 election would have been the primary contest between Hillary Clinton and Bernie Sanders.  Sanders generally fit the mold of a “leftist protest candidate”, but was far more successful than previous such candidates have been.  In this post, I will examine the 2016 American National Election Studies data, hoping to find clues that explain why.

Continue reading

Bayesian analysis of Slate Star Codex survey data.

[Epistemic status: I’m teaching myself Bayesian analysis out of an O’Reilly-esque programming book; I haven’t yet mustered myself to crack the intimidating Andrew Gelman tome on my shelf. I beg you, correct me if I have screwed this up.]

Scott Alexander posted his survey data results several months ago, and recently has been posting some interesting things about how different groups perceive optical illusions.

As part of my quest to finally understand the differences between Bayesian analysis and frequentist analysis, I downloaded his data and poked at it with PyMC, again modeling my analyses after those in chapter 2 of Bayesian Methods for Hackers, by Cameron Davidson-Pilon (the A/B testing example and the Challenger example.)

Continue reading

The same thing, but with different priors

A couple of days ago I posted a Bayesian re-analysis of the data from a paper on prenatal progesterone exposure and sexual orientation.  For that analysis, I used uniform priors for both exposed and unexposed subjects – that is, I assumed we pretty much don’t know anything about how common non-heterosexuality is, and that the effects of progesterone exposure could be anywhere from infinity to nothing.  These priors didn’t seem very realistic, but the results I got seem fairly intuitive, given the data and outside figures on how common non-heterosexuality is.

Continue reading

Machine learning and racial bias

Warning: Second-order contrarianism.
You may remember this Propublica article from about a year ago, arguing that the COMPAS scores, a machine-learning algorithm that predicts risk of criminal recidivism, is racially biased.  Their methodology was a bit strange and they made their data available openly, so I had been intending to reanalyze using more straightforward methods.  Fortunately, several people have already done this, sparing me the effort; several of them found that according to commonly accepted standards, the COMPAS algorithm is not racially biased.  The Washington Post also published an article, saying the question is complicated. Continue reading

Obama/Trump voters in the National Election Studies data, Part 4

Interpretations:

I conducted three analyses and found that:

  1. There were a surprisingly large number of Obama/Trump voters, and with some exceptions they matched the media portrait – white, working class, low income, older, and Midwestern.
  2. These voters probably chose Trump based on racially loaded policy issues; in particular, “law and order” issues related to fear of criminal violence.
  3.  These voters held economically populist views on a number of issues, even if those views did not drive their votes in the 2016 election.  However, their economic views on most issues were at least slightly to the right of the views of Obama/Clinton voters.

Continue reading