A lesson from R-fortunes: all science is not good science.


“It is becoming apparent that you do not know how to use the results from either system. The progress of science would be safer if you get some advice from a person that knows what they are doing.”

— David Winsemius (in response to a user that obtained different linear regression results in R and SPSS and wanted to know which one to use)      R-help (July 2011)

I can always count on my fortunes R-package for a good laugh (especially at the expense of SPSS users), however, this post raises an interesting point about the misuse of statistics.

First, let me digress. Before undergraduate level coursework in psychology, I didn’t know much about the way people acted. After some undergraduate level classes, I knew everything about the inner workings of the mind. I knew that priming people with stereotypically older words reduced their walking speed (Bargh, Chen, & Burrows, 1996), that the Implicit Association Test (IAT; Greenwald et al., 2002) measured meaningful unconscious attitudes, that narcissism was associated with using more first person pronouns (Raskin & Shaw, 1988), etc. It wasn’t until several years in graduate school, advanced statistical training, reading some meta-research, and a visit from the replication police that I realized a) that the findings are never as clear cut as they seem and b) all of these findings have been called into question (Priming; Doyen, Klein, Pichon, Cleeremans, 2012; Pronouns: Carey et al., 2015; IAT; Blanton et al., 2009). Further reading reveals p-hacking (Simonsohn Nelson, & Simmons, 2014), incredibility indices (Schimmack, 2012), and that half of all published findings may be false (Ioannidis, 2005).

I hope this digression illustrates the point that a little knowledge and a false sense of understanding can be dangerous. A novice statistician who runs participants until his or her hypotheses are statistically significant might not realize he/she just increased type one error rate to 20% despite a p < .05 statistical test (Sherman, 2014), but those findings get published.

This brings me back to the original (humorous) quote from my R-fortunes package. Misuse and misunderstanding of analyses are some of the reasons that so few findings across many scientific disciplines do not replicate (Freedman, Cockburn, & Simcoe, 2015). I think the ‘take away’ from this ‘fortune’ (and blog post) is that statistics are often misused and abused, sometimes knowingly and other time unwittingly. The scientific process is slow and self-correcting, but not perfect. Published papers are not necessarily error free. Interpret analyses cautiously. Interpret the research of others cautiously. Most importantly, use R, not SPSS.


