A Tutorial on Hunting Statistical Significance by Chasing N.

Szucs, Denes

A Tutorial on Hunting Statistical Significance by Chasing N.

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/266448

Repository DOI

https://doi.org/10.17863/CAM.9956

Files

Published version (2.1 MB)

Type

Article

Authors

Szucs, Denes

https://orcid.org/0000-0002-9477-0801

Abstract

There is increasing concern about the replicability of studies in psychology and cognitive neuroscience. Hidden data dredging (also called p-hacking) is a major contributor to this crisis because it substantially increases Type I error resulting in a much larger proportion of false positive findings than the usually expected 5%. In order to build better intuition to avoid, detect and criticize some typical problems, here I systematically illustrate the large impact of some easy to implement and so, perhaps frequent data dredging techniques on boosting false positive findings. I illustrate several forms of two special cases of data dredging. First, researchers may violate the data collection stopping rules of null hypothesis significance testing by repeatedly checking for statistical significance with various numbers of participants. Second, researchers may group participants post hoc along potential but unplanned independent grouping variables. The first approach 'hacks' the number of participants in studies, the second approach 'hacks' the number of variables in the analysis. I demonstrate the high amount of false positive findings generated by these techniques with data from true null distributions. I also illustrate that it is extremely easy to introduce strong bias into data by very mild selection and re-testing. Similar, usually undocumented data dredging steps can easily lead to having 20-50%, or more false positives.

Keywords

N-hacking, Type I error, bias and data dredging, false positive error, null hypothesis significance testing (NHST), p-hacking, replication crisis

Journal Title

Frontiers in Psychology

Journal ISSN

1664-1078
1664-1078

Volume Title

7

Publisher

Frontiers Media

Publisher DOI

https://doi.org/10.3389/fpsyg.2016.01444

Rights

Attribution 4.0 International

Sponsorship

DS is supported by the James S. McDonnell Foundation.

Collections

Scholarly Works - Psychology