USP: an independence test that improves on Pearson's chi-squared and the $G$-test
View / Open Files
ISSN
1364-5021
Publisher
The Royal Society
Type
Article
This Version
AM
Metadata
Show full item recordCitation
Berrett, T. B., & Samworth, R. USP: an independence test that improves on Pearson's chi-squared and the $G$-test. https://doi.org/10.17863/CAM.78074
Abstract
We present the U-Statistic Permutation (USP) test of independence in the context of discrete data displayed in a contingency table. Either Pearson's chi-squared test of independence, or the G-test, are typically used for this task, but we argue that these tests have serious deficiencies, both in terms of their inability to control the size of the test, and their power properties. By contrast, the USP test is guaranteed to control the size of the test at the nominal level for all sample sizes, has no issues with small (or zero) cell counts, and is able to detect distributions that violate independence in only a minimal way. The test statistic is derived from a U-statistic estimator of a natural population measure of dependence, and we prove that this is the unique minimum variance unbiased estimator of this population quantity. The practical utility of the USP test is demonstrated on both simulated data, where its power can be dramatically greater than those of Pearson's test, the G-test and Fisher's exact test, and on real data. The USP test is implemented in the R package USP.
Keywords
stat.ME, stat.ME, math.ST, stat.AP, stat.ML, stat.TH, 62H17, 62H20, 62F03, 62F05, 62E20
Sponsorship
LANCASTER UNIVERSITY (FB EPSRC) (EP/N031938/1)
EPSRC (EP/P031447/1)
European Commission Horizon 2020 (H2020) ERC (101019498)
Embargo Lift Date
2024-11-12
Identifiers
This record's DOI: https://doi.org/10.17863/CAM.78074
This record's URL: https://www.repository.cam.ac.uk/handle/1810/330630
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk