Repository logo

The effect of rare variants on inflation of the test statistics in case-control analyses.

Change log


Pirie, Ailith 
Lush, Michael 
Pharoah, Paul DP 


BACKGROUND: The detection of bias due to cryptic population structure is an important step in the evaluation of findings of genetic association studies. The standard method of measuring this bias in a genetic association study is to compare the observed median association test statistic to the expected median test statistic. This ratio is inflated in the presence of cryptic population structure. However, inflation may also be caused by the properties of the association test itself particularly in the analysis of rare variants. We compared the properties of the three most commonly used association tests: the likelihood ratio test, the Wald test and the score test when testing rare variants for association using simulated data. RESULTS: We found evidence of inflation in the median test statistics of the likelihood ratio and score tests for tests of variants with less than 20 heterozygotes across the sample, regardless of the total sample size. The test statistics for the Wald test were under-inflated at the median for variants below the same minor allele frequency. CONCLUSIONS: In a genetic association study, if a substantial proportion of the genetic variants tested have rare minor allele frequencies, the properties of the association test may mask the presence or absence of bias due to population structure. The use of either the likelihood ratio test or the score test is likely to lead to inflation in the median test statistic in the absence of population structure. In contrast, the use of the Wald test is likely to result in under-inflation of the median test statistic which may mask the presence of population structure.



Case-Control Studies, Computer Simulation, Data Interpretation, Statistical, Gene Frequency, Genetic Association Studies, Genetic Markers, Genetic Predisposition to Disease, Genetic Variation, Heterozygote, Humans, Models, Genetic, Models, Statistical

Journal Title

BMC Bioinformatics

Conference Name

Journal ISSN


Volume Title



Springer Science and Business Media LLC
Medical Research Council (MR/L003120/1)
Cancer Research Uk (None)
Cancer Research Uk (None)
Cancer Research Uk (None)
Cancer Research Uk (None)
Cancer Research Uk (None)
British Heart Foundation (None)
This work was supported by a grant from Cancer Research UK (C490/A16561). AP is funded by a Medical Research Council studentship.