Repository logo
 

Bayesian model selection for multilevel models using integrated likelihoods.

Published version
Peer-reviewed

Change log

Abstract

Multilevel linear models allow flexible statistical modelling of complex data with different levels of stratification. Identifying the most appropriate model from the large set of possible candidates is a challenging problem. In the Bayesian setting, the standard approach is a comparison of models using the model evidence or the Bayes factor. Explicit expressions for these quantities are available for the simplest linear models with unrealistic priors, but in most cases, direct computation is impossible. In practice, Markov Chain Monte Carlo approaches are widely used, such as sequential Monte Carlo, but it is not always clear how well such techniques perform. We present a method for estimation of the log model evidence, by an intermediate marginalisation over non-variance parameters. This reduces the dimensionality of any Monte Carlo sampling algorithm, which in turn yields more consistent estimates. The aim of this paper is to show how this framework fits together and works in practice, particularly on data with hierarchical structure. We illustrate this method on simulated multilevel data and on a popular dataset containing levels of radon in homes in the US state of Minnesota.

Description

Acknowledgements: We would like to thank Dr Torben Sell (University of Edinburgh) for his insight and advice about challenges in MCMC sampling. A CC BY or equivalent licence is applied to the AAM arising from this submission.

Journal Title

PLoS One

Conference Name

Journal ISSN

1932-6203
1932-6203

Volume Title

Publisher

Public Library of Science (PLoS)

Rights and licensing

Except where otherwised noted, this item's license is described as http://creativecommons.org/licenses/by/4.0/
Sponsorship
EPSRC (2089662)
Engineering and Physical Sciences Research Council (EP/S515334/1)