Variational Bayesian dropout: pitfalls and fixes
View / Open Files
Authors
Matthews, Alexander G de G
Ghahramani, Zoubin
Publication Date
2018Journal Title
Proceedings of Machine Learning Research
Conference Name
ICML 2018: 35th International Conference on Machine Learning
ISSN
2640-3498
Publisher
Proceedings of Machine Learning Research
Volume
80
Pages
2024-2033
Type
Conference Object
This Version
VoR
Metadata
Show full item recordCitation
Matthews, A. G. d. G., & Ghahramani, Z. (2018). Variational Bayesian dropout: pitfalls and fixes. Proceedings of Machine Learning Research, 80 2024-2033. https://doi.org/10.17863/CAM.28034
Abstract
Dropout, a stochastic regularisation technique for training of neural networks, has recently been reinterpreted as a specific type of approximate inference algorithm for Bayesian neural networks. The main contribution of the reinterpretation is in providing a theoretical framework useful for analysing and extending the algorithm. We show that the proposed framework suffers from several issues; from undefined or pathological behaviour of the true posterior related to use of improper priors, to an ill-defined variational objective due to singularity of the approximating distribution relative to the true posterior. Our analysis of the improper log uniform prior used in variational Gaussian dropout suggests the pathologies are generally irredeemable, and that the algorithm still works only because the variational formulation annuls some of the pathologies. To address the singularity issue, we proffer Quasi-KL (QKL) divergence, a new approximate inference objective for approximation of high-dimensional distributions. We show that motivations for variational Bernoulli dropout based on discretisation and noise have QKL as a limit. Properties of QKL are studied both theoretically and on a simple practical example which shows that the QKL-optimal approximation of a full rank Gaussian with a degenerate one naturally leads to the Principal Component Analysis solution.
Keywords
stat.ML, stat.ML, cs.LG
Sponsorship
Jiri Hron holds a Nokia CASE Studentship. Alexander Matthews and Zoubin Ghahramani acknowledge the support of EPSRC Grant EP/N014162/1 and EPSRC Grant EP/N510129/1 (The Alan Turing Institute).
Funder references
EPSRC (via University of Sheffield) (143103)
Alan Turing Institute (EP/N510129/1)
Identifiers
External DOI: https://doi.org/10.17863/CAM.28034
This record's URL: https://www.repository.cam.ac.uk/handle/1810/280670
Rights
Licence:
http://www.rioxx.net/licenses/all-rights-reserved
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk