Muprop: Unbiased backpropagation for stochastic neural networks

Gu, Shixiang; Levine, Sergey; Sutskever, Ilya; Mnih, Andriy

Muprop: Unbiased backpropagation for stochastic neural networks

Repository URI

https://www.repository.cam.ac.uk/handle/1810/255755

Files

Gu_et_al-2016-ICLR-VoR.pdf (538 KB)

Type

Article

Authors

Gu, Shixiang

Levine, Sergey

Sutskever, Ilya

Mnih, Andriy

Abstract

Deep neural networks are powerful parametric models that can be trained efficiently using the backpropagation algorithm. Stochastic neural networks combine the power of large parametric functions with that of graphical models, which makes it possible to learn very complex distributions. However, as backpropagation is not directly applicable to stochastic networks that include discrete sampling operations within their computational graph, training such networks remains difficult. We present MuProp, an unbiased gradient estimator for stochastic networks, designed to make this task easier. MuProp improves on the likelihood-ratio estimator by reducing its variance using a control variate based on the first-order Taylor expansion of a mean-field network. Crucially, unlike prior attempts at using backpropagation for training stochastic networks, the resulting estimator is unbiased and well behaved. Our experiments on structured output prediction and discrete latent variable modeling demonstrate that MuProp yields consistently good performance across a range of difficult tasks.

Description

This is the final version of the article. It first appeared from International Conference on Learning Representations via http://arxiv.org/abs/1511.05176v3

Journal Title

4th International Conference on Learning Representations, ICLR 2016 - Conference Track Proceedings

Publisher

MIT Press

Publisher URL

http://arxiv.org/abs/1511.05176v3

Rights

Sponsorship

ALTA; Jesus College Cambridge; Cambridge-Tubingen PhD Fellowship

Collections

Scholarly Works - Engineering
Symplectic mapped items for data match