Discriminative Bayesian Filtering Lends Momentum to the Stochastic Newton Method for Minimizing Log-Convex Functions

Burkhart, Michael

Discriminative Bayesian Filtering Lends Momentum to the Stochastic Newton Method for Minimizing Log-Convex Functions

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/337413

Repository DOI

https://doi.org/10.17863/CAM.84825

Files

Accepted version (260.64 KB) (Embargoed until: 2025-05-23)

Type

Article

Authors

Burkhart, Michael

https://orcid.org/0000-0002-2772-5840

Abstract

To minimize the average of a set of log-convex functions, the stochastic Newton method iteratively updates its estimate using subsampled versions of the full objective's gradient and Hessian. We contextualize this optimization problem as sequential Bayesian inference on a latent state-space model with a discriminatively-specified observation process. Applying Bayesian filtering then yields a novel optimization algorithm that considers the entire history of gradients and Hessians when forming an update. We establish matrix-based conditions under which the effect of older observations diminishes over time, in a manner analogous to Polyak's heavy ball momentum. We illustrate various aspects of our approach with an example and review other relevant innovations for the stochastic Newton method.

Journal Title

Optimization Letters

Journal ISSN

1862-4472

Publisher

Springer

Rights

Publisher's own licence

Collections

Cambridge University Research Outputs