Taking gradients through experiments: LSTMs and memory proximal policy optimization for black-box quantum control
View / Open Files
Authors
August, Moritz
Hernández-Lobato, José Miguel
Publication Date
2018Journal Title
ISC High Performance 2018: High Performance Computing
Conference Name
International Conference on High Performance Computing 2018
ISSN
0302-9743
ISBN
978-3-030-02465-9
Publisher
https://link.springer.com/chapter/10.1007/978-3-030-02465-9_43#aboutcontent
Volume
LNCS, volume 11203
Pages
591-693
Type
Conference Object
This Version
AM
Metadata
Show full item recordCitation
August, M., & Hernández-Lobato, J. M. (2018). Taking gradients through experiments: LSTMs and memory proximal policy
optimization for black-box quantum control. ISC High Performance 2018: High Performance Computing, LNCS, volume 11203 591-693. https://doi.org/10.1007/978-3-030-02465-9_43
Abstract
In this work we introduce the application of black-box quantum control as an
interesting rein- forcement learning problem to the machine learning community.
We analyze the structure of the reinforcement learning problems arising in
quantum physics and argue that agents parameterized by long short-term memory
(LSTM) networks trained via stochastic policy gradients yield a general method
to solving them. In this context we introduce a variant of the proximal policy
optimization (PPO) algorithm called the memory proximal policy optimization
(MPPO) which is based on this analysis. We then show how it can be applied to
specific learning tasks and present results of nu- merical experiments showing
that our method achieves state-of-the-art results for several learning tasks in
quantum control with discrete and continouous control parameters.
Keywords
cs.LG, cs.LG, quant-ph
Identifiers
External DOI: https://doi.org/10.1007/978-3-030-02465-9_43
This record's URL: https://www.repository.cam.ac.uk/handle/1810/288024
Rights
Licence:
http://www.rioxx.net/licenses/all-rights-reserved
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk