Interpolated policy gradient: Merging on-policy and off-policy gradient estimation for deep reinforcement learning
View / Open Files
Publication Date
2017-01-01Journal Title
Advances in Neural Information Processing Systems
ISSN
1049-5258
Volume
2017-December
Pages
3847-3856
Type
Conference Object
This Version
AM
Metadata
Show full item recordCitation
Gu, S., Lillicrap, T., Ghahramani, Z., Turner, R., Schölkopf, B., & Levine, S. (2017). Interpolated policy gradient: Merging on-policy and off-policy gradient estimation for deep reinforcement learning. Advances in Neural Information Processing Systems, 2017-December 3847-3856. https://doi.org/10.17863/CAM.21291
Sponsorship
EPSRC (EP/J012300/1)
EPSRC (via University of Sheffield) (EP/N014162/1)
Identifiers
This record's DOI: https://doi.org/10.17863/CAM.21291
This record's URL: https://www.repository.cam.ac.uk/handle/1810/274192
Rights
Licence:
http://www.rioxx.net/licenses/all-rights-reserved