The optimal Reward Baseline for Gradient-Based Reinforcement Learning
Date
2001
Authors
Weaver, L
Tao, Nigel
Journal Title
Journal ISSN
Volume Title
Publisher
Morgan Kauffman Publishers
Abstract
Description
Keywords
Citation
Collections
Source
Uncertainty in Artificial Intelligence: Proceedings of the Seventeenth Conference (2001)
Type
Conference paper