Skip navigation
Skip navigation

The optimal Reward Baseline for Gradient-Based Reinforcement Learning

CollectionsANU Research Publications
Date published: 2001
Type: Conference paper
Source: Uncertainty in Artificial Intelligence: Proceedings of the Seventeenth Conference (2001)


There are no files associated with this item.

Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  19 May 2020/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator