Skip navigation
Skip navigation

Browsing by Author Weaver, L

Or enter first few letters:  
Showing results 4 to 4 of 4

The optimal Reward Baseline for Gradient-Based Reinforcement Learning

Author(s)Weaver, L; Tao, Nigel
TypeConference paper
Date Published2001
Date CreatedAugust 2 2001

Updated:  12 April 2016/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator