Skip navigation
Skip navigation

Browsing by Author Weaver, L

Or enter first few letters:  
Showing results 3 to 4 of 4

STD(λ): learning state differences with TD(λ)

Author(s)Weaver, L; Baxter, Jonathan
TypeConference paper
Date Published2001
Date CreatedJuly 14 2001

The optimal Reward Baseline for Gradient-Based Reinforcement Learning

Author(s)Weaver, L; Tao, Nigel
TypeConference paper
Date Published2001
Date CreatedAugust 2 2001

Updated:  12 April 2016/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator