Browsing by Author Weaver, L
Showing results 3 to 4 of 4
STD(λ): learning state differences with TD(λ)
Author(s) | Weaver, L; Baxter, Jonathan |
---|---|
Type | Conference paper |
Date Published | 2001 |
Date Created | July 14 2001 |
The optimal Reward Baseline for Gradient-Based Reinforcement Learning
Author(s) | Weaver, L; Tao, Nigel |
---|---|
Type | Conference paper |
Date Published | 2001 |
Date Created | August 2 2001 |
- previous
- 1
- next
Updated: 12 April 2016/ Responsible Officer: University Librarian/ Page Contact: Library Systems & Web Coordinator