Browsing by Author Weaver, L
Showing results 1 to 4 of 4
Experiments with Infinite-Horizon, Policy-Gradient Estimation
Author(s) | Baxter, Jon; Bartlett, Peter; Weaver, L |
---|---|
Type | Journal article |
Date Published | 2001 |
Date Created | - |
Learning to Play Chess Using Temporal Differences
Author(s) | Baxter, Jon; Tridgell, A; Weaver, L |
---|---|
Type | Journal article |
Date Published | 2000 |
Date Created | - |
STD(λ): learning state differences with TD(λ)
Author(s) | Weaver, L; Baxter, Jonathan |
---|---|
Type | Conference paper |
Date Published | 2001 |
Date Created | July 14 2001 |
The optimal Reward Baseline for Gradient-Based Reinforcement Learning
Author(s) | Weaver, L; Tao, Nigel |
---|---|
Type | Conference paper |
Date Published | 2001 |
Date Created | August 2 2001 |
- previous
- 1
- next
Updated: 12 April 2016/ Responsible Officer: University Librarian/ Page Contact: Library Systems & Web Coordinator