Skip navigation
Skip navigation
Open Research will be down for maintenance between 8:00 and 8:15 am on Tuesday, December 1 2020.

Browsing by Author Weaver, L

Or enter first few letters:  
Showing results 1 to 4 of 4

Experiments with Infinite-Horizon, Policy-Gradient Estimation

Author(s)Baxter, Jon; Bartlett, Peter; Weaver, L
TypeJournal article
Date Published2001
Date Created-

Learning to Play Chess Using Temporal Differences

Author(s)Baxter, Jon; Tridgell, A; Weaver, L
TypeJournal article
Date Published2000
Date Created-

STD(λ): learning state differences with TD(λ)

Author(s)Weaver, L; Baxter, Jonathan
TypeConference paper
Date Published2001
Date CreatedJuly 14 2001

The optimal Reward Baseline for Gradient-Based Reinforcement Learning

Author(s)Weaver, L; Tao, Nigel
TypeConference paper
Date Published2001
Date CreatedAugust 2 2001
  • previous
  • 1
  • next

Updated:  12 April 2016/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator