Browsing by Author Baxter, Jon
Showing results 1 to 12 of 12
Boosting Algorithms as Gradient Descent
Author(s) | Mason, Llew; Bartlett, Peter; Baxter, Jon, et al |
---|---|
Type | Conference paper |
Date Published | 2000 |
Date Created | November 29 1999 |
Direct Gradient-based Reinforcement Learning
Author(s) | Baxter, Jon; Bartlett, Peter |
---|---|
Type | Conference paper |
Date Published | 2000 |
Date Created | May 28 2000 |
Estimation and Approximation Bounds for Gradient-based Reinforcement Learning
Author(s) | Bartlett, Peter; Baxter, Jon |
---|---|
Type | Conference paper |
Date Published | 2000 |
Date Created | June 28 2000 |
Experiments with Infinite-Horizon, Policy-Gradient Estimation
Author(s) | Baxter, Jon; Bartlett, Peter; Weaver, L |
---|---|
Type | Journal article |
Date Published | 2001 |
Date Created | - |
Functional Gradient Techniques for Combining Hypotheses
Author(s) | Mason, Llew; Baxter, Jon; Bartlett, Peter, et al |
---|---|
Type | Book chapter |
Date Published | 2000 |
Date Created | - |
General Matrix-matrix Multiplication using SIMD Features of the PIII
Author(s) | Aberdeen, Douglas; Baxter, Jon |
---|---|
Type | Conference paper |
Date Published | 2000 |
Date Created | Aug 29 2000 |
Improved Generalization Through Explicit Optimization of Margins
Author(s) | Mason, Llew; Bartlett, Peter; Baxter, Jon |
---|---|
Type | Journal article |
Date Published | 2000 |
Date Created | - |
Learning to Play Chess Using Temporal Differences
Author(s) | Baxter, Jon; Tridgell, A; Weaver, L |
---|---|
Type | Journal article |
Date Published | 2000 |
Date Created | - |
Reinforcement Learning in POMDPs via Direct Gradient Ascent
Author(s) | Baxter, Jon; Bartlett, Peter |
---|---|
Type | Book chapter |
Date Published | 2000 |
Date Created | - |
Stochastic Optimisation of Controlled Partially Observable Markov Decision Processes
Author(s) | Bartlett, Peter; Baxter, Jon |
---|---|
Type | Conference paper |
Date Published | 2000 |
Date Created | December 12 2000 |
Variance reduction techniques for gradient estimates in reinforcement learning
Author(s) | Greensmith, Evan; Bartlett, Peter; Baxter, Jon |
---|---|
Type | Conference paper |
Date Published | 2002 |
Date Created | September 2 2002 |
Voting Methods for Data Segmentation
Author(s) | Bartlett, Peter; Baxter, Jon |
---|---|
Type | Conference paper |
Date Published | 2000 |
Date Created | Dec 19 2000 |
- previous
- 1
- next
Updated: 12 April 2016/ Responsible Officer: University Librarian/ Page Contact: Library Systems & Web Coordinator