Skip navigation
Skip navigation

The optimal Reward Baseline for Gradient-Based Reinforcement Learning

dc.contributor.authorWeaver, L
dc.contributor.authorTao, Nigel
dc.coverage.spatialSeattle USA
dc.date.accessioned2015-12-10T23:11:05Z
dc.date.available2015-12-10T23:11:05Z
dc.date.createdAugust 2 2001
dc.identifier.isbn1558608001
dc.identifier.urihttp://hdl.handle.net/1885/63665
dc.publisherMorgan Kauffman Publishers
dc.relation.ispartofseriesConference on Uncertainty in Artificial Intelligence (UAI 2001)
dc.sourceUncertainty in Artificial Intelligence: Proceedings of the Seventeenth Conference (2001)
dc.titleThe optimal Reward Baseline for Gradient-Based Reinforcement Learning
dc.typeConference paper
local.description.notesImported from ARIES
local.description.refereedYes
dc.date.issued2001
local.identifier.absfor080110 - Simulation and Modelling
local.identifier.ariespublicationMigratedxPub834
local.type.statusPublished Version
local.contributor.affiliationWeaver, L, College of Engineering and Computer Science, ANU
local.contributor.affiliationTao, Nigel, College of Engineering and Computer Science, ANU
local.bibliographicCitation.startpage538
local.bibliographicCitation.lastpage545
dc.date.updated2015-12-10T09:19:50Z
CollectionsANU Research Publications

Download

There are no files associated with this item.


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  17 November 2022/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator