The optimal Reward Baseline for Gradient-Based Reinforcement Learning

dc.contributor.author	Weaver, L
dc.contributor.author	Tao, Nigel
dc.coverage.spatial	Seattle USA
dc.date.accessioned	2015-12-10T23:11:05Z
dc.date.available	2015-12-10T23:11:05Z
dc.date.created	August 2 2001
dc.identifier.isbn	1558608001
dc.identifier.uri	http://hdl.handle.net/1885/63665
dc.publisher	Morgan Kauffman Publishers
dc.relation.ispartofseries	Conference on Uncertainty in Artificial Intelligence (UAI 2001)
dc.source	Uncertainty in Artificial Intelligence: Proceedings of the Seventeenth Conference (2001)
dc.title	The optimal Reward Baseline for Gradient-Based Reinforcement Learning
dc.type	Conference paper
local.description.notes	Imported from ARIES
local.description.refereed	Yes
dc.date.issued	2001
local.identifier.absfor	080110 - Simulation and Modelling
local.identifier.ariespublication	MigratedxPub834
local.type.status	Published Version
local.contributor.affiliation	Weaver, L, College of Engineering and Computer Science, ANU
local.contributor.affiliation	Tao, Nigel, College of Engineering and Computer Science, ANU
local.bibliographicCitation.startpage	538
local.bibliographicCitation.lastpage	545
dc.date.updated	2015-12-10T09:19:50Z
Collections	ANU Research Publications

There are no files associated with this item.

Show simple item record