Variance reduction techniques for gradient estimates in reinforcement learning

dc.contributor.author	Greensmith, Evan
dc.contributor.author	Bartlett, Peter
dc.contributor.author	Baxter, Jon
dc.coverage.spatial	Cambridge USA
dc.date.accessioned	2015-12-13T22:30:51Z
dc.date.available	2015-12-13T22:30:51Z
dc.date.created	September 2 2002
dc.date.issued	2002
dc.date.updated	2015-12-11T08:57:01Z
dc.identifier.isbn	0262042088
dc.identifier.uri	http://hdl.handle.net/1885/75024
dc.publisher	MIT Press
dc.relation.ispartofseries	Conference on Advances in Neural Information Processing Systems (NIPS 2002)
dc.source	Advances in Neural Information Processing Systems 14
dc.title	Variance reduction techniques for gradient estimates in reinforcement learning
dc.type	Conference paper
local.bibliographicCitation.lastpage	1514
local.bibliographicCitation.startpage	1507
local.contributor.affiliation	Greensmith, Evan, College of Engineering and Computer Science, ANU
local.contributor.affiliation	Bartlett, Peter, College of Engineering and Computer Science, ANU
local.contributor.affiliation	Baxter, Jon, College of Engineering and Computer Science, ANU
local.contributor.authoruid	Greensmith, Evan, u4005284
local.contributor.authoruid	Bartlett, Peter, u9301805
local.contributor.authoruid	Baxter, Jon, u9612464
local.description.notes	Imported from ARIES
local.description.refereed	Yes
local.identifier.absfor	080109 - Pattern Recognition and Data Mining
local.identifier.ariespublication	MigratedxPub4429
local.type.status	Published Version

Collections