Variance reduction techniques for gradient estimates in reinforcement learning

dc.contributor.authorGreensmith, Evan
dc.contributor.authorBartlett, Peter
dc.contributor.authorBaxter, Jon
dc.coverage.spatialCambridge USA
dc.date.accessioned2015-12-13T22:30:51Z
dc.date.available2015-12-13T22:30:51Z
dc.date.createdSeptember 2 2002
dc.date.issued2002
dc.date.updated2015-12-11T08:57:01Z
dc.identifier.isbn0262042088
dc.identifier.urihttp://hdl.handle.net/1885/75024
dc.publisherMIT Press
dc.relation.ispartofseriesConference on Advances in Neural Information Processing Systems (NIPS 2002)
dc.sourceAdvances in Neural Information Processing Systems 14
dc.titleVariance reduction techniques for gradient estimates in reinforcement learning
dc.typeConference paper
local.bibliographicCitation.lastpage1514
local.bibliographicCitation.startpage1507
local.contributor.affiliationGreensmith, Evan, College of Engineering and Computer Science, ANU
local.contributor.affiliationBartlett, Peter, College of Engineering and Computer Science, ANU
local.contributor.affiliationBaxter, Jon, College of Engineering and Computer Science, ANU
local.contributor.authoruidGreensmith, Evan, u4005284
local.contributor.authoruidBartlett, Peter, u9301805
local.contributor.authoruidBaxter, Jon, u9612464
local.description.notesImported from ARIES
local.description.refereedYes
local.identifier.absfor080109 - Pattern Recognition and Data Mining
local.identifier.ariespublicationMigratedxPub4429
local.type.statusPublished Version

Downloads