Variance reduction techniques for gradient estimates in reinforcement learning
| dc.contributor.author | Greensmith, Evan | |
| dc.contributor.author | Bartlett, Peter | |
| dc.contributor.author | Baxter, Jon | |
| dc.coverage.spatial | Cambridge USA | |
| dc.date.accessioned | 2015-12-13T22:30:51Z | |
| dc.date.available | 2015-12-13T22:30:51Z | |
| dc.date.created | September 2 2002 | |
| dc.date.issued | 2002 | |
| dc.date.updated | 2015-12-11T08:57:01Z | |
| dc.identifier.isbn | 0262042088 | |
| dc.identifier.uri | http://hdl.handle.net/1885/75024 | |
| dc.publisher | MIT Press | |
| dc.relation.ispartofseries | Conference on Advances in Neural Information Processing Systems (NIPS 2002) | |
| dc.source | Advances in Neural Information Processing Systems 14 | |
| dc.title | Variance reduction techniques for gradient estimates in reinforcement learning | |
| dc.type | Conference paper | |
| local.bibliographicCitation.lastpage | 1514 | |
| local.bibliographicCitation.startpage | 1507 | |
| local.contributor.affiliation | Greensmith, Evan, College of Engineering and Computer Science, ANU | |
| local.contributor.affiliation | Bartlett, Peter, College of Engineering and Computer Science, ANU | |
| local.contributor.affiliation | Baxter, Jon, College of Engineering and Computer Science, ANU | |
| local.contributor.authoruid | Greensmith, Evan, u4005284 | |
| local.contributor.authoruid | Bartlett, Peter, u9301805 | |
| local.contributor.authoruid | Baxter, Jon, u9612464 | |
| local.description.notes | Imported from ARIES | |
| local.description.refereed | Yes | |
| local.identifier.absfor | 080109 - Pattern Recognition and Data Mining | |
| local.identifier.ariespublication | MigratedxPub4429 | |
| local.type.status | Published Version |