Feature reinforcement learning in practice
-
Altmetric Citations
Nguyen, Phuong; Sunehag, Peter; Hutter, Marcus
Description
Following a recent surge in using history-based methods for resolving perceptual aliasing in reinforcement learning, we introduce an algorithm based on the feature reinforcement learning framework called ΦMDP [13]. To create a practical algorithm we devi
dc.contributor.author | Nguyen, Phuong | |
---|---|---|
dc.contributor.author | Sunehag, Peter | |
dc.contributor.author | Hutter, Marcus | |
dc.date.accessioned | 2015-12-10T23:32:29Z | |
dc.identifier.issn | 0302-9743 | |
dc.identifier.uri | http://hdl.handle.net/1885/68857 | |
dc.description.abstract | Following a recent surge in using history-based methods for resolving perceptual aliasing in reinforcement learning, we introduce an algorithm based on the feature reinforcement learning framework called ΦMDP [13]. To create a practical algorithm we devi | |
dc.publisher | Springer | |
dc.rights | Copyright Information: © Springer-Verlag Berlin Heidelberg 2011. http://www.sherpa.ac.uk/romeo/issn/0302-9743/..."Author's post-print on any open access repository after 12 months after publication" from SHERPA/RoMEO site (as at 20/08/15) | |
dc.source | Lecture Notes in Computer Science (LNCS) | |
dc.subject | Keywords: Bayesian mixture; Empirical evaluations; Parallel tempering; Perceptual aliasing; Proposal distribution; Q-learning; Stochastic search; Algorithms; Forestry; Trees (mathematics); Reinforcement learning; Algorithms; Forestry; Performance; Reinforcement | |
dc.title | Feature reinforcement learning in practice | |
dc.type | Journal article | |
local.description.notes | Imported from ARIES | |
local.description.refereed | Yes | |
local.identifier.citationvolume | 7188 | |
dc.date.issued | 2012 | |
local.identifier.absfor | 080101 - Adaptive Agents and Intelligent Robotics | |
local.identifier.ariespublication | f5625xPUB1849 | |
local.type.status | Published Version | |
local.contributor.affiliation | Nguyen, Phuong, College of Engineering and Computer Science, ANU | |
local.contributor.affiliation | Sunehag, Peter, College of Engineering and Computer Science, ANU | |
local.contributor.affiliation | Hutter, Marcus, College of Engineering and Computer Science, ANU | |
local.description.embargo | 2037-12-31 | |
local.bibliographicCitation.startpage | 66 | |
local.bibliographicCitation.lastpage | 77 | |
local.identifier.doi | 10.1007/978-3-642-29946-9_10 | |
local.identifier.absseo | 970108 - Expanding Knowledge in the Information and Computing Sciences | |
dc.date.updated | 2016-02-24T08:51:20Z | |
local.identifier.scopusID | 2-s2.0-84861654915 | |
Collections | ANU Research Publications |
Download
File | Description | Size | Format | Image |
---|---|---|---|---|
01_Nguyen_Feature_reinforcement_learning_2012.pdf | 399.21 kB | Adobe PDF | Request a copy | |
02_Nguyen_Feature_reinforcement_learning_2012.pdf | 399.21 kB | Adobe PDF | Request a copy |
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.
Updated: 17 November 2022/ Responsible Officer: University Librarian/ Page Contact: Library Systems & Web Coordinator