Skip navigation
Skip navigation

Feature reinforcement learning in practice

Nguyen, Phuong; Sunehag, Peter; Hutter, Marcus

Description

Following a recent surge in using history-based methods for resolving perceptual aliasing in reinforcement learning, we introduce an algorithm based on the feature reinforcement learning framework called ΦMDP [13]. To create a practical algorithm we devi

dc.contributor.authorNguyen, Phuong
dc.contributor.authorSunehag, Peter
dc.contributor.authorHutter, Marcus
dc.date.accessioned2015-12-10T23:32:29Z
dc.identifier.issn0302-9743
dc.identifier.urihttp://hdl.handle.net/1885/68857
dc.description.abstractFollowing a recent surge in using history-based methods for resolving perceptual aliasing in reinforcement learning, we introduce an algorithm based on the feature reinforcement learning framework called ΦMDP [13]. To create a practical algorithm we devi
dc.publisherSpringer
dc.rightsCopyright Information: © Springer-Verlag Berlin Heidelberg 2011. http://www.sherpa.ac.uk/romeo/issn/0302-9743/..."Author's post-print on any open access repository after 12 months after publication" from SHERPA/RoMEO site (as at 20/08/15)
dc.sourceLecture Notes in Computer Science (LNCS)
dc.subjectKeywords: Bayesian mixture; Empirical evaluations; Parallel tempering; Perceptual aliasing; Proposal distribution; Q-learning; Stochastic search; Algorithms; Forestry; Trees (mathematics); Reinforcement learning; Algorithms; Forestry; Performance; Reinforcement
dc.titleFeature reinforcement learning in practice
dc.typeJournal article
local.description.notesImported from ARIES
local.description.refereedYes
local.identifier.citationvolume7188
dc.date.issued2012
local.identifier.absfor080101 - Adaptive Agents and Intelligent Robotics
local.identifier.ariespublicationf5625xPUB1849
local.type.statusPublished Version
local.contributor.affiliationNguyen, Phuong, College of Engineering and Computer Science, ANU
local.contributor.affiliationSunehag, Peter, College of Engineering and Computer Science, ANU
local.contributor.affiliationHutter, Marcus, College of Engineering and Computer Science, ANU
local.description.embargo2037-12-31
local.bibliographicCitation.startpage66
local.bibliographicCitation.lastpage77
local.identifier.doi10.1007/978-3-642-29946-9_10
local.identifier.absseo970108 - Expanding Knowledge in the Information and Computing Sciences
dc.date.updated2016-02-24T08:51:20Z
local.identifier.scopusID2-s2.0-84861654915
CollectionsANU Research Publications

Download

File Description SizeFormat Image
01_Nguyen_Feature_reinforcement_learning_2012.pdf399.21 kBAdobe PDF    Request a copy
02_Nguyen_Feature_reinforcement_learning_2012.pdf399.21 kBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  20 July 2017/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator