Skip navigation
Skip navigation

Gradient based algorithms with loss functions and kernels for improved on-policy control

Robards, Matthew; Sunehag, Peter

Description

We introduce and empirically evaluate two novel online gradient-based reinforcement learning algorithms with function approximation - one model based, and the other model free. These algorithms come with the possibility of having non-squared loss functions which is novel in reinforcement learning, and seems to come with empirical advantages. We further extend a previous gradient based algorithm to the case of full control, by using generalized policy iteration. Theoretical properties of these...[Show more]

dc.contributor.authorRobards, Matthew
dc.contributor.authorSunehag, Peter
dc.date.accessioned2015-12-10T23:32:32Z
dc.identifier.issn0302-9743
dc.identifier.urihttp://hdl.handle.net/1885/68876
dc.description.abstractWe introduce and empirically evaluate two novel online gradient-based reinforcement learning algorithms with function approximation - one model based, and the other model free. These algorithms come with the possibility of having non-squared loss functions which is novel in reinforcement learning, and seems to come with empirical advantages. We further extend a previous gradient based algorithm to the case of full control, by using generalized policy iteration. Theoretical properties of these algorithms are studied in a companion paper.
dc.publisherSpringer
dc.sourceLecture Notes in Computer Science (LNCS)
dc.subjectKeywords: Full control; Function approximation; Gradient based; Gradient based algorithm; Loss functions; Model free; Model-based OPC; Policy iteration; Learning algorithms; Reinforcement learning
dc.titleGradient based algorithms with loss functions and kernels for improved on-policy control
dc.typeJournal article
local.description.notesImported from ARIES
local.description.refereedYes
local.identifier.citationvolume7188
dc.date.issued2012
local.identifier.absfor080101 - Adaptive Agents and Intelligent Robotics
local.identifier.ariespublicationf5625xPUB1854
local.type.statusPublished Version
local.contributor.affiliationRobards, Matthew, College of Engineering and Computer Science, ANU
local.contributor.affiliationSunehag, Peter, College of Engineering and Computer Science, ANU
local.description.embargo2037-12-31
local.bibliographicCitation.startpage30
local.bibliographicCitation.lastpage41
local.identifier.doi10.1007/978-3-642-29946-9_7
local.identifier.absseo970108 - Expanding Knowledge in the Information and Computing Sciences
dc.date.updated2016-02-24T08:51:23Z
local.identifier.scopusID2-s2.0-84861701646
CollectionsANU Research Publications

Download

File Description SizeFormat Image
01_Robards_Gradient_based_algorithms_with_2012.pdf872.92 kBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  17 November 2022/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator