Gradient based algorithms with loss functions and kernels for improved on-policy control

Robards, Matthew; Sunehag, Peter

Gradient based algorithms with loss functions and kernels for improved on-policy control

Date

2012

Authors

Robards, Matthew

Sunehag, Peter

Publisher

Springer

Abstract

We introduce and empirically evaluate two novel online gradient-based reinforcement learning algorithms with function approximation - one model based, and the other model free. These algorithms come with the possibility of having non-squared loss functions which is novel in reinforcement learning, and seems to come with empirical advantages. We further extend a previous gradient based algorithm to the case of full control, by using generalized policy iteration. Theoretical properties of these algorithms are studied in a companion paper.

Keywords

Keywords: Full control; Function approximation; Gradient based; Gradient based algorithm; Loss functions; Model free; Model-based OPC; Policy iteration; Learning algorithms; Reinforcement learning

URI

http://hdl.handle.net/1885/68876

Collections

ANU Research Publications

Source

Lecture Notes in Computer Science (LNCS)

Type

Journal article

DOI

10.1007/978-3-642-29946-9_7

Restricted until

2037-12-31

Downloads

File

Description

01_Robards_Gradient_based_algorithms_with_2012.pdf (872.92 KB)

Full item page

Gradient based algorithms with loss functions and kernels for improved on-policy control

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

Downloads