Skip navigation
Skip navigation

A Monte-Carlo AIXI approximation

Veness, Joel; Ng, Kee Siong; Hutter, Marcus; Uther, William T.B.; Silver, David

Description

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. Our approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents. Previously, it

CollectionsANU Research Publications
Date published: 2011
Type: Journal article
URI: http://hdl.handle.net/1885/68774
Source: Journal of Artificial Intelligence Research
DOI: 10.1613/jair.3125

Download

There are no files associated with this item.


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  20 July 2017/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator