Skip navigation
Skip navigation

A Monte-Carlo AIXI Approximation

Veness, Joel; Ng, Kee Siong; Hutter, Marcus; Uther, William; Silver, David

Description

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. Our approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents. Previously, it has been unclear whether the theory of AIXI could motivate the design of practical algorithms. We answer this hitherto open...[Show more]

CollectionsANU Research Publications
Date published: 2011-01
Type: Journal article
URI: http://hdl.handle.net/1885/14906
Source: Journal of Artificial Intelligence Research
DOI: 10.1613/jair.3125

Download

File Description SizeFormat Image
Veness et al A Monte Carlo AIXI Approximation 2011.pdf682.41 kBAdobe PDFThumbnail


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  12 November 2018/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator