Skip navigation
Skip navigation

Universal reinforcement learning algorithms: Survey and experiments

Aslanides, John; Leike, Jan; Hutter, Marcus

Description

Many state-of-the-art reinforcement learning (RL) algorithms typically assume that the environment is an ergodic Markov Decision Process (MDP). In contrast, the field of universal reinforcement learning (URL) is concerned with algorithms that make as few assumptions as possible about the environment. The universal Bayesian agent AIXI and a family of related URL algorithms have been developed in this setting. While numerous theoretical optimality results have been proven for these agents, there...[Show more]

CollectionsANU Research Publications
Date published: 2017
Type: Conference paper
URI: http://hdl.handle.net/1885/205778
Source: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI-17)
DOI: 10.24963/ijcai.2017/194
Access Rights: Open Access

Download

File Description SizeFormat Image
01_Aslanides_Universal_reinforcement_2017.pdf1.9 MBAdobe PDFThumbnail


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  19 May 2020/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator