Skip navigation
Skip navigation

Axioms for rational reinforcement learning

Sunehag, Peter; Hutter, Marcus

Description

We provide a formal, simple and intuitive theory of rational decision making including sequential decisions that affect the environment. The theory has a geometric flavor, which makes the arguments easy to visualize and understand. Our theory is for complete decision makers, which means that they have a complete set of preferences. Our main result shows that a complete rational decision maker implicitly has a probabilistic model of the environment. We have a countable version of this result...[Show more]

CollectionsANU Research Publications
Date published: 2011-10
Type: Conference paper
URI: http://hdl.handle.net/1885/14813
DOI: 10.1007/978-3-642-24412-4_27

Download

File Description SizeFormat Image
Sunehag and Hutter Axioms for Rational Reinforcement Learning 2011.pdf125.2 kBAdobe PDFThumbnail


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  12 November 2018/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator