Skip navigation
Skip navigation

Axioms for rational reinforcement learning

Sunehag, Peter; Hutter, Marcus


We provide a formal, simple and intuitive theory of rational decision making including sequential decisions that affect the environment. The theory has a geometric flavor, which makes the arguments easy to visualize and understand. Our theory is for complete decision makers, which means that they have a complete set of preferences. Our main result shows that a complete rational decision maker implicitly has a probabilistic model of the environment. We have a countable version of this result...[Show more]

CollectionsANU Research Publications
Date published: 2011-10
Type: Conference paper
DOI: 10.1007/978-3-642-24412-4_27


File Description SizeFormat Image
Sunehag and Hutter Axioms for Rational Reinforcement Learning 2011.pdf125.2 kBAdobe PDFThumbnail

Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  20 July 2017/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator