Skip navigation
Skip navigation

Axioms for Rational Reinforcement Learning

Sunehag, Peter; Hutter, Marcus


We provide a formal, simple and intuitive theory of rational decision making including sequential decisions that affect the environment. The theory has a geometric flavor, which makes the arguments easy to visualize and understand. Our theory is for complete decision makers, which means that they have a complete set of preferences. Our main result shows that a complete rational decision maker implicitly has a probabilistic model of the environment. We have a countable version of this result...[Show more]

CollectionsANU Research Publications
Date published: 2011
Type: Conference paper
Source: Lecture Notes in Artificial Intelligence 6925
DOI: 10.1007/978-3-642-24412-4_27
Access Rights: Open Access


File Description SizeFormat Image
01_Sunehag_Axioms_for_Rational_2011.pdf237.84 kBAdobe PDF

Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  19 May 2020/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator