Skip navigation
Skip navigation

Axioms for Rational Reinforcement Learning

Sunehag, Peter; Hutter, Marcus

Description

We provide a formal, simple and intuitive theory of rational decision making including sequential decisions that affect the environment. The theory has a geometric flavor, which makes the arguments easy to visualize and understand. Our theory is for complete decision makers, which means that they have a complete set of preferences. Our main result shows that a complete rational decision maker implicitly has a probabilistic model of the environment. We have a countable version of this result...[Show more]

CollectionsANU Research Publications
Date published: 2011
Type: Conference paper
URI: http://hdl.handle.net/1885/37947
Source: Lecture Notes in Artificial Intelligence 6925
DOI: 10.1007/978-3-642-24412-4_27

Download

File Description SizeFormat Image
01_Sunehag_Axioms_for_Rational_2011.pdf237.84 kBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  20 July 2017/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator