Axioms for Rational Reinforcement Learning
We provide a formal, simple and intuitive theory of rational decision making including sequential decisions that affect the environment. The theory has a geometric flavor, which makes the arguments easy to visualize and understand. Our theory is for complete decision makers, which means that they have a complete set of preferences. Our main result shows that a complete rational decision maker implicitly has a probabilistic model of the environment. We have a countable version of this result...[Show more]
|Collections||ANU Research Publications|
|Source:||Lecture Notes in Artificial Intelligence 6925|
|01_Sunehag_Axioms_for_Rational_2011.pdf||237.84 kB||Adobe PDF|
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.