A Monte-Carlo AIXI approximation
This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. Our approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents. Previously, it
|Collections||ANU Research Publications|
|Source:||Journal of Artificial Intelligence Research|
|Access Rights:||Open Access via publisher website|
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.