Lamont, Sean; Aslanides, John; Leike, Jan; Hutter, Marcus
In recent years, work has been done to develop the theory of General Reinforcement Learning (GRL). However, there are no examples demonstrating the known results regarding generalised discounting. We have added to the GRL simulation platform (AIXIjs) the functionality to assign an agent arbitrary discount functions, and an environment which can be used to determine the effect of discounting on an agent's policy. Using this, we investigate how geometric, hyperbolic and power discounting affect...[Show more]
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.