Extreme State Aggregation beyond MDPs

Hutter, Marcus

doi:10.1007/978-3-319-11662-4_14

New Open Research Repository launching soon. This site will be offline from 5 to 9pm, Monday 20 May, while the new site goes live!

Extreme State Aggregation beyond MDPs

Download (204.89 kB)

link to publisher version

Altmetric Citations

Hutter, Marcus

Description

We consider a Reinforcement Learning setup without any (esp. MDP) assumptions on the environment. State aggregation and more generally feature reinforcement learning is concerned with mapping histories/raw-states to reduced/aggregated states. The idea behind both is that the resulting reduced process (approximately) forms a small stationary finite-state MDP, which can then be efficiently solved or learnt. We considerably generalize existing aggregation results by showing that even if the...[Show more] reduced process is not an MDP, the (q-)value functions and (optimal) policies of an associated MDP with same state-space size solve the original problem, as long as the solution can approximately be represented as a function of the reduced states. This implies an upper bound on the required state space size that holds uniformly for all RL problems. It may also explain why RL algorithms designed for MDPs sometimes perform well beyond MDPs.

Collections	ANU Research Publications
Date published:	2014-10
Type:	Conference paper
URI:	http://hdl.handle.net/1885/14699
Book Title:	Algorithmic Learning Theory: 25th International Conference, ALT 2014, Bled, Slovenia, October 8-10, 2014. Proceedings
DOI:	10.1007/978-3-319-11662-4_14
Access Rights:	Open Access

Download

File	Description	Size	Format	Image
Hutter Extreme State Aggregation 2014.pdf		204.89 kB	Adobe PDF

Show full item record

Extreme State Aggregation beyond MDPs

Altmetric Citations

Description

Download