Skip navigation
Skip navigation

Extreme State Aggregation beyond MDPs

Hutter, Marcus

Description

We consider a Reinforcement Learning setup without any (esp. MDP) assumptions on the environment. State aggregation and more generally feature reinforcement learning is concerned with mapping histories/raw-states to reduced/aggregated states. The idea behind both is that the resulting reduced process (approximately) forms a small stationary finite-state MDP, which can then be efficiently solved or learnt. We considerably generalize existing aggregation results by showing that even if the...[Show more]

CollectionsANU Research Publications
Date published: 2014-10
Type: Conference paper
URI: http://hdl.handle.net/1885/14699
DOI: 10.1007/978-3-319-11662-4_14

Download

File Description SizeFormat Image
Hutter Extreme State Aggregation 2014.pdf204.89 kBAdobe PDFThumbnail


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  12 November 2018/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator