Skip navigation
Skip navigation

Context tree maximizing reinforcement learning

Nguyen, Phuong; Sunehag, Peter; Hutter, Marcus

Description

Recent developments in reinforcement learning for non-Markovian problems witness a surge in history-based methods, among which we are particularly interested in two frameworks, ΦMDP and MC-AIXI-CTW. ΦMDP attempts to reduce the general RL problem, where

CollectionsANU Research Publications
Date published: 2012
Type: Conference paper
URI: http://hdl.handle.net/1885/68805
Source: Proceedings of the National Conference on Artificial Intelligence

Download

File Description SizeFormat Image
01_Nguyen_Context_tree_maximizing_2012.pdf501.87 kBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  20 July 2017/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator