Context tree maximizing reinforcement learning
Nguyen, Phuong; Sunehag, Peter; Hutter, Marcus
Description
Recent developments in reinforcement learning for non-Markovian problems witness a surge in history-based methods, among which we are particularly interested in two frameworks, ΦMDP and MC-AIXI-CTW. ΦMDP attempts to reduce the general RL problem, where
Collections | ANU Research Publications |
---|---|
Date published: | 2012 |
Type: | Conference paper |
URI: | http://hdl.handle.net/1885/68805 |
Source: | Proceedings of the National Conference on Artificial Intelligence |
Download
File | Description | Size | Format | Image |
---|---|---|---|---|
01_Nguyen_Context_tree_maximizing_2012.pdf | 501.87 kB | Adobe PDF | Request a copy |
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.
Updated: 17 November 2022/ Responsible Officer: University Librarian/ Page Contact: Library Systems & Web Coordinator