Exact Reduction of Huge Action Spaces in General Reinforcement Learning

Majeed, Sultan; Hutter, Marcus

Exact Reduction of Huge Action Spaces in General Reinforcement Learning

Date

2021

Authors

Majeed, Sultan

Hutter, Marcus

Publisher

The AAAI Press

Abstract

The reinforcement learning (RL) framework formalizes the notion of learning with interactions. Many real-world problems have large state-spaces and/or action-spaces such as in Go, StarCraft, protein folding, and robotics or are non-Markovian, which cause significant challenges to RL algorithms. In this work we address the large action-space problem by sequentializing actions, which can reduce the action-space size significantly, even down to two actions at the expense of an increased planning horizon. We provide explicit and exact constructions and equivalence proofs for all quantities of interest for arbitrary history-based processes. In the case of MDPs, this could help RL algorithms that bootstrap. In this work we show how action-binarization in the nonMDP case can significantly improve Extreme State Aggregation (ESA) bounds. ESA allows casting any (non-MDP, non-ergodic, history-based) RL problem into a fixed-sized non-Markovian state-space with the help of a surrogate Markovian process. On the upside, ESA enjoys similar optimality guarantees as Markovian models do. But a downside is that the size of the aggregated state-space becomes exponential in the size of the action-space. In this work, we patch this issue by binarizing the action-space. We provide an upper bound on the number of states of this binarized ESA that is logarithmic in the original action-space size, a double-exponential improvement.

URI

http://hdl.handle.net/1885/312404

Collections

ANU Research Publications

Type

Conference paper

Access Statement

Free Access via publisher website

DOI

10.48550/arXiv.2012.10200

Restricted until

2099-12-31

Downloads

File

Description

17074-Article Text-20568-1-2-20210518.pdf (168.27 KB)

Full item page

Cultural advice

Exact Reduction of Huge Action Spaces in General Reinforcement Learning

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

Downloads