Feature Markov Decision Processes
Abstract
General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observations, actions, and rewards. On the other hand, reinforcement learning is well-developed for small finite state Markov Decision Processes (MDPs). So far it is
Description
Citation
Collections
Source
Advances in Intelligent Systems Research: Proceedings of the 2nd Conference on Artificial General Intelligence (AGI 2009)
Type
Book Title
Entity type
Access Statement
License Rights
Restricted until
2037-12-31