Skip navigation
Skip navigation

Feature Dynamic Bayesian Networks

Hutter, Marcus

Description

Feature Markov Decision Processes (MDPs) [Hut09] are well-suited for learning agents in general environments. Nevertheless, unstructured ()MDPs are limited to rela- tively simple environments. Structured MDPs like Dynamic Bayesian Networks (DBNs) are used for large-scale real- world problems. In this article I extend MDP to DBN. The primary contribution is to derive a cost criterion that al- lows to automatically extract the most relevant features from the environment, leading to the "best" DBN...[Show more]

dc.contributor.authorHutter, Marcus
dc.date.accessioned2015-08-26T05:32:20Z
dc.date.available2015-08-26T05:32:20Z
dc.identifier.isbn9789078677246
dc.identifier.urihttp://hdl.handle.net/1885/14961
dc.description.abstractFeature Markov Decision Processes (MDPs) [Hut09] are well-suited for learning agents in general environments. Nevertheless, unstructured ()MDPs are limited to rela- tively simple environments. Structured MDPs like Dynamic Bayesian Networks (DBNs) are used for large-scale real- world problems. In this article I extend MDP to DBN. The primary contribution is to derive a cost criterion that al- lows to automatically extract the most relevant features from the environment, leading to the "best" DBN representation. I discuss all building blocks required for a complete general learning algorithm.
dc.publisherAtlantis Press
dc.relation.ispartofArtificial general intelligence: proceedings of the second conference on Artificial General Intelligence, AGI 2009, Arlington, Virginia, USA, March 6-9, 2009
dc.rights© Atlantis Press and the Author(s). This article is distributed under the terms of the Creative Commons Attribution License, which permits non-commercial use, distribution and reproduction in any medium, provided the original work is properly cited.
dc.subjectReinforcement learning
dc.subjectdynamic Bayesian network
dc.subjectstructure learning
dc.subjectfeature learning
dc.subjectexplore-exploit
dc.titleFeature Dynamic Bayesian Networks
dc.typeConference paper
dc.date.issued2009-05
local.type.statusPublished Version
local.contributor.affiliationHutter, M., Research School of Computer Science, The Australian National University
local.bibliographicCitation.startpage1
local.bibliographicCitation.lastpage6
local.identifier.doi10.2991/agi.2009.6
CollectionsANU Research Publications

Download

File Description SizeFormat Image
Hutter Feature Dynamic 2009.pdf411.76 kBAdobe PDFThumbnail


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  17 November 2022/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator