RAO*: An algorithm for chance-constrained POMDP's
Loading...
Date
Authors
Santana, Pedro
Thiebaux, Sylvie
Williams, Brian C.
Journal Title
Journal ISSN
Volume Title
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Abstract
Autonomous agents operating in partially observable stochastic environments often face the problem of optimizing expected performance while bounding the risk of violating safety constraints. Such problems can be modeled as chance-constrained POMDP’s (CCPOMDP’s). Our first contribution is a systematic derivation of execution risk in POMDP domains, which improves upon how chance constraints are handled in the constrained POMDP literature. Second, we present RAO∗, a heuristic forward search algorithm producing optimal, deterministic, finite-horizon policies for CCPOMDP’s. In addition to the utility heuristic, RAO∗ leverages an admissible execution risk heuristic to quickly detect and prune overly-risky policy branches. Third, we demonstrate the usefulness of RAO∗ in two challenging domains of practical interest: power supply restoration and autonomous science agents.
Description
Keywords
Citation
Collections
Source
30th AAAI Conference on Artificial Intelligence, AAAI 2016
Type
Book Title
Entity type
Access Statement
Free Access via publisher website
License Rights
DOI
Restricted until
2099-12-31
Downloads
File
Description