Heuristic search in dual space for Constrained stochastic shortest path problems
Trevizan, Felipe; Thiebaux, Sylvie; Santana, Pedro; Williams, Brian
Description
We consider the problem of generating optimal stochastic policies for Constrained Stochastic Shortest Path problems, which are a natural model for planning under uncertainty for resourcebounded agents with multiple competing objectives. While unconstrained SSPs enjoy a multitude of efficient heuristic search solution methods with the ability to focus on promising areas reachable from the initial state, the state of the art for constrained SSPs revolves around linear and dynamic programming...[Show more]
dc.contributor.author | Trevizan, Felipe | |
---|---|---|
dc.contributor.author | Thiebaux, Sylvie | |
dc.contributor.author | Santana, Pedro | |
dc.contributor.author | Williams, Brian | |
dc.coverage.spatial | London, United Kingdom | |
dc.date.accessioned | 2018-11-30T01:19:42Z | |
dc.date.available | 2018-11-30T01:19:42Z | |
dc.date.created | June 12-17 2016 | |
dc.identifier.isbn | 9781577357575 | |
dc.identifier.uri | http://hdl.handle.net/1885/154160 | |
dc.description.abstract | We consider the problem of generating optimal stochastic policies for Constrained Stochastic Shortest Path problems, which are a natural model for planning under uncertainty for resourcebounded agents with multiple competing objectives. While unconstrained SSPs enjoy a multitude of efficient heuristic search solution methods with the ability to focus on promising areas reachable from the initial state, the state of the art for constrained SSPs revolves around linear and dynamic programming algorithms which explore the entire state space. In this paper, we present i-dual, which, to the best of our knowledge, is the first heuristic search algorithm for constrained SSPs. To concisely represent constraints and efficiently decide their violation, i-dual operates in the space of dual variables describing the policy occupation measures. It does so while retaining the ability to use standard value function heuristics computed by well-known methods. Our experiments on a suite of PPDDL problems augmented with constraints show that these features enable i-dual to achieve up to two orders of magnitude improvement in run-time and memory over linear programming algorithms | |
dc.format.mimetype | application/pdf | |
dc.publisher | Association for the Advancement of Artificial Intelligence (AAAI) | |
dc.relation.ispartofseries | International Conference on Automated Planning and Scheduling (ICAPS 2016) | |
dc.source | Proceedings of the Twenty-Sixth International Conference on Automated Planning and Scheduling (ICAPS 2016) | |
dc.source.uri | http://icaps16.icaps-conference.org/index.html | |
dc.title | Heuristic search in dual space for Constrained stochastic shortest path problems | |
dc.type | Conference paper | |
local.description.notes | Imported from ARIES | |
local.description.refereed | Yes | |
dc.date.issued | 2016 | |
local.identifier.absfor | 080101 - Adaptive Agents and Intelligent Robotics | |
local.identifier.ariespublication | U3488905xPUB25290 | |
local.type.status | Published Version | |
local.contributor.affiliation | Trevizan, Felipe, College of Engineering and Computer Science, ANU | |
local.contributor.affiliation | Thiebaux, Sylvie, College of Engineering and Computer Science, ANU | |
local.contributor.affiliation | Santana, Pedro, MERS Group, MIT | |
local.contributor.affiliation | Williams, Brian, MERS Group, MIT | |
local.bibliographicCitation.startpage | 326 | |
local.bibliographicCitation.lastpage | 334 | |
dc.date.updated | 2018-11-29T08:22:03Z | |
local.identifier.scopusID | 2-s2.0-84989811775 | |
dcterms.accessRights | Open Access | |
Collections | ANU Research Publications |
Download
File | Description | Size | Format | Image |
---|---|---|---|---|
01_Trevizan_Heuristic_search_in_dual_space_2016.pdf | 528.11 kB | Adobe PDF | ![]() |
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.
Updated: 17 November 2022/ Responsible Officer: University Librarian/ Page Contact: Library Systems & Web Coordinator