Bounded approximate symbolic dynamic programming for hybrid MDPs

dc.contributor.authorVianna, Luis Gustavo Rochaen
dc.contributor.authorSanner, Scotten
dc.contributor.authorDe Barros, Leliane Nunesen
dc.date.accessioned2025-12-31T21:41:40Z
dc.date.available2025-12-31T21:41:40Z
dc.date.issued2013en
dc.description.abstractRecent advances in symbolic dynamic programming (SDP) combined with the extended algebraic decision diagram (XADD) data structure have provided exact solutions for mixed discrete and continuous (hybrid) MDPs with piecewise linear dynamics and continuous actions. Since XADD-based exact solutions may grow intractably large for many problems, we propose a bounded error compression technique for XADDs that involves the solution of a constrained bilinear saddle point problem. Fortuitously, we show that given the special structure of this problem, it can be expressed as a bilevel linear programming problem and solved to optimality in finite time via constraint generation, despite having an infinite set of constraints. This solution permits the use of efficient linear program solvers for XADD compression and enables a novel class of bounded approximate SDP algorithms for hybrid MDPs that empirically offers order-of-magnitude speedups over the exact solution in exchange for a small approximation error.en
dc.description.statusPeer-revieweden
dc.format.extent10en
dc.identifier.scopus84888179891en
dc.identifier.urihttps://hdl.handle.net/1885/733798184
dc.language.isoenen
dc.relation.ispartofseries29th Conference on Uncertainty in Artificial Intelligence, UAI 2013en
dc.titleBounded approximate symbolic dynamic programming for hybrid MDPsen
dc.typeConference paperen
dspace.entity.typePublicationen
local.bibliographicCitation.lastpage683en
local.bibliographicCitation.startpage674en
local.contributor.affiliationVianna, Luis Gustavo Rocha; Universidade de São Pauloen
local.contributor.affiliationSanner, Scott; School of Computing, ANU College of Systems and Society, The Australian National Universityen
local.contributor.affiliationDe Barros, Leliane Nunes; Universidade de São Pauloen
local.identifier.ariespublicationu4334215xPUB1235en
local.identifier.pure10c2bb3f-3c19-421e-a582-1b25f3b57645en
local.identifier.urlhttps://www.scopus.com/pages/publications/84888179891en
local.type.statusPublisheden

Downloads