Loss-Calibrated Monte Carlo Action Selection

Abbasnejad, Ehsan; Domke, Justin; Sanner, Scott

Loss-Calibrated Monte Carlo Action Selection

Date

2015

Authors

Abbasnejad, Ehsan

Domke, Justin

Sanner, Scott

Publisher

American Association for Artificial Intelligence (AAAI) Press

Abstract

Bayesian decision-theory underpins robust decision-making in applications ranging from plant control to robotics where hedging action selection against state uncertainty is critical for minimizing low probability but potentially catastrophic outcomes (e.g, uncontrollable plant conditions or robots falling into stairwells). Unfortunately, belief state distributions in such settings are often complex and/or high dimensional, thus prohibiting the efficient application of analytical techniques for expected utility computation when real-time control is required. This leaves Monte Carlo evaluation as one of the few viable (and hence frequently used) techniques for online action selection. However, loss-insensitive Monte Carlo methods may require large numbers of samples to identify optimal actions with high certainty since they may sample from high probability regions that do not disambiguate action utilities. In this paper we remedy this problem by deriving an optimal proposal distribution for a loss-calibrated Monte Carlo importance sampler that bounds the regret of using an estimated optimal action. Empirically, we show that using our loss-calibrated Monte Carlo method yields high-accuracy optimal action selections in a fraction of the number of samples required by conventional loss-insensitive samplers

URI

http://hdl.handle.net/1885/103805

Collections

ANU Research Publications

Source

HVAC-Aware Occupancy Scheduling

Type

Conference paper

Restricted until

2037-12-31

Downloads

File

Description

01_Abbasnejad_Loss-Calibrated_Monte_Carlo_2015.pdf (959.36 KB)

Full item page

Cultural advice

Loss-Calibrated Monte Carlo Action Selection

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

Downloads