Concurrent probabilistic temporal planning with policy-gradients

Date

2007

Authors

Aberdeen, Douglas
Buffet, Olivier

Journal Title

Journal ISSN

Volume Title

Publisher

AAAI Press

Abstract

We present an any-time concurrent probabilistic temporal planner that includes continuous and discrete uncertainties and metric functions. Our approach is a direct policy search that attempts to optimise a parameterised policy using gradient ascent. Low memory use, plus the use of function approximation methods, plus factorisation of the policy, allow us to scale to challenging domains. This Factored Policy Gradient (FPG) Planner also attempts to optimise both steps to goal and the probability of success. We compare the FPG planner to other planners on CPTP domains, and on simpler but better studied probabilistic non-temporal domains.

Description

Keywords

Keywords: Approximation theory; Probability; Scheduling; Do-mains; Function approximations; Gradient ascents; Low memories; Metric functions; Policy searches; Probability of successes; Temporal domains; Temporal planning; Planning

Citation

Source

Proceedings of The 17th International Conference on Automated Planning and Scheduling (ICAPS 2007)

Type

Conference paper

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

2037-12-31