Concurrent probabilistic temporal planning with policy-gradients

Aberdeen, Douglas; Buffet, Olivier

Concurrent probabilistic temporal planning with policy-gradients

Date

2007

Authors

Aberdeen, Douglas

Buffet, Olivier

Publisher

AAAI Press

Abstract

We present an any-time concurrent probabilistic temporal planner that includes continuous and discrete uncertainties and metric functions. Our approach is a direct policy search that attempts to optimise a parameterised policy using gradient ascent. Low memory use, plus the use of function approximation methods, plus factorisation of the policy, allow us to scale to challenging domains. This Factored Policy Gradient (FPG) Planner also attempts to optimise both steps to goal and the probability of success. We compare the FPG planner to other planners on CPTP domains, and on simpler but better studied probabilistic non-temporal domains.

Keywords

Keywords: Approximation theory; Probability; Scheduling; Do-mains; Function approximations; Gradient ascents; Low memories; Metric functions; Policy searches; Probability of successes; Temporal domains; Temporal planning; Planning

URI

http://hdl.handle.net/1885/38924

Collections

ANU Research Publications

Source

Proceedings of The 17th International Conference on Automated Planning and Scheduling (ICAPS 2007)

Type

Conference paper

Restricted until

2037-12-31

Downloads

File

Description

01_Aberdeen_Concurrent_probabilistic_2007.pdf (441.84 KB)

Full item page

Cultural advice

Concurrent probabilistic temporal planning with policy-gradients

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

Downloads