Concurrent probabilistic temporal planning with policy-gradients
Date
2007
Authors
Aberdeen, Douglas
Buffet, Olivier
Journal Title
Journal ISSN
Volume Title
Publisher
AAAI Press
Abstract
We present an any-time concurrent probabilistic temporal planner that includes continuous and discrete uncertainties and metric functions. Our approach is a direct policy search that attempts to optimise a parameterised policy using gradient ascent. Low memory use, plus the use of function approximation methods, plus factorisation of the policy, allow us to scale to challenging domains. This Factored Policy Gradient (FPG) Planner also attempts to optimise both steps to goal and the probability of success. We compare the FPG planner to other planners on CPTP domains, and on simpler but better studied probabilistic non-temporal domains.
Description
Keywords
Keywords: Approximation theory; Probability; Scheduling; Do-mains; Function approximations; Gradient ascents; Low memories; Metric functions; Policy searches; Probability of successes; Temporal domains; Temporal planning; Planning
Citation
Collections
Source
Proceedings of The 17th International Conference on Automated Planning and Scheduling (ICAPS 2007)
Type
Conference paper
Book Title
Entity type
Access Statement
License Rights
DOI
Restricted until
2037-12-31
Downloads
File
Description