Learning Generalised Policies for Numeric Planning.
| dc.contributor.author | Wang, Ryan Xiao | en |
| dc.contributor.author | Thiébaux, Sylvie | en |
| dc.date.accessioned | 2026-03-02T15:41:04Z | |
| dc.date.available | 2026-03-02T15:41:04Z | |
| dc.date.issued | 2024 | en |
| dc.description.abstract | We extend Action Schema Networks (ASNets) to learn gen-eralised policies for numeric planning, which features quan-titative numeric state variables, preconditions and effects. Wepropose a neural network architecture that can reason aboutthe numeric variables both directly and in context of othervariables. We also develop a dynamic exploration algorithmfor more efficient training, by better balancing the explo-ration versus learning tradeoff to account for the greater com-putational demand of numeric teacher planners. Experimen-tally, we find that the learned generalised policies are capableof outperforming traditional numeric planners on some do-mains, and the dynamic exploration algorithm to be on aver-age much faster at learning effective generalised policies thanthe original ASNets training algorithm | en |
| dc.description.status | Peer-reviewed | en |
| dc.format.extent | 10 | en |
| dc.identifier.other | dblp:conf/icaps/WangT24 | en |
| dc.identifier.scopus | 85195897515 | en |
| dc.identifier.uri | https://hdl.handle.net/1885/733806996 | |
| dc.relation.ispartof | ICAPS | en |
| dc.rights | DBLP License: DBLP's bibliographic metadata records provided through http://dblp.org/ are distributed under a Creative Commons CC0 1.0 Universal Public Domain Dedication. Although the bibliographic metadata records are provided consistent with CC0 1.0 Dedication, the content described by the metadata records is not. Content may be subject to copyright, rights of privacy, rights of publicity and other restrictions. | en |
| dc.title | Learning Generalised Policies for Numeric Planning. | en |
| dc.type | Conference paper | en |
| dspace.entity.type | Publication | en |
| local.bibliographicCitation.lastpage | 642 | en |
| local.bibliographicCitation.startpage | 633 | en |
| local.contributor.affiliation | Wang, Ryan Xiao; School of Computing | en |
| local.contributor.affiliation | Thiébaux, Sylvie; School of Computing, ANU College of Systems and Society, The Australian National University | en |
| local.identifier.doi | 10.1609/icaps.v34i1.31526 | en |
| local.identifier.pure | a84f297e-ecfc-4f8d-8df6-1a9a5d63fffd | en |
| local.type.status | Published | en |