Unbounded dynamic programming via the Q-transform
| dc.contributor.author | Ma, Qingyin | |
| dc.contributor.author | Stachurski, John | |
| dc.contributor.author | Toda, Alexis Akira | |
| dc.date.accessioned | 2024-09-23T02:24:23Z | |
| dc.date.available | 2024-09-23T02:24:23Z | |
| dc.date.issued | 2022 | |
| dc.date.updated | 2024-03-17T07:15:17Z | |
| dc.description.abstract | We propose a new approach to solving dynamic decision problems with unbounded rewards based on the transformations used in Q-learning. In our case, however, the objective of the transform is not learning. Rather, it is to convert an unbounded dynamic program into a bounded one. The approach is general enough to handle problems for which existing methods struggle, and yet simple relative to other techniques and accessible for applied work. We show by example that a variety of common decision problems satisfy our conditions. | |
| dc.description.sponsorship | Qingyin Ma gratefully acknowledges the financial support from NSFC, China No. 72003138 and CUEB Startup, China Grant XRZ2020029. | |
| dc.format.mimetype | application/pdf | en_AU |
| dc.identifier.issn | 0304-4068 | |
| dc.identifier.uri | https://hdl.handle.net/1885/733720807 | |
| dc.language.iso | en_AU | en_AU |
| dc.provenance | https://v2.sherpa.ac.uk/id/publication/13999/..."published version can be archived in institutional repository" from SHERPA/RoMEO site as at 25/09/2024 | |
| dc.publisher | Elsevier | |
| dc.relation | http://purl.org/au-research/grants/arc/DP120100321 | |
| dc.rights | © 2022 The authors | |
| dc.source | Journal of Mathematical Economics | |
| dc.subject | Dynamic programming | |
| dc.subject | Optimality | |
| dc.subject | Reinforcement learning | |
| dc.title | Unbounded dynamic programming via the Q-transform | |
| dc.type | Journal article | |
| dcterms.accessRights | Open Access | |
| local.contributor.affiliation | Ma, Qingyin, Capital University of Economics and Business | |
| local.contributor.affiliation | Stachurski, John, College of Business and Economics, ANU | |
| local.contributor.affiliation | Toda, Alexis Akira, University of California San Diego | |
| local.contributor.authoruid | Stachurski, John, u3915156 | |
| local.description.notes | Imported from ARIES | |
| local.identifier.absfor | 380303 - Mathematical economics | |
| local.identifier.absseo | 150302 - Management | |
| local.identifier.ariespublication | a383154xPUB26243 | |
| local.identifier.citationvolume | 100 | |
| local.identifier.doi | 10.1016/j.jmateco.2022.102652 | |
| local.identifier.scopusID | 2-s2.0-85125302776 | |
| local.publisher.url | https://www.sciencedirect.com/ | |
| local.type.status | Accepted Version | |
| publicationvolume.volumeNumber | 100 |
Downloads
Original bundle
1 - 1 of 1