Unbounded dynamic programming via the Q-transform

dc.contributor.authorMa, Qingyin
dc.contributor.authorStachurski, John
dc.contributor.authorToda, Alexis Akira
dc.date.accessioned2024-09-23T02:24:23Z
dc.date.available2024-09-23T02:24:23Z
dc.date.issued2022
dc.date.updated2024-03-17T07:15:17Z
dc.description.abstractWe propose a new approach to solving dynamic decision problems with unbounded rewards based on the transformations used in Q-learning. In our case, however, the objective of the transform is not learning. Rather, it is to convert an unbounded dynamic program into a bounded one. The approach is general enough to handle problems for which existing methods struggle, and yet simple relative to other techniques and accessible for applied work. We show by example that a variety of common decision problems satisfy our conditions.
dc.description.sponsorshipQingyin Ma gratefully acknowledges the financial support from NSFC, China No. 72003138 and CUEB Startup, China Grant XRZ2020029.
dc.format.mimetypeapplication/pdfen_AU
dc.identifier.issn0304-4068
dc.identifier.urihttps://hdl.handle.net/1885/733720807
dc.language.isoen_AUen_AU
dc.provenancehttps://v2.sherpa.ac.uk/id/publication/13999/..."published version can be archived in institutional repository" from SHERPA/RoMEO site as at 25/09/2024
dc.publisherElsevier
dc.relationhttp://purl.org/au-research/grants/arc/DP120100321
dc.rights© 2022 The authors
dc.sourceJournal of Mathematical Economics
dc.subjectDynamic programming
dc.subjectOptimality
dc.subjectReinforcement learning
dc.titleUnbounded dynamic programming via the Q-transform
dc.typeJournal article
dcterms.accessRightsOpen Access
local.contributor.affiliationMa, Qingyin, Capital University of Economics and Business
local.contributor.affiliationStachurski, John, College of Business and Economics, ANU
local.contributor.affiliationToda, Alexis Akira, University of California San Diego
local.contributor.authoruidStachurski, John, u3915156
local.description.notesImported from ARIES
local.identifier.absfor380303 - Mathematical economics
local.identifier.absseo150302 - Management
local.identifier.ariespublicationa383154xPUB26243
local.identifier.citationvolume100
local.identifier.doi10.1016/j.jmateco.2022.102652
local.identifier.scopusID2-s2.0-85125302776
local.publisher.urlhttps://www.sciencedirect.com/
local.type.statusAccepted Version
publicationvolume.volumeNumber100

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2012.00219v2.pdf
Size:
420.57 KB
Format:
Adobe Portable Document Format