Decision-theoretic planning with non-Markovian rewards
| dc.contributor.author | Thiebaux, Sylvie M | |
| dc.contributor.author | Gretton, Charles | |
| dc.contributor.author | Slaney, John K | |
| dc.contributor.author | Price, David | |
| dc.contributor.author | Kabanza, Froduald | |
| dc.date.accessioned | 2009-04-16T01:59:23Z | en_US |
| dc.date.accessioned | 2010-12-20T06:04:19Z | |
| dc.date.available | 2009-04-16T01:59:23Z | en_US |
| dc.date.available | 2010-12-20T06:04:19Z | |
| dc.date.issued | 2006 | en_US |
| dc.date.updated | 2015-12-08T02:53:50Z | |
| dc.description.abstract | A decision process in which rewards depend on history rather than merely on the current state is called a decision process with non-Markovian rewards (NMRDP). In decision-theoretic planning, where many desirable behaviours are more naturally expressed as | |
| dc.format | 58 pages | |
| dc.identifier.citation | Journal of Artificial Intelligence Research 25 (2006): 17-74 | |
| dc.identifier.issn | 1076-9757 | en_US |
| dc.identifier.uri | http://hdl.handle.net/10440/32 | en_US |
| dc.identifier.uri | http://digitalcollections.anu.edu.au/handle/10440/32 | |
| dc.publisher | Morgan Kauffman Publishers | |
| dc.source | Journal of Artificial Intelligence Research | |
| dc.source.uri | http://www.jair.org//papers/paper1676.html | en_US |
| dc.subject | Keywords: Computer software; Decision theory; Dynamic programming; Formal logic; Functions; Heuristic methods; Planning; Hand-coded tracks; Markovian decision process (MDP) model; Non-Markovian rewards (NMRDP); Software platform; Artificial intelligence | |
| dc.title | Decision-theoretic planning with non-Markovian rewards | |
| dc.type | Journal article | |
| local.bibliographicCitation.lastpage | 74 | |
| local.bibliographicCitation.startpage | 17 | |
| local.contributor.affiliation | Thiebaux, Sylvie M, National ICT Australia and ANU | en_US |
| local.contributor.affiliation | Gretton, Charles, National ICT Australia and ANU | en_US |
| local.contributor.affiliation | Slaney, John K, National ICT Australia and ANU | en_US |
| local.contributor.affiliation | Price, David, National ICT Australia and ANU | en_US |
| local.contributor.affiliation | Kabanza, Froduald, University of Sherbrooke | en_US |
| local.contributor.authoruid | u4033066 | en_US |
| local.contributor.authoruid | u3223587 | en_US |
| local.contributor.authoruid | u8800435 | en_US |
| local.contributor.authoruid | u4017176 | en_US |
| local.identifier.absfor | 080199 | en_US |
| local.identifier.ariespublication | u8803936xPUB7 | en_US |
| local.identifier.citationvolume | 25 | |
| local.identifier.scopusID | 2-s2.0-33744462367 | |
| local.type.status | Published Version | en_US |
Downloads
Original bundle
1 - 1 of 1
Loading...
- Name:
- Thiebaux_Decision2006.pdf
- Size:
- 682.74 KB
- Format:
- Adobe Portable Document Format