Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

Gradient-based Reinforcement Planning in Policy-Search Methods

dc.contributor.authorKwee, Ivo
dc.contributor.authorHutter, Marcus
dc.contributor.authorSchmidhuber, Jürgen
dc.date.accessioned2015-09-04T00:06:57Z
dc.date.available2015-09-04T00:06:57Z
dc.date.issued2001
dc.description.abstractWe introduce a learning method called "gradient-based reinforcement planning" (GREP). Unlike traditional DP methods that improve their policy backwards in time, GREP is a gradient-based method that plans ahead and improves its policy {\em before} it actually acts in the environment. We derive formulas for the exact policy gradient that maximizes the expected future reward and confirm our ideas with numerical experiments.en_AU
dc.identifier.isbn90-393-2874-9en_AU
dc.identifier.issn1389-5184en_AU
dc.identifier.urihttp://hdl.handle.net/1885/15170
dc.publisherUtrecht Universityen_AU
dc.relation.ispartofProceedings of the fifth European Workshop on Reinforcement Learning (EWRL-5)en_AU
dc.rights© The Author(s)en_AU
dc.titleGradient-based Reinforcement Planning in Policy-Search Methodsen_AU
dc.typeConference paperen_AU
local.bibliographicCitation.lastpage29en_AU
local.bibliographicCitation.startpage27en_AU
local.contributor.affiliationHutter, M., Research School of Computer Science, The Australian National Universityen_AU
local.contributor.authoruidu4350841en_AU
local.identifier.citationvolume27en_AU
local.type.statusPublished Versionen_AU

Downloads

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
884 B
Format:
Item-specific license agreed upon to submission
Description:
abcd