Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

Policy Gradient Methods: Variance Reduction and Stochastic Convergence

dc.contributor.authorGreensmith, Evanen_US
dc.date.accessioned2008-06-16T06:35:31Zen_US
dc.date.accessioned2011-01-04T02:38:37Z
dc.date.available2008-06-16T06:35:31Zen_US
dc.date.available2011-01-04T02:38:37Z
dc.date.issued2005
dc.description.abstractIn a reinforcement learning task an agent must learn a policy for performing actions so as to perform well in a given environment. Policy gradient methods consider a parameterized class of policies, and using a policy from the class, and a trajectory through the environment taken by the agent using this policy, estimate the performance of the policy with respect to the parameters. Policy gradient methods avoid some of the problems of value function methods, such as policy degradation, where inaccuracy in the value function leads to the choice of a poor policy. However, the estimates produced by policy gradient methods can have high variance. ¶ ...en_US
dc.identifier.otherb2247206x
dc.identifier.urihttp://hdl.handle.net/1885/47105
dc.language.isoenen_US
dc.rights.uriThe Australian National Universityen_US
dc.subjectreinforcement learningen_US
dc.subjectpolicy gradienten_US
dc.subjectstochastic convergenceen_US
dc.subjectvariance reductionen_US
dc.titlePolicy Gradient Methods: Variance Reduction and Stochastic Convergenceen_US
dc.typeThesis (PhD)en_US
dcterms.valid2005en_US
local.contributor.affiliationResearch School of Information Sciences and Engineering, Computer Sciences Laboratoryen_US
local.contributor.affiliationThe Australian National Universityen_US
local.description.refereedyesen_US
local.identifier.doi10.25911/5d7a2a01dcebe
local.mintdoimint
local.type.degreeDoctor of Philosophy (PhD)en_US

Downloads

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
02whole.pdf
Size:
1.16 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
01front.pdf
Size:
69.02 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.93 KB
Format:
Plain Text
Description:
abcd