The optimal Reward Baseline for Gradient-Based Reinforcement Learning

Date

2001

Authors

Weaver, L
Tao, Nigel

Journal Title

Journal ISSN

Volume Title

Publisher

Morgan Kauffman Publishers

Abstract

Description

Keywords

Citation

Source

Uncertainty in Artificial Intelligence: Proceedings of the Seventeenth Conference (2001)

Type

Conference paper

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until