Causal bandits: Learning good interventions via causal inference

Loading...
Thumbnail Image

Date

Authors

Lattimore, Finnian
Lattimore, Tor
Reid, Mark D.

Journal Title

Journal ISSN

Volume Title

Publisher

Access Statement

Research Projects

Organizational Units

Journal Issue

Abstract

We study the problem of using causal models to improve the rate at which good interventions can be learned online in a stochastic environment. Our formalism combines multi-arm bandits and causal inference to model a novel type of bandit feedback that is not exploited by existing approaches. We propose a new algorithm that exploits the causal feedback and prove a bound on its simple regret that is strictly better (in all quantities) than algorithms that do not use the additional causal information.

Description

Keywords

Citation

Source

Advances in Neural Information Processing Systems

Book Title

Entity type

Publication

Access Statement

License Rights

DOI

Restricted until