Causal Bandits: Learning Good Interventions via Causal Inference

Lattimore, Finnian Rachel; Lattimore, Tor; Reid, Mark

Causal Bandits: Learning Good Interventions via Causal Inference

Date

2016

Authors

Lattimore, Finnian Rachel

Lattimore, Tor

Reid, Mark

Publisher

Neural Information Processing Systems Foundation

Abstract

We study the problem of using causal models to improve the rate at which good interventions can be learned online in a stochastic environment. Our formalism combines multi-arm bandits and causal inference to model a novel type of bandit feedback that is not exploited by existing approaches. We propose a new algorithm that exploits the causal feedback and prove a bound on its simple regret that is strictly better (in all quantities) than algorithms that do not use the additional causal information

URI

http://hdl.handle.net/1885/186528

Collections

ANU Research Publications

Source

Advances in Neural Information Processing Systems 29: 30th Annual Conference on Neural Information Processing Systems 2016

Type

Conference paper

Restricted until

2037-12-31

Downloads

File

Description

01_Lattimore_Causal_Bandits%3A_Learning_Good_2016.pdf (1.47 MB)

Full item page

Causal Bandits: Learning Good Interventions via Causal Inference

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

Downloads