Learning how to act: making good decisions with machine learning

Lattimore, Finnian Rachel

Learning how to act: making good decisions with machine learning

dc.contributor.author	Lattimore, Finnian Rachel
dc.date.accessioned	2018-06-27T06:17:13Z
dc.date.available	2018-06-27T06:17:13Z
dc.date.issued	2017
dc.description.abstract	This thesis is about machine learning and statistical approaches to decision making. How can we learn from data to anticipate the consequence of, and optimally select, interventions or actions? Problems such as deciding which medication to prescribe to patients, who should be released on bail, and how much to charge for insurance are ubiquitous, and have far reaching impacts on our lives. There are two fundamental approaches to learning how to act: reinforcement learning, in which an agent directly intervenes in a system and learns from the outcome, and observational causal inference, whereby we seek to infer the outcome of an intervention from observing the system. The goal of this thesis to connect and unify these key approaches. I introduce causal bandit problems: a synthesis that combines causal graphical models, which were developed for observational causal inference, with multi-armed bandit problems, which are a subset of reinforcement learning problems that are simple enough to admit formal analysis. I show that knowledge of the causal structure allows us to transfer information learned about the outcome of one action to predict the outcome of an alternate action, yielding a novel form of structure between bandit arms that cannot be exploited by existing algorithms. I propose an algorithm for causal bandit problems and prove bounds on the simple regret demonstrating it is close to mini-max optimal and better than algorithms that do not use the additional causal information.	en_AU
dc.identifier.other	b53507320
dc.identifier.uri	http://hdl.handle.net/1885/144602
dc.language.iso	en	en_AU
dc.subject	machine learning	en_AU
dc.subject	causal inference	en_AU
dc.subject	causality	en_AU
dc.subject	reinforcement learning	en_AU
dc.subject	multi-armed bandits	en_AU
dc.title	Learning how to act: making good decisions with machine learning	en_AU
dc.type	Thesis (PhD)	en_AU
dcterms.valid	2018	en_AU
local.contributor.affiliation	College of Engineering & Computer Science, The Australian National University	en_AU
local.contributor.authoremail	finnlattimore@gmail.com	en_AU
local.contributor.supervisor	Ong, Cheng Soon
local.contributor.supervisorcontact	chengsoon.ong@anu.edu.au	en_AU
local.description.notes	the author deposited 27/06/2018	en_AU
local.identifier.doi	10.25911/5d67b766194ec
local.identifier.proquest	Yes
local.mintdoi	mint
local.type.degree	Doctor of Philosophy (PhD)	en_AU

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Lattimore Thesis 2018.pdf
Size:: 2.09 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 884 B
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Open Access Theses