Learning how to act: making good decisions with machine learning

dc.contributor.authorLattimore, Finnian Rachel
dc.date.accessioned2018-06-27T06:17:13Z
dc.date.available2018-06-27T06:17:13Z
dc.date.issued2017
dc.description.abstractThis thesis is about machine learning and statistical approaches to decision making. How can we learn from data to anticipate the consequence of, and optimally select, interventions or actions? Problems such as deciding which medication to prescribe to patients, who should be released on bail, and how much to charge for insurance are ubiquitous, and have far reaching impacts on our lives. There are two fundamental approaches to learning how to act: reinforcement learning, in which an agent directly intervenes in a system and learns from the outcome, and observational causal inference, whereby we seek to infer the outcome of an intervention from observing the system. The goal of this thesis to connect and unify these key approaches. I introduce causal bandit problems: a synthesis that combines causal graphical models, which were developed for observational causal inference, with multi-armed bandit problems, which are a subset of reinforcement learning problems that are simple enough to admit formal analysis. I show that knowledge of the causal structure allows us to transfer information learned about the outcome of one action to predict the outcome of an alternate action, yielding a novel form of structure between bandit arms that cannot be exploited by existing algorithms. I propose an algorithm for causal bandit problems and prove bounds on the simple regret demonstrating it is close to mini-max optimal and better than algorithms that do not use the additional causal information.en_AU
dc.identifier.otherb53507320
dc.identifier.urihttp://hdl.handle.net/1885/144602
dc.language.isoenen_AU
dc.subjectmachine learningen_AU
dc.subjectcausal inferenceen_AU
dc.subjectcausalityen_AU
dc.subjectreinforcement learningen_AU
dc.subjectmulti-armed banditsen_AU
dc.titleLearning how to act: making good decisions with machine learningen_AU
dc.typeThesis (PhD)en_AU
dcterms.valid2018en_AU
local.contributor.affiliationCollege of Engineering & Computer Science, The Australian National Universityen_AU
local.contributor.authoremailfinnlattimore@gmail.comen_AU
local.contributor.supervisorOng, Cheng Soon
local.contributor.supervisorcontactchengsoon.ong@anu.edu.auen_AU
local.description.notesthe author deposited 27/06/2018en_AU
local.identifier.doi10.25911/5d67b766194ec
local.identifier.proquestYes
local.mintdoimint
local.type.degreeDoctor of Philosophy (PhD)en_AU

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Lattimore Thesis 2018.pdf
Size:
2.09 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
884 B
Format:
Item-specific license agreed upon to submission
Description: