Decisions, Learning and Games: You've Got To Have Freedom.

Della Penna, Nicolas

Decisions, Learning and Games: You've Got To Have Freedom.

dc.contributor.author	Della Penna, Nicolas
dc.date.accessioned	2022-03-19T01:27:08Z
dc.date.available	2022-03-19T01:27:08Z
dc.date.issued	2022
dc.description.abstract	Maintaining a subject's freedom to decide imposes structure and constraints on learning systems that aim to guide those decisions. Two natural sources from which subjects can learn to make good decisions are past experiences and advice from others. Both are affected by the subject's freedom to ultimately act as they wish, giving rise to learning theoretic and game theoretic repercussions respectively. To study the effect of past experiences, we extend the standard bandit setting: after the algorithm chooses an action, the subject may actually carry out a different action. This is then observed along with the reward. Algorithms whose choice of action is mediated by the subject can gain from awareness of the subject's actual actions, which we term compliance awareness. We present algorithms that take advantage of compliance awareness, while maintaining worst case regret bounds up to multiplicative constants. We study their empirical finite sample performance on synthetic data and simulations using real data from clinical trials. To study the effect of advice of others, we consider the literature on incentives for multiple experts by a decision maker that will take an action and receive a reward about which the experts may have information. Existing mechanisms for multiple experts are known not to be truthful, even in the limited sense of myopic incentive compatibility, unless the decision maker renounces their ability to always take on the best ex-post action and commits to a randomized strategy with full support. We present a new class of mechanisms based on second price auctions that maintain the subject's freedom. Experts submit their private information, and the algorithm auctions off the rights to a share of the reward of the subject, who then has freedom to pick the action they desire after observing the submitted information. We show several situations in which existing mechanisms fail and this one succeeds. We also consider strategic limitations of this mechanism beyond the myopic setting that arise due to complementary information between experts, and practical considerations in its implementation in real institutions. We conclude by considering a natural hybrid setting, where a sequence of subjects make decisions and each can receive advice from a fixed set of experts that the mechanism seeks to incentivize. The model for this setting is extremely general, having as special cases standard, compliance aware and contextual bandits, as well as decision markets. We present a novel practical market structure for this setting that incentivizes exploration, information revelation, and aggregation with selfish experts.
dc.identifier.uri	http://hdl.handle.net/1885/262297
dc.language.iso	en_AU
dc.title	Decisions, Learning and Games: You've Got To Have Freedom.
dc.type	Thesis (PhD)
local.contributor.supervisor	Gould, Stephen
local.identifier.doi	10.25911/YRW3-HM53
local.identifier.proquest	Yes
local.mintdoi	mint
local.thesisANUonly.author	5143e348-83de-4a69-9a76-d65bc933db80
local.thesisANUonly.key	0d10ce4f-4b89-bff6-04fc-62cd8a3ada68
local.thesisANUonly.title	000000013583_TC_2

Downloads

Original bundle

Now showing 1 - 1 of 1

Name:: Della Penna Thesis 2022.pdf
Size:: 15.33 MB
Format:: Adobe Portable Document Format
Description:: Thesis Material

Download

Collections

Open Access Theses

Cultural advice

Decisions, Learning and Games: You've Got To Have Freedom.

Downloads

Original bundle

Collections