Skip navigation
Skip navigation

Adaptive Recommendations with Bandit Feedback

Zhang, Mengyan

Description

Sequential decision-making under uncertainty is a fundamental challenge in machine learning. We consider multi-armed bandits problems, where the goal is to design a policy to recommend arms (options) sequentially where in each round only the rewards sampled from the selected arms are available. In this thesis, we study both theoretical and practical aspects of sequential decision-making with bandit feedback. From the theory aspect, we summarise the rewards using quantiles for risk-averse...[Show more]

CollectionsOpen Access Theses
Date published: 2023
Type: Thesis (PhD)
URI: http://hdl.handle.net/1885/284132
DOI: 10.25911/ER4Y-K363

Download

File Description SizeFormat Image
MengyanZhang_thesis_final.pdfThesis Material11.86 MBAdobe PDFThumbnail


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  17 November 2022/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator