Defensive universal learning with experts

Poland, Jan; Hutter, Marcus

doi:10.1007/11564089_28

New Open Research Repository launching soon. This site will be offline from 5 to 9pm, Monday 20 May, while the new site goes live!

Defensive universal learning with experts

Download (249.15 kB)

link to publisher version

Altmetric Citations

Poland, Jan; Hutter, Marcus

Description

This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedback from the actions actually chosen (bandit setup), (b) it can be applied with countably infinite expert classes, and (c) it copes with losses that may grow in time appropriately slowly. We prove loss bounds against an adaptive adversary. From this, we obtain a master algorithm for “reactive” experts problems, which...[Show more] means that the master’s actions may influence the behavior of the adversary. Our algorithm can significantly outperform standard experts algorithms on such problems. Finally, we combine it with a universal expert class. The resulting universal learner performs – in a certain sense – almost as well as any computable strategy, for any online decision problem. We also specify the (worst-case) convergence speed, which is very slow.

Collections	ANU Research Publications
Date published:	2005
Type:	Conference paper
URI:	http://hdl.handle.net/1885/15047
Book Title:	Algorithmic Learning Theory: 16th International Conference, ALT 2005, Singapore, October 8-11, 2005. Proceedings
DOI:	10.1007/11564089_28
Access Rights:	Open Access

Download

File	Description	Size	Format	Image
Poland and Hutter Defensive Universal Learning 2005.pdf		249.15 kB	Adobe PDF

Show full item record

Defensive universal learning with experts

Altmetric Citations

Description

Download