Defensive universal learning with experts
Loading...
Date
Authors
Poland, Jan
Hutter, Marcus
Journal Title
Journal ISSN
Volume Title
Publisher
Springer
Abstract
This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedback from the actions actually chosen (bandit setup), (b) it can be applied with countably infinite expert classes, and (c) it copes with losses that may grow in time appropriately slowly. We prove loss bounds against an adaptive adversary. Prom this, we obtain a master algorithm for "reactive" experts problems, which means that the master's actions may influence the behavior of the adversary. Our algorithm can significantly outperform standard experts algorithms on such problems. Finally, we combine it with a universal expert class. The resulting universal learner performs - in a certain sense - almost as well as any computable strategy, for any online decision problem. We also specify the (worst-case) convergence speed, which is very slow.
Description
Citation
Collections
Source
Algorithmic Learning Theory: Proceedings of the 16th International Conference on Algorithmic Learning Theory (ALT-05) - LNAI 3734
Type
Book Title
Entity type
Access Statement
License Rights
Restricted until
2037-12-31
Downloads
File
Description