Master algorithms for active experts problems based on increasing loss values

Poland, Jan; Hutter, Marcus

Master algorithms for active experts problems based on increasing loss values

Authors

Poland, Jan

Hutter, Marcus

Publisher

Belgian-Dutch Conference on Machine Learning (Benelearn)

Abstract

We specify an experts algorithm with the following characteristics: (a) it uses only feedback from the actions actually chosen (bandit setup), (b) it can be applied with countably infinite expert classes, and (c) it copes with losses that may grow in time appropriately slowly. We prove loss bounds against an adaptive adversary. From this, we obtain master algorithms for ``active experts problems'', which means that the master's actions may influence the behavior of the adversary. Our algorithm can significantly outperform standard experts algorithms on such problems. Finally, we combine it with a universal expert class. This results in a (computationally infeasible) universal master algorithm which performs - in a certain sense - almost as well as any computable strategy, for any online problem.

Keywords

Prediction with expert advice, responsive environments, partial observation game, universal learning, asymptotic optimality

URI

http://hdl.handle.net/1885/15052

Collections

ANU Research Publications

Type

Conference paper

Book Title

Proceedings of the 14th Dutch-Belgium Conference on Machine Learning Benelearn'05

Downloads

File

Description

Poland and Hutter Master Algorithms for Active Experts Problems 2005.pdf (219.37 KB)

Full item page

Master algorithms for active experts problems based on increasing loss values

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

Downloads