Asymptotics of discrete (MDL) for online prediction

Poland, Jan; Hutter, Marcus

Asymptotics of discrete (MDL) for online prediction

dc.contributor.author	Poland, Jan
dc.contributor.author	Hutter, Marcus
dc.date.accessioned	2015-12-10T22:40:57Z
dc.date.issued	2005
dc.date.updated	2016-02-24T11:44:49Z
dc.description.abstract	Minimum description length (MDL) is an important principle for induction and prediction, with strong relations to optimal Bayesian learning. This paper deals with learning processes which are independent and identically distributed (i.i.d.) by means of two-part MDL, where the underlying model class is countable. We consider the online learning framework, i.e., observations come in one by one, and the predictor is allowed to update its state of mind after each time step. We identify two ways of predicting by MDL for this setup, namely, a static and a dynamic one. (A third variant, hybrid MDL, will turn out inferior.) We will prove that under the only assumption that the data is generated by a distribution contained in the model class, the MDL predictions converge to the true values almost surely. This is accomplished by proving finite bounds on the quadratic, the Hellinger, and the Kullback-Leibler loss of the MDL learner, which are, however, exponentially worse than for Bayesian prediction. We demonstrate that these bounds are sharp, even for model classes containing only Bernoulli distributions. We show how these bounds imply regret bounds for arbitrary loss functions. Our results apply to a wide range of setups, namely, sequence prediction, pattern classification, regression, and universal induction in the sense of algorithmic information theory among others.
dc.identifier.issn	0018-9448
dc.identifier.uri	http://hdl.handle.net/1885/57671
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE Inc)
dc.rights	Copyright Information: http://www.sherpa.ac.uk/romeo/issn/0018-9448/..."Author's post-print on Author's server or Institutional server" from SHERPA/RoMEO site (as at 31/08/15).;© 2005 IEEE. Personal use of this material is permitted. Permission from IEEE
dc.source	IEEE Transactions on Information Theory
dc.subject	Keywords: Learning systems; Mathematical models; Pattern recognition; Probability distributions; Regression analysis; Theorem proving; Algorithmic information theory; Classification consistency; Discrete model class; Minimum description length (MDL); Sequence predi Algorithmic information theory; Classification; Consistency; Discrete model class; Loss bounds; Minimum description length (MDL); Regression; Sequence prediction; Stabilization; Universal induction
dc.title	Asymptotics of discrete (MDL) for online prediction
dc.type	Journal article
local.bibliographicCitation.issue	11
local.bibliographicCitation.lastpage	3795
local.bibliographicCitation.startpage	3780
local.contributor.affiliation	Poland, Jan, Hokkaido University
local.contributor.affiliation	Hutter, Marcus, College of Engineering and Computer Science, ANU
local.contributor.authoruid	Hutter, Marcus, u4350841
local.description.notes	Imported from ARIES
local.identifier.absfor	080109 - Pattern Recognition and Data Mining
local.identifier.ariespublication	u8803936xPUB410
local.identifier.citationvolume	51
local.identifier.doi	10.1109/TIT.2005.856956
local.identifier.scopusID	2-s2.0-27744462709
local.type.status	Published Version

Downloads

Original bundle

Now showing 1 - 1 of 1

Name:: 01_Poland_Asymptotics_of_discrete_(MDL)_2005.pdf
Size:: 499.18 KB
Format:: Adobe Portable Document Format

Download

Collections

ANU Research Publications