Sparse adaptive dirichlet-multinomial-like processes

Hutter, Marcus

Sparse adaptive dirichlet-multinomial-like processes

Date

2013

Authors

Hutter, Marcus

Abstract

Online estimation and modelling of i.i.d. data for short sequences over large or complex "alphabets" is a ubiquitous (sub)problem in machine learning, information theory, data compression, statistical language processing, and document analysis. The Dirichlet-Multinomial distribution (also called Polya urn scheme) and extensions there of are widely applied for online i.i.d. estimation. Good a-priori choices for the parameters in this regime are difficult to obtain though. I derive an optimal adaptive choice for the main parameter via tight, data-dependent redundancy bounds for a related model. The 1-line recommendation is to set the 'total mass' = 'precision' = 'concentration' parameter to m/[2 ln n+1/m], where n is the (past) sample size and m the number of different symbols observed (so far). The resulting estimator is simple, online, fast, and experimental performance is superb.

Keywords

Adaptive parameters, Data compression, Data-dependent redundancy bound, Dirichlet-Multinomial, Polya urn, Small/large alphabet, Sparse coding

URI

https://hdl.handle.net/1885/733797761

Collections

ANU Research Publications

Source

Journal of Machine Learning Research

Type

Conference paper

Entity type

Publication

Full item page

Sparse adaptive dirichlet-multinomial-like processes

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Access Statement

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until