Mixability is Bayes risk curvature relative to log loss




Van Erven, Tim
Reid, Mark
Williamson, Robert

Journal Title

Journal ISSN

Volume Title


MIT Press


Mixability of a loss characterizes fast rates in the online learning setting of prediction with expert advice. The determination of the mixability constant for binary losses is straightforward but opaque. In the binary case we make this transparent and simpler by characterising mixability in terms of the second derivative of the Bayes risk of proper losses. We then extend this result to multiclass proper losses where there are few existing results. We show that mixability is governed by the maximum eigenvalue of the Hessian of the Bayes risk, relative to the Hessian of the Bayes risk for log loss. We conclude by comparing our result to other work that bounds prediction performance in terms of the geometry of the Bayes risk. Although all calculations are for proper losses, we also show how to carry the results across to improper losses.



Keywords: Bayes risk; Eigen-value; Fast rate; Learning rates; Mixability; Multi-class; Online learning; Prediction performance; Prediction with expert advice; Second derivatives; Eigenvalues and eigenfunctions; Forecasting Learning rates; Mixability; Multiclass; Prediction with expert advice; Proper loss



Journal of Machine Learning Research


Journal article

Book Title

Entity type

Access Statement

Open Access

License Rights


Restricted until