Non-uniform stochastic average gradient method for training conditional random fields

Date

Authors

Schmidt, Mark
Babanezhad, Reza
Ahemd, M. Osama
Defazio, Aaron
Clifton, Ann
Sarkar, Anoop

Journal Title

Journal ISSN

Volume Title

Publisher

Access Statement

Research Projects

Organizational Units

Journal Issue

Abstract

We apply stochastic average gradient (SAG) algorithms for training conditional random fields (CRFs). We describe a practical im-plementation that uses structure in the CRF gradient to reduce the memory requirement of this linearly-convergent stochastic gradi-ent method, propose a non-uniform sampling scheme that substantially improves practical performance, and analyze the rate of con-vergence of the SAGA variant under non-uniform sampling. Our experimental results reveal that our method significantly outper-forms existing methods in terms of the training objective, and performs as well or bet-ter than optimally-tuned stochastic gradient methods in terms of test error.

Description

Keywords

Citation

Source

Journal of Machine Learning Research

Book Title

Entity type

Publication

Access Statement

License Rights

DOI

Restricted until