Proximal mean-field for neural network quantization

Ajanthan, Thalaiyasingam; Dokania, Puneet K.; Hartley, Richard; Torr, Philip H.S.

Proximal mean-field for neural network quantization

Date

2019

Authors

Ajanthan, Thalaiyasingam

Dokania, Puneet K.

Hartley, Richard

Torr, Philip H.S.

Publisher

IEEE, Institute of Electrical and Electronics Engineers

Abstract

Compressing large Neural Networks (NN) by quantizing the parameters, while maintaining the performance is highly desirable due to reduced memory and time complexity. In this work, we cast NN quantization as a discrete labelling problem, and by examining relaxations, we design an efficient iterative optimization procedure that involves stochastic gradient descent followed by a projection. We prove that our simple projected gradient descent approach is, in fact, equivalent to a proximal version of the well-known mean-field method. These findings would allow the decades-old and theoretically grounded research on MRF optimization to be used to design better network quantization schemes. Our experiments on standard classification datasets (MNIST, CIFAR10/100, TinyImageNet) with convolutional and residual architectures show that our algorithm obtains fully-quantized networks with accuracies very close to the floating-point reference networks.

URI

http://hdl.handle.net/1885/294127

Collections

ANU Research Publications

Source

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV 2019)

Type

Conference paper

DOI

10.1109/ICCV.2019.00497

Restricted until

2099-12-31

Downloads

File

Description

Proximal_Mean-Field_for_Neural_Network_Quantization.pdf (660.77 KB)

Full item page

Cultural advice

Proximal mean-field for neural network quantization

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

Downloads