Approach
Firstly, we introduce an efficient test-phase computation process with the network parameters quantized. Secondly, we demonstrate that better quantization can be learned by directly minimizing the estimation error of each layer’s response.
-
Quantizing the Fully connected Layer

- Quantization with Error Correction


Experiment

References:
Quantized Convolutional Neural Networks for Mobile Devices, chengjian, 2016, CVPR
网友评论