Model Quantization Papers

  • 鼻祖:Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
  • 综述:
    1. Quantizing deep convolutional networks for efficient inference: A whitepaper
    2. A White Paper on Neural Network Quantization
  • 上手:
    1. ZeroQ: A Novel Zero Shot Quantization Framework
    2. HAWQ-V3: Dyadic Neural Network Quantization
    3. Up or Down? Adaptive Rounding for Post-Training Quantization

Title Class
ZeroQ DFQ
SQuant DFQ
ACIQ PTQ
GDFQ DFQ