Model Quantization Papers
- 鼻祖:Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
- 综述:
- Quantizing deep convolutional networks for efficient inference: A whitepaper
- A White Paper on Neural Network Quantization
- 上手:
- ZeroQ: A Novel Zero Shot Quantization Framework
- HAWQ-V3: Dyadic Neural Network Quantization
- Up or Down? Adaptive Rounding for Post-Training Quantization
| Title | Class |
|---|---|
| ZeroQ | DFQ |
| SQuant | DFQ |
| ACIQ | PTQ |
| GDFQ | DFQ |