BSQ: Exploring Bit-Level Sparsity for Mixed-Precision ...

BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization

Huanrui Yang, Lin Duan, Yiran Chen and Hai LiDuke University

ICLR 2021

The Need of Mixed-Precision Fixed-point Quantization • Less bits, less memory consumption, less energy cost

• Add: 30x less energy; Mult: 18.5x less energy (8-bit fixed-point vs. 32-bit float)

• Some layers are more important -> mixed-precision quantization• Key problem: find optimal MP quantization scheme• MP quantization introduced a large and discrete design space:

exponential to #layers• Search with NAS• Rank layers/filters with saliency and manually decide precision

• Goal: get MP quantization schemes directly with differentiable regularizer, efficiently explore performance-model size tradeoff

A Bit-level View of Quantization• For a fixed-point quantized matrix, when can its precision be reduced?

• MSB=0 for all elements: precision can reduce directly

• LSB=0 for all elements: precision can be reduced with scaling factor 2

• MP quantization scheme can be explored by inducing structural bit-level sparsity -> We name this method BSQ

DNN Training under Bit RepresentationBit representation conversion FP/BP loop: each bit as

float trainable variable

STE formulation

BSQ Training Pipeline• Start with 8-bit quantized model

• Most models can keep original accuracy under 8-bit quantization

• Bit-level group LASSO

• Overall training objective

• Apply periodic re-quantization and precision adjustment throughout the training process

• Finalize MP scheme and finetune quantized weight

Accuracy-#Bits Tradeoff• BSQ achieves further compression with a larger regularization strength

• Better than training with the same quantization scheme from scratch

Comparing with SOTA MethodsLarger compression rate under similar accuracy

More results available in our paper

ThanksMore details can be found at:

Paper: https://openreview.net/pdf?id=TiXl51SCNw8Code: https://github.com/yanghr/BSQ

This work was supported in part by NSF CCF-1910299 and NSF CNS-1822085.

BSQ: Exploring Bit-Level Sparsity for Mixed-Precision ...

Documents

Transcript of BSQ: Exploring Bit-Level Sparsity for Mixed-Precision ...

bsq 2010spr - comuto.jp

DLG! Bsq - Startseite...Bsq UJQCPCG qGIJ qGIJ GLLG!CPGIJ I

Fakkii Markabuu Shifarraan Qophaa'e Dubartoota Aangessuu ... · Pirojektii BSQ-SHD Gabaabinaan Pirojektiin BSQ-SHDn waliigaltee gargaarsaa mootummaa Fiinlaandiii fi mootummaa Itiyoophiyaan

AßrOoq-bsq - DTIC

BSQ Centrifugal Inline Fan - content.greenheck.com · BSQ Centrifugal Inline Fan New Dimensions Supplement Belt Drive July 2015 BSQ Redesign In early 2015, the BSQ was redesigned.

ReFACTor Column-Sparsity - arXiv

Инструкция Ballu Bravo BSQ-36H N1 - Интернет-магазин F.uaBallu Bravo BSQ-36H N1 Инструкция Ballu Bravo BSQ-36H N1 Руководство пользователя

BSQ strategic formulation BSQ strategic formulation … › lckoo › 2004BSQstrategicformulationframework.pdfstrategic formulation framework. Nevertheless, many modern practitioners

Boosting with Structural Sparsityjduchi/projects/DuchiSi09d.pdf · structural sparsity by using mixed-norm regularization, denoted ℓ1/ℓp. Mixed-norm regularization is used when

Low-Rank Laplacian-Uniform Mixed Model for Robust Face ...openaccess.thecvf.com/content_CVPR_2019/papers/...we propose a mixed model combining robust sparsity con- straint and low-rank

BSQ BSW PACKS 1120 EY v14 - stvincents.trafford.sch.uk

Bsq MOPUSU Hsne

Newsletter 31 BSQ

Sparsity and Compressed Sensing

Sparsity Based Regularization

2020 - bsq-bausparkasse.de

Drive Through BSQ copy - Weebly

Installation, Operation and Maintenance Manual - …webmedia.greenheck.com/iomuser/IOM/472058 SQ BSQ... · Model SQ/BSQ • Centrifugal Inline Exhaust Fans 1 Installation, Operation

Total Variation with Overlapping Group Sparsity for Removing … · 2018-09-18 · Total Variation with Overlapping Group Sparsity for Removing Mixed Noise Jin-Jin Mei and Ting-Zhu

Newsletter BSQ 7 Feb