This platform requires JavaScript for full functionality. Please enable JavaScript in your browser settings.

Quality follows upgrading

Longsheng Zhou, Yu Shen

Articles by Longsheng Zhou, Yu Shen

Academic · 1 min

Prune-Quantize-Distill: An Ordered Pipeline for Efficient Neural Network Compression

arXiv:2604.04988v1 Announce Type: new Abstract: Modern deployment often requires trading accuracy for efficiency under tight CPU and memory constraints, yet common compression proxies such as …

32 views Apr 8

Longsheng Zhou, Yu Shen

Articles by Longsheng Zhou, Yu Shen

Prune-Quantize-Distill: An Ordered Pipeline for Efficient Neural Network Compression

JCG, PC

HSOLLC Co., Ltd.