ROSE: Reordered SparseGPT for More Accurate One-Shot Large Language Models Pruning
arXiv:2603.05878v1 Announce Type: new Abstract: Pruning is widely recognized as an effective method for reducing the parameters of large language models (LLMs), potentially leading to …
Mingluo Su, Huan Wang
14 views