COMPOT: Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers Compression
arXiv:2602.15200v1 Announce Type: new Abstract: Post-training compression of Transformer models commonly relies on truncated singular value decomposition (SVD). However, enforcing a single shared subspace can …