Distributed Interpretability and Control for Large Language Models
arXiv:2604.06483v1 Announce Type: new Abstract: Large language models that require multiple GPU cards to host are usually the most capable models. It is necessary to …
Dev Arpan Desai, Shaoyi Huang, Zining Zhu
11 views