Skip to main content

Academic

Academic

Academic · 1 min

Decoder-based Sense Knowledge Distillation

arXiv:2602.22351v1 Announce Type: new Abstract: Large language models (LLMs) learn contextual embeddings that capture rich semantic information, yet they often overlook structured lexical knowledge such …

Qitong Wang, Mohammed J. Zaki, Georgios Kollias, Vasileios Kalantzis
4 views
Academic · 1 min

Causality $\neq$ Invariance: Function and Concept Vectors in LLMs

arXiv:2602.22424v1 Announce Type: new Abstract: Do large language models (LLMs) represent concepts abstractly, i.e., independent of input format? We revisit Function Vectors (FVs), compact representations …

Gustaw Opie{\l}ka, Hannes Rosenbusch, Claire E. Stevenson
4 views
Academic · 1 min

Ruyi2 Technical Report

arXiv:2602.22543v1 Announce Type: new Abstract: Large Language Models (LLMs) face significant challenges regarding deployment costs and latency, necessitating adaptive computing strategies. Building upon the AI …

Huan Song, Shuyu Tian, Junyi Hao, Minxiu Xu, Hongjun An, Yiliang Song, Jiawei Shao, Xuelong Li
4 views