Skip to main content

Academic

Academic

Academic · 1 min

Search-P1: Path-Centric Reward Shaping for Stable and Efficient Agentic RAG Training

arXiv:2602.22576v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) enhances large language models (LLMs) by incorporating external knowledge, yet traditional single-round retrieval struggles with complex multi-step …

Tianle Xia, Ming Xu, Lingxiang Hu, Yiding Sun, Wenwei Li, Linfang Shang, Liqun Liu, Peng Shu, Huan Yu, Jie Jiang
5 views
Academic · 1 min

dLLM: Simple Diffusion Language Modeling

arXiv:2602.22661v1 Announce Type: new Abstract: Although diffusion language models (DLMs) are evolving quickly, many recent models converge on a set of shared components. These components, …

Zhanhui Zhou, Lingjie Chen, Hanghang Tong, Dawn Song
4 views