Academic

Academic

Academic · 1 min

Heterogeneous Decentralized Diffusion Models

arXiv:2603.06741v1 Announce Type: new Abstract: Training frontier-scale diffusion models often requires substantial computational resources concentrated in tightly coupled clusters, limiting participation to well-resourced institutions. While …

Zhiying Jiang, Raihan Seraj, Marcos Villagra, Bidhan Roy
24 views
Academic · 1 min

Stabilizing Reinforcement Learning for Diffusion Language Models

arXiv:2603.06743v1 Announce Type: new Abstract: Group Relative Policy Optimization (GRPO) is highly effective for post-training autoregressive (AR) language models, yet its direct application to diffusion …

Jianyuan Zhong, Kaibo Wang, Ding Ding, Zijin Feng, Haoli Bai, Yang Xiang, Jiacheng Sun, Qiang Xu
14 views
Academic · 1 min

On the Value of Tokeniser Pretraining in Physics Foundation Models

arXiv:2603.05598v1 Announce Type: cross Abstract: We investigate the impact of tokeniser pretraining on the accuracy and efficiency of physics emulation. Modern high-resolution simulations produce vast …

Hadi Sotoudeh, Payel Mukhopadhyay, Ruben Ohana, Michael McCabe, Neil D. Lawrence, Shirley Ho, Miles Cranmer
54 views