Skip to main content

All Articles

Articles

Academic · 1 min

GeoPT: Scaling Physics Simulation via Lifted Geometric Pre-Training

arXiv:2602.20399v1 Announce Type: new Abstract: Neural simulators promise efficient surrogates for physics simulation, but scaling them is bottlenecked by the prohibitive cost of generating high-fidelity …

Haixu Wu, Minghao Guo, Zongyi Li, Zhiyang Dou, Mingsheng Long, Kaiming He, Wojciech Matusik
4 views
Academic · 1 min

Wasserstein Distributionally Robust Online Learning

arXiv:2602.20403v1 Announce Type: new Abstract: We study distributionally robust online learning, where a risk-averse learner updates decisions sequentially to guard against worst-case distributions drawn from …

Guixian Chen, Salar Fattahi, Soroosh Shafiee
8 views
Academic · 1 min

Imputation of Unknown Missingness in Sparse Electronic Health Records

arXiv:2602.20442v1 Announce Type: new Abstract: Machine learning holds great promise for advancing the field of medicine, with electronic health records (EHRs) serving as a primary …

Jun Han, Josue Nassar, Sanjit Singh Batra, Aldo Cordova-Palomera, Vijay Nori, Robert E. Tillman
3 views
Academic · 1 min

Oracle-Robust Online Alignment for Large Language Models

arXiv:2602.20457v1 Announce Type: new Abstract: We study online alignment of large language models under misspecified preference feedback, where the observed preference oracle deviates from an …

Zimeng Li, Mudit Gaur, Vaneet Aggarwal
4 views
Academic · 1 min

Nonparametric Teaching of Attention Learners

arXiv:2602.20461v1 Announce Type: new Abstract: Attention learners, neural networks built on the attention mechanism, e.g., transformers, excel at learning the implicit relationships that relate sequences …

Chen Zhang, Jianghui Wang, Bingyang Cheng, Zhongtao Chen, Wendong XU, Cong Wang, Marco Canini, Francesco Orabona, Yik Chung WU, Ngai Wong
3 views
Academic · 1 min

A Long-Short Flow-Map Perspective for Drifting Models

arXiv:2602.20463v1 Announce Type: new Abstract: This paper provides a reinterpretation of the Drifting Model~\cite{deng2026generative} through a semigroup-consistent long-short flow-map factorization. We show that a global …

Zhiqi Li, Bo Zhu
7 views