Skip to main content

All Articles

Articles

Academic · 1 min

Model Merging in the Essential Subspace

arXiv:2602.20208v1 Announce Type: new Abstract: Model merging aims to integrate multiple task-specific fine-tuned models derived from a shared pre-trained checkpoint into a single multi-task model …

Longhua Li, Lei Qi, Qi Tian, Xin Geng
4 views
Academic · 1 min

Uncertainty-Aware Delivery Delay Duration Prediction via Multi-Task Deep Learning

arXiv:2602.20271v1 Announce Type: new Abstract: Accurate delivery delay prediction is critical for maintaining operational efficiency and customer satisfaction across modern supply chains. Yet the increasing …

Stefan Faulkner, Reza Zandehshahvar, Vahid Eghbal Akhlaghi, Sebastien Ouellet, Carsten Jordan, Pascal Van Hentenryck
8 views
Academic · 1 min

The Truthfulness Spectrum Hypothesis

arXiv:2602.20273v1 Announce Type: new Abstract: Large language models (LLMs) have been reported to linearly encode truthfulness, yet recent work questions this finding's generality. We reconcile …

Zhuofan Josh Ying, Shauli Ravfogel, Nikolaus Kriegeskorte, Peter Hase
4 views
Academic · 1 min

Learning to Solve Complex Problems via Dataset Decomposition

arXiv:2602.20296v1 Announce Type: new Abstract: Curriculum learning is a class of training strategies that organizes the data being exposed to a model by difficulty, gradually …

Wanru Zhao, Lucas Caccia, Zhengyan Shi, Minseon Kim, Weijia Xu, Alessandro Sordoni
4 views