Academic

Academic · 1 min

LLMs Should Express Uncertainty Explicitly

arXiv:2604.05306v1 Announce Type: new Abstract: Large language models are increasingly used in settings where uncertainty must drive decisions such as abstention, retrieval, and verification. Most …

Junyu Guo, Shangding Gu, Ming Jin, Costas Spanos, Javad Lavaei

15 views Apr 8

Academic · 1 min

Jeffreys Flow: Robust Boltzmann Generators for Rare Event Sampling via Parallel Tempering Distillation

arXiv:2604.05303v1 Announce Type: new Abstract: Sampling physical systems with rough energy landscapes is hindered by rare events and metastable trapping. While Boltzmann generators already offer …

Guang Lin, Christian Moya, Di Qi, Xuda Ye

20 views Apr 8

Academic · 1 min

Extending Tabular Denoising Diffusion Probabilistic Models for Time-Series Data Generation

arXiv:2604.05257v1 Announce Type: new Abstract: Diffusion models are increasingly being utilised to create synthetic tabular and time series data for privacy-preserving augmentation. Tabular Denoising Diffusion …

Umang Dobhal, Christina Garcia, Sozo Inoue

3 views Apr 8

Academic · 1 min

DualDiffusion: A Speculative Decoding Strategy for Masked Diffusion Models

arXiv:2604.05250v1 Announce Type: new Abstract: Masked Diffusion Models (MDMs) offer a promising alternative to autoregressive language models by enabling parallel token generation and bidirectional context …

Satyam Goyal, Kushal Patel, Tanush Mittal, Arjun Laxman

3 views Apr 8

Academic · 1 min

Improving Sparse Memory Finetuning

arXiv:2604.05248v1 Announce Type: new Abstract: Large Language Models (LLMs) are typically static after training, yet real-world applications require continual adaptation to new knowledge without degrading …

Satyam Goyal, Anirudh Kanchi, Garv Shah, Prakhar Gupta

5 views Apr 8

Academic · 1 min

Curvature-Aware Optimization for High-Accuracy Physics-Informed Neural Networks

arXiv:2604.05230v1 Announce Type: new Abstract: Efficient and robust optimization is essential for neural networks, enabling scientific machine learning models to converge rapidly to very high …

Anas Jnini, Elham Kiyani, Khemraj Shukla, Jorge F. Urban, Nazanin Ahmadi Daryakenari, Johannes Muller, Marius Zeinhofer, George Em Karniadakis

27 views Apr 8

Academic · 1 min

On the Geometry of Positional Encodings in Transformers

arXiv:2604.05217v1 Announce Type: new Abstract: Neural language models process sequences of words, but the mathematical operations inside them are insensitive to the order in which …

Giansalvo Cirrincione

16 views Apr 8

Academic · 1 min

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

arXiv:2604.05195v1 Announce Type: new Abstract: Unlike traditional homogeneous routing problems, the Heterogeneous Fleet Vehicle Routing Problem (HFVRP) involves heterogeneous fixed costs, variable travel costs, and …

Shihong Huang, Shengjie Wang, Lei Gao, Hong Ma, Zhanluo Zhang, Feng Zhang, Weihua Zhou

6 views Apr 8

Academic · 1 min

FNO$^{\angle \theta}$: Extended Fourier neural operator for learning state and optimal control of distributed parameter …

arXiv:2604.05187v1 Announce Type: new Abstract: We propose an extended Fourier neural operator (FNO) architecture for learning state and linear quadratic additive optimal control of systems …

Zhexian Li, Ketan Savla

16 views Apr 8

Academic · 1 min

Cross-fitted Proximal Learning for Model-Based Reinforcement Learning

arXiv:2604.05185v1 Announce Type: new Abstract: Model-based reinforcement learning is attractive for sequential decision-making because it explicitly estimates reward and transition models and then supports planning …

Nishanth Venkatesh, Andreas A. Malikopoulos

16 views Apr 8

Academic · 1 min

Not All Turns Are Equally Hard: Adaptive Thinking Budgets For Efficient Multi-Turn Reasoning

arXiv:2604.05164v1 Announce Type: new Abstract: As LLM reasoning performance plateau, improving inference-time compute efficiency is crucial to mitigate overthinking and long thinking traces even for …

Neharika Jali, Anupam Nayak, Gauri Joshi

9 views Apr 8

Academic · 1 min

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

arXiv:2604.05134v1 Announce Type: new Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning …

Lucas Dionisopoulos, Nicklas Majamaki, Prithviraj Ammanabrolu

3 views Apr 8

LLMs Should Express Uncertainty Explicitly

Jeffreys Flow: Robust Boltzmann Generators for Rare Event Sampling via Parallel Tempering Distillation

Extending Tabular Denoising Diffusion Probabilistic Models for Time-Series Data Generation

DualDiffusion: A Speculative Decoding Strategy for Masked Diffusion Models

Improving Sparse Memory Finetuning

Curvature-Aware Optimization for High-Accuracy Physics-Informed Neural Networks

On the Geometry of Positional Encodings in Transformers

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

FNO$^{\angle \theta}$: Extended Fourier neural operator for learning state and optimal control of distributed parameter …

Cross-fitted Proximal Learning for Model-Based Reinforcement Learning

Not All Turns Are Equally Hard: Adaptive Thinking Budgets For Efficient Multi-Turn Reasoning

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

JCG, PC

HSOLLC Co., Ltd.