Academic

Academic · 1 min

Variational LSTM with Augmented Inputs: Nonlinear Response History Metamodeling with Aleatoric and Epistemic Uncertainty

arXiv:2604.01587v1 Announce Type: new Abstract: Uncertainty propagation in high-dimensional nonlinear dynamic structural systems is pivotal in state-of-the-art performance-based design and risk assessment, where uncertainties from …

Manisha Sapkota, Min Li, Bowei Li

16 views Apr 3

Academic · 1 min

Thinking While Listening: Fast-Slow Recurrence for Long-Horizon Sequential Modeling

arXiv:2604.01577v1 Announce Type: new Abstract: We extend the recent latent recurrent modeling to sequential input streams. By interleaving fast, recurrent latent updates with self-organizational ability …

Shota Takashiro, Masanori Koyama, Takeru Miyato, Yusuke Iwasawa, Yutaka Matsuo, Kohei Hayashi

12 views Apr 3

Academic · 1 min

Care-Conditioned Neuromodulation for Autonomy-Preserving Supportive Dialogue Agents

arXiv:2604.01576v1 Announce Type: new Abstract: Large language models deployed in supportive or advisory roles must balance helpfulness with preservation of user autonomy, yet standard alignment …

Shalima Binta Manir, Tim Oates

14 views Apr 3

Academic · 1 min

ZEUS: Accelerating Diffusion Models with Only Second-Order Predictor

arXiv:2604.01552v1 Announce Type: new Abstract: Denoising generative models deliver high-fidelity generation but remain bottlenecked by inference latency due to the many iterative denoiser calls required …

Yixiao Wang, Ting Jiang, Zishan Shao, Hancheng Ye, Jingwei Sun, Mingyuan Ma, Jianyi Zhang, Yiran Chen, Hai Li

23 views Apr 3

Academic · 1 min

Learning ECG Image Representations via Dual Physiological-Aware Alignments

arXiv:2604.01526v1 Announce Type: new Abstract: Electrocardiograms (ECGs) are among the most widely used diagnostic tools for cardiovascular diseases, and a large amount of ECG data …

Hung Manh Pham, Jialu Tang, Aaqib Saeed, Dong Ma, Bin Zhu, Pan Zhou

26 views Apr 3

Academic · 1 min

Beyond Logit Adjustment: A Residual Decomposition Framework for Long-Tailed Reranking

arXiv:2604.01506v1 Announce Type: new Abstract: Long-tailed classification, where a small number of frequent classes dominate many rare ones, remains challenging because models systematically favor frequent …

Zhanliang Wang, Hongzhuo Chen, Quan Minh Nguyen, Mian Umair Ahsan, Kai Wang

34 views Apr 3

Academic · 1 min

Matching Accuracy, Different Geometry: Evolution Strategies vs GRPO in LLM Post-Training

arXiv:2604.01499v1 Announce Type: new Abstract: Evolution Strategies (ES) have emerged as a scalable gradient-free alternative to reinforcement learning based LLM fine-tuning, but it remains unclear …

William Hoy, Binxu Wang, Xu Pan

13 views Apr 3

Academic · 1 min

CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTe

arXiv:2604.01489v1 Announce Type: new Abstract: High-performance GPU kernels are critical to modern machine learning systems, yet developing efficient implementations remains a challenging, expert-driven process due …

Tara Saba, Anne Ouyang, Xujie Si, Fan Long

42 views Apr 3

Academic · 1 min

DISCO-TAB: A Hierarchical Reinforcement Learning Framework for Privacy-Preserving Synthesis of Complex Clinical Data

arXiv:2604.01481v1 Announce Type: new Abstract: The development of robust clinical decision support systems is frequently impeded by the scarcity of high-fidelity, privacy-preserving biomedical data. While …

Arshia Ilaty, Hossein Shirazi, Amir Rahmani, Hajar Homayouni

12 views Apr 3

Academic · 1 min

Soft MPCritic: Amortized Model Predictive Value Iteration

arXiv:2604.01477v1 Announce Type: new Abstract: Reinforcement learning (RL) and model predictive control (MPC) offer complementary strengths, yet combining them at scale remains computationally challenging. We …

Thomas Banker, Nathan P. Lawrence, Ali Mesbah

28 views Apr 3

Academic · 1 min

When Reward Hacking Rebounds: Understanding and Mitigating It with Representation-Level Signals

arXiv:2604.01476v1 Announce Type: new Abstract: Reinforcement learning for LLMs is vulnerable to reward hacking, where models exploit shortcuts to maximize reward without solving the intended …

Rui Wu, Ruixiang Tang

11 views Apr 3

Academic · 1 min

Improving Latent Generalization Using Test-time Compute

arXiv:2604.01430v1 Announce Type: new Abstract: Language Models (LMs) exhibit two distinct mechanisms for knowledge acquisition: in-weights learning (i.e., encoding information within the model weights) and …

Arslan Chaudhry, Sridhar Thiagarajan, Andrew Lampinen

14 views Apr 3

Variational LSTM with Augmented Inputs: Nonlinear Response History Metamodeling with Aleatoric and Epistemic Uncertainty

Thinking While Listening: Fast-Slow Recurrence for Long-Horizon Sequential Modeling

Care-Conditioned Neuromodulation for Autonomy-Preserving Supportive Dialogue Agents

ZEUS: Accelerating Diffusion Models with Only Second-Order Predictor

Learning ECG Image Representations via Dual Physiological-Aware Alignments

Beyond Logit Adjustment: A Residual Decomposition Framework for Long-Tailed Reranking

Matching Accuracy, Different Geometry: Evolution Strategies vs GRPO in LLM Post-Training

CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTe

DISCO-TAB: A Hierarchical Reinforcement Learning Framework for Privacy-Preserving Synthesis of Complex Clinical Data

Soft MPCritic: Amortized Model Predictive Value Iteration

When Reward Hacking Rebounds: Understanding and Mitigating It with Representation-Level Signals

Improving Latent Generalization Using Test-time Compute

JCG, PC

HSOLLC Co., Ltd.