Academic

Academic · 1 min

Align Your Structures: Generating Trajectories with Structure Pretraining for Molecular Dynamics

arXiv:2604.03911v1 Announce Type: new Abstract: Generating molecular dynamics (MD) trajectories using deep generative models has attracted increasing attention, yet remains inherently challenging due to the …

Aniketh Iyengar, Jiaqi Han, Pengwei Sun, Mingjian Jiang, Jianwen Xie, Stefano Ermon

12 views Apr 7

Academic · 1 min

Improving Model Performance by Adapting the KGE Metric to Account for System Non-Stationarity

arXiv:2604.03906v1 Announce Type: new Abstract: Geoscientific systems tend to be characterized by pronounced temporal non-stationarity, arising from seasonal and climatic variability in hydrometeorological drivers, and …

M Jawad, HV Gupta, YH Wang, MA Farmani, A Behrangi, GY Niu

17 views Apr 7

Academic · 1 min

Provable Multi-Task Reinforcement Learning: A Representation Learning Framework with Low Rank Rewards

arXiv:2604.03891v1 Announce Type: new Abstract: Multi-task representation learning (MTRL) is an approach that learns shared latent representations across related tasks, facilitating collaborative learning that improves …

Yaoze Guo, Shana Moothedath

7 views Apr 7

Academic · 1 min

Regime-Calibrated Demand Priors for Ride-Hailing Fleet Dispatch and Repositioning

arXiv:2604.03883v1 Announce Type: new Abstract: Effective ride-hailing dispatch requires anticipating demand patterns that vary substantially across time-of-day, day-of-week, season, and special events. We propose a …

Indar Kumar, Akanksha Tiwari

24 views Apr 7

Academic · 1 min

Spatiotemporal Interpolation of GEDI Biomass with Calibrated Uncertainty

arXiv:2604.03874v1 Announce Type: new Abstract: Monitoring deforestation-driven carbon emissions requires both spatially explicit and temporally continuous estimates of aboveground biomass density (AGBD) with calibrated uncertainty. …

Robin Young, Srinivasan Keshav

26 views Apr 7

Academic · 1 min

SODA: Semi On-Policy Black-Box Distillation for Large Language Models

arXiv:2604.03873v1 Announce Type: new Abstract: Black-box knowledge distillation for large language models presents a strict trade-off. Simple off-policy methods (e.g., sequence-level knowledge distillation) struggle to …

Xiwen Chen, Jingjing Wang, Wenhui Zhu, Peijie Qiu, Xuanzhao Dong, Hejian Sang, Zhipeng Wang, Alborz Geramifard, Feng Luo

49 views Apr 7

Academic · 1 min

Where to Steer: Input-Dependent Layer Selection for Steering Improves LLM Alignment

arXiv:2604.03867v1 Announce Type: new Abstract: Steering vectors have emerged as a lightweight and effective approach for aligning large language models (LLMs) at inference time, enabling …

Soham Gadgil, Chris Lin, Su-In Lee

21 views Apr 7

Academic · 1 min

A Bayesian Information-Theoretic Approach to Data Attribution

arXiv:2604.03858v1 Announce Type: new Abstract: Training Data Attribution (TDA) seeks to trace model predictions back to influential training examples, enhancing interpretability and safety. We formulate …

Dharmesh Tailor, Nicol\`o Felicioni, Kamil Ciosek

18 views Apr 7

Academic · 1 min

Understanding When Poisson Log-Normal Models Outperform Penalized Poisson Regression for Microbiome Count Data

arXiv:2604.03853v1 Announce Type: new Abstract: Multivariate count models are often justified by their ability to capture latent dependence, but researchers receive little guidance on when …

Daniel Agyapong, Julien Chiquet, Jane Marks, Toby Dylan Hocking

11 views Apr 7

Academic · 1 min

Collapse-Free Prototype Readout Layer for Transformer Encoders

arXiv:2604.03850v1 Announce Type: new Abstract: DDCL-Attention is a prototype-based readout layer for transformer encoders that replaces simple pooling methods, such as mean pooling or class …

Giansalvo Cirrincione, Rahul Ranjeev Kumar

30 views Apr 7

Academic · 1 min

k-Maximum Inner Product Attention for Graph Transformers and the Expressive Power of GraphGPS The Expressive …

arXiv:2604.03815v1 Announce Type: new Abstract: Graph transformers have shown promise in overcoming limitations of traditional graph neural networks, such as oversquashing and difficulties in modelling …

Jonas De Schouwer, Haitz S\'aez de Oc\'ariz Borde, Xiaowen Dong

5 views Apr 7

Academic · 1 min

Representational Collapse in Multi-Agent LLM Committees: Measurement and Diversity-Aware Consensus

arXiv:2604.03809v1 Announce Type: new Abstract: Multi-agent LLM committees replicate the same model under different role prompts and aggregate outputs by majority vote, implicitly assuming that …

Dipkumar Patel

28 views Apr 7

Align Your Structures: Generating Trajectories with Structure Pretraining for Molecular Dynamics

Improving Model Performance by Adapting the KGE Metric to Account for System Non-Stationarity

Provable Multi-Task Reinforcement Learning: A Representation Learning Framework with Low Rank Rewards

Regime-Calibrated Demand Priors for Ride-Hailing Fleet Dispatch and Repositioning

Spatiotemporal Interpolation of GEDI Biomass with Calibrated Uncertainty

SODA: Semi On-Policy Black-Box Distillation for Large Language Models

Where to Steer: Input-Dependent Layer Selection for Steering Improves LLM Alignment

A Bayesian Information-Theoretic Approach to Data Attribution

Understanding When Poisson Log-Normal Models Outperform Penalized Poisson Regression for Microbiome Count Data

Collapse-Free Prototype Readout Layer for Transformer Encoders

k-Maximum Inner Product Attention for Graph Transformers and the Expressive Power of GraphGPS The Expressive …

Representational Collapse in Multi-Agent LLM Committees: Measurement and Diversity-Aware Consensus

JCG, PC

HSOLLC Co., Ltd.