Academic

Academic · 1 min

Generalization Limits of Reinforcement Learning Alignment

arXiv:2604.02652v1 Announce Type: new Abstract: The safety of large language models (LLMs) relies on alignment techniques such as reinforcement learning from human feedback (RLHF). However, …

Haruhi Shida, Koo Imai, Keigo Kansa

29 views Apr 6

Academic · 1 min

Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training

arXiv:2604.02651v1 Announce Type: new Abstract: Graph neural networks (GNNs) are widely used for learning on graph datasets derived from various real-world scenarios. Learning from extremely …

Cunyang Wei, Siddharth Singh, Aishwarya Sarkar, Daniel Nichols, Tisha Patel, Aditya K. Ranjan, Sayan Ghosh, Ali Jannesari, Nathan R. Tallent, Abhinav Bhatele

20 views Apr 6

Academic · 1 min

Conditional Sampling via Wasserstein Autoencoders and Triangular Transport

arXiv:2604.02644v1 Announce Type: new Abstract: We present Conditional Wasserstein Autoencoders (CWAEs), a framework for conditional simulation that exploits low-dimensional structure in both the conditioned and …

Mohammad Al-Jarrah, Michele Martino, Marcus Yim, Bamdad Hosseini, Amirhossein Taghvaei

32 views Apr 6

Academic · 1 min

AXELRAM: Quantize Once, Never Dequantize

arXiv:2604.02638v1 Announce Type: new Abstract: We propose AXELRAM, a smart SRAM macro architecture that computes attention scores directly from quantized KV cache indices without dequantization. …

Yasushi Nishida

36 views Apr 6

Academic · 1 min

Analytic Drift Resister for Non-Exemplar Continual Graph Learning

arXiv:2604.02633v1 Announce Type: new Abstract: Non-Exemplar Continual Graph Learning (NECGL) seeks to eliminate the privacy risks intrinsic to rehearsal-based paradigms by retaining solely class-level prototype …

Lei Song, Shihan Guan, Youyong Kong

12 views Apr 6

Academic · 1 min

Complex-Valued GNNs for Distributed Basis-Invariant Control of Planar Systems

arXiv:2604.02615v1 Announce Type: new Abstract: Graph neural networks (GNNs) are a well-regarded tool for learned control of networked dynamical systems due to their ability to …

Samuel Honor, Mohamed Abdelnaby, Kevin Leahy

12 views Apr 6

Academic · 1 min

Steerable but Not Decodable: Function Vectors Operate Beyond the Logit Lens

arXiv:2604.02608v1 Announce Type: new Abstract: Function vectors (FVs) -- mean-difference directions extracted from in-context learning demonstrations -- can steer large language model behavior when added …

Mohammed Suhail B Nadaf

9 views Apr 6

Academic · 1 min

WGFINNs: Weak formulation-based GENERIC formalism informed neural networks'

arXiv:2604.02601v1 Announce Type: new Abstract: Data-driven discovery of governing equations from noisy observations remains a fundamental challenge in scientific machine learning. While GENERIC formalism informed …

Jun Sur Richard Park, Auroni Huque Hashim, Siu Wun Cheung, Youngsoo Choi, Yeonjong Shin

33 views Apr 6

Academic · 1 min

VoxelCodeBench: Benchmarking 3D World Modeling Through Code Generation

arXiv:2604.02580v1 Announce Type: new Abstract: Evaluating code generation models for 3D spatial reasoning requires executing generated code in realistic environments and assessing outputs beyond surface-level …

Yan Zheng, Florian Bordes

4 views Apr 6

Academic · 1 min

ROMAN: A Multiscale Routing Operator for Convolutional Time Series Models

arXiv:2604.02577v1 Announce Type: new Abstract: We introduce ROMAN (ROuting Multiscale representAtioN), a deterministic operator for time series that maps temporal scale and coarse temporal position …

Gonzalo Uribarri

8 views Apr 6

Academic · 1 min

Communication-Efficient Distributed Learning with Differential Privacy

arXiv:2604.02558v1 Announce Type: new Abstract: We address nonconvex learning problems over undirected networks. In particular, we focus on the challenge of designing an algorithm that …

Xiaoxing Ren, Yuwen Ma, Nicola Bastianello, Karl H. Johansson, Thomas Parisini, Andreas A. Malikopoulos

28 views Apr 6

Academic · 1 min

Fast NF4 Dequantization Kernels for Large Language Model Inference

arXiv:2604.02556v1 Announce Type: new Abstract: Large language models (LLMs) have grown beyond the memory capacity of single GPU devices, necessitating quantization techniques for practical deployment. …

Xiangbo Qi, Chaoyi Jiang, Murali Annavaram

17 views Apr 6

Generalization Limits of Reinforcement Learning Alignment

Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training

Conditional Sampling via Wasserstein Autoencoders and Triangular Transport

AXELRAM: Quantize Once, Never Dequantize

Analytic Drift Resister for Non-Exemplar Continual Graph Learning

Complex-Valued GNNs for Distributed Basis-Invariant Control of Planar Systems

Steerable but Not Decodable: Function Vectors Operate Beyond the Logit Lens

WGFINNs: Weak formulation-based GENERIC formalism informed neural networks'

VoxelCodeBench: Benchmarking 3D World Modeling Through Code Generation

ROMAN: A Multiscale Routing Operator for Convolutional Time Series Models

Communication-Efficient Distributed Learning with Differential Privacy

Fast NF4 Dequantization Kernels for Large Language Model Inference

JCG, PC

HSOLLC Co., Ltd.