Academic

Academic · 1 min

Cactus: Accelerating Auto-Regressive Decoding with Constrained Acceptance Speculative Sampling

arXiv:2604.04987v1 Announce Type: new Abstract: Speculative sampling (SpS) has been successful in accelerating the decoding throughput of auto-regressive large language models by leveraging smaller draft …

Yongchang Hao, Lili Mou

18 views Apr 8

Academic · 1 min

Enhancing sample efficiency in reinforcement-learning-based flow control: replacing the critic with an adaptive reduced-order model

arXiv:2604.04986v1 Announce Type: new Abstract: Model-free deep reinforcement learning (DRL) methods suffer from poor sample efficiency. To overcome this limitation, this work introduces an adaptive …

Zesheng Yao, Zhen-Hua Wan, Canjun Yang, Qingchao Xia, Mengqi Zhang

4 views Apr 8

Academic · 1 min

Territory Paint Wars: Diagnosing and Mitigating Failure Modes in Competitive Multi-Agent PPO

arXiv:2604.04983v1 Announce Type: new Abstract: We present Territory Paint Wars, a minimal competitive multi-agent reinforcement learning environment implemented in Unity, and use it to systematically …

Diyansha Singh

10 views Apr 8

Academic · 1 min

A Theory-guided Weighted $L^2$ Loss for solving the BGK model via Physics-informed neural networks

arXiv:2604.04971v1 Announce Type: new Abstract: While Physics-Informed Neural Networks offer a promising framework for solving partial differential equations, the standard $L^2$ loss formulation is fundamentally …

Gyounghun Ko, Sung-Jun Son, Seung Yeon Cho, Myeong-Su Lee

19 views Apr 8

Academic · 1 min

MedLayBench-V: A Large-Scale Benchmark for Expert-Lay Semantic Alignment in Medical Vision Language Models

arXiv:2604.05738v1 Announce Type: new Abstract: Medical Vision-Language Models (Med-VLMs) have achieved expert-level proficiency in interpreting diagnostic imaging. However, current models are predominantly trained on professional …

Han Jang, Junhyeok Lee, Heeseong Eum, Kyu Sung Choi

6 views Apr 8

Academic · 1 min

Dialogue Act Patterns in GenAI-Mediated L2 Oral Practice: A Sequential Analysis of Learner-Chatbot Interactions

arXiv:2604.05702v1 Announce Type: new Abstract: While generative AI (GenAI) voice chatbots offer scalable opportunities for second language (L2) oral practice, the interactional processes related to …

Liqun He (Cindy), Shijun (Cindy), Chen, Mutlu Cukurova, Manolis Mavrikis

4 views Apr 8

Academic · 1 min

Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion

arXiv:2604.05688v1 Announce Type: new Abstract: Key-Value (KV) cache memory and bandwidth increasingly dominate large language model inference cost in long-context and long-generation regimes. Architectures such …

Zhen Cheng, Hao-Bo Yang, Wan-Yi Huang, Jin-Long Li

4 views Apr 8

Academic · 1 min

LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals

arXiv:2604.05655v1 Announce Type: new Abstract: This work characterizes large language models' chain-of-thought generation as a structured trajectory through representation space. We show that mathematical reasoning …

Lihao Sun, Hang Dong, Bo Qiao, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan

4 views Apr 8

Academic · 1 min

See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference …

arXiv:2604.05650v1 Announce Type: new Abstract: Video Large Language Models (Video-LLMs) excel in video understanding but suffer from high inference latency during autoregressive generation. Speculative Decoding …

Yicheng Ji, Jun Zhang, Jinpeng Chen, Cong Wang, Lidan Shou, Gang Chen, Huan Li

24 views Apr 8

Academic · 1 min

Graph-Based Chain-of-Thought Pruning for Reducing Redundant Reflections in Reasoning LLMs

arXiv:2604.05643v1 Announce Type: new Abstract: Extending CoT through RL has been widely used to enhance the reasoning capabilities of LLMs. However, due to the sparsity …

Hongyuan Yuan, Xinran He, Run Shao, Bolei He, Xianwei Xue, Mengke Chen, Qiutong Pan, Haiwei Wang, Haifeng Li

5 views Apr 8

Academic · 1 min

YoNER: A New Yor\`ub\'a Multi-domain Named Entity Recognition Dataset

arXiv:2604.05624v1 Announce Type: new Abstract: Named Entity Recognition (NER) is a foundational NLP task, yet research in Yor\`ub\'a has been constrained by limited and domain-specific …

Peace Busola Falola, Jesujoba O. Alabi, Solomon O. Akinola, Folashade T. Ogunajo, Emmanuel Oluwadunsin Alabi, David Ifeoluwa Adelani

4 views Apr 8

Academic · 1 min

THIVLVC: Retrieval Augmented Dependency Parsing for Latin

arXiv:2604.05564v1 Announce Type: new Abstract: We describe THIVLVC, a two-stage system for the EvaLatin 2026 Dependency Parsing task. Given a Latin sentence, we retrieve structurally …

Luc Pommeret (STL), Thibault Wagret (ENS de Lyon, HiSoMA), Jules Deret

5 views Apr 8

Cactus: Accelerating Auto-Regressive Decoding with Constrained Acceptance Speculative Sampling

Enhancing sample efficiency in reinforcement-learning-based flow control: replacing the critic with an adaptive reduced-order model

Territory Paint Wars: Diagnosing and Mitigating Failure Modes in Competitive Multi-Agent PPO

A Theory-guided Weighted $L^2$ Loss for solving the BGK model via Physics-informed neural networks

MedLayBench-V: A Large-Scale Benchmark for Expert-Lay Semantic Alignment in Medical Vision Language Models

Dialogue Act Patterns in GenAI-Mediated L2 Oral Practice: A Sequential Analysis of Learner-Chatbot Interactions

Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion

LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals

See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference …

Graph-Based Chain-of-Thought Pruning for Reducing Redundant Reflections in Reasoning LLMs

YoNER: A New Yor\`ub\'a Multi-domain Named Entity Recognition Dataset

THIVLVC: Retrieval Augmented Dependency Parsing for Latin

JCG, PC

HSOLLC Co., Ltd.