Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

LinearARD: Linear-Memory Attention Distillation for RoPE Restoration

arXiv:2604.00004v1 Announce Type: cross Abstract: The extension of context windows in Large Language Models is typically facilitated by scaling positional encodings followed by lightweight Continual …

Ning Yang, Hengyu Zhong, Wentao Wang, Baoliang Tian, Haijun Zhang, Jun Wang

47 views Apr 3

Academic · 1 min

Towards Reliable Truth-Aligned Uncertainty Estimation in Large Language Models

arXiv:2604.00445v1 Announce Type: new Abstract: Uncertainty estimation (UE) aims to detect hallucinated outputs of large language models (LLMs) to improve their reliability. However, UE metrics …

Ponhvoan Srey, Quang Minh Nguyen, Xiaobao Wu, Anh Tuan Luu

32 views Apr 3

Academic · 1 min

DySCo: Dynamic Semantic Compression for Effective Long-term Time Series Forecasting

arXiv:2604.01261v1 Announce Type: new Abstract: Time series forecasting (TSF) is critical across domains such as finance, meteorology, and energy. While extending the lookback window theoretically …

Xiang Ao, Yinyu Tan, Mengru Chen

36 views Apr 3

Academic · 1 min

Hierarchical Chain-of-Thought Prompting: Enhancing LLM Reasoning Performance and Efficiency

arXiv:2604.00130v1 Announce Type: new Abstract: Chain-of-Thought (CoT) prompting has significantly improved the reasoning capabilities of large language models (LLMs). However, conventional CoT often relies on …

Xingshuai Huang, Derek Li, Bahareh Nikpour, Parsa Omidi

24 views Apr 3

Academic · 1 min

WHBench: Evaluating Frontier LLMs with Expert-in-the-Loop Validation on Women's Health Topics

arXiv:2604.00024v1 Announce Type: new Abstract: Large language models are increasingly used for medical guidance, but women's health remains under-evaluated in benchmark design. We present the …

Sneha Maurya, Pragya Saboo, Girish Kumar

49 views Apr 3

Academic · 1 min

Benchmark for Assessing Olfactory Perception of Large Language Models

arXiv:2604.00002v1 Announce Type: cross Abstract: Here we introduce the Olfactory Perception (OP) benchmark, designed to assess the capability of large language models (LLMs) to reason …

Eftychia Makri, Nikolaos Nakis, Laura Sisson, Gigi Minsky, Leandros Tassiulas, Vahid Satarifard, Nicholas A. Christakis

35 views Apr 3

Academic · 1 min

Two-Stage Optimizer-Aware Online Data Selection for Large Language Models

arXiv:2604.00001v1 Announce Type: cross Abstract: Gradient-based data selection offers a principled framework for estimating sample utility in large language model (LLM) fine-tuning, but existing methods …

Fangxin Wang, Peyman Baghershahi, Langzhou He, Henry Peng Zou, Sourav Medya, Philip S. Yu

32 views Apr 3

Academic · 1 min

Matching Accuracy, Different Geometry: Evolution Strategies vs GRPO in LLM Post-Training

arXiv:2604.01499v1 Announce Type: new Abstract: Evolution Strategies (ES) have emerged as a scalable gradient-free alternative to reinforcement learning based LLM fine-tuning, but it remains unclear …

William Hoy, Binxu Wang, Xu Pan

34 views Apr 3

Academic · 1 min

Polysemanticity or Polysemy? Lexical Identity Confounds Superposition Metrics

arXiv:2604.00443v1 Announce Type: new Abstract: If the same neuron activates for both "lender" and "riverside," standard metrics attribute the overlap to superposition--the neuron must be …

Iyad Ait Hou, Rebecca Hwa

44 views Apr 3

Academic · 1 min

Oblivion: Self-Adaptive Agentic Memory Control through Decay-Driven Activation

arXiv:2604.00131v1 Announce Type: new Abstract: Human memory adapts through selective forgetting: experiences become less accessible over time but can be reactivated by reinforcement or contextual …

Ashish Rana, Chia-Chien Hung, Qumeng Sun, Julian Martin Kunkel, Carolin Lawrence

54 views Apr 3

Academic · 1 min

DISCO-TAB: A Hierarchical Reinforcement Learning Framework for Privacy-Preserving Synthesis of Complex Clinical Data

arXiv:2604.01481v1 Announce Type: new Abstract: The development of robust clinical decision support systems is frequently impeded by the scarcity of high-fidelity, privacy-preserving biomedical data. While …

Arshia Ilaty, Hossein Shirazi, Amir Rahmani, Hajar Homayouni

35 views Apr 3

Academic · 1 min

ASCAT: An Arabic Scientific Corpus and Benchmark for Advanced Translation Evaluation

arXiv:2604.00015v1 Announce Type: new Abstract: We present ASCAT (Arabic Scientific Corpus for Advanced Translation), a high-quality English-Arabic parallel benchmark corpus designed for scientific translation evaluation …

Serry Sibaee, Khloud Al Jallad, Zineb Yousfi, Israa Elsayed Elhosiny, Yousra El-Ghawi, Batool Balah, Omer Nacar

35 views Apr 3

← Previous

52 53 54 55 56

Academic

LinearARD: Linear-Memory Attention Distillation for RoPE Restoration

Towards Reliable Truth-Aligned Uncertainty Estimation in Large Language Models

DySCo: Dynamic Semantic Compression for Effective Long-term Time Series Forecasting

Hierarchical Chain-of-Thought Prompting: Enhancing LLM Reasoning Performance and Efficiency

WHBench: Evaluating Frontier LLMs with Expert-in-the-Loop Validation on Women's Health Topics

Benchmark for Assessing Olfactory Perception of Large Language Models

Two-Stage Optimizer-Aware Online Data Selection for Large Language Models

Matching Accuracy, Different Geometry: Evolution Strategies vs GRPO in LLM Post-Training

Polysemanticity or Polysemy? Lexical Identity Confounds Superposition Metrics

Oblivion: Self-Adaptive Agentic Memory Control through Decay-Driven Activation

DISCO-TAB: A Hierarchical Reinforcement Learning Framework for Privacy-Preserving Synthesis of Complex Clinical Data

ASCAT: An Arabic Scientific Corpus and Benchmark for Advanced Translation Evaluation

JCG, PC

HSOLLC Co., Ltd.