Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Verbalizing LLMs' assumptions to explain and control sycophancy

arXiv:2604.03058v1 Announce Type: new Abstract: LLMs can be socially sycophantic, affirming users when they ask questions like "am I in the wrong?" rather than providing …

Myra Cheng, Isabel Sieh, Humishka Zope, Sunny Yu, Lujain Ibrahim, Aryaman Arora, Jared Moore, Desmond Ong, Dan Jurafsky, Diyi Yang

21 views Apr 6

Academic · 1 min

DeltaLogic: Minimal Premise Edits Reveal Belief-Revision Failures in Logical Reasoning Models

arXiv:2604.02733v1 Announce Type: new Abstract: Reasoning benchmarks typically evaluate whether a model derives the correct answer from a fixed premise set, but they under-measure a …

Amit Dhanda

5 views Apr 6

Academic · 1 min

Beyond Message Passing: Toward Semantically Aligned Agent Communication

arXiv:2604.02369v1 Announce Type: cross Abstract: Agent communication protocols are becoming critical infrastructure for large language model (LLM) systems that must use tools, coordinate with other …

Dun Yuan, Fuyuan Lyu, Ye Yuan, Weixu Zhang, Bowei He, Jiayi Geng, Linfeng Du, Zipeng Sun, Yankai Chen, Changjiang Han, Jikun Kang, Alex Chen, Haolun Wu, Xue Liu

22 views Apr 6

Academic · 1 min

One Model to Translate Them All? A Journey to Mount Doom for Multilingual Model Merging

arXiv:2604.02881v1 Announce Type: new Abstract: Weight-space model merging combines independently fine-tuned models without accessing original training data, offering a practical alternative to joint training. While …

Baban Gain, Asif Ekbal, Trilok Nath Singh

6 views Apr 6

Academic · 1 min

DrugPlayGround: Benchmarking Large Language Models and Embeddings for Drug Discovery

arXiv:2604.02346v1 Announce Type: cross Abstract: Large language models (LLMs) are in the ascendancy for research in drug discovery, offering unprecedented opportunities to reshape drug research …

Tianyu Liu, Sihan Jiang, Fan Zhang, Kunyang Sun, Teresa Head-Gordon, Hongyu Zhao

13 views Apr 6

Academic · 1 min

Internalized Reasoning for Long-Context Visual Document Understanding

arXiv:2604.02371v1 Announce Type: cross Abstract: Visual long-document understanding is critical for enterprise, legal, and scientific applications, yet the best performing open recipes have not explored …

Austin Veselka

14 views Apr 6

Academic · 1 min

Product-Stability: Provable Convergence for Gradient Descent on the Edge of Stability

arXiv:2604.02653v1 Announce Type: new Abstract: Empirically, modern deep learning training often occurs at the Edge of Stability (EoS), where the sharpness of the loss exceeds …

Eric Gan

38 views Apr 6

Academic · 1 min

Empirical Sufficiency Lower Bounds for Language Modeling with Locally-Bootstrapped Semantic Structures

arXiv:2305.18915v1 Announce Type: cross Abstract: In this work we build upon negative results from an attempt at language modeling with predicted semantic structure, in order …

Jakob Prange, Emmanuele Chersoni

5 views Apr 6

Academic · 1 min

Convolutional Surrogate for 3D Discrete Fracture-Matrix Tensor Upscaling

arXiv:2604.02335v1 Announce Type: new Abstract: Modeling groundwater flow in three-dimensional fractured crystalline media requires accounting for strong spatial heterogeneity induced by fractures. Fine-scale discrete fracture-matrix …

Martin \v{S}petl\'ik, Jan B\v{r}ezina

22 views Apr 6

Academic · 1 min

Reliability Gated Multi-Teacher Distillation for Low Resource Abstractive Summarization

arXiv:2604.03192v1 Announce Type: new Abstract: We study multiteacher knowledge distillation for low resource abstractive summarization from a reliability aware perspective. We introduce EWAD (Entropy Weighted …

Dipto Sumit, Ankan Kumar Roy, Sadia Khair Rodela, Atia Haque Asha, Mourchona Afrin, Niloy Farhan, Farig Yousuf Sadeque

17 views Apr 6

Academic · 1 min

ESL-Bench: An Event-Driven Synthetic Longitudinal Benchmark for Health Agents

arXiv:2604.02834v1 Announce Type: new Abstract: Longitudinal health agents must reason across multi-source trajectories that combine continuous device streams, sparse clinical exams, and episodic life events …

Chao Li, Cailiang Liu, Ang Gao, Kexin Deng, Shu Zhang, Langping Xu, Xiaotong Shi, Xionghao Ding, Jian Pei, Xun Jiang

4 views Apr 6

Academic · 1 min

VoxelCodeBench: Benchmarking 3D World Modeling Through Code Generation

arXiv:2604.02580v1 Announce Type: new Abstract: Evaluating code generation models for 3D spatial reasoning requires executing generated code in realistic environments and assessing outputs beyond surface-level …

Yan Zheng, Florian Bordes

4 views Apr 6

← Previous

38 39 40 41 42

Academic

Verbalizing LLMs' assumptions to explain and control sycophancy

DeltaLogic: Minimal Premise Edits Reveal Belief-Revision Failures in Logical Reasoning Models

Beyond Message Passing: Toward Semantically Aligned Agent Communication

One Model to Translate Them All? A Journey to Mount Doom for Multilingual Model Merging

DrugPlayGround: Benchmarking Large Language Models and Embeddings for Drug Discovery

Internalized Reasoning for Long-Context Visual Document Understanding

Product-Stability: Provable Convergence for Gradient Descent on the Edge of Stability

Empirical Sufficiency Lower Bounds for Language Modeling with Locally-Bootstrapped Semantic Structures

Convolutional Surrogate for 3D Discrete Fracture-Matrix Tensor Upscaling

Reliability Gated Multi-Teacher Distillation for Low Resource Abstractive Summarization

ESL-Bench: An Event-Driven Synthetic Longitudinal Benchmark for Health Agents

VoxelCodeBench: Benchmarking 3D World Modeling Through Code Generation

JCG, PC

HSOLLC Co., Ltd.