Academic

Academic · 1 min

Polysemanticity or Polysemy? Lexical Identity Confounds Superposition Metrics

arXiv:2604.00443v1 Announce Type: new Abstract: If the same neuron activates for both "lender" and "riverside," standard metrics attribute the overlap to superposition--the neuron must be …

Iyad Ait Hou, Rebecca Hwa

21 views Apr 3

Academic · 1 min

TR-ICRL: Test-Time Rethinking for In-Context Reinforcement Learning

arXiv:2604.00438v1 Announce Type: new Abstract: In-Context Reinforcement Learning (ICRL) enables Large Language Models (LLMs) to learn online from external rewards directly within the context window. …

Wenxuan Jiang, Yuxin Zuo, Zijian Zhang, Xuecheng Wu, Zining Fan, Wenxuan Liu, Li Chen, Xiaoyu Li, Xuezhi Cao, Xiaolong Jin, Ninghao Liu

63 views Apr 3

Academic · 1 min

Locally Confident, Globally Stuck: The Quality-Exploration Dilemma in Diffusion Language Models

arXiv:2604.00375v1 Announce Type: new Abstract: Diffusion large language models (dLLMs) theoretically permit token decoding in arbitrary order, a flexibility that could enable richer exploration of …

Liancheng Fang, Aiwei Liu, Henry Peng Zou, Yankai Chen, Enze Ma, Leyi Pan, Chunyu Miao, Wei-Chieh Huang, Xue Liu, Philip S. Yu

10 views Apr 3

Academic · 1 min

Agent Q-Mix: Selecting the Right Action for LLM Multi-Agent Systems through Reinforcement Learning

arXiv:2604.00344v1 Announce Type: new Abstract: Large Language Models (LLMs) have shown remarkable performance in completing various tasks. However, solving complex problems often requires the coordination …

Eric Hanchen Jiang, Levina Li, Rui Sun, Xiao Liang, Yubei Li, Yuchen Wu, Haozheng Luo, Hengli Li, Zhi Zhang, Zhaolu Kang, Kai-Wei Chang, Ying Nian Wu

17 views Apr 3

Academic · 1 min

Large Language Models in the Abuse Detection Pipeline

arXiv:2604.00323v1 Announce Type: new Abstract: Online abuse has grown increasingly complex, spanning toxic language, harassment, manipulation, and fraudulent behavior. Traditional machine-learning approaches dependent on static …

Suraj Kath, Sanket Badhe, Preet Shah, Ashwin Sampathkumar, Shivani Gupta

31 views Apr 3

Academic · 1 min

Asymmetric Actor-Critic for Multi-turn LLM Agents

arXiv:2604.00304v1 Announce Type: new Abstract: Large language models (LLMs) exhibit strong reasoning and conversational abilities, but ensuring reliable behavior in multi-turn interactions remains challenging. In …

Shuli Jiang, Zhaoyang Zhang, Yi Zhang, Shuo Yang, Wei Xia, Stefano Soatto

23 views Apr 3

Academic · 1 min

Frege in the Flesh: Biolinguistics and the Neural Enforcement of Syntactic Structures

arXiv:2604.00291v1 Announce Type: new Abstract: Biolinguistics is the interdisciplinary scientific study of the biological foundations, evolution, and genetic basis of human language. It treats language …

Elliot Murphy

12 views Apr 3

Academic · 1 min

Can Large Language Models Self-Correct in Medical Question Answering? An Exploratory Study

arXiv:2604.00261v2 Announce Type: new Abstract: Large language models (LLMs) have achieved strong performance on medical question answering (medical QA), and chain-of-thought (CoT) prompting has further …

Zaifu Zhan, Mengyuan Cui, Rui Zhang

26 views Apr 3

Academic · 1 min

LLM Essay Scoring Under Holistic and Analytic Rubrics: Prompt Effects and Bias

arXiv:2604.00259v1 Announce Type: new Abstract: Despite growing interest in using Large Language Models (LLMs) for educational assessment, it remains unclear how closely they align with …

Filip J. Kucia, Anirban Chakraborty, Anna Wr\'oblewska

10 views Apr 3

Academic · 1 min

REM-CTX: Automated Peer Review via Reinforcement Learning with Auxiliary Context

arXiv:2604.00248v1 Announce Type: new Abstract: Most automated peer review systems rely on textual manuscript content alone, leaving visual elements such as figures and external scholarly …

Pawin Taechoyotin, Daniel E. Acuna

9 views Apr 3

Academic · 1 min

A Taxonomy of Programming Languages for Code Generation

arXiv:2604.00239v1 Announce Type: new Abstract: The world's 7,000+ languages vary widely in the availability of resources for NLP, motivating efforts to systematically categorize them by …

Nishat Raihan, Christian Newman, Marcos Zampieri

26 views Apr 3

Academic · 1 min

Do Language Models Know When They'll Refuse? Probing Introspective Awareness of Safety Boundaries

arXiv:2604.00228v1 Announce Type: new Abstract: Large language models are trained to refuse harmful requests, but can they accurately predict when they will refuse before responding? …

Tanay Gondil

10 views Apr 3

Polysemanticity or Polysemy? Lexical Identity Confounds Superposition Metrics

TR-ICRL: Test-Time Rethinking for In-Context Reinforcement Learning

Locally Confident, Globally Stuck: The Quality-Exploration Dilemma in Diffusion Language Models

Agent Q-Mix: Selecting the Right Action for LLM Multi-Agent Systems through Reinforcement Learning

Large Language Models in the Abuse Detection Pipeline

Asymmetric Actor-Critic for Multi-turn LLM Agents

Frege in the Flesh: Biolinguistics and the Neural Enforcement of Syntactic Structures

Can Large Language Models Self-Correct in Medical Question Answering? An Exploratory Study

LLM Essay Scoring Under Holistic and Analytic Rubrics: Prompt Effects and Bias

REM-CTX: Automated Peer Review via Reinforcement Learning with Auxiliary Context

A Taxonomy of Programming Languages for Code Generation

Do Language Models Know When They'll Refuse? Probing Introspective Awareness of Safety Boundaries

JCG, PC

HSOLLC Co., Ltd.