Academic

Academic · 1 min

Verbalizing LLMs' assumptions to explain and control sycophancy

arXiv:2604.03058v1 Announce Type: new Abstract: LLMs can be socially sycophantic, affirming users when they ask questions like "am I in the wrong?" rather than providing …

Myra Cheng, Isabel Sieh, Humishka Zope, Sunny Yu, Lujain Ibrahim, Aryaman Arora, Jared Moore, Desmond Ong, Dan Jurafsky, Diyi Yang

21 views Apr 6

Academic · 1 min

Querying Structured Data Through Natural Language Using Language Models

arXiv:2604.03057v1 Announce Type: new Abstract: This paper presents an open source methodology for allowing users to query structured non textual datasets through natural language Unlike …

Hontan Valentin-Micu, Bunea Andrei-Alexandru, Tantaroudas Nikolaos Dimitrios, Popovici Dan-Matei

5 views Apr 6

Academic · 1 min

R2-Write: Reflection and Revision for Open-Ended Writing with Deep Reasoning

arXiv:2604.03004v1 Announce Type: new Abstract: While deep reasoning with long chain-of-thought has dramatically improved large language models in verifiable domains like mathematics, its effectiveness for …

Wanlong Liu, Bo Zhang, Chenliang Li, Shaopeng Lai, Yuning Wu, Xuanyu Lei, Ming Yan

5 views Apr 6

Academic · 1 min

NeuReasoner: Towards Explainable, Controllable, and Unified Reasoning via Mixture-of-Neurons

arXiv:2604.02972v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) have recently achieved remarkable success in complex reasoning tasks. However, closer scrutiny reveals persistent failure modes …

Haonan Dong, Kehan Jiang, Haoran Ye, Wenhao Zhu, Zhaolu Kang, Guojie Song

5 views Apr 6

Academic · 1 min

LogicPoison: Logical Attacks on Graph Retrieval-Augmented Generation

arXiv:2604.02954v1 Announce Type: new Abstract: Graph-based Retrieval-Augmented Generation (GraphRAG) enhances the reasoning capabilities of Large Language Models (LLMs) by grounding their responses in structured knowledge …

Yilin Xiao, Jin Chen, Qinggang Zhang, Yujing Zhang, Chuang Zhou, Longhao Yang, Lingfei Ren, Xin Yang, Xiao Huang

9 views Apr 6

Academic · 1 min

A Multi-head-based architecture for effective morphological tagging in Russian with open dictionary

arXiv:2604.02926v1 Announce Type: new Abstract: The article proposes a new architecture based on Multi-head attention to solve the problem of morphological tagging for the Russian …

K. Skibin, M. Pozhidaev, S. Suschenko

8 views Apr 6

Academic · 1 min

Council Mode: Mitigating Hallucination and Bias in LLMs via Multi-Agent Consensus

arXiv:2604.02923v1 Announce Type: new Abstract: Large Language Models (LLMs), particularly those employing Mixture-of-Experts (MoE) architectures, have achieved remarkable capabilities across diverse natural language processing tasks. …

Shuai Wu, Xue Li, Yanna Feng, Yufang Li, Zhijun Wang

5 views Apr 6

Academic · 1 min

BioUNER: A Benchmark Dataset for Clinical Urdu Named Entity Recognition

arXiv:2604.02904v1 Announce Type: new Abstract: In this article, we present a gold-standard benchmark dataset for Biomedical Urdu Named Entity Recognition (BioUNER), developed by crawling health-related …

Wazir Ali, Adeeb Noor, Sanaullah Mahar, Alia, Muhammad Mazhar Younas

8 views Apr 6

Academic · 1 min

One Model to Translate Them All? A Journey to Mount Doom for Multilingual Model Merging

arXiv:2604.02881v1 Announce Type: new Abstract: Weight-space model merging combines independently fine-tuned models without accessing original training data, offering a practical alternative to joint training. While …

Baban Gain, Asif Ekbal, Trilok Nath Singh

8 views Apr 6

Academic · 1 min

LLM-based Atomic Propositions help weak extractors: Evaluation of a Propositioner for triplet extraction

arXiv:2604.02866v1 Announce Type: new Abstract: Knowledge Graph construction from natural language requires extracting structured triplets from complex, information-dense sentences. In this paper, we investigate if …

Luc Pommeret (STL), Thomas Gerald (LISN), Patrick Paroubek (STL), Sahar Ghannay (STL), Christophe Servan (STL, AMIAD), Sophie Rosset (LISN, STL)

9 views Apr 6

Academic · 1 min

GRADE: Probing Knowledge Gaps in LLMs through Gradient Subspace Dynamics

arXiv:2604.02830v1 Announce Type: new Abstract: Detecting whether a model's internal knowledge is sufficient to correctly answer a given question is a fundamental challenge in deploying …

Yujing Wang, Yuanbang Liang, Yukun Lai, Hainan Zhang, Hanqi Yan

6 views Apr 6

Academic · 1 min

Student-in-the-Loop Chain-of-Thought Distillation via Generation-Time Selection

arXiv:2604.02819v1 Announce Type: new Abstract: Large reasoning models achieve strong performance on complex tasks through long chain-of-thought (CoT) trajectories, but directly transferring such reasoning processes …

Chaoqun He, Yingfa Chen, Chaojun Xiao, Xu Han, Lijie Wen

21 views Apr 6

Verbalizing LLMs' assumptions to explain and control sycophancy

Querying Structured Data Through Natural Language Using Language Models

R2-Write: Reflection and Revision for Open-Ended Writing with Deep Reasoning

NeuReasoner: Towards Explainable, Controllable, and Unified Reasoning via Mixture-of-Neurons

LogicPoison: Logical Attacks on Graph Retrieval-Augmented Generation

A Multi-head-based architecture for effective morphological tagging in Russian with open dictionary

Council Mode: Mitigating Hallucination and Bias in LLMs via Multi-Agent Consensus

BioUNER: A Benchmark Dataset for Clinical Urdu Named Entity Recognition

One Model to Translate Them All? A Journey to Mount Doom for Multilingual Model Merging

LLM-based Atomic Propositions help weak extractors: Evaluation of a Propositioner for triplet extraction

GRADE: Probing Knowledge Gaps in LLMs through Gradient Subspace Dynamics

Student-in-the-Loop Chain-of-Thought Distillation via Generation-Time Selection

JCG, PC

HSOLLC Co., Ltd.