All Articles

Articles

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability

arXiv:2603.10384v1 Announce Type: new Abstract: Evaluating LLM reliability via scalar probabilities often fails to capture the structural dynamics of reasoning. We introduce TRACED, a framework …

Xinyan Jiang, Ninghao Liu, Di Wang, Lijie Hu

30 views Mar 12

Academic · 1 min

Empathy Is Not What Changed: Clinical Assessment of Psychological Safety Across GPT Model Generations

arXiv:2603.09997v1 Announce Type: cross Abstract: When OpenAI deprecated GPT-4o in early 2026, thousands of users protested under #keep4o, claiming newer models had "lost their empathy." …

Michael Keeman, Anastasia Keeman

37 views Mar 12

Academic · 1 min

CUAAudit: Meta-Evaluation of Vision-Language Models as Auditors of Autonomous Computer-Use Agents

arXiv:2603.10577v1 Announce Type: new Abstract: Computer-Use Agents (CUAs) are emerging as a new paradigm in human-computer interaction, enabling autonomous execution of tasks in desktop environment …

Marta Sumyk, Oleksandr Kosovan

37 views Mar 12

Academic · 1 min

Quantifying Hallucinations in Language Language Models on Medical Textbooks

arXiv:2603.09986v1 Announce Type: cross Abstract: Hallucinations, the tendency for large language models to provide responses with factually incorrect and unsupported claims, is a serious problem …

Brandon C. Colelough, Davis Bartels, Dina Demner-Fushman

52 views Mar 12

Academic · 1 min

FAME: Formal Abstract Minimal Explanation for Neural Networks

arXiv:2603.10661v1 Announce Type: new Abstract: We propose FAME (Formal Abstract Minimal Explanations), a new class of abductive explanations grounded in abstract interpretation. FAME is the …

Ryma Boumazouza, Raya Elsaleh, Melanie Ducoffe, Shahaf Bassan, Guy Katz

42 views Mar 12

Academic · 1 min

PoultryLeX-Net: Domain-Adaptive Dual-Stream Transformer Architecture for Large-Scale Poultry Stakeholder Modeling

arXiv:2603.09991v1 Announce Type: cross Abstract: The rapid growth of the global poultry industry, driven by rising demand for affordable animal protein, has intensified public discourse …

Stephen Afrifa, Biswash Khatiwada, Kapalik Khanal, Sanjay Shah, Lingjuan Wang-Li, Ramesh Bahadur Bist

34 views Mar 12

Academic · 1 min

TAMUSA-Chat: A Domain-Adapted Large Language Model Conversational System for Research and Responsible Deployment

arXiv:2603.09992v1 Announce Type: cross Abstract: This paper presents TAMUSA-Chat, a research-oriented framework for building domain-adapted large language model conversational systems. The work addresses critical challenges …

Izzat Alsmadi, Anas Alsobeh

41 views Mar 12

Academic · 1 min

Assessing Cognitive Biases in LLMs for Judicial Decision Support: Virtuous Victim and Halo Effects

arXiv:2603.10016v1 Announce Type: cross Abstract: We investigate whether large language models (LLMs) display human-like cognitive biases, focusing on potential implications for assistance in judicial sentencing, …

Sierra S. Liu

70 views Mar 12

Academic · 1 min

Causally Grounded Mechanistic Interpretability for LLMs with Faithful Natural-Language Explanations

arXiv:2603.09988v1 Announce Type: cross Abstract: Mechanistic interpretability identifies internal circuits responsible for model behaviors, yet translating these findings into human-understandable explanations remains an open problem. …

Ajay Pravin Mahale

32 views Mar 12

Academic · 1 min

Hybrid Self-evolving Structured Memory for GUI Agents

arXiv:2603.10291v1 Announce Type: new Abstract: The remarkable progress of vision-language models (VLMs) has enabled GUI agents to interact with computers in a human-like manner. Yet …

Sibo Zhu, Wenyi Wu, Kun Zhou, Stephen Wang, Biwei Huang

43 views Mar 12

Academic · 1 min

Evolving Demonstration Optimization for Chain-of-Thought Feature Transformation

arXiv:2603.09987v1 Announce Type: cross Abstract: Feature Transformation (FT) is a core data-centric AI task that improves feature space quality to advance downstream predictive performance. However, …

Xinyuan Wang, Kunpeng Liu, Arun Vignesh Malarkkan, Yanjie Fu

27 views Mar 12

Academic · 1 min

Explainable LLM Unlearning Through Reasoning

arXiv:2603.09980v1 Announce Type: cross Abstract: LLM unlearning is essential for mitigating safety, copyright, and privacy concerns in pre-trained large language models (LLMs). Compared to preference …

Junfeng Liao, Qizhou Wang, Shanshan Ye, Xin Yu, Ling Chen, Zhen Fang

44 views Mar 12

← Previous

196 197 198 199 200

Articles

Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability

Empathy Is Not What Changed: Clinical Assessment of Psychological Safety Across GPT Model Generations

CUAAudit: Meta-Evaluation of Vision-Language Models as Auditors of Autonomous Computer-Use Agents

Quantifying Hallucinations in Language Language Models on Medical Textbooks

FAME: Formal Abstract Minimal Explanation for Neural Networks

PoultryLeX-Net: Domain-Adaptive Dual-Stream Transformer Architecture for Large-Scale Poultry Stakeholder Modeling

TAMUSA-Chat: A Domain-Adapted Large Language Model Conversational System for Research and Responsible Deployment

Assessing Cognitive Biases in LLMs for Judicial Decision Support: Virtuous Victim and Halo Effects

Causally Grounded Mechanistic Interpretability for LLMs with Faithful Natural-Language Explanations

Hybrid Self-evolving Structured Memory for GUI Agents

Evolving Demonstration Optimization for Chain-of-Thought Feature Transformation

Explainable LLM Unlearning Through Reasoning

JCG, PC

HSOLLC Co., Ltd.