International Law

LOW Academic United States

Continually self-improving AI

arXiv:2603.18073v1 Announce Type: new Abstract: Modern language model-based AI systems are remarkably powerful, yet their capabilities remain fundamentally capped by their human creators in three key ways. First, although a model's weights can be updated via fine-tuning, acquiring new knowledge...

1 min 1 month ago

ear

LOW Academic International

An Onto-Relational-Sophic Framework for Governing Synthetic Minds

arXiv:2603.18633v1 Announce Type: new Abstract: The rapid evolution of artificial intelligence, from task-specific systems to foundation models exhibiting broad, flexible competence across reasoning, creative synthesis, and social interaction, has outpaced the conceptual and governance frameworks designed to manage it. Current...

1 min 1 month ago

ear

LOW Academic United States

Interpretability without actionability: mechanistic methods cannot correct language model errors despite near-perfect internal representations

arXiv:2603.18353v1 Announce Type: new Abstract: Language models encode task-relevant knowledge in internal representations that far exceeds their output performance, but whether mechanistic interpretability methods can bridge this knowledge-action gap has not been systematically tested. We compared four mechanistic interpretability methods...

1 min 1 month ago

ear

LOW Academic European Union

From Weak Cues to Real Identities: Evaluating Inference-Driven De-Anonymization in LLM Agents

arXiv:2603.18382v1 Announce Type: new Abstract: Anonymization is widely treated as a practical safeguard because re-identifying anonymous records was historically costly, requiring domain expertise, tailored algorithms, and manual corroboration. We study a growing privacy risk that may weaken this barrier: LLM-based...

1 min 1 month ago

ear

LOW Academic International

MANAR: Memory-augmented Attention with Navigational Abstract Conceptual Representation

arXiv:2603.18676v1 Announce Type: new Abstract: MANAR (Memory-augmented Attention with Navigational Abstract Conceptual Representation), contextualization layer generalizes standard multi-head attention (MHA) by instantiating the principles of Global Workspace Theory (GWT). While MHA enables unconstrained all-to-all communication, it lacks the functional bottleneck...

1 min 1 month ago

ear

LOW Academic International

FaithSteer-BENCH: A Deployment-Aligned Stress-Testing Benchmark for Inference-Time Steering

arXiv:2603.18329v1 Announce Type: new Abstract: Inference-time steering is widely regarded as a lightweight and parameter-free mechanism for controlling large language model (LLM) behavior, and prior work has often suggested that simple activation-level interventions can reliably induce targeted behavioral changes. However,...

1 min 1 month ago

ear

LOW Academic International

Agentic Flow Steering and Parallel Rollout Search for Spatially Grounded Text-to-Image Generation

arXiv:2603.18627v1 Announce Type: new Abstract: Precise Text-to-Image (T2I) generation has achieved great success but is hindered by the limited relational reasoning of static text encoders and the error accumulation in open-loop sampling. Without real-time feedback, initial semantic ambiguities during the...

1 min 1 month ago

ear

LOW Academic United States

Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures

arXiv:2603.18729v1 Announce Type: new Abstract: Many works in the literature show that LLM outputs exhibit discriminatory behaviour, triggering stereotype-based inferences based on the dialect in which the inputs are written. This bias has been shown to be particularly pronounced when...

1 min 1 month ago

ear

LOW Academic International

Reasonably reasoning AI agents can avoid game-theoretic failures in zero-shot, provably

arXiv:2603.18563v1 Announce Type: new Abstract: AI agents are increasingly deployed in interactive economic environments characterized by repeated AI-AI interactions. Despite AI agents' advanced capabilities, empirical studies reveal that such interactions often fail to stably induce a strategic equilibrium, such as...

1 min 1 month ago

ear

LOW Academic International

Do Large Language Models Possess a Theory of Mind? A Comparative Evaluation Using the Strange Stories Paradigm

arXiv:2603.18007v1 Announce Type: new Abstract: The study explores whether current Large Language Models (LLMs) exhibit Theory of Mind (ToM) capabilities -- specifically, the ability to infer others' beliefs, intentions, and emotions from text. Given that LLMs are trained on language...

1 min 1 month ago

ear

LOW Academic United States

Can LLM generate interesting mathematical research problems?

arXiv:2603.18813v1 Announce Type: new Abstract: This paper is the second one in a series of work on the mathematical creativity of LLM. In the first paper, the authors proposed three criteria for evaluating the mathematical creativity of LLM and constructed...

1 min 1 month ago

ear

LOW Academic European Union

NeuroGame Transformer: Gibbs-Inspired Attention Driven by Game Theory and Statistical Physics

arXiv:2603.18761v1 Announce Type: new Abstract: Standard attention mechanisms in transformers are limited by their pairwise formulation, which hinders the modeling of higher-order dependencies among tokens. We introduce the NeuroGame Transformer (NGT) to overcome this by reconceptualizing attention through a dual...

1 min 1 month ago

ear

LOW Academic United States

The Validity Gap in Health AI Evaluation: A Cross-Sectional Analysis of Benchmark Composition

arXiv:2603.18294v1 Announce Type: new Abstract: Background: Clinical trials rely on transparent inclusion criteria to ensure generalizability. In contrast, benchmarks validating health-related large language models (LLMs) rarely characterize the "patient" or "query" populations they contain. Without defined composition, aggregate performance metrics...

1 min 1 month ago

ear

LOW Academic United States

Retrieval-Augmented LLM Agents: Learning to Learn from Experience

arXiv:2603.18272v1 Announce Type: new Abstract: While large language models (LLMs) have advanced the development of general-purpose agents, achieving robust generalization to unseen tasks remains a significant challenge. Current approaches typically rely on either fine-tuning or training-free memory-augmented generation using retrieved...

1 min 1 month ago

ear

LOW Academic European Union

Understanding the Theoretical Foundations of Deep Neural Networks through Differential Equations

arXiv:2603.18331v1 Announce Type: new Abstract: Deep neural networks (DNNs) have achieved remarkable empirical success, yet the absence of a principled theoretical foundation continues to hinder their systematic development. In this survey, we present differential equations as a theoretical foundation for...

1 min 1 month ago

ear

LOW Academic International

DEAF: A Benchmark for Diagnostic Evaluation of Acoustic Faithfulness in Audio Language Models

arXiv:2603.18048v1 Announce Type: new Abstract: Recent Audio Multimodal Large Language Models (Audio MLLMs) demonstrate impressive performance on speech benchmarks, yet it remains unclear whether these models genuinely process acoustic signals or rely on text-based semantic inference. To systematically study this...

1 min 1 month ago

ear

LOW Academic International

An Agentic System for Schema Aware NL2SQL Generation

arXiv:2603.18018v1 Announce Type: new Abstract: The natural language to SQL (NL2SQL) task plays a pivotal role in democratizing data access by enabling non-expert users to interact with relational databases through intuitive language. While recent frameworks have enhanced translation accuracy via...

1 min 1 month ago

ear

LOW Academic International

Learned but Not Expressed: Capability-Expression Dissociation in Large Language Models

arXiv:2603.18013v1 Announce Type: new Abstract: Large language models (LLMs) demonstrate the capacity to reconstruct and trace learned content from their training data under specific elicitation conditions, yet this capability does not manifest in standard generation contexts. This empirical observational study...

1 min 1 month ago

ear

LOW Academic International

Evaluating FrameNet-Based Semantic Modeling for Gender-Based Violence Detection in Clinical Records

arXiv:2603.18124v1 Announce Type: new Abstract: Gender-based violence (GBV) is a major public health issue, with the World Health Organization estimating that one in three women experiences physical or sexual violence by an intimate partner during her lifetime. In Brazil, although...

1 min 1 month ago

ear

LOW Academic International

Proceedings of the 2nd Workshop on Advancing Artificial Intelligence through Theory of Mind

arXiv:2603.18786v1 Announce Type: new Abstract: This volume includes a selection of papers presented at the 2nd Workshop on Advancing Artificial Intelligence through Theory of Mind held at AAAI 2026 in Singapore on 26th January 2026. The purpose of this volume...

1 min 1 month ago

ear

LOW Academic International

Interplay: Training Independent Simulators for Reference-Free Conversational Recommendation

arXiv:2603.18573v1 Announce Type: new Abstract: Training conversational recommender systems (CRS) requires extensive dialogue data, which is challenging to collect at scale. To address this, researchers have used simulated user-recommender conversations. Traditional simulation approaches often utilize a single large language model...

1 min 1 month ago

ear

LOW Academic International

Controllable Evidence Selection in Retrieval-Augmented Question Answering via Deterministic Utility Gating

arXiv:2603.18011v1 Announce Type: new Abstract: Many modern AI question-answering systems convert text into vectors and retrieve the closest matches to a user question. While effective for topical similarity, similarity scores alone do not explain why some retrieved text can serve...

1 min 1 month ago

ear

LOW Academic United States

Agentic Framework for Political Biography Extraction

arXiv:2603.18010v1 Announce Type: new Abstract: The production of large-scale political datasets typically demands extracting structured facts from vast piles of unstructured documents or web sources, a task that traditionally relies on expensive human experts and remains prohibitively difficult to automate...

1 min 1 month ago

ear

LOW Academic International

CORE: Robust Out-of-Distribution Detection via Confidence and Orthogonal Residual Scoring

arXiv:2603.18290v1 Announce Type: new Abstract: Out-of-distribution (OOD) detection is essential for deploying deep learning models reliably, yet no single method performs consistently across architectures and datasets -- a scorer that leads on one benchmark often falters on another. We attribute...

1 min 1 month ago

ear

LOW Academic European Union

How LLMs Distort Our Written Language

arXiv:2603.18161v1 Announce Type: new Abstract: Large language models (LLMs) are used by over a billion people globally, most often to assist with writing. In this work, we demonstrate that LLMs not only alter the voice and tone of human writing,...

1 min 1 month ago

ear

LOW Academic United Kingdom

CWoMP: Morpheme Representation Learning for Interlinear Glossing

arXiv:2603.18184v1 Announce Type: new Abstract: Interlinear glossed text (IGT) is a standard notation for language documentation which is linguistically rich but laborious to produce manually. Recent automated IGT methods treat glosses as character sequences, neglecting their compositional structure. We propose...

1 min 1 month ago

ear

LOW Academic United States

How Psychological Learning Paradigms Shaped and Constrained Artificial Intelligence

arXiv:2603.18203v1 Announce Type: new Abstract: The dominant paradigms of artificial intelligence were shaped by learning theories from psychology: behaviorism inspired reinforcement learning, cognitivism gave rise to deep learning and memory-augmented architectures, and constructivism influenced curriculum learning and compositional approaches. This...

1 min 1 month ago

ear

LOW Academic International

From Noise to Signal: When Outliers Seed New Topics

arXiv:2603.18358v1 Announce Type: new Abstract: Outliers in dynamic topic modeling are typically treated as noise, yet we show that some can serve as early signals of emerging topics. We introduce a temporal taxonomy of news-document trajectories that defines how documents...

1 min 1 month ago

ear

LOW Academic International

Synthetic Data Generation for Training Diversified Commonsense Reasoning Models

arXiv:2603.18361v1 Announce Type: new Abstract: Conversational agents are required to respond to their users not only with high quality (i.e. commonsense bearing) responses, but also considering multiple plausible alternative scenarios, reflecting the diversity in their responses. Despite the growing need...

1 min 1 month ago

ear

LOW Academic European Union

PowerFlow: Unlocking the Dual Nature of LLMs via Principled Distribution Matching

arXiv:2603.18363v1 Announce Type: new Abstract: Unsupervised Reinforcement Learning from Internal Feedback (RLIF) has emerged as a promising paradigm for eliciting the latent capabilities of Large Language Models (LLMs) without external supervision. However, current methods rely on heuristic intrinsic rewards, which...

1 min 1 month ago

ear

Continually self-improving AI

An Onto-Relational-Sophic Framework for Governing Synthetic Minds

Interpretability without actionability: mechanistic methods cannot correct language model errors despite near-perfect internal representations

From Weak Cues to Real Identities: Evaluating Inference-Driven De-Anonymization in LLM Agents

MANAR: Memory-augmented Attention with Navigational Abstract Conceptual Representation

FaithSteer-BENCH: A Deployment-Aligned Stress-Testing Benchmark for Inference-Time Steering

Agentic Flow Steering and Parallel Rollout Search for Spatially Grounded Text-to-Image Generation

Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures

Reasonably reasoning AI agents can avoid game-theoretic failures in zero-shot, provably

Do Large Language Models Possess a Theory of Mind? A Comparative Evaluation Using the Strange Stories Paradigm

Can LLM generate interesting mathematical research problems?

NeuroGame Transformer: Gibbs-Inspired Attention Driven by Game Theory and Statistical Physics

The Validity Gap in Health AI Evaluation: A Cross-Sectional Analysis of Benchmark Composition

Retrieval-Augmented LLM Agents: Learning to Learn from Experience

Understanding the Theoretical Foundations of Deep Neural Networks through Differential Equations

DEAF: A Benchmark for Diagnostic Evaluation of Acoustic Faithfulness in Audio Language Models

An Agentic System for Schema Aware NL2SQL Generation

Learned but Not Expressed: Capability-Expression Dissociation in Large Language Models

Evaluating FrameNet-Based Semantic Modeling for Gender-Based Violence Detection in Clinical Records

Proceedings of the 2nd Workshop on Advancing Artificial Intelligence through Theory of Mind

Interplay: Training Independent Simulators for Reference-Free Conversational Recommendation

Controllable Evidence Selection in Retrieval-Augmented Question Answering via Deterministic Utility Gating

Agentic Framework for Political Biography Extraction

CORE: Robust Out-of-Distribution Detection via Confidence and Orthogonal Residual Scoring

How LLMs Distort Our Written Language

CWoMP: Morpheme Representation Learning for Interlinear Glossing

How Psychological Learning Paradigms Shaped and Constrained Artificial Intelligence

From Noise to Signal: When Outliers Seed New Topics

Synthetic Data Generation for Training Diversified Commonsense Reasoning Models

PowerFlow: Unlocking the Dual Nature of LLMs via Principled Distribution Matching

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.