Intellectual Property

LOW Academic International

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

arXiv:2603.09095v1 Announce Type: new Abstract: Multimodal large language models (MLLMs) can process text presented as images, yet they often perform worse than when the same content is provided as textual tokens. We systematically diagnose this "modality gap" by evaluating seven...

1 min 1 month, 1 week ago

ip

LOW Academic International

Learning When to Sample: Confidence-Aware Self-Consistency for Efficient LLM Chain-of-Thought Reasoning

arXiv:2603.08999v1 Announce Type: new Abstract: Large language models (LLMs) achieve strong reasoning performance through chain-of-thought (CoT) reasoning, yet often generate unnecessarily long reasoning paths that incur high inference cost. Recent self-consistency-based approaches further improve accuracy but require sampling and aggregating...

1 min 1 month, 1 week ago

ip

LOW Academic International

One Language, Two Scripts: Probing Script-Invariance in LLM Concept Representations

arXiv:2603.08869v1 Announce Type: new Abstract: Do the features learned by Sparse Autoencoders (SAEs) represent abstract meaning, or are they tied to how text is written? We investigate this question using Serbian digraphia as a controlled testbed: Serbian is written interchangeably...

1 min 1 month, 1 week ago

ip

LOW Academic International

EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages

arXiv:2603.09678v1 Announce Type: new Abstract: Large language models achieve near-ceiling performance on code generation benchmarks, yet these results increasingly reflect memorization rather than genuine reasoning. We introduce EsoLang-Bench, a benchmark using five esoteric programming languages (Brainfuck, Befunge-98, Whitespace, Unlambda, and...

1 min 1 month, 1 week ago

nda

LOW Academic International

PRECEPT: Planning Resilience via Experience, Context Engineering & Probing Trajectories A Unified Framework for Test-Time Adaptation with Compositional Rule Learning and Pareto-Guided Prompt Evolution

arXiv:2603.09641v1 Announce Type: new Abstract: LLM agents that store knowledge as natural language suffer steep retrieval degradation as condition count grows, often struggle to compose learned rules reliably, and typically lack explicit mechanisms to detect stale or adversarial knowledge. We...

1 min 1 month, 1 week ago

ip

LOW Academic United States

The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness

arXiv:2603.09200v1 Announce Type: new Abstract: Situational awareness, the capacity of an AI system to recognize its own nature, understand its training and deployment context, and reason strategically about its circumstances, is widely considered among the most dangerous emergent capabilities in...

1 min 1 month, 1 week ago

ip

LOW Academic International

Explainable Innovation Engine: Dual-Tree Agent-RAG with Methods-as-Nodes and Verifiable Write-Back

arXiv:2603.09192v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) improves factual grounding, yet most systems rely on flat chunk retrieval and provide limited control over multi-step synthesis. We propose an Explainable Innovation Engine that upgrades the knowledge unit from text chunks...

1 min 1 month, 1 week ago

ip

LOW Academic United States

EPOCH: An Agentic Protocol for Multi-Round System Optimization

arXiv:2603.09049v1 Announce Type: new Abstract: Autonomous agents are increasingly used to improve prompts, code, and machine learning systems through iterative execution and feedback. Yet existing approaches are usually designed as task-specific optimization loops rather than as a unified protocol for...

1 min 1 month, 1 week ago

nda

LOW Academic International

Investigating Gender Stereotypes in Large Language Models via Social Determinants of Health

arXiv:2603.09416v1 Announce Type: new Abstract: Large Language Models (LLMs) excel in Natural Language Processing (NLP) tasks, but they often propagate biases embedded in their training data, which is potentially impactful in sensitive domains like healthcare. While existing benchmarks evaluate biases...

1 min 1 month, 1 week ago

ip

LOW Academic International

A Consensus-Driven Multi-LLM Pipeline for Missing-Person Investigations

arXiv:2603.08954v1 Announce Type: new Abstract: The first 72 hours of a missing-person investigation are critical for successful recovery. Guardian is an end-to-end system designed to support missing-child investigation and early search planning. This paper presents the Guardian LLM Pipeline, a...

1 min 1 month, 1 week ago

ip

LOW Academic International

DataFactory: Collaborative Multi-Agent Framework for Advanced Table Question Answering

arXiv:2603.09152v1 Announce Type: new Abstract: Table Question Answering (TableQA) enables natural language interaction with structured tabular data. However, existing large language model (LLM) approaches face critical limitations: context length constraints that restrict data handling capabilities, hallucination issues that compromise answer...

1 min 1 month, 1 week ago

ip

LOW Academic United States

Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance

arXiv:2603.08933v1 Announce Type: new Abstract: The first 72 hours of a missing-child investigation are critical for successful recovery. However, law enforcement agencies often face fragmented, unstructured data and a lack of dynamic, geospatial predictive tools. Our system, Guardian, provides an...

1 min 1 month, 1 week ago

ip

LOW Academic European Union

Curveball Steering: The Right Direction To Steer Isn't Always Linear

arXiv:2603.09313v1 Announce Type: new Abstract: Activation steering is a widely used approach for controlling large language model (LLM) behavior by intervening on internal representations. Existing methods largely rely on the Linear Representation Hypothesis, assuming behavioral attributes can be manipulated using...

1 min 1 month, 1 week ago

ip

LOW Academic European Union

AI Act Evaluation Benchmark: An Open, Transparent, and Reproducible Evaluation Dataset for NLP and RAG Systems

arXiv:2603.09435v1 Announce Type: new Abstract: The rapid rollout of AI in heterogeneous public and societal sectors has subsequently escalated the need for compliance with regulatory standards and frameworks. The EU AI Act has emerged as a landmark in the regulatory...

1 min 1 month, 1 week ago

nda

LOW Academic European Union

An Empirical Study and Theoretical Explanation on Task-Level Model-Merging Collapse

arXiv:2603.09463v1 Announce Type: new Abstract: Model merging unifies independently fine-tuned LLMs from the same base, enabling reuse and integration of parallel development efforts without retraining. However, in practice we observe that merging does not always succeed: certain combinations of task-specialist...

1 min 1 month, 1 week ago

nda

LOW Academic International

Vibe-Creation: The Epistemology of Human-AI Emergent Cognition

arXiv:2603.09486v1 Announce Type: new Abstract: The encounter between human reasoning and generative artificial intelligence (GenAI) cannot be adequately described by inherited metaphors of tool use, augmentation, or collaborative partnership. This article argues that such interactions produce a qualitatively distinct cognitive-epistemic...

1 min 1 month, 1 week ago

ip

LOW Academic European Union

Enhancing Debunking Effectiveness through LLM-based Personality Adaptation

arXiv:2603.09533v1 Announce Type: new Abstract: This study proposes a novel methodology for generating personalized fake news debunking messages by prompting Large Language Models (LLMs) with persona-based inputs aligned to the Big Five personality traits: Extraversion, Agreeableness, Conscientiousness, Neuroticism, and Openness....

1 min 1 month, 1 week ago

ip

LOW Academic European Union

AutoAgent: Evolving Cognition and Elastic Memory Orchestration for Adaptive Agents

arXiv:2603.09716v1 Announce Type: new Abstract: Autonomous agent frameworks still struggle to reconcile long-term experiential learning with real-time, context-sensitive decision-making. In practice, this gap appears as static cognition, rigid workflow dependence, and inefficient context usage, which jointly limit adaptability in open-ended...

1 min 1 month, 1 week ago

nda

LOW Academic European Union

Quantifying the Necessity of Chain of Thought through Opaque Serial Depth

arXiv:2603.09786v1 Announce Type: new Abstract: Large language models (LLMs) tend to externalize their reasoning in their chain of thought, making the chain of thought a good target for monitoring. This is partially an inherent feature of the Transformer architecture: sufficiently...

1 min 1 month, 1 week ago

nda

LOW Academic International

LCA: Local Classifier Alignment for Continual Learning

arXiv:2603.09888v1 Announce Type: new Abstract: A fundamental requirement for intelligent systems is the ability to learn continuously under changing environments. However, models trained in this regime often suffer from catastrophic forgetting. Leveraging pre-trained models has recently emerged as a promising...

1 min 1 month, 1 week ago

nda

LOW Academic International

Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts

arXiv:2603.09890v1 Announce Type: new Abstract: Large Language Models (LLMs) have emerged as a new paradigm for multi-agent systems. However, existing research on the behaviour of LLM-based multi-agents relies on ad hoc prompts and lacks a principled policy perspective. Different from...

1 min 1 month, 1 week ago

ip

LOW Academic International

PathMem: Toward Cognition-Aligned Memory Transformation for Pathology MLLMs

arXiv:2603.09943v1 Announce Type: new Abstract: Computational pathology demands both visual pattern recognition and dynamic integration of structured domain knowledge, including taxonomy, grading criteria, and clinical evidence. In practice, diagnostic reasoning requires linking morphological evidence with formal diagnostic and grading criteria....

1 min 1 month, 1 week ago

nda

LOW Academic International

Logos: An evolvable reasoning engine for rational molecular design

arXiv:2603.09268v1 Announce Type: new Abstract: The discovery and design of functional molecules remain central challenges across chemistry,biology, and materials science. While recent advances in machine learning have accelerated molecular property prediction and candidate generation, existing models tend to excel either...

1 min 1 month, 1 week ago

ip

LOW Academic International

Emotion is Not Just a Label: Latent Emotional Factors in LLM Processing

arXiv:2603.09205v1 Announce Type: new Abstract: Large language models are routinely deployed on text that varies widely in emotional tone, yet their reasoning behavior is typically evaluated without accounting for emotion as a source of representational variation. Prior work has largely...

1 min 1 month, 1 week ago

ip

LOW Academic International

SPAR-K: Scheduled Periodic Alternating Early Exit for Spoken Language Models

arXiv:2603.09215v1 Announce Type: new Abstract: Interleaved spoken language models (SLMs) alternately generate text and speech tokens, but decoding at full transformer depth for every step becomes costly, especially due to long speech sequences. We propose SPAR-K, a modality-aware early exit...

1 min 1 month, 1 week ago

ip

LOW Academic International

Quantifying and extending the coverage of spatial categorization data sets

arXiv:2603.09373v1 Announce Type: new Abstract: Variation in spatial categorization across languages is often studied by eliciting human labels for the relations depicted in a set of scenes known as the Topological Relations Picture Series (TRPS). We demonstrate that labels generated...

1 min 1 month, 1 week ago

nda

LOW Academic European Union

Automated Thematic Analysis for Clinical Qualitative Data: Iterative Codebook Refinement with Full Provenance

arXiv:2603.08989v1 Announce Type: new Abstract: Thematic analysis (TA) is widely used in health research to extract patterns from patient interviews, yet manual TA faces challenges in scalability and reproducibility. LLM-based automation can help, but existing approaches produce codebooks with limited...

1 min 1 month, 1 week ago

ip

LOW Academic International

MASEval: Extending Multi-Agent Evaluation from Models to Systems

arXiv:2603.08835v1 Announce Type: new Abstract: The rapid adoption of LLM-based agentic systems has produced a rich ecosystem of frameworks (smolagents, LangGraph, AutoGen, CAMEL, LlamaIndex, i.a.). Yet existing benchmarks are model-centric: they fix the agentic setup and do not compare other...

1 min 1 month, 1 week ago

ip

LOW Academic United States

LDP: An Identity-Aware Protocol for Multi-Agent LLM Systems

arXiv:2603.08852v1 Announce Type: new Abstract: As multi-agent AI systems grow in complexity, the protocols connecting them constrain their capabilities. Current protocols such as A2A and MCP do not expose model-level properties as first-class primitives, ignoring properties fundamental to effective delegation:...

1 min 1 month, 1 week ago

nda

LOW Academic International

Quantifying the Accuracy and Cost Impact of Design Decisions in Budget-Constrained Agentic LLM Search

arXiv:2603.08877v1 Announce Type: new Abstract: Agentic Retrieval-Augmented Generation (RAG) systems combine iterative search, planning prompts, and retrieval backends, but deployed settings impose explicit budgets on tool calls and completion tokens. We present a controlled measurement study of how search depth,...

1 min 1 month, 1 week ago

ip

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Learning When to Sample: Confidence-Aware Self-Consistency for Efficient LLM Chain-of-Thought Reasoning

One Language, Two Scripts: Probing Script-Invariance in LLM Concept Representations

EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages

PRECEPT: Planning Resilience via Experience, Context Engineering & Probing Trajectories A Unified Framework for Test-Time Adaptation with Compositional Rule Learning and Pareto-Guided Prompt Evolution

The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness

Explainable Innovation Engine: Dual-Tree Agent-RAG with Methods-as-Nodes and Verifiable Write-Back

EPOCH: An Agentic Protocol for Multi-Round System Optimization

Investigating Gender Stereotypes in Large Language Models via Social Determinants of Health

A Consensus-Driven Multi-LLM Pipeline for Missing-Person Investigations

DataFactory: Collaborative Multi-Agent Framework for Advanced Table Question Answering

Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance

Curveball Steering: The Right Direction To Steer Isn't Always Linear

AI Act Evaluation Benchmark: An Open, Transparent, and Reproducible Evaluation Dataset for NLP and RAG Systems

An Empirical Study and Theoretical Explanation on Task-Level Model-Merging Collapse

Vibe-Creation: The Epistemology of Human-AI Emergent Cognition

Enhancing Debunking Effectiveness through LLM-based Personality Adaptation

AutoAgent: Evolving Cognition and Elastic Memory Orchestration for Adaptive Agents

Quantifying the Necessity of Chain of Thought through Opaque Serial Depth

LCA: Local Classifier Alignment for Continual Learning

Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts

PathMem: Toward Cognition-Aligned Memory Transformation for Pathology MLLMs

Logos: An evolvable reasoning engine for rational molecular design

Emotion is Not Just a Label: Latent Emotional Factors in LLM Processing

SPAR-K: Scheduled Periodic Alternating Early Exit for Spoken Language Models

Quantifying and extending the coverage of spatial categorization data sets

Automated Thematic Analysis for Clinical Qualitative Data: Iterative Codebook Refinement with Full Provenance

MASEval: Extending Multi-Agent Evaluation from Models to Systems

LDP: An Identity-Aware Protocol for Multi-Agent LLM Systems

Quantifying the Accuracy and Cost Impact of Design Decisions in Budget-Constrained Agentic LLM Search

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.