International Law

LOW Academic International

MedMASLab: A Unified Orchestration Framework for Benchmarking Multimodal Medical Multi-Agent Systems

arXiv:2603.09909v1 Announce Type: new Abstract: While Multi-Agent Systems (MAS) show potential for complex clinical decision support, the field remains hindered by architectural fragmentation and the lack of standardized multimodal integration. Current medical MAS research suffers from non-uniform data ingestion pipelines,...

1 min 1 month, 1 week ago

ear

LOW Academic International

SPAR-K: Scheduled Periodic Alternating Early Exit for Spoken Language Models

arXiv:2603.09215v1 Announce Type: new Abstract: Interleaved spoken language models (SLMs) alternately generate text and speech tokens, but decoding at full transformer depth for every step becomes costly, especially due to long speech sequences. We propose SPAR-K, a modality-aware early exit...

1 min 1 month, 1 week ago

ear

LOW Academic International

Reward Prediction with Factorized World States

arXiv:2603.09400v1 Announce Type: new Abstract: Agents must infer action outcomes and select actions that maximize a reward signal indicating how close the goal is to being reached. Supervised learning of reward models could introduce biases inherent to training data, limiting...

1 min 1 month, 1 week ago

ear

LOW Academic International

EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages

arXiv:2603.09678v1 Announce Type: new Abstract: Large language models achieve near-ceiling performance on code generation benchmarks, yet these results increasingly reflect memorization rather than genuine reasoning. We introduce EsoLang-Bench, a benchmark using five esoteric programming languages (Brainfuck, Befunge-98, Whitespace, Unlambda, and...

1 min 1 month, 1 week ago

ear

LOW Academic International

Learning When to Sample: Confidence-Aware Self-Consistency for Efficient LLM Chain-of-Thought Reasoning

arXiv:2603.08999v1 Announce Type: new Abstract: Large language models (LLMs) achieve strong reasoning performance through chain-of-thought (CoT) reasoning, yet often generate unnecessarily long reasoning paths that incur high inference cost. Recent self-consistency-based approaches further improve accuracy but require sampling and aggregating...

1 min 1 month, 1 week ago

ear

LOW Academic International

PRECEPT: Planning Resilience via Experience, Context Engineering & Probing Trajectories A Unified Framework for Test-Time Adaptation with Compositional Rule Learning and Pareto-Guided Prompt Evolution

arXiv:2603.09641v1 Announce Type: new Abstract: LLM agents that store knowledge as natural language suffer steep retrieval degradation as condition count grows, often struggle to compose learned rules reliably, and typically lack explicit mechanisms to detect stale or adversarial knowledge. We...

1 min 1 month, 1 week ago

ear

LOW Academic International

Telogenesis: Goal Is All U Need

arXiv:2603.09476v1 Announce Type: new Abstract: Goal-conditioned systems assume goals are provided externally. We ask whether attentional priorities can emerge endogenously from an agent's internal cognitive state. We propose a priority function that generates observation targets from three epistemic gaps: ignorance...

1 min 1 month, 1 week ago

ear

LOW Academic International

AgentOS: From Application Silos to a Natural Language-Driven Data Ecosystem

arXiv:2603.08938v1 Announce Type: new Abstract: The rapid emergence of open-source, locally hosted intelligent agents marks a critical inflection point in human-computer interaction. Systems such as OpenClaw demonstrate that Large Language Model (LLM)-based agents can autonomously operate local computing environments, orchestrate...

1 min 1 month, 1 week ago

ear

LOW Academic International

One Language, Two Scripts: Probing Script-Invariance in LLM Concept Representations

arXiv:2603.08869v1 Announce Type: new Abstract: Do the features learned by Sparse Autoencoders (SAEs) represent abstract meaning, or are they tied to how text is written? We investigate this question using Serbian digraphia as a controlled testbed: Serbian is written interchangeably...

1 min 1 month, 1 week ago

ear

LOW Academic International

Social-R1: Towards Human-like Social Reasoning in LLMs

arXiv:2603.09249v1 Announce Type: new Abstract: While large language models demonstrate remarkable capabilities across numerous domains, social intelligence - the capacity to perceive social cues, infer mental states, and generate appropriate responses - remains a critical challenge, particularly for enabling effective...

1 min 1 month, 1 week ago

ear

LOW Academic International

Context Engineering: From Prompts to Corporate Multi-Agent Architecture

arXiv:2603.09619v1 Announce Type: new Abstract: As artificial intelligence (AI) systems evolve from stateless chatbots to autonomous multi-step agents, prompt engineering (PE), the discipline of crafting individual queries, proves necessary but insufficient. This paper introduces context engineering (CE) as a standalone...

1 min 1 month, 1 week ago

ear

LOW Academic International

MASEval: Extending Multi-Agent Evaluation from Models to Systems

arXiv:2603.08835v1 Announce Type: new Abstract: The rapid adoption of LLM-based agentic systems has produced a rich ecosystem of frameworks (smolagents, LangGraph, AutoGen, CAMEL, LlamaIndex, i.a.). Yet existing benchmarks are model-centric: they fix the agentic setup and do not compare other...

1 min 1 month, 1 week ago

ear

LOW Academic International

LCA: Local Classifier Alignment for Continual Learning

arXiv:2603.09888v1 Announce Type: new Abstract: A fundamental requirement for intelligent systems is the ability to learn continuously under changing environments. However, models trained in this regime often suffer from catastrophic forgetting. Leveraging pre-trained models has recently emerged as a promising...

1 min 1 month, 1 week ago

ear

LOW Academic International

ConFu: Contemplate the Future for Better Speculative Sampling

arXiv:2603.08899v1 Announce Type: new Abstract: Speculative decoding has emerged as a powerful approach to accelerate large language model (LLM) inference by employing lightweight draft models to propose candidate tokens that are subsequently verified by the target model. The effectiveness of...

1 min 1 month, 1 week ago

ear

LOW Academic International

SciTaRC: Benchmarking QA on Scientific Tabular Data that Requires Language Reasoning and Complex Computation

arXiv:2603.08910v1 Announce Type: new Abstract: We introduce SciTaRC, an expert-authored benchmark of questions about tabular data in scientific papers requiring both deep language reasoning and complex computation. We show that current state-of-the-art AI models fail on at least 23% of...

1 min 1 month, 1 week ago

itar

LOW Academic International

Chaotic Dynamics in Multi-LLM Deliberation

arXiv:2603.09127v1 Announce Type: new Abstract: Collective AI systems increasingly rely on multi-LLM deliberation, but their stability under repeated execution remains poorly characterized. We model five-agent LLM committees as random dynamical systems and quantify inter-run sensitivity using an empirical Lyapunov exponent...

1 min 1 month, 1 week ago

ear

LOW Academic International

TaSR-RAG: Taxonomy-guided Structured Reasoning for Retrieval-Augmented Generation

arXiv:2603.09341v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) helps large language models (LLMs) answer knowledge-intensive and time-sensitive questions by conditioning generation on external evidence. However, most RAG systems still retrieve unstructured chunks and rely on one-shot generation, which often yields...

1 min 1 month, 1 week ago

ear

LOW Academic International

MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants

arXiv:2603.09652v1 Announce Type: new Abstract: With the rapid advancement of Large Language Models (LLMs) in code generation, human-AI interaction is evolving from static text responses to dynamic, interactive HTML-based applications, which we term MiniApps. These applications require models to not...

1 min 1 month, 1 week ago

ear

LOW Academic International

You Didn't Have to Say It like That: Subliminal Learning from Faithful Paraphrases

arXiv:2603.09517v1 Announce Type: new Abstract: When language models are trained on synthetic data, they (student model) can covertly acquire behavioral traits from the data-generating model (teacher model). Subliminal learning refers to the transmission of traits from a teacher to a...

1 min 1 month, 1 week ago

ear

LOW Academic International

Understanding the Interplay between LLMs' Utilisation of Parametric and Contextual Knowledge: A keynote at ECIR 2025

arXiv:2603.09654v1 Announce Type: new Abstract: Language Models (LMs) acquire parametric knowledge from their training process, embedding it within their weights. The increasing scalability of LMs, however, poses significant challenges for understanding a model's inner workings and further for updating or...

1 min 1 month, 1 week ago

ear

LOW Academic International

Automatic Cardiac Risk Management Classification using large-context Electronic Patients Health Records

arXiv:2603.09685v1 Announce Type: new Abstract: To overcome the limitations of manual administrative coding in geriatric Cardiovascular Risk Management, this study introduces an automated classification framework leveraging unstructured Electronic Health Records (EHRs). Using a dataset of 3,482 patients, we benchmarked three...

1 min 1 month, 1 week ago

ear

LOW Academic International

RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation

arXiv:2603.09723v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used across the scientific workflow, including to draft peer-review reports. However, many AI-generated reviews are superficial and insufficiently actionable, leaving authors without concrete, implementable guidance and motivating the gap...

1 min 1 month, 1 week ago

ear

LOW Academic International

Chow-Liu Ordering for Long-Context Reasoning in Chain-of-Agents

arXiv:2603.09835v1 Announce Type: new Abstract: Sequential multi-agent reasoning frameworks such as Chain-of-Agents (CoA) handle long-context queries by decomposing inputs into chunks and processing them sequentially using LLM-based worker agents that read from and update a bounded shared memory. From a...

1 min 1 month, 1 week ago

ear

LOW Academic International

Benchmarking Political Persuasion Risks Across Frontier Large Language Models

arXiv:2603.09884v1 Announce Type: new Abstract: Concerns persist regarding the capacity of Large Language Models (LLMs) to sway political views. Although prior research has claimed that LLMs are not more persuasive than standard political campaign practices, the recent rise of frontier...

1 min 1 month, 1 week ago

ear

LOW Academic International

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

arXiv:2603.09906v1 Announce Type: new Abstract: While reasoning in LLMs plays a natural role in math, code generation, and multi-hop factual questions, its effect on simple, single-hop factual questions remains unclear. Such questions do not require step-by-step logical decomposition, making the...

1 min 1 month, 1 week ago

ear

LOW Academic International

CREATE: Testing LLMs for Associative Creativity

arXiv:2603.09970v1 Announce Type: new Abstract: A key component of creativity is associative reasoning: the ability to draw novel yet meaningful connections between concepts. We introduce CREATE, a benchmark designed to evaluate models' capacity for creative associative reasoning. CREATE requires models...

1 min 1 month, 1 week ago

ear

LOW Academic International

VeriInteresting: An Empirical Study of Model Prompt Interactions in Verilog Code Generation

arXiv:2603.08715v1 Announce Type: cross Abstract: Rapid advances in language models (LMs) have created new opportunities for automated code generation while complicating trade-offs between model characteristics and prompt design choices. In this work, we provide an empirical map of recent trends...

1 min 1 month, 1 week ago

ear

LOW Academic International

Self-hosted Lecture-to-Quiz: Local LLM MCQ Generation with Deterministic Quality Control

arXiv:2603.08729v1 Announce Type: cross Abstract: We present an end-to-end self-hosted (API-free) pipeline, where API-free means that lecture content is not sent to any external LLM service, that converts lecture PDFs into multiple-choice questions (MCQs) using a local LLM plus deterministic...

1 min 1 month, 1 week ago

ear

LOW Academic International

PathoScribe: Transforming Pathology Data into a Living Library with a Unified LLM-Driven Framework for Semantic Retrieval and Clinical Integration

arXiv:2603.08935v1 Announce Type: cross Abstract: Pathology underpins modern diagnosis and cancer care, yet its most valuable asset, the accumulated experience encoded in millions of narrative reports, remains largely inaccessible. Although institutions are rapidly digitizing pathology workflows, storing data without effective...

1 min 1 month, 1 week ago

ear

LOW Academic International

Equitable Multi-Task Learning for AI-RANs

arXiv:2603.08717v1 Announce Type: new Abstract: AI-enabled Radio Access Networks (AI-RANs) are expected to serve heterogeneous users with time-varying learning tasks over shared edge resources. Ensuring equitable inference performance across these users requires adaptive and fair learning mechanisms. This paper introduces...

1 min 1 month, 1 week ago

ear

MedMASLab: A Unified Orchestration Framework for Benchmarking Multimodal Medical Multi-Agent Systems

SPAR-K: Scheduled Periodic Alternating Early Exit for Spoken Language Models

Reward Prediction with Factorized World States

EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages

Learning When to Sample: Confidence-Aware Self-Consistency for Efficient LLM Chain-of-Thought Reasoning

PRECEPT: Planning Resilience via Experience, Context Engineering & Probing Trajectories A Unified Framework for Test-Time Adaptation with Compositional Rule Learning and Pareto-Guided Prompt Evolution

Telogenesis: Goal Is All U Need

AgentOS: From Application Silos to a Natural Language-Driven Data Ecosystem

One Language, Two Scripts: Probing Script-Invariance in LLM Concept Representations

Social-R1: Towards Human-like Social Reasoning in LLMs

Context Engineering: From Prompts to Corporate Multi-Agent Architecture

MASEval: Extending Multi-Agent Evaluation from Models to Systems

LCA: Local Classifier Alignment for Continual Learning

ConFu: Contemplate the Future for Better Speculative Sampling

SciTaRC: Benchmarking QA on Scientific Tabular Data that Requires Language Reasoning and Complex Computation

Chaotic Dynamics in Multi-LLM Deliberation

TaSR-RAG: Taxonomy-guided Structured Reasoning for Retrieval-Augmented Generation

MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants

You Didn't Have to Say It like That: Subliminal Learning from Faithful Paraphrases

Understanding the Interplay between LLMs' Utilisation of Parametric and Contextual Knowledge: A keynote at ECIR 2025

Automatic Cardiac Risk Management Classification using large-context Electronic Patients Health Records

RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation

Chow-Liu Ordering for Long-Context Reasoning in Chain-of-Agents

Benchmarking Political Persuasion Risks Across Frontier Large Language Models

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

CREATE: Testing LLMs for Associative Creativity

VeriInteresting: An Empirical Study of Model Prompt Interactions in Verilog Code Generation

Self-hosted Lecture-to-Quiz: Local LLM MCQ Generation with Deterministic Quality Control

PathoScribe: Transforming Pathology Data into a Living Library with a Unified LLM-Driven Framework for Semantic Retrieval and Clinical Integration

Equitable Multi-Task Learning for AI-RANs

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.