International Law

LOW Academic International

Logos: An evolvable reasoning engine for rational molecular design

arXiv:2603.09268v1 Announce Type: new Abstract: The discovery and design of functional molecules remain central challenges across chemistry,biology, and materials science. While recent advances in machine learning have accelerated molecular property prediction and candidate generation, existing models tend to excel either...

1 min 1 month, 1 week ago

ear

LOW Academic European Union

MultiGraSCCo: A Multilingual Anonymization Benchmark with Annotations of Personal Identifiers

arXiv:2603.08879v1 Announce Type: new Abstract: Accessing sensitive patient data for machine learning is challenging due to privacy concerns. Datasets with annotations of personally identifiable information are crucial for developing and testing anonymization systems to enable safe data sharing that complies...

1 min 1 month, 1 week ago

ear

LOW Academic International

Reward Prediction with Factorized World States

arXiv:2603.09400v1 Announce Type: new Abstract: Agents must infer action outcomes and select actions that maximize a reward signal indicating how close the goal is to being reached. Supervised learning of reward models could introduce biases inherent to training data, limiting...

1 min 1 month, 1 week ago

ear

LOW Academic International

Telogenesis: Goal Is All U Need

arXiv:2603.09476v1 Announce Type: new Abstract: Goal-conditioned systems assume goals are provided externally. We ask whether attentional priorities can emerge endogenously from an agent's internal cognitive state. We propose a priority function that generates observation targets from three epistemic gaps: ignorance...

1 min 1 month, 1 week ago

ear

LOW Academic International

One Language, Two Scripts: Probing Script-Invariance in LLM Concept Representations

arXiv:2603.08869v1 Announce Type: new Abstract: Do the features learned by Sparse Autoencoders (SAEs) represent abstract meaning, or are they tied to how text is written? We investigate this question using Serbian digraphia as a controlled testbed: Serbian is written interchangeably...

1 min 1 month, 1 week ago

ear

LOW Academic International

TaSR-RAG: Taxonomy-guided Structured Reasoning for Retrieval-Augmented Generation

arXiv:2603.09341v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) helps large language models (LLMs) answer knowledge-intensive and time-sensitive questions by conditioning generation on external evidence. However, most RAG systems still retrieve unstructured chunks and rely on one-shot generation, which often yields...

1 min 1 month, 1 week ago

ear

LOW Academic European Union

Enhancing Debunking Effectiveness through LLM-based Personality Adaptation

arXiv:2603.09533v1 Announce Type: new Abstract: This study proposes a novel methodology for generating personalized fake news debunking messages by prompting Large Language Models (LLMs) with persona-based inputs aligned to the Big Five personality traits: Extraversion, Agreeableness, Conscientiousness, Neuroticism, and Openness....

1 min 1 month, 1 week ago

ear

LOW Academic International

Social-R1: Towards Human-like Social Reasoning in LLMs

arXiv:2603.09249v1 Announce Type: new Abstract: While large language models demonstrate remarkable capabilities across numerous domains, social intelligence - the capacity to perceive social cues, infer mental states, and generate appropriate responses - remains a critical challenge, particularly for enabling effective...

1 min 1 month, 1 week ago

ear

LOW Academic International

Learning When to Sample: Confidence-Aware Self-Consistency for Efficient LLM Chain-of-Thought Reasoning

arXiv:2603.08999v1 Announce Type: new Abstract: Large language models (LLMs) achieve strong reasoning performance through chain-of-thought (CoT) reasoning, yet often generate unnecessarily long reasoning paths that incur high inference cost. Recent self-consistency-based approaches further improve accuracy but require sampling and aggregating...

1 min 1 month, 1 week ago

ear

LOW Academic International

Evaluate-as-Action: Self-Evaluated Process Rewards for Retrieval-Augmented Agents

arXiv:2603.09203v1 Announce Type: new Abstract: Retrieval-augmented agents can query external evidence, yet their reliability in multi-step reasoning remains limited: noisy retrieval may derail multi-hop question answering, while outcome-only reinforcement learning provides credit signals that are too coarse to optimize intermediate...

1 min 1 month, 1 week ago

ear

LOW Academic International

Quantifying the Accuracy and Cost Impact of Design Decisions in Budget-Constrained Agentic LLM Search

arXiv:2603.08877v1 Announce Type: new Abstract: Agentic Retrieval-Augmented Generation (RAG) systems combine iterative search, planning prompts, and retrieval backends, but deployed settings impose explicit budgets on tool calls and completion tokens. We present a controlled measurement study of how search depth,...

1 min 1 month, 1 week ago

ear

LOW Academic International

Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts

arXiv:2603.09890v1 Announce Type: new Abstract: Large Language Models (LLMs) have emerged as a new paradigm for multi-agent systems. However, existing research on the behaviour of LLM-based multi-agents relies on ad hoc prompts and lacks a principled policy perspective. Different from...

1 min 1 month, 1 week ago

ear

LOW Academic European Union

The Confidence Gate Theorem: When Should Ranked Decision Systems Abstain?

arXiv:2603.09947v1 Announce Type: new Abstract: Ranked decision systems -- recommenders, ad auctions, clinical triage queues -- must decide when to intervene in ranked outputs and when to abstain. We study when confidence-based abstention monotonically improves decision quality, and when it...

1 min 1 month, 1 week ago

ear

LOW Academic International

LCA: Local Classifier Alignment for Continual Learning

arXiv:2603.09888v1 Announce Type: new Abstract: A fundamental requirement for intelligent systems is the ability to learn continuously under changing environments. However, models trained in this regime often suffer from catastrophic forgetting. Leveraging pre-trained models has recently emerged as a promising...

1 min 1 month, 1 week ago

ear

LOW Academic United States

Does the Question Really Matter? Training-Free Data Selection for Vision-Language SFT

arXiv:2603.09715v1 Announce Type: new Abstract: Visual instruction tuning is crucial for improving vision-language large models (VLLMs). However, many samples can be solved via linguistic patterns or common-sense shortcuts, without genuine cross-modal reasoning, limiting the effectiveness of multimodal learning. Prior data...

1 min 1 month, 1 week ago

ear

LOW Academic International

MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games

arXiv:2603.09022v1 Announce Type: new Abstract: Multi-turn, multi-agent LLM game evaluations often exhibit substantial run-to-run variance. In long-horizon interactions, small early deviations compound across turns and are amplified by multi-agent coupling. This biases win rate estimates and makes rankings unreliable across...

1 min 1 month, 1 week ago

ear

LOW Academic European Union

LooComp: Leverage Leave-One-Out Strategy to Encoder-only Transformer for Efficient Query-aware Context Compression

arXiv:2603.09222v1 Announce Type: new Abstract: Efficient context compression is crucial for improving the accuracy and scalability of question answering. For the efficiency of Retrieval Augmented Generation, context should be delivered fast, compact, and precise to ensure clue sufficiency and budget-friendly...

1 min 1 month, 1 week ago

ear

LOW Academic United States

Deep Tabular Research via Continual Experience-Driven Execution

arXiv:2603.09151v1 Announce Type: new Abstract: Large language models often struggle with complex long-horizon analytical tasks over unstructured tables, which typically feature hierarchical and bidirectional headers and non-canonical layouts. We formalize this challenge as Deep Tabular Research (DTR), requiring multi-step reasoning...

1 min 1 month, 1 week ago

ear

LOW Academic International

Context Engineering: From Prompts to Corporate Multi-Agent Architecture

arXiv:2603.09619v1 Announce Type: new Abstract: As artificial intelligence (AI) systems evolve from stateless chatbots to autonomous multi-step agents, prompt engineering (PE), the discipline of crafting individual queries, proves necessary but insufficient. This paper introduces context engineering (CE) as a standalone...

1 min 1 month, 1 week ago

ear

LOW Academic International

SPAR-K: Scheduled Periodic Alternating Early Exit for Spoken Language Models

arXiv:2603.09215v1 Announce Type: new Abstract: Interleaved spoken language models (SLMs) alternately generate text and speech tokens, but decoding at full transformer depth for every step becomes costly, especially due to long speech sequences. We propose SPAR-K, a modality-aware early exit...

1 min 1 month, 1 week ago

ear

LOW Academic European Union

The FABRIC Strategy for Verifying Neural Feedback Systems

arXiv:2603.08964v1 Announce Type: new Abstract: Forward reachability analysis is a dominant approach for verifying reach-avoid specifications in neural feedback systems, i.e., dynamical systems controlled by neural networks, and a number of directions have been proposed and studied. In contrast, far...

1 min 1 month, 1 week ago

ear

LOW Academic United States

Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance

arXiv:2603.08933v1 Announce Type: new Abstract: The first 72 hours of a missing-child investigation are critical for successful recovery. However, law enforcement agencies often face fragmented, unstructured data and a lack of dynamic, geospatial predictive tools. Our system, Guardian, provides an...

1 min 1 month, 1 week ago

ear

LOW Academic International

EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages

arXiv:2603.09678v1 Announce Type: new Abstract: Large language models achieve near-ceiling performance on code generation benchmarks, yet these results increasingly reflect memorization rather than genuine reasoning. We introduce EsoLang-Bench, a benchmark using five esoteric programming languages (Brainfuck, Befunge-98, Whitespace, Unlambda, and...

1 min 1 month, 1 week ago

ear

LOW Academic European Union

Automated Thematic Analysis for Clinical Qualitative Data: Iterative Codebook Refinement with Full Provenance

arXiv:2603.08989v1 Announce Type: new Abstract: Thematic analysis (TA) is widely used in health research to extract patterns from patient interviews, yet manual TA faces challenges in scalability and reproducibility. LLM-based automation can help, but existing approaches produce codebooks with limited...

1 min 1 month, 1 week ago

ear

LOW Academic International

MASEval: Extending Multi-Agent Evaluation from Models to Systems

arXiv:2603.08835v1 Announce Type: new Abstract: The rapid adoption of LLM-based agentic systems has produced a rich ecosystem of frameworks (smolagents, LangGraph, AutoGen, CAMEL, LlamaIndex, i.a.). Yet existing benchmarks are model-centric: they fix the agentic setup and do not compare other...

1 min 1 month, 1 week ago

ear

LOW Academic United States

PrivPRISM: Automatically Detecting Discrepancies Between Google Play Data Safety Declarations and Developer Privacy Policies

arXiv:2603.09214v1 Announce Type: new Abstract: End-users seldom read verbose privacy policies, leading app stores like Google Play to mandate simplified data safety declarations as a user-friendly alternative. However, these self-declared disclosures often contradict the full privacy policies, deceiving users about...

1 min 1 month, 1 week ago

ear

LOW Academic International

SciTaRC: Benchmarking QA on Scientific Tabular Data that Requires Language Reasoning and Complex Computation

arXiv:2603.08910v1 Announce Type: new Abstract: We introduce SciTaRC, an expert-authored benchmark of questions about tabular data in scientific papers requiring both deep language reasoning and complex computation. We show that current state-of-the-art AI models fail on at least 23% of...

1 min 1 month, 1 week ago

itar

LOW Academic International

Chaotic Dynamics in Multi-LLM Deliberation

arXiv:2603.09127v1 Announce Type: new Abstract: Collective AI systems increasingly rely on multi-LLM deliberation, but their stability under repeated execution remains poorly characterized. We model five-agent LLM committees as random dynamical systems and quantify inter-run sensitivity using an empirical Lyapunov exponent...

1 min 1 month, 1 week ago

ear

LOW Academic International

ConFu: Contemplate the Future for Better Speculative Sampling

arXiv:2603.08899v1 Announce Type: new Abstract: Speculative decoding has emerged as a powerful approach to accelerate large language model (LLM) inference by employing lightweight draft models to propose candidate tokens that are subsequently verified by the target model. The effectiveness of...

1 min 1 month, 1 week ago

ear

LOW Academic International

AgentOS: From Application Silos to a Natural Language-Driven Data Ecosystem

arXiv:2603.08938v1 Announce Type: new Abstract: The rapid emergence of open-source, locally hosted intelligent agents marks a critical inflection point in human-computer interaction. Systems such as OpenClaw demonstrate that Large Language Model (LLM)-based agents can autonomously operate local computing environments, orchestrate...

1 min 1 month, 1 week ago

ear

Logos: An evolvable reasoning engine for rational molecular design

MultiGraSCCo: A Multilingual Anonymization Benchmark with Annotations of Personal Identifiers

Reward Prediction with Factorized World States

Telogenesis: Goal Is All U Need

One Language, Two Scripts: Probing Script-Invariance in LLM Concept Representations

TaSR-RAG: Taxonomy-guided Structured Reasoning for Retrieval-Augmented Generation

Enhancing Debunking Effectiveness through LLM-based Personality Adaptation

Social-R1: Towards Human-like Social Reasoning in LLMs

Learning When to Sample: Confidence-Aware Self-Consistency for Efficient LLM Chain-of-Thought Reasoning

Evaluate-as-Action: Self-Evaluated Process Rewards for Retrieval-Augmented Agents

Quantifying the Accuracy and Cost Impact of Design Decisions in Budget-Constrained Agentic LLM Search

Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts

The Confidence Gate Theorem: When Should Ranked Decision Systems Abstain?

LCA: Local Classifier Alignment for Continual Learning

Does the Question Really Matter? Training-Free Data Selection for Vision-Language SFT

MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games

LooComp: Leverage Leave-One-Out Strategy to Encoder-only Transformer for Efficient Query-aware Context Compression

Deep Tabular Research via Continual Experience-Driven Execution

Context Engineering: From Prompts to Corporate Multi-Agent Architecture

SPAR-K: Scheduled Periodic Alternating Early Exit for Spoken Language Models

The FABRIC Strategy for Verifying Neural Feedback Systems

Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance

EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages

Automated Thematic Analysis for Clinical Qualitative Data: Iterative Codebook Refinement with Full Provenance

MASEval: Extending Multi-Agent Evaluation from Models to Systems

PrivPRISM: Automatically Detecting Discrepancies Between Google Play Data Safety Declarations and Developer Privacy Policies

SciTaRC: Benchmarking QA on Scientific Tabular Data that Requires Language Reasoning and Complex Computation

Chaotic Dynamics in Multi-LLM Deliberation

ConFu: Contemplate the Future for Better Speculative Sampling

AgentOS: From Application Silos to a Natural Language-Driven Data Ecosystem

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.