International Law

LOW Academic United States

TAB-PO: Preference Optimization with a Token-Level Adaptive Barrier for Token-Critical Structured Generation

arXiv:2603.00025v1 Announce Type: new Abstract: Direct Preference Optimization is an offline post-SFT method for aligning language models from preference pairs, with strong results in instruction following and summarization. However, DPO's sequence-level implicit reward can be brittle for token-critical structured prediction...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Autorubric: A Unified Framework for Rubric-Based LLM Evaluation

arXiv:2603.00077v1 Announce Type: new Abstract: Rubric-based evaluation with large language models (LLMs) has become standard practice for assessing text generation at scale, yet the underlying techniques are scattered across papers with inconsistent terminology and partial solutions. We present a unified...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Stepwise Penalization for Length-Efficient Chain-of-Thought Reasoning

arXiv:2603.00296v1 Announce Type: new Abstract: Large reasoning models improve with more test-time computation, but often overthink, producing unnecessarily long chains-of-thought that raise cost without improving accuracy. Prior reinforcement learning approaches typically rely on a single outcome reward with trajectory-level length...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

When Metrics Disagree: Automatic Similarity vs. LLM-as-a-Judge for Clinical Dialogue Evaluation

arXiv:2603.00314v1 Announce Type: new Abstract: This paper details the baseline model selection, fine-tuning process, evaluation methods, and the implications of deploying more accurate LLMs in healthcare settings. As large language models (LLMs) are increasingly employed to address diverse problems, including...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Federated Inference: Toward Privacy-Preserving Collaborative and Incentivized Model Serving

arXiv:2603.02214v1 Announce Type: new Abstract: Federated Inference (FI) studies how independently trained and privately owned models can collaborate at inference time without sharing data or model parameters. While recent work has explored secure and distributed inference from disparate perspectives, a...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

SuperLocalMemory: Privacy-Preserving Multi-Agent Memory with Bayesian Trust Defense Against Memory Poisoning

arXiv:2603.02240v1 Announce Type: new Abstract: We present SuperLocalMemory, a local-first memory system for multi-agent AI that defends against OWASP ASI06 memory poisoning through architectural isolation and Bayesian trust scoring, while personalizing retrieval through adaptive learning-to-rank -- all without cloud dependencies...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Estimating Visual Attribute Effects in Advertising from Observational Data: A Deepfake-Informed Double Machine Learning Approach

arXiv:2603.02359v1 Announce Type: new Abstract: Digital advertising increasingly relies on visual content, yet marketers lack rigorous methods for understanding how specific visual attributes causally affect consumer engagement. This paper addresses a fundamental methodological challenge: estimating causal effects when the treatment,...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

COOL-MC: Verifying and Explaining RL Policies for Platelet Inventory Management

arXiv:2603.02396v1 Announce Type: new Abstract: Platelets expire within five days. Blood banks face uncertain daily demand and must balance ordering decisions between costly wastage from overstocking and life-threatening shortages from understocking. Reinforcement learning (RL) can learn effective ordering policies for...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

VL-KGE: Vision-Language Models Meet Knowledge Graph Embeddings

arXiv:2603.02435v1 Announce Type: new Abstract: Real-world multimodal knowledge graphs (MKGs) are inherently heterogeneous, modeling entities that are associated with diverse modalities. Traditional knowledge graph embedding (KGE) methods excel at learning continuous representations of entities and relations, yet they are typically...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Diagnosing Retrieval vs. Utilization Bottlenecks in LLM Agent Memory

arXiv:2603.02473v1 Announce Type: new Abstract: Memory-augmented LLM agents store and retrieve information from prior interactions, yet the relative importance of how memories are written versus how they are retrieved remains unclear. We introduce a diagnostic framework that analyzes how performance...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

NeuroProlog: Multi-Task Fine-Tuning for Neurosymbolic Mathematical Reasoning via the Cocktail Effect

arXiv:2603.02504v1 Announce Type: new Abstract: Large Language Models (LLMs) achieve strong performance on natural language tasks but remain unreliable in mathematical reasoning, frequently generating fluent yet logically inconsistent solutions. We present \textbf{NeuroProlog}, a neurosymbolic framework that ensures verifiable reasoning by...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

LLM-MLFFN: Multi-Level Autonomous Driving Behavior Feature Fusion via Large Language Model

arXiv:2603.02528v1 Announce Type: new Abstract: Accurate classification of autonomous vehicle (AV) driving behaviors is critical for safety validation, performance diagnosis, and traffic integration analysis. However, existing approaches primarily rely on numerical time-series modeling and often lack semantic abstraction, limiting interpretability...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

A Neuropsychologically Grounded Evaluation of LLM Cognitive Abilities

arXiv:2603.02540v1 Announce Type: new Abstract: Large language models (LLMs) exhibit a unified "general factor" of capability across 10 benchmarks, a finding confirmed by our factor analysis of 156 models, yet they still struggle with simple, trivial tasks for humans. This...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

LLMs for High-Frequency Decision-Making: Normalized Action Reward-Guided Consistency Policy Optimization

arXiv:2603.02680v1 Announce Type: new Abstract: While Large Language Models (LLMs) form the cornerstone of sequential decision-making agent development, they have inherent limitations in high-frequency decision tasks. Existing research mainly focuses on discrete embodied decision scenarios with low-frequency and significant semantic...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Retrieval-Augmented Robots via Retrieve-Reason-Act

arXiv:2603.02688v1 Announce Type: new Abstract: To achieve general-purpose utility, we argue that robots must evolve from passive executors into active Information Retrieval users. In strictly zero-shot settings where no prior demonstrations exist, robots face a critical information gap, such as...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

A Natural Language Agentic Approach to Study Affective Polarization

arXiv:2603.02711v1 Announce Type: new Abstract: Affective polarization has been central to political and social studies, with growing focus on social media, where partisan divisions are often exacerbated. Real-world studies tend to have limited scope, while simulated studies suffer from insufficient...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

EvoSkill: Automated Skill Discovery for Multi-Agent Systems

arXiv:2603.02766v1 Announce Type: new Abstract: Coding agents are increasingly used as general-purpose problem solvers, but their flexibility does not by itself confer the domain expertise needed for specialized tasks. Recent work addresses this through \textit{agent skills}: reusable workflows, and code,...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Rethinking Code Similarity for Automated Algorithm Design with LLMs

arXiv:2603.02787v1 Announce Type: new Abstract: The rise of Large Language Model-based Automated Algorithm Design (LLM-AAD) has transformed algorithm development by autonomously generating code implementations of expert-level algorithms. Unlike traditional expert-driven algorithm development, in the LLM-AAD paradigm, the main design principle...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

LLM-based Argument Mining meets Argumentation and Description Logics: a Unified Framework for Reasoning about Debates

arXiv:2603.02858v1 Announce Type: new Abstract: Large Language Models (LLMs) achieve strong performance in analyzing and generating text, yet they struggle with explicit, transparent, and verifiable reasoning over complex texts such as those containing debates. In particular, they lack structured representations...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Retrievit: In-context Retrieval Capabilities of Transformers, State Space Models, and Hybrid Architectures

arXiv:2603.02874v1 Announce Type: new Abstract: Transformers excel at in-context retrieval but suffer from quadratic complexity with sequence length, while State Space Models (SSMs) offer efficient linear-time processing but have limited retrieval capabilities. We investigate whether hybrid architectures combining Transformers and...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

SAE as a Crystal Ball: Interpretable Features Predict Cross-domain Transferability of LLMs without Training

arXiv:2603.02908v1 Announce Type: new Abstract: In recent years, pre-trained large language models have achieved remarkable success across diverse tasks. Besides the pivotal role of self-supervised pre-training, their effectiveness in downstream applications also depends critically on the post-training process, which adapts...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

ShipTraj-R1: Reinforcing Ship Trajectory Prediction in Large Language Models via Group Relative Policy Optimization

arXiv:2603.02939v1 Announce Type: new Abstract: Recent advancements in reinforcement fine-tuning have significantly improved the reasoning ability of large language models (LLMs). In particular, methods such as group relative policy optimization (GRPO) have demonstrated strong capabilities across various fields. However, applying...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Architecting Trust in Artificial Epistemic Agents

arXiv:2603.02960v1 Announce Type: new Abstract: Large language models increasingly function as epistemic agents -- entities that can 1) autonomously pursue epistemic goals and 2) actively shape our shared knowledge environment. They curate the information we receive, often supplanting traditional search-based...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

OrchMAS: Orchestrated Reasoning with Multi Collaborative Heterogeneous Scientific Expert Structured Agents

arXiv:2603.03005v1 Announce Type: new Abstract: Multi-agent large language model frameworks are promising for complex multi step reasoning, yet existing systems remain weak for scientific and knowledge intensive domains due to static prompts and agent roles, rigid workflows, and homogeneous model...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

REGAL: A Registry-Driven Architecture for Deterministic Grounding of Agentic AI in Enterprise Telemetry

arXiv:2603.03018v1 Announce Type: new Abstract: Enterprise engineering organizations produce high-volume, heterogeneous telemetry from version control systems, CI/CD pipelines, issue trackers, and observability platforms. Large Language Models (LLMs) enable new forms of agentic automation, but grounding such agents on private telemetry...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

TikZilla: Scaling Text-to-TikZ with High-Quality Data and Reinforcement Learning

arXiv:2603.03072v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used to assist scientists across diverse workflows. A key challenge is generating high-quality figures from textual descriptions, often represented as TikZ programs that can be rendered as scientific images....

1 min 1 month, 2 weeks ago

ear

LOW Academic International

RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization

arXiv:2603.03078v1 Announce Type: new Abstract: Agentic Reinforcement Learning (Agentic RL) has shown remarkable potential in large language model-based (LLM) agents. These works can empower LLM agents to tackle complex tasks via multi-step, tool-integrated reasoning. However, an inherent limitation of existing...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Odin: Multi-Signal Graph Intelligence for Autonomous Discovery in Knowledge Graphs

arXiv:2603.03097v1 Announce Type: new Abstract: We present Odin, the first production-deployed graph intelligence engine for autonomous discovery of meaningful patterns in knowledge graphs without prior specification. Unlike retrieval-based systems that answer predefined queries, Odin guides exploration through the COMPASS (Composite...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

FEAST: Retrieval-Augmented Multi-Hierarchical Food Classification for the FoodEx2 System

arXiv:2603.03176v1 Announce Type: new Abstract: Hierarchical text classification (HTC) and extreme multi-label classification (XML) tasks face compounded challenges from complex label interdependencies, data sparsity, and extreme output dimensions. These challenges are exemplified in the European Food Safety Authority's FoodEx2 system-a...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Neuro-Symbolic Artificial Intelligence: A Task-Directed Survey in the Black-Box Models Era

arXiv:2603.03177v1 Announce Type: new Abstract: The integration of symbolic computing with neural networks has intrigued researchers since the first theorizations of Artificial intelligence (AI). The ability of Neuro-Symbolic (NeSy) methods to infer or exploit behavioral schema has been widely considered...

1 min 1 month, 2 weeks ago

ear

TAB-PO: Preference Optimization with a Token-Level Adaptive Barrier for Token-Critical Structured Generation

Autorubric: A Unified Framework for Rubric-Based LLM Evaluation

Stepwise Penalization for Length-Efficient Chain-of-Thought Reasoning

When Metrics Disagree: Automatic Similarity vs. LLM-as-a-Judge for Clinical Dialogue Evaluation

Federated Inference: Toward Privacy-Preserving Collaborative and Incentivized Model Serving

SuperLocalMemory: Privacy-Preserving Multi-Agent Memory with Bayesian Trust Defense Against Memory Poisoning

Estimating Visual Attribute Effects in Advertising from Observational Data: A Deepfake-Informed Double Machine Learning Approach

COOL-MC: Verifying and Explaining RL Policies for Platelet Inventory Management

VL-KGE: Vision-Language Models Meet Knowledge Graph Embeddings

Diagnosing Retrieval vs. Utilization Bottlenecks in LLM Agent Memory

NeuroProlog: Multi-Task Fine-Tuning for Neurosymbolic Mathematical Reasoning via the Cocktail Effect

LLM-MLFFN: Multi-Level Autonomous Driving Behavior Feature Fusion via Large Language Model

A Neuropsychologically Grounded Evaluation of LLM Cognitive Abilities

LLMs for High-Frequency Decision-Making: Normalized Action Reward-Guided Consistency Policy Optimization

Retrieval-Augmented Robots via Retrieve-Reason-Act

A Natural Language Agentic Approach to Study Affective Polarization

EvoSkill: Automated Skill Discovery for Multi-Agent Systems

Rethinking Code Similarity for Automated Algorithm Design with LLMs

LLM-based Argument Mining meets Argumentation and Description Logics: a Unified Framework for Reasoning about Debates

Retrievit: In-context Retrieval Capabilities of Transformers, State Space Models, and Hybrid Architectures

SAE as a Crystal Ball: Interpretable Features Predict Cross-domain Transferability of LLMs without Training

ShipTraj-R1: Reinforcing Ship Trajectory Prediction in Large Language Models via Group Relative Policy Optimization

Architecting Trust in Artificial Epistemic Agents

OrchMAS: Orchestrated Reasoning with Multi Collaborative Heterogeneous Scientific Expert Structured Agents

REGAL: A Registry-Driven Architecture for Deterministic Grounding of Agentic AI in Enterprise Telemetry

TikZilla: Scaling Text-to-TikZ with High-Quality Data and Reinforcement Learning

RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization

Odin: Multi-Signal Graph Intelligence for Autonomous Discovery in Knowledge Graphs

FEAST: Retrieval-Augmented Multi-Hierarchical Food Classification for the FoodEx2 System

Neuro-Symbolic Artificial Intelligence: A Task-Directed Survey in the Black-Box Models Era

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.