Tax Law

LOW Academic European Union

SpatialText: A Pure-Text Cognitive Benchmark for Spatial Understanding in Large Language Models

arXiv:2603.03002v1 Announce Type: new Abstract: Genuine spatial reasoning relies on the capacity to construct and manipulate coherent internal spatial representations, often conceptualized as mental models, rather than merely processing surface linguistic associations. While large language models exhibit advanced capabilities across...

1 min 1 month, 1 week ago

deduction

LOW Academic International

REGAL: A Registry-Driven Architecture for Deterministic Grounding of Agentic AI in Enterprise Telemetry

arXiv:2603.03018v1 Announce Type: new Abstract: Enterprise engineering organizations produce high-volume, heterogeneous telemetry from version control systems, CI/CD pipelines, issue trackers, and observability platforms. Large Language Models (LLMs) enable new forms of agentic automation, but grounding such agents on private telemetry...

1 min 1 month, 1 week ago

vat

LOW Academic International

Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation

arXiv:2603.03116v1 Announce Type: new Abstract: Large Language Model (LLM)-based agents are increasingly adopted in high-stakes settings, but current benchmarks evaluate mainly whether a task was completed, not how. We introduce Procedure-Aware Evaluation (PAE), a framework that formalizes agent procedures as...

1 min 1 month, 1 week ago

vat

LOW Academic European Union

FEAST: Retrieval-Augmented Multi-Hierarchical Food Classification for the FoodEx2 System

arXiv:2603.03176v1 Announce Type: new Abstract: Hierarchical text classification (HTC) and extreme multi-label classification (XML) tasks face compounded challenges from complex label interdependencies, data sparsity, and extreme output dimensions. These challenges are exemplified in the European Food Safety Authority's FoodEx2 system-a...

1 min 1 month, 1 week ago

tax

LOW Academic International

Generative AI in Managerial Decision-Making: Redefining Boundaries through Ambiguity Resolution and Sycophancy Analysis

arXiv:2603.03970v1 Announce Type: new Abstract: Generative artificial intelligence is increasingly being integrated into complex business workflows, fundamentally shifting the boundaries of managerial decision-making. However, the reliability of its strategic advice in ambiguous business contexts remains a critical knowledge gap. This...

1 min 1 month, 1 week ago

tax

LOW Academic International

Phi-4-reasoning-vision-15B Technical Report

arXiv:2603.03975v1 Announce Type: new Abstract: We present Phi-4-reasoning-vision-15B, a compact open-weight multimodal reasoning model, and share the motivations, design choices, experiments, and learnings that informed its development. Our goal is to contribute practical insight to the research community on building...

1 min 1 month, 1 week ago

vat

LOW Academic International

Capability Thresholds and Manufacturing Topology: How Embodied Intelligence Triggers Phase Transitions in Economic Geography

arXiv:2603.04457v1 Announce Type: new Abstract: The fundamental topology of manufacturing has not undergone a paradigm-level transformation since Henry Ford's moving assembly line in 1913. Every major innovation of the past century, from the Toyota Production System to Industry 4.0, has...

1 min 1 month, 1 week ago

vat

LOW Academic International

Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding

arXiv:2603.04514v1 Announce Type: new Abstract: Diffusion language models generate text through iterative denoising under a uniform refinement rule applied to all tokens. However, tokens stabilize at different rates in practice, leading to substantial redundant refinement and motivating refinement control over...

1 min 1 month, 1 week ago

vat

LOW Academic European Union

Discovering mathematical concepts through a multi-agent system

arXiv:2603.04528v1 Announce Type: new Abstract: Mathematical concepts emerge through an interplay of processes, including experimentation, efforts at proof, and counterexamples. In this paper, we present a new multi-agent model for computational mathematical discovery based on this observation. Our system, conceived...

1 min 1 month, 1 week ago

vat

LOW Academic International

Adaptive Memory Admission Control for LLM Agents

arXiv:2603.04549v1 Announce Type: new Abstract: LLM-based agents increasingly rely on long-term memory to support multi-session reasoning and interaction, yet current systems provide little control over what information is retained. In practice, agents either accumulate large volumes of conversational content, including...

1 min 1 month, 1 week ago

audit

LOW Academic International

Towards automated data analysis: A guided framework for LLM-based risk estimation

arXiv:2603.04631v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly integrated into critical decision-making pipelines, a trend that raises the demand for robust and automated data analysis. Current approaches to dataset risk analysis are limited to manual auditing methods...

1 min 1 month, 1 week ago

audit

LOW Academic United States

From Offline to Periodic Adaptation for Pose-Based Shoplifting Detection in Real-world Retail Security

arXiv:2603.04723v1 Announce Type: new Abstract: Shoplifting is a growing operational and economic challenge for retailers, with incidents rising and losses increasing despite extensive video surveillance. Continuous human monitoring is infeasible, motivating automated, privacy-preserving, and resource-aware detection solutions. In this paper,...

1 min 1 month, 1 week ago

vat

LOW Academic European Union

Solving an Open Problem in Theoretical Physics using AI-Assisted Discovery

arXiv:2603.04735v1 Announce Type: new Abstract: This paper demonstrates that artificial intelligence can accelerate mathematical discovery by autonomously solving an open problem in theoretical physics. We present a neuro-symbolic system, combining the Gemini Deep Think large language model with a systematic...

1 min 1 month, 1 week ago

vat

LOW Academic United States

MOOSEnger -- a Domain-Specific AI Agent for the MOOSE Ecosystem

arXiv:2603.04756v1 Announce Type: new Abstract: MOOSEnger is a tool-enabled AI agent tailored to the Multiphysics Object-Oriented Simulation Environment (MOOSE). MOOSE cases are specified in HIT ".i" input files; the large object catalog and strict syntax make initial setup and debugging...

1 min 1 month, 1 week ago

tax

LOW Academic International

Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling

arXiv:2603.04791v1 Announce Type: new Abstract: We introduce Timer-S1, a strong Mixture-of-Experts (MoE) time series foundation model with 8.3B total parameters, 0.75B activated parameters for each token, and a context length of 11.5K. To overcome the scalability bottleneck in existing pre-trained...

1 min 1 month, 1 week ago

vat

LOW Academic International

LLM-Grounded Explainability for Port Congestion Prediction via Temporal Graph Attention Networks

arXiv:2603.04818v1 Announce Type: new Abstract: Port congestion at major maritime hubs disrupts global supply chains, yet existing prediction systems typically prioritize forecasting accuracy without providing operationally interpretable explanations. This paper proposes AIS-TGNN, an evidence-grounded framework that jointly performs congestion-escalation prediction...

1 min 1 month, 1 week ago

audit

LOW Academic International

SEA-TS: Self-Evolving Agent for Autonomous Code Generation of Time Series Forecasting Algorithms

arXiv:2603.04873v1 Announce Type: new Abstract: Accurate time series forecasting underpins decision-making across domains, yet conventional ML development suffers from data scarcity in new deployments, poor adaptability under distribution shift, and diminishing returns from manual iteration. We propose Self-Evolving Agent for...

1 min 1 month, 1 week ago

vat

LOW Academic International

Differentially Private Multimodal In-Context Learning

arXiv:2603.04894v1 Announce Type: new Abstract: Vision-language models are increasingly applied to sensitive domains such as medical imaging and personal photographs, yet existing differentially private methods for in-context learning are limited to few-shot, text-only settings because privacy cost scales with the...

1 min 1 month, 1 week ago

vat

LOW Academic International

Alignment Backfire: Language-Dependent Reversal of Safety Interventions Across 16 Languages in LLM Multi-Agent Systems

arXiv:2603.04904v1 Announce Type: new Abstract: In perpetrator treatment, a recurring observation is the dissociation between insight and action: offenders articulate remorse yet behavioral change does not follow. We report four preregistered studies (1,584 multi-agent simulations across 16 languages and three...

1 min 1 month, 1 week ago

vat

LOW Academic United States

Rethinking Representativeness and Diversity in Dynamic Data Selection

arXiv:2603.04981v1 Announce Type: new Abstract: Dynamic data selection accelerates training by sampling a changing subset of the dataset while preserving accuracy. We rethink two core notions underlying sample evaluation: representativeness and diversity. Instead of local geometric centrality, we define representativeness...

1 min 1 month, 1 week ago

vat

LOW Academic United States

S5-SHB Agent: Society 5.0 enabled Multi-model Agentic Blockchain Framework for Smart Home

arXiv:2603.05027v1 Announce Type: new Abstract: The smart home is a key application domain within the Society 5.0 vision for a human-centered society. As smart home ecosystems expand with heterogeneous IoT protocols, diverse devices, and evolving threats, autonomous systems must manage...

1 min 1 month, 1 week ago

audit

LOW Academic United States

Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure

arXiv:2603.05028v1 Announce Type: new Abstract: As Large Language Models (LLMs) evolve from chatbots to agentic assistants, they are increasingly observed to exhibit risky behaviors when subjected to survival pressure, such as the threat of being shut down. While multiple cases...

1 min 1 month, 1 week ago

vat

LOW Academic United States

AegisUI: Behavioral Anomaly Detection for Structured User Interface Protocols in AI Agent Systems

arXiv:2603.05031v1 Announce Type: new Abstract: AI agents that build user interfaces on the fly assembling buttons, forms, and data displays from structured protocol payloads are becoming common in production systems. The trouble is that a payload can pass every schema...

1 min 1 month, 1 week ago

tax

LOW Academic International

Semantic Containment as a Fundamental Property of Emergent Misalignment

arXiv:2603.04407v1 Announce Type: new Abstract: Fine-tuning language models on narrowly harmful data causes emergent misalignment (EM) -- behavioral failures extending far beyond training distributions. Recent work demonstrates compartmentalization of misalignment behind contextual triggers, but these experiments mixed 97% benign data...

1 min 1 month, 1 week ago

tax

LOW Academic International

The Thinking Boundary: Quantifying Reasoning Suitability of Multimodal Tasks via Dual Tuning

arXiv:2603.04415v1 Announce Type: new Abstract: While reasoning-enhanced Large Language Models (LLMs) have demonstrated remarkable advances in complex tasks such as mathematics and coding, their effectiveness across universal multimodal scenarios remains uncertain. The trend of releasing parallel "Instruct" and "Thinking" models...

1 min 1 month, 1 week ago

vat

LOW Academic International

From Static Inference to Dynamic Interaction: Navigating the Landscape of Streaming Large Language Models

arXiv:2603.04592v1 Announce Type: new Abstract: Standard Large Language Models (LLMs) are predominantly designed for static inference with pre-defined inputs, which limits their applicability in dynamic, real-time scenarios. To address this gap, the streaming LLM paradigm has emerged. However, existing definitions...

1 min 1 month, 1 week ago

tax

LOW Academic International

iAgentBench: Benchmarking Sensemaking Capabilities of Information-Seeking Agents on High-Traffic Topics

arXiv:2603.04656v1 Announce Type: new Abstract: With the emergence of search-enabled generative QA systems, users are increasingly turning to tools that browse, aggregate, and reconcile evidence across multiple sources on their behalf. Yet many widely used QA benchmarks remain answerable by...

1 min 1 month, 1 week ago

audit

LOW Academic International

Stan: An LLM-based thermodynamics course assistant

arXiv:2603.04657v1 Announce Type: new Abstract: Discussions of AI in education focus predominantly on student-facing tools -- chatbots, tutors, and problem generators -- while the potential for the same infrastructure to support instructors remains largely unexplored. We describe Stan, a suite...

1 min 1 month, 1 week ago

vat

LOW Academic International

TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings

arXiv:2603.04772v1 Announce Type: new Abstract: Despite the exceptional reasoning capabilities of Multimodal Large Language Models (MLLMs), their adaptation into universal embedding models is significantly impeded by task conflict. To address this, we propose TSEmbed, a universal multimodal embedding framework that...

1 min 1 month, 1 week ago

vat

LOW Academic European Union

From Unfamiliar to Familiar: Detecting Pre-training Data via Gradient Deviations in Large Language Models

arXiv:2603.04828v1 Announce Type: new Abstract: Pre-training data detection for LLMs is essential for addressing copyright concerns and mitigating benchmark contamination. Existing methods mainly focus on the likelihood-based statistical features or heuristic signals before and after fine-tuning, but the former are...

1 min 1 month, 1 week ago

vat

SpatialText: A Pure-Text Cognitive Benchmark for Spatial Understanding in Large Language Models

REGAL: A Registry-Driven Architecture for Deterministic Grounding of Agentic AI in Enterprise Telemetry

Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation

FEAST: Retrieval-Augmented Multi-Hierarchical Food Classification for the FoodEx2 System

Generative AI in Managerial Decision-Making: Redefining Boundaries through Ambiguity Resolution and Sycophancy Analysis

Phi-4-reasoning-vision-15B Technical Report

Capability Thresholds and Manufacturing Topology: How Embodied Intelligence Triggers Phase Transitions in Economic Geography

Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding

Discovering mathematical concepts through a multi-agent system

Adaptive Memory Admission Control for LLM Agents

Towards automated data analysis: A guided framework for LLM-based risk estimation

From Offline to Periodic Adaptation for Pose-Based Shoplifting Detection in Real-world Retail Security

Solving an Open Problem in Theoretical Physics using AI-Assisted Discovery

MOOSEnger -- a Domain-Specific AI Agent for the MOOSE Ecosystem

Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling

LLM-Grounded Explainability for Port Congestion Prediction via Temporal Graph Attention Networks

SEA-TS: Self-Evolving Agent for Autonomous Code Generation of Time Series Forecasting Algorithms

Differentially Private Multimodal In-Context Learning

Alignment Backfire: Language-Dependent Reversal of Safety Interventions Across 16 Languages in LLM Multi-Agent Systems

Rethinking Representativeness and Diversity in Dynamic Data Selection

S5-SHB Agent: Society 5.0 enabled Multi-model Agentic Blockchain Framework for Smart Home

Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure

AegisUI: Behavioral Anomaly Detection for Structured User Interface Protocols in AI Agent Systems

Semantic Containment as a Fundamental Property of Emergent Misalignment

The Thinking Boundary: Quantifying Reasoning Suitability of Multimodal Tasks via Dual Tuning

From Static Inference to Dynamic Interaction: Navigating the Landscape of Streaming Large Language Models

iAgentBench: Benchmarking Sensemaking Capabilities of Information-Seeking Agents on High-Traffic Topics

Stan: An LLM-based thermodynamics course assistant

TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings

From Unfamiliar to Familiar: Detecting Pre-training Data via Gradient Deviations in Large Language Models

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.