International Law

LOW Academic International

Cross-fitted Proximal Learning for Model-Based Reinforcement Learning

arXiv:2604.05185v1 Announce Type: new Abstract: Model-based reinforcement learning is attractive for sequential decision-making because it explicitly estimates reward and transition models and then supports planning through simulated rollouts. In offline settings with hidden confounding, however, models learned directly from observational...

1 min 1 week, 3 days ago

ear

LOW Academic United States

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

arXiv:2604.05195v1 Announce Type: new Abstract: Unlike traditional homogeneous routing problems, the Heterogeneous Fleet Vehicle Routing Problem (HFVRP) involves heterogeneous fixed costs, variable travel costs, and capacity constraints, rendering solution quality highly sensitive to vehicle selection. Furthermore, real-world logistics applications often...

1 min 1 week, 3 days ago

ear

LOW Academic International

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

arXiv:2604.05134v1 Announce Type: new Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolves in a language model -- from supervised fine-tuning (SFT) to reinforcement learning (RL) --...

1 min 1 week, 3 days ago

ear

LOW Academic United States

Expectation Maximization (EM) Converges for General Agnostic Mixtures

arXiv:2604.05842v1 Announce Type: new Abstract: Mixture of linear regression is well studied in statistics and machine learning, where the data points are generated probabilistically using $k$ linear models. Algorithms like Expectation Maximization (EM) may be used to recover the ground...

1 min 1 week, 3 days ago

ear

LOW Academic International

EpiBench: Benchmarking Multi-turn Research Workflows for Multimodal Agents

arXiv:2604.05557v1 Announce Type: new Abstract: Scientific research follows multi-turn, multi-step workflows that require proactively searching the literature, consulting figures and tables, and integrating evidence across papers to align experimental settings and support reproducible conclusions. This joint capability is not systematically...

1 min 1 week, 3 days ago

ear

LOW Academic International

Learning to Edit Knowledge via Instruction-based Chain-of-Thought Prompting

arXiv:2604.05540v1 Announce Type: new Abstract: Large language models (LLMs) can effectively handle outdated information through knowledge editing. However, current approaches face two key limitations: (I) Poor generalization: Most approaches rigidly inject new knowledge without ensuring that the model can use...

1 min 1 week, 3 days ago

ear

LOW Academic United States

Operational Noncommutativity in Sequential Metacognitive Judgments

arXiv:2604.04938v1 Announce Type: new Abstract: Metacognition, understood as the monitoring and regulation of one's own cognitive processes, is inherently sequential: an agent evaluates an internal state, updates it, and may then re-evaluate under modified criteria. Order effects in cognition are...

1 min 1 week, 3 days ago

ear

LOW Academic European Union

FNO$^{\angle \theta}$: Extended Fourier neural operator for learning state and optimal control of distributed parameter systems

arXiv:2604.05187v1 Announce Type: new Abstract: We propose an extended Fourier neural operator (FNO) architecture for learning state and linear quadratic additive optimal control of systems governed by partial differential equations. Using the Ehrenpreis-Palamodov fundamental principle, we show that any state...

1 min 1 week, 3 days ago

ear

LOW Academic International

Controllable Image Generation with Composed Parallel Token Prediction

arXiv:2604.05730v1 Announce Type: new Abstract: Conditional discrete generative models struggle to faithfully compose multiple input conditions. To address this, we derive a theoretically-grounded formulation for composing discrete probabilistic generative processes, with masked generation (absorbing diffusion) as a special case. Our...

1 min 1 week, 3 days ago

ear

LOW Academic International

OmniDiagram: Advancing Unified Diagram Code Generation via Visual Interrogation Reward

arXiv:2604.05514v1 Announce Type: new Abstract: The paradigm of programmable diagram generation is evolving rapidly, playing a crucial role in structured visualization. However, most existing studies are confined to a narrow range of task formulations and language support, constraining their applicability...

1 min 1 week, 3 days ago

ear

LOW Academic European Union

El Nino Prediction Based on Weather Forecast and Geographical Time-series Data

arXiv:2604.04998v1 Announce Type: new Abstract: This paper proposes a novel framework for enhancing the prediction accuracy and lead time of El Ni\~no events, crucial for mitigating their global climatic, economic, and societal impacts. Traditional prediction models often rely on oceanic...

1 min 1 week, 3 days ago

ear

LOW Academic International

Dynamic Agentic AI Expert Profiler System Architecture for Multidomain Intelligence Modeling

arXiv:2604.05345v1 Announce Type: new Abstract: In today's artificial intelligence driven world, modern systems communicate with people from diverse backgrounds and skill levels. For human-machine interaction to be meaningful, systems must be aware of context and user expertise. This study proposes...

1 min 1 week, 3 days ago

ear

LOW Academic International

HYVE: Hybrid Views for LLM Context Engineering over Machine Data

arXiv:2604.05400v1 Announce Type: new Abstract: Machine data is central to observability and diagnosis in modern computing systems, appearing in logs, metrics, telemetry traces, and configuration snapshots. When provided to large language models (LLMs), this data typically arrives as a mixture...

1 min 1 week, 3 days ago

ear

LOW Academic International

Beneath the Surface: Investigating LLMs' Capabilities for Communicating with Subtext

arXiv:2604.05273v1 Announce Type: new Abstract: Human communication is fundamentally creative, and often makes use of subtext -- implied meaning that goes beyond the literal content of the text. Here, we systematically study whether language models can use subtext in communicative...

1 min 1 week, 3 days ago

ear

LOW Academic United States

AutoSOTA: An End-to-End Automated Research System for State-of-the-Art AI Model Discovery

arXiv:2604.05550v1 Announce Type: new Abstract: Artificial intelligence research increasingly depends on prolonged cycles of reproduction, debugging, and iterative refinement to achieve State-Of-The-Art (SOTA) performance, creating a growing need for systems that can accelerate the full pipeline of empirical model optimization....

1 min 1 week, 3 days ago

ear

LOW Academic European Union

The UNDO Flip-Flop: A Controlled Probe for Reversible Semantic State Management in State Space Model

arXiv:2604.05923v1 Announce Type: new Abstract: State space models (SSMs) have been shown to possess the theoretical capacity to model both star-free sequential tasks and bounded hierarchical structures Sarrof et al. (2024). However, formal expressivity results do not guarantee that gradient-based...

1 min 1 week, 3 days ago

ear

LOW Academic United States

Energy-Based Dynamical Models for Neurocomputation, Learning, and Optimization

arXiv:2604.05042v1 Announce Type: new Abstract: Recent advances at the intersection of control theory, neuroscience, and machine learning have revealed novel mechanisms by which dynamical systems perform computation. These advances encompass a wide range of conceptual, mathematical, and computational ideas, with...

1 min 1 week, 3 days ago

ear

LOW Academic International

SenseAI: A Human-in-the-Loop Dataset for RLHF-Aligned Financial Sentiment Reasoning

arXiv:2604.05135v1 Announce Type: new Abstract: We introduce SenseAI, a human-in-the-loop (HITL) validated financial sentiment dataset designed to capture not only model outputs but the full reasoning process behind them. Unlike existing resources, SenseAI incorporates reasoning chains, confidence scores, human correction...

1 min 1 week, 3 days ago

ear

LOW Academic International

MedLayBench-V: A Large-Scale Benchmark for Expert-Lay Semantic Alignment in Medical Vision Language Models

arXiv:2604.05738v1 Announce Type: new Abstract: Medical Vision-Language Models (Med-VLMs) have achieved expert-level proficiency in interpreting diagnostic imaging. However, current models are predominantly trained on professional literature, limiting their ability to communicate findings in the lay register required for patient-centered care....

1 min 1 week, 3 days ago

ear

LOW Academic International

Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series

arXiv:2604.05064v1 Announce Type: new Abstract: Synthetic data is essential for training foundation models for time series (FMTS), but most generators assume static correlations, and are typically missing realistic inter-channel dependencies. We introduce DynLMC, a Dynamic Linear Model of Coregionalization, that...

1 min 1 week, 3 days ago

ear

LOW Academic United States

LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals

arXiv:2604.05655v1 Announce Type: new Abstract: This work characterizes large language models' chain-of-thought generation as a structured trajectory through representation space. We show that mathematical reasoning traverses functionally ordered, step-specific subspaces that become increasingly separable with layer depth. This structure already...

1 min 1 week, 3 days ago

ear

LOW Academic International

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

arXiv:2604.05426v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) is now the dominant method for parameter-efficient fine-tuning of large language models, but achieving a high-quality adapter often requires systematic hyperparameter tuning because LoRA performance is highly sensitive to configuration choices. In...

1 min 1 week, 3 days ago

ear

LOW Academic International

Top-K Retrieval with Fixed-Size Linear-Attention Completion: Backbone- and KV-Format-Preserving Attention for KV-Cache Read Reduction

arXiv:2604.05438v1 Announce Type: new Abstract: Long-context generation is increasingly limited by decode-time key-value (KV) cache traffic, particularly when KV is offloaded beyond GPU memory. Query-aware retrieval (e.g., Top-K selection) reduces this traffic by loading only a subset of KV pairs,...

1 min 1 week, 3 days ago

ear

LOW Academic International

Training Without Orthogonalization, Inference With SVD: A Gradient Analysis of Rotation Representations

arXiv:2604.05414v1 Announce Type: new Abstract: Recent work has shown that removing orthogonalization during training and applying it only at inference improves rotation estimation in deep learning, with empirical evidence favoring 9D representations with SVD projection. However, the theoretical understanding of...

1 min 1 week, 3 days ago

ear

LOW Academic International

Optimal-Transport-Guided Functional Flow Matching for Turbulent Field Generation in Hilbert Space

arXiv:2604.05700v1 Announce Type: new Abstract: High-fidelity modeling of turbulent flows requires capturing complex spatiotemporal dynamics and multi-scale intermittency, posing a fundamental challenge for traditional knowledge-based systems. While deep generative models, such as diffusion models and Flow Matching, have shown promising...

1 min 1 week, 3 days ago

ear

LOW Academic United States

Graph Topology Information Enhanced Heterogeneous Graph Representation Learning

arXiv:2604.05732v1 Announce Type: new Abstract: Real-world heterogeneous graphs are inherently noisy and usually not in the optimal graph structures for downstream tasks, which often adversely affects the performance of GRL models in downstream tasks. Although Graph Structure Learning (GSL) methods...

1 min 1 week, 3 days ago

ear

LOW Academic United States

Reproducing AlphaZero on Tablut: Self-Play RL for an Asymmetric Board Game

arXiv:2604.05476v1 Announce Type: new Abstract: This work investigates the adaptation of the AlphaZero reinforcement learning algorithm to Tablut, an asymmetric historical board game featuring unequal piece counts and distinct player objectives (king capture versus king escape). While the original AlphaZero...

1 min 1 week, 3 days ago

ear

LOW Academic International

Improving Sparse Memory Finetuning

arXiv:2604.05248v1 Announce Type: new Abstract: Large Language Models (LLMs) are typically static after training, yet real-world applications require continual adaptation to new knowledge without degrading existing capabilities. Standard approaches to updating models, like full finetuning or parameter-efficient methods (e.g., LoRA),...

1 min 1 week, 3 days ago

ear

LOW Academic International

PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing

arXiv:2604.05018v1 Announce Type: new Abstract: Synthesizing unstructured research materials into manuscripts is an essential yet under-explored challenge in AI-driven scientific discovery. Existing autonomous writers are rigidly coupled to specific experimental pipelines, and produce superficial literature reviews. We introduce PaperOrchestra, a...

1 min 1 week, 3 days ago

ear

LOW Academic International

Uncertainty-Guided Latent Diagnostic Trajectory Learning for Sequential Clinical Diagnosis

arXiv:2604.05116v1 Announce Type: new Abstract: Clinical diagnosis requires sequential evidence acquisition under uncertainty. However, most Large Language Model (LLM) based diagnostic systems assume fully observed patient information and therefore do not explicitly model how clinical evidence should be sequentially acquired...

1 min 1 week, 3 days ago

ear

Cross-fitted Proximal Learning for Model-Based Reinforcement Learning

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

Expectation Maximization (EM) Converges for General Agnostic Mixtures

EpiBench: Benchmarking Multi-turn Research Workflows for Multimodal Agents

Learning to Edit Knowledge via Instruction-based Chain-of-Thought Prompting

Operational Noncommutativity in Sequential Metacognitive Judgments

FNO$^{\angle \theta}$: Extended Fourier neural operator for learning state and optimal control of distributed parameter systems

Controllable Image Generation with Composed Parallel Token Prediction

OmniDiagram: Advancing Unified Diagram Code Generation via Visual Interrogation Reward

El Nino Prediction Based on Weather Forecast and Geographical Time-series Data

Dynamic Agentic AI Expert Profiler System Architecture for Multidomain Intelligence Modeling

HYVE: Hybrid Views for LLM Context Engineering over Machine Data

Beneath the Surface: Investigating LLMs' Capabilities for Communicating with Subtext

AutoSOTA: An End-to-End Automated Research System for State-of-the-Art AI Model Discovery

The UNDO Flip-Flop: A Controlled Probe for Reversible Semantic State Management in State Space Model

Energy-Based Dynamical Models for Neurocomputation, Learning, and Optimization

SenseAI: A Human-in-the-Loop Dataset for RLHF-Aligned Financial Sentiment Reasoning

MedLayBench-V: A Large-Scale Benchmark for Expert-Lay Semantic Alignment in Medical Vision Language Models

Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series

LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

Top-K Retrieval with Fixed-Size Linear-Attention Completion: Backbone- and KV-Format-Preserving Attention for KV-Cache Read Reduction

Training Without Orthogonalization, Inference With SVD: A Gradient Analysis of Rotation Representations

Optimal-Transport-Guided Functional Flow Matching for Turbulent Field Generation in Hilbert Space

Graph Topology Information Enhanced Heterogeneous Graph Representation Learning

Reproducing AlphaZero on Tablut: Self-Play RL for an Asymmetric Board Game

Improving Sparse Memory Finetuning

PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing

Uncertainty-Guided Latent Diagnostic Trajectory Learning for Sequential Clinical Diagnosis

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.