International Law

LOW Academic International

RAG or Learning? Understanding the Limits of LLM Adaptation under Continuous Knowledge Drift in the Real World

arXiv:2604.05096v1 Announce Type: new Abstract: Large language models (LLMs) acquire most of their knowledge during pretraining, which ties them to a fixed snapshot of the world and makes adaptation to continuously evolving knowledge challenging. As facts, entities, and events change...

1 min 1 week, 3 days ago

ear

LOW Academic International

PRIME: Prototype-Driven Multimodal Pretraining for Cancer Prognosis with Missing Modalities

arXiv:2604.04999v1 Announce Type: new Abstract: Multimodal self-supervised pretraining offers a promising route to cancer prognosis by integrating histopathology whole-slide images, gene expression, and pathology reports, yet most existing approaches require fully paired and complete inputs. In practice, clinical cohorts are...

1 min 1 week, 3 days ago

ear

LOW Academic United States

Dialogue Act Patterns in GenAI-Mediated L2 Oral Practice: A Sequential Analysis of Learner-Chatbot Interactions

arXiv:2604.05702v1 Announce Type: new Abstract: While generative AI (GenAI) voice chatbots offer scalable opportunities for second language (L2) oral practice, the interactional processes related to learners' gains remain underexplored. This study investigates dialogue act (DA) patterns in interactions between Grade...

1 min 1 week, 3 days ago

ear

LOW Academic International

Cross-fitted Proximal Learning for Model-Based Reinforcement Learning

arXiv:2604.05185v1 Announce Type: new Abstract: Model-based reinforcement learning is attractive for sequential decision-making because it explicitly estimates reward and transition models and then supports planning through simulated rollouts. In offline settings with hidden confounding, however, models learned directly from observational...

1 min 1 week, 3 days ago

ear

LOW Law Review United States

Shadow Derivatives: The Quiet Propertization of AI Learning

Introduction Artificial intelligence (AI) systems learn. In today’s AI markets, durable advantage comes less from any single output than from the learning that accumulates through training, fine-tuning, and downstream feedback loops.[1] Each interaction, correction, and deployment contributes incrementally to improved...

1 min 1 week, 3 days ago

ear

LOW Academic United States

Expectation Maximization (EM) Converges for General Agnostic Mixtures

arXiv:2604.05842v1 Announce Type: new Abstract: Mixture of linear regression is well studied in statistics and machine learning, where the data points are generated probabilistically using $k$ linear models. Algorithms like Expectation Maximization (EM) may be used to recover the ground...

1 min 1 week, 3 days ago

ear

LOW Academic International

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

arXiv:2604.05134v1 Announce Type: new Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolves in a language model -- from supervised fine-tuning (SFT) to reinforcement learning (RL) --...

1 min 1 week, 3 days ago

ear

LOW Academic International

Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion

arXiv:2604.05688v1 Announce Type: new Abstract: Key-Value (KV) cache memory and bandwidth increasingly dominate large language model inference cost in long-context and long-generation regimes. Architectures such as multi-head latent attention (MLA) and hybrid sliding-window attention (SWA) can alleviate this bound, but...

1 min 1 week, 3 days ago

ear

LOW Academic International

Top-K Retrieval with Fixed-Size Linear-Attention Completion: Backbone- and KV-Format-Preserving Attention for KV-Cache Read Reduction

arXiv:2604.05438v1 Announce Type: new Abstract: Long-context generation is increasingly limited by decode-time key-value (KV) cache traffic, particularly when KV is offloaded beyond GPU memory. Query-aware retrieval (e.g., Top-K selection) reduces this traffic by loading only a subset of KV pairs,...

1 min 1 week, 3 days ago

ear

LOW Academic United States

Reproducing AlphaZero on Tablut: Self-Play RL for an Asymmetric Board Game

arXiv:2604.05476v1 Announce Type: new Abstract: This work investigates the adaptation of the AlphaZero reinforcement learning algorithm to Tablut, an asymmetric historical board game featuring unequal piece counts and distinct player objectives (king capture versus king escape). While the original AlphaZero...

1 min 1 week, 3 days ago

ear

LOW Academic European Union

The UNDO Flip-Flop: A Controlled Probe for Reversible Semantic State Management in State Space Model

arXiv:2604.05923v1 Announce Type: new Abstract: State space models (SSMs) have been shown to possess the theoretical capacity to model both star-free sequential tasks and bounded hierarchical structures Sarrof et al. (2024). However, formal expressivity results do not guarantee that gradient-based...

1 min 1 week, 3 days ago

ear

LOW Academic United States

Energy-Based Dynamical Models for Neurocomputation, Learning, and Optimization

arXiv:2604.05042v1 Announce Type: new Abstract: Recent advances at the intersection of control theory, neuroscience, and machine learning have revealed novel mechanisms by which dynamical systems perform computation. These advances encompass a wide range of conceptual, mathematical, and computational ideas, with...

1 min 1 week, 3 days ago

ear

LOW Academic United States

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

arXiv:2604.05195v1 Announce Type: new Abstract: Unlike traditional homogeneous routing problems, the Heterogeneous Fleet Vehicle Routing Problem (HFVRP) involves heterogeneous fixed costs, variable travel costs, and capacity constraints, rendering solution quality highly sensitive to vehicle selection. Furthermore, real-world logistics applications often...

1 min 1 week, 3 days ago

ear

LOW Academic International

Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series

arXiv:2604.05064v1 Announce Type: new Abstract: Synthetic data is essential for training foundation models for time series (FMTS), but most generators assume static correlations, and are typically missing realistic inter-channel dependencies. We introduce DynLMC, a Dynamic Linear Model of Coregionalization, that...

1 min 1 week, 3 days ago

ear

LOW Academic International

Learning Stable Predictors from Weak Supervision under Distribution Shift

arXiv:2604.05002v1 Announce Type: new Abstract: Learning from weak or proxy supervision is common when ground-truth labels are unavailable, yet robustness under distribution shift remains poorly understood, especially when the supervision mechanism itself changes. We formalize this as supervision drift, defined...

1 min 1 week, 3 days ago

ear

LOW Academic International

Learning-Based Multi-Criteria Decision Making Model for Sawmill Location Problems

arXiv:2604.04996v1 Announce Type: new Abstract: Strategically locating a sawmill is vital for enhancing the efficiency, profitability, and sustainability of timber supply chains. Our study proposes a Learning-Based Multi-Criteria Decision-Making (LB-MCDM) framework that integrates machine learning (ML) with GIS-based spatial location...

1 min 1 week, 3 days ago

ear

LOW Academic European Union

Enhancing sample efficiency in reinforcement-learning-based flow control: replacing the critic with an adaptive reduced-order model

arXiv:2604.04986v1 Announce Type: new Abstract: Model-free deep reinforcement learning (DRL) methods suffer from poor sample efficiency. To overcome this limitation, this work introduces an adaptive reduced-order-model (ROM)-based reinforcement learning framework for active flow control. In contrast to conventional actor--critic architectures,...

1 min 1 week, 3 days ago

ear

LOW Academic International

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

arXiv:2604.05426v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) is now the dominant method for parameter-efficient fine-tuning of large language models, but achieving a high-quality adapter often requires systematic hyperparameter tuning because LoRA performance is highly sensitive to configuration choices. In...

1 min 1 week, 3 days ago

ear

LOW Academic United States

Graph Topology Information Enhanced Heterogeneous Graph Representation Learning

arXiv:2604.05732v1 Announce Type: new Abstract: Real-world heterogeneous graphs are inherently noisy and usually not in the optimal graph structures for downstream tasks, which often adversely affects the performance of GRL models in downstream tasks. Although Graph Structure Learning (GSL) methods...

1 min 1 week, 3 days ago

ear

LOW Academic European Union

Hidden in the Multiplicative Interaction: Uncovering Fragility in Multimodal Contrastive Learning

arXiv:2604.05834v1 Announce Type: new Abstract: Multimodal contrastive learning is increasingly enriched by going beyond image-text pairs. Among recent contrastive methods, Symile is a strong approach for this challenge because its multiplicative interaction objective captures higher-order cross-modal dependence. Yet, we find...

1 min 1 week, 3 days ago

ear

LOW Academic United States

AutoSOTA: An End-to-End Automated Research System for State-of-the-Art AI Model Discovery

arXiv:2604.05550v1 Announce Type: new Abstract: Artificial intelligence research increasingly depends on prolonged cycles of reproduction, debugging, and iterative refinement to achieve State-Of-The-Art (SOTA) performance, creating a growing need for systems that can accelerate the full pipeline of empirical model optimization....

1 min 1 week, 3 days ago

ear

LOW Academic International

Optimal-Transport-Guided Functional Flow Matching for Turbulent Field Generation in Hilbert Space

arXiv:2604.05700v1 Announce Type: new Abstract: High-fidelity modeling of turbulent flows requires capturing complex spatiotemporal dynamics and multi-scale intermittency, posing a fundamental challenge for traditional knowledge-based systems. While deep generative models, such as diffusion models and Flow Matching, have shown promising...

1 min 1 week, 3 days ago

ear

LOW Academic International

Training Without Orthogonalization, Inference With SVD: A Gradient Analysis of Rotation Representations

arXiv:2604.05414v1 Announce Type: new Abstract: Recent work has shown that removing orthogonalization during training and applying it only at inference improves rotation estimation in deep learning, with empirical evidence favoring 9D representations with SVD projection. However, the theoretical understanding of...

1 min 1 week, 3 days ago

ear

LOW Academic International

Uncertainty-Guided Latent Diagnostic Trajectory Learning for Sequential Clinical Diagnosis

arXiv:2604.05116v1 Announce Type: new Abstract: Clinical diagnosis requires sequential evidence acquisition under uncertainty. However, most Large Language Model (LLM) based diagnostic systems assume fully observed patient information and therefore do not explicitly model how clinical evidence should be sequentially acquired...

1 min 1 week, 3 days ago

ear

LOW Academic International

EpiBench: Benchmarking Multi-turn Research Workflows for Multimodal Agents

arXiv:2604.05557v1 Announce Type: new Abstract: Scientific research follows multi-turn, multi-step workflows that require proactively searching the literature, consulting figures and tables, and integrating evidence across papers to align experimental settings and support reproducible conclusions. This joint capability is not systematically...

1 min 1 week, 3 days ago

ear

LOW Academic United States

On the Geometry of Positional Encodings in Transformers

arXiv:2604.05217v1 Announce Type: new Abstract: Neural language models process sequences of words, but the mathematical operations inside them are insensitive to the order in which words appear. Positional encodings are the component added to remedy this. Despite their importance, positional...

1 min 1 week, 3 days ago

ear

LOW Academic International

From Retinal Evidence to Safe Decisions: RETINA-SAFE and ECRT for Hallucination Risk Triage in Medical LLMs

arXiv:2604.05348v1 Announce Type: new Abstract: Hallucinations in medical large language models (LLMs) remain a safety-critical issue, particularly when available evidence is insufficient or conflicting. We study this problem in diabetic retinopathy (DR) decision settings and introduce RETINA-SAFE, an evidence-grounded benchmark...

1 min 1 week, 3 days ago

ear

LOW Academic International

Controllable Image Generation with Composed Parallel Token Prediction

arXiv:2604.05730v1 Announce Type: new Abstract: Conditional discrete generative models struggle to faithfully compose multiple input conditions. To address this, we derive a theoretically-grounded formulation for composing discrete probabilistic generative processes, with masked generation (absorbing diffusion) as a special case. Our...

1 min 1 week, 3 days ago

ear

LOW Academic European Union

FNO$^{\angle \theta}$: Extended Fourier neural operator for learning state and optimal control of distributed parameter systems

arXiv:2604.05187v1 Announce Type: new Abstract: We propose an extended Fourier neural operator (FNO) architecture for learning state and linear quadratic additive optimal control of systems governed by partial differential equations. Using the Ehrenpreis-Palamodov fundamental principle, we show that any state...

1 min 1 week, 3 days ago

ear

LOW Academic International

A mathematical theory of evolution for self-designing AIs

arXiv:2604.05142v1 Announce Type: new Abstract: As artificial intelligence systems (AIs) become increasingly produced by recursive self-improvement, a form of evolution may emerge, in which the traits of AI systems are shaped by the success of earlier AIs in designing and...

1 min 1 week, 3 days ago

ear

RAG or Learning? Understanding the Limits of LLM Adaptation under Continuous Knowledge Drift in the Real World

PRIME: Prototype-Driven Multimodal Pretraining for Cancer Prognosis with Missing Modalities

Dialogue Act Patterns in GenAI-Mediated L2 Oral Practice: A Sequential Analysis of Learner-Chatbot Interactions

Cross-fitted Proximal Learning for Model-Based Reinforcement Learning

Shadow Derivatives: The Quiet Propertization of AI Learning

Expectation Maximization (EM) Converges for General Agnostic Mixtures

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion

Top-K Retrieval with Fixed-Size Linear-Attention Completion: Backbone- and KV-Format-Preserving Attention for KV-Cache Read Reduction

Reproducing AlphaZero on Tablut: Self-Play RL for an Asymmetric Board Game

The UNDO Flip-Flop: A Controlled Probe for Reversible Semantic State Management in State Space Model

Energy-Based Dynamical Models for Neurocomputation, Learning, and Optimization

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series

Learning Stable Predictors from Weak Supervision under Distribution Shift

Learning-Based Multi-Criteria Decision Making Model for Sawmill Location Problems

Enhancing sample efficiency in reinforcement-learning-based flow control: replacing the critic with an adaptive reduced-order model

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

Graph Topology Information Enhanced Heterogeneous Graph Representation Learning

Hidden in the Multiplicative Interaction: Uncovering Fragility in Multimodal Contrastive Learning

AutoSOTA: An End-to-End Automated Research System for State-of-the-Art AI Model Discovery

Optimal-Transport-Guided Functional Flow Matching for Turbulent Field Generation in Hilbert Space

Training Without Orthogonalization, Inference With SVD: A Gradient Analysis of Rotation Representations

Uncertainty-Guided Latent Diagnostic Trajectory Learning for Sequential Clinical Diagnosis

EpiBench: Benchmarking Multi-turn Research Workflows for Multimodal Agents

On the Geometry of Positional Encodings in Transformers

From Retinal Evidence to Safe Decisions: RETINA-SAFE and ECRT for Hallucination Risk Triage in Medical LLMs

Controllable Image Generation with Composed Parallel Token Prediction

FNO$^{\angle \theta}$: Extended Fourier neural operator for learning state and optimal control of distributed parameter systems

A mathematical theory of evolution for self-designing AIs

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.