AI & Technology Law

LOW Academic International

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

arXiv:2604.05091v1 Announce Type: new Abstract: We present MegaTrain, a memory-centric system that efficiently trains 100B+ parameter large language models at full precision on a single GPU. Unlike traditional GPU-centric systems, MegaTrain stores parameters and optimizer states in host memory (CPU...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

Multilingual Language Models Encode Script Over Linguistic Structure

arXiv:2604.05090v1 Announce Type: new Abstract: Multilingual language models (LMs) organize representations for typologically and orthographically diverse languages into a shared parameter space, yet the nature of this internal organization remains elusive. In this work, we investigate which linguistic properties -...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

Memory Dial: A Training Framework for Controllable Memorization in Language Models

arXiv:2604.05074v1 Announce Type: new Abstract: Memorization in language models is widely studied but remains difficult to isolate and control. Understanding when and what models memorize is essential for explaining their predictions, yet existing approaches are post-hoc: they can detect memorization...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

PRIME: Prototype-Driven Multimodal Pretraining for Cancer Prognosis with Missing Modalities

arXiv:2604.04999v1 Announce Type: new Abstract: Multimodal self-supervised pretraining offers a promising route to cancer prognosis by integrating histopathology whole-slide images, gene expression, and pathology reports, yet most existing approaches require fully paired and complete inputs. In practice, clinical cohorts are...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

Learning Stable Predictors from Weak Supervision under Distribution Shift

arXiv:2604.05002v1 Announce Type: new Abstract: Learning from weak or proxy supervision is common when ground-truth labels are unavailable, yet robustness under distribution shift remains poorly understood, especially when the supervision mechanism itself changes. We formalize this as supervision drift, defined...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series

arXiv:2604.05064v1 Announce Type: new Abstract: Synthetic data is essential for training foundation models for time series (FMTS), but most generators assume static correlations, and are typically missing realistic inter-channel dependencies. We introduce DynLMC, a Dynamic Linear Model of Coregionalization, that...

1 min 2 weeks, 5 days ago

ai

LOW Academic European Union

Weight-Informed Self-Explaining Clustering for Mixed-Type Tabular Data

arXiv:2604.05857v1 Announce Type: new Abstract: Clustering mixed-type tabular data is fundamental for exploratory analysis, yet remains challenging due to misaligned numerical-categorical representations, uneven and context-dependent feature relevance, and disconnected and post-hoc explanation from the clustering process. We propose WISE, a...

1 min 2 weeks, 5 days ago

ai

LOW Academic European Union

Towards Scaling Law Analysis For Spatiotemporal Weather Data

arXiv:2604.05068v1 Announce Type: new Abstract: Compute-optimal scaling laws are relatively well studied for NLP and CV, where objectives are typically single-step and targets are comparatively homogeneous. Weather forecasting is harder to characterize in the same framework: autoregressive rollouts compound errors...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

arXiv:2604.05134v1 Announce Type: new Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolves in a language model -- from supervised fine-tuning (SFT) to reinforcement learning (RL) --...

1 min 2 weeks, 5 days ago

ai

LOW Academic United States

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

arXiv:2604.05195v1 Announce Type: new Abstract: Unlike traditional homogeneous routing problems, the Heterogeneous Fleet Vehicle Routing Problem (HFVRP) involves heterogeneous fixed costs, variable travel costs, and capacity constraints, rendering solution quality highly sensitive to vehicle selection. Furthermore, real-world logistics applications often...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

Cross-Machine Anomaly Detection Leveraging Pre-trained Time-series Model

arXiv:2604.05335v1 Announce Type: new Abstract: Achieving resilient and high-quality manufacturing requires reliable data-driven anomaly detection methods that are capable of addressing differences in behaviors among different individual machines which are nominally the same and are executing the same processes. To...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

arXiv:2604.05426v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) is now the dominant method for parameter-efficient fine-tuning of large language models, but achieving a high-quality adapter often requires systematic hyperparameter tuning because LoRA performance is highly sensitive to configuration choices. In...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

Channel-wise Retrieval for Multivariate Time Series Forecasting

arXiv:2604.05543v1 Announce Type: new Abstract: Multivariate time series forecasting often struggles to capture long-range dependencies due to fixed lookback windows. Retrieval-augmented forecasting addresses this by retrieving historical segments from memory, but existing approaches rely on a channel-agnostic strategy that applies...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

Optimal-Transport-Guided Functional Flow Matching for Turbulent Field Generation in Hilbert Space

arXiv:2604.05700v1 Announce Type: new Abstract: High-fidelity modeling of turbulent flows requires capturing complex spatiotemporal dynamics and multi-scale intermittency, posing a fundamental challenge for traditional knowledge-based systems. While deep generative models, such as diffusion models and Flow Matching, have shown promising...

1 min 2 weeks, 5 days ago

ai

LOW Academic United States

Graph Topology Information Enhanced Heterogeneous Graph Representation Learning

arXiv:2604.05732v1 Announce Type: new Abstract: Real-world heterogeneous graphs are inherently noisy and usually not in the optimal graph structures for downstream tasks, which often adversely affects the performance of GRL models in downstream tasks. Although Graph Structure Learning (GSL) methods...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

LatentAudit: Real-Time White-Box Faithfulness Monitoring for Retrieval-Augmented Generation with Verifiable Deployment

arXiv:2604.05358v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) mitigates hallucination but does not eliminate it: a deployed system must still decide, at inference time, whether its answer is actually supported by the retrieved evidence. We introduce LatentAudit, a white-box auditor...

1 min 2 weeks, 5 days ago

ai

LOW Academic European Union

Neural Assistive Impulses: Synthesizing Exaggerated Motions for Physics-based Characters

arXiv:2604.05394v1 Announce Type: new Abstract: Physics-based character animation has become a fundamental approach for synthesizing realistic, physically plausible motions. While current data-driven deep reinforcement learning (DRL) methods can synthesize complex skills, they struggle to reproduce exaggerated, stylized motions, such as...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

FastDiSS: Few-step Match Many-step Diffusion Language Model on Sequence-to-Sequence Generation--Full Version

arXiv:2604.05551v1 Announce Type: new Abstract: Self-conditioning has been central to the success of continuous diffusion language models, as it allows models to correct previous errors. Yet its ability degrades precisely in the regime where diffusion is most attractive for deployment:...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

Hierarchical SVG Tokenization: Learning Compact Visual Programs for Scalable Vector Graphics Modeling

arXiv:2604.05072v1 Announce Type: new Abstract: Recent large language models have shifted SVG generation from differentiable rendering optimization to autoregressive program synthesis. However, existing approaches still rely on generic byte-level tokenization inherited from natural language processing, which poorly reflects the geometric...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

Feature-Aware Anisotropic Local Differential Privacy for Utility-Preserving Graph Representation Learning in Metal Additive Manufacturing

arXiv:2604.05077v1 Announce Type: new Abstract: Metal additive manufacturing (AM) enables the fabrication of safety-critical components, but reliable quality assurance depends on high-fidelity sensor streams containing proprietary process information, limiting collaborative data sharing. Existing defect-detection models typically treat melt-pool observations as...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

Proximity Measure of Information Object Features for Solving the Problem of Their Identification in Information Systems

arXiv:2604.04939v1 Announce Type: new Abstract: The paper considers a new quantitative-qualitative proximity measure for the features of information objects, where data enters a common information resource from several sources independently. The goal is to determine the possibility of their relation...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

SCMAPR: Self-Correcting Multi-Agent Prompt Refinement for Complex-Scenario Text-to-Video Generation

arXiv:2604.05489v1 Announce Type: new Abstract: Text-to-Video (T2V) generation has benefited from recent advances in diffusion models, yet current systems still struggle under complex scenarios, which are generally exacerbated by the ambiguity and underspecification of text prompts. In this work, we...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

ETR: Entropy Trend Reward for Efficient Chain-of-Thought Reasoning

arXiv:2604.05355v1 Announce Type: new Abstract: Chain-of-thought (CoT) reasoning improves large language model performance on complex tasks, but often produces excessively long and inefficient reasoning traces. Existing methods shorten CoTs using length penalties or global entropy reduction, implicitly assuming that low...

1 min 2 weeks, 5 days ago

ai

LOW Law Review United States

The Many Ways of Constitutional Discourse

On January 31, 2026, in a stunning three-page order by Judge Fred Biery, the United States District Court for the Western District of Texas granted habeas relief to five-year-old Liam Conejo Ramos and his father Adrian Conejo Arias—who had been...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

THIVLVC: Retrieval Augmented Dependency Parsing for Latin

arXiv:2604.05564v1 Announce Type: new Abstract: We describe THIVLVC, a two-stage system for the EvaLatin 2026 Dependency Parsing task. Given a Latin sentence, we retrieve structurally similar entries from the CIRCSE treebank using sentence length and POS n-gram similarity, then prompt...

1 min 2 weeks, 5 days ago

ai

LOW Academic United States

Operational Noncommutativity in Sequential Metacognitive Judgments

arXiv:2604.04938v1 Announce Type: new Abstract: Metacognition, understood as the monitoring and regulation of one's own cognitive processes, is inherently sequential: an agent evaluates an internal state, updates it, and may then re-evaluate under modified criteria. Order effects in cognition are...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

Turbulence-like 5/3 spectral scaling in contextual representations of language as a complex system

arXiv:2604.05536v1 Announce Type: new Abstract: Natural language is a complex system that exhibits robust statistical regularities. Here, we represent text as a trajectory in a high-dimensional embedding space generated by transformer-based language models, and quantify scale-dependent fluctuations along the token...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling

arXiv:2604.05445v1 Announce Type: new Abstract: Vision-language reward modeling faces a dilemma: generative approaches are interpretable but slow, while discriminative ones are efficient but act as opaque "black boxes." To bridge this gap, we propose VL-MDR (Vision-Language Multi-Dimensional Reward), a framework...

1 min 2 weeks, 5 days ago

ai

LOW Academic International

The Illusion of Latent Generalization: Bi-directionality and the Reversal Curse

arXiv:2604.04943v1 Announce Type: new Abstract: The reversal curse describes a failure of autoregressive language models to retrieve a fact in reverse order (e.g., training on ``$A > B$'' but failing on ``$B < A$''). Recent work shows that objectives with...

1 min 2 weeks, 5 days ago

ai

LOW Conference United States

ICML 2026 Press Information

1 min 2 weeks, 5 days ago

ai

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Multilingual Language Models Encode Script Over Linguistic Structure

Memory Dial: A Training Framework for Controllable Memorization in Language Models

PRIME: Prototype-Driven Multimodal Pretraining for Cancer Prognosis with Missing Modalities

Learning Stable Predictors from Weak Supervision under Distribution Shift

Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series

Weight-Informed Self-Explaining Clustering for Mixed-Type Tabular Data

Towards Scaling Law Analysis For Spatiotemporal Weather Data

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

Cross-Machine Anomaly Detection Leveraging Pre-trained Time-series Model

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

Channel-wise Retrieval for Multivariate Time Series Forecasting

Optimal-Transport-Guided Functional Flow Matching for Turbulent Field Generation in Hilbert Space

Graph Topology Information Enhanced Heterogeneous Graph Representation Learning

LatentAudit: Real-Time White-Box Faithfulness Monitoring for Retrieval-Augmented Generation with Verifiable Deployment

Neural Assistive Impulses: Synthesizing Exaggerated Motions for Physics-based Characters

FastDiSS: Few-step Match Many-step Diffusion Language Model on Sequence-to-Sequence Generation--Full Version

Hierarchical SVG Tokenization: Learning Compact Visual Programs for Scalable Vector Graphics Modeling

Feature-Aware Anisotropic Local Differential Privacy for Utility-Preserving Graph Representation Learning in Metal Additive Manufacturing

Proximity Measure of Information Object Features for Solving the Problem of Their Identification in Information Systems

SCMAPR: Self-Correcting Multi-Agent Prompt Refinement for Complex-Scenario Text-to-Video Generation

ETR: Entropy Trend Reward for Efficient Chain-of-Thought Reasoning

The Many Ways of Constitutional Discourse

THIVLVC: Retrieval Augmented Dependency Parsing for Latin

Operational Noncommutativity in Sequential Metacognitive Judgments

Turbulence-like 5/3 spectral scaling in contextual representations of language as a complex system

Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling

The Illusion of Latent Generalization: Bi-directionality and the Reversal Curse

ICML 2026 Press Information

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.