Immigration Law

LOW Academic European Union

ODE-free Neural Flow Matching for One-Step Generative Modeling

arXiv:2604.06413v1 Announce Type: new Abstract: Diffusion and flow matching models generate samples by learning time-dependent vector fields whose integration transports noise to data, requiring tens to hundreds of network evaluations at inference. We instead learn the transport map directly. We...

1 min 1 week, 2 days ago

ead

LOW Academic European Union

A Comparative Study of Demonstration Selection for Practical Large Language Models-based Next POI Prediction

arXiv:2604.06207v1 Announce Type: new Abstract: This paper investigates demonstration selection strategies for predicting a user's next point-of-interest (POI) using large language models (LLMs), aiming to accurately forecast a user's subsequent location based on historical check-in data. While in-context learning (ICL)...

1 min 1 week, 2 days ago

tps

LOW Academic United States

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

arXiv:2604.05195v1 Announce Type: new Abstract: Unlike traditional homogeneous routing problems, the Heterogeneous Fleet Vehicle Routing Problem (HFVRP) involves heterogeneous fixed costs, variable travel costs, and capacity constraints, rendering solution quality highly sensitive to vehicle selection. Furthermore, real-world logistics applications often...

1 min 1 week, 3 days ago

ead

LOW Academic International

EpiBench: Benchmarking Multi-turn Research Workflows for Multimodal Agents

arXiv:2604.05557v1 Announce Type: new Abstract: Scientific research follows multi-turn, multi-step workflows that require proactively searching the literature, consulting figures and tables, and integrating evidence across papers to align experimental settings and support reproducible conclusions. This joint capability is not systematically...

1 min 1 week, 3 days ago

ead

LOW Academic International

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

arXiv:2604.05134v1 Announce Type: new Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolves in a language model -- from supervised fine-tuning (SFT) to reinforcement learning (RL) --...

1 min 1 week, 3 days ago

ead

LOW Academic United States

Expectation Maximization (EM) Converges for General Agnostic Mixtures

arXiv:2604.05842v1 Announce Type: new Abstract: Mixture of linear regression is well studied in statistics and machine learning, where the data points are generated probabilistically using $k$ linear models. Algorithms like Expectation Maximization (EM) may be used to recover the ground...

1 min 1 week, 3 days ago

ead

LOW Academic International

Jeffreys Flow: Robust Boltzmann Generators for Rare Event Sampling via Parallel Tempering Distillation

arXiv:2604.05303v1 Announce Type: new Abstract: Sampling physical systems with rough energy landscapes is hindered by rare events and metastable trapping. While Boltzmann generators already offer a solution, their reliance on the reverse Kullback--Leibler divergence frequently induces catastrophic mode collapse, missing...

1 min 1 week, 3 days ago

ead

LOW Academic European Union

Towards Scaling Law Analysis For Spatiotemporal Weather Data

arXiv:2604.05068v1 Announce Type: new Abstract: Compute-optimal scaling laws are relatively well studied for NLP and CV, where objectives are typically single-step and targets are comparatively homogeneous. Weather forecasting is harder to characterize in the same framework: autoregressive rollouts compound errors...

1 min 1 week, 3 days ago

ead

LOW Academic International

Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue

arXiv:2604.05552v1 Announce Type: new Abstract: Large Language Models demonstrate outstanding performance in many language tasks but still face fundamental challenges in managing the non-linear flow of human conversation. The prevalent approach of treating dialogue history as a flat, linear sequence...

1 min 1 week, 3 days ago

ead

LOW Academic International

MMORF: A Multi-agent Framework for Designing Multi-objective Retrosynthesis Planning Systems

arXiv:2604.05075v1 Announce Type: new Abstract: Multi-objective retrosynthesis planning is a critical chemistry task requiring dynamic balancing of quality, safety, and cost objectives. Language model-based multi-agent systems (MAS) offer a promising approach for this task: leveraging interactions of specialized agents to...

1 min 1 week, 3 days ago

tps

LOW Academic United States

Attribution Bias in Large Language Models

arXiv:2604.05224v1 Announce Type: new Abstract: As Large Language Models (LLMs) are increasingly used to support search and information retrieval, it is critical that they accurately attribute content to its original authors. In this work, we introduce AttriBench, the first fame-...

1 min 1 week, 3 days ago

ead

LOW Academic International

PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing

arXiv:2604.05018v1 Announce Type: new Abstract: Synthesizing unstructured research materials into manuscripts is an essential yet under-explored challenge in AI-driven scientific discovery. Existing autonomous writers are rigidly coupled to specific experimental pipelines, and produce superficial literature reviews. We introduce PaperOrchestra, a...

1 min 1 week, 3 days ago

ead

LOW Academic European Union

A Theory-guided Weighted $L^2$ Loss for solving the BGK model via Physics-informed neural networks

arXiv:2604.04971v1 Announce Type: new Abstract: While Physics-Informed Neural Networks offer a promising framework for solving partial differential equations, the standard $L^2$ loss formulation is fundamentally insufficient when applied to the Bhatnagar-Gross-Krook (BGK) model. Specifically, simply minimizing the standard loss does...

1 min 1 week, 3 days ago

ead

LOW Academic European Union

Neural Assistive Impulses: Synthesizing Exaggerated Motions for Physics-based Characters

arXiv:2604.05394v1 Announce Type: new Abstract: Physics-based character animation has become a fundamental approach for synthesizing realistic, physically plausible motions. While current data-driven deep reinforcement learning (DRL) methods can synthesize complex skills, they struggle to reproduce exaggerated, stylized motions, such as...

1 min 1 week, 3 days ago

ead

LOW Academic United States

Reproducing AlphaZero on Tablut: Self-Play RL for an Asymmetric Board Game

arXiv:2604.05476v1 Announce Type: new Abstract: This work investigates the adaptation of the AlphaZero reinforcement learning algorithm to Tablut, an asymmetric historical board game featuring unequal piece counts and distinct player objectives (king capture versus king escape). While the original AlphaZero...

1 min 1 week, 3 days ago

ead

LOW Academic European Union

Prune-Quantize-Distill: An Ordered Pipeline for Efficient Neural Network Compression

arXiv:2604.04988v1 Announce Type: new Abstract: Modern deployment often requires trading accuracy for efficiency under tight CPU and memory constraints, yet common compression proxies such as parameter count or FLOPs do not reliably predict wall-clock inference time. In particular, unstructured sparsity...

1 min 1 week, 3 days ago

ead

LOW Academic United States

Stop Fixating on Prompts: Reasoning Hijacking and Constraint Tightening for Red-Teaming LLM Agents

arXiv:2604.05549v1 Announce Type: new Abstract: With the widespread application of LLM-based agents across various domains, their complexity has introduced new security threats. Existing red-team methods mostly rely on modifying user prompts, which lack adaptability to new data and may impact...

1 min 1 week, 3 days ago

ead

LOW Academic International

Graph-Based Chain-of-Thought Pruning for Reducing Redundant Reflections in Reasoning LLMs

arXiv:2604.05643v1 Announce Type: new Abstract: Extending CoT through RL has been widely used to enhance the reasoning capabilities of LLMs. However, due to the sparsity of reward signals, it can also induce undesirable thinking patterns such as overthinking, i.e., generating...

1 min 1 week, 3 days ago

ead

LOW Academic International

Learning to Edit Knowledge via Instruction-based Chain-of-Thought Prompting

arXiv:2604.05540v1 Announce Type: new Abstract: Large language models (LLMs) can effectively handle outdated information through knowledge editing. However, current approaches face two key limitations: (I) Poor generalization: Most approaches rigidly inject new knowledge without ensuring that the model can use...

1 min 1 week, 3 days ago

tps

LOW Academic International

Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects

arXiv:2604.05546v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) enable sophisticated reasoning over images and videos, yet their inference is hindered by a systemic efficiency barrier known as visual token dominance. This overhead is driven by a multi-regime interplay between...

1 min 1 week, 3 days ago

ead

LOW Academic International

Phase-Associative Memory: Sequence Modeling in Complex Hilbert Space

arXiv:2604.05030v1 Announce Type: new Abstract: We present Phase-Associative Memory (PAM), a recurrent sequence model in which all representations are complex-valued, associations accumulate in a matrix state $S_{t}$ $\in$ $\mathbb{C}^{d \times d}$ via outer products, and retrieval operates through the conjugate...

1 min 1 week, 3 days ago

ead

LOW Academic International

TFRBench: A Reasoning Benchmark for Evaluating Forecasting Systems

arXiv:2604.05364v1 Announce Type: new Abstract: We introduce TFRBench, the first benchmark designed to evaluate the reasoning capabilities of forecasting systems. Traditionally, time-series forecasting has been evaluated solely on numerical accuracy, treating foundation models as ``black boxes.'' Unlike existing benchmarks, TFRBench...

1 min 1 week, 3 days ago

tps

LOW Academic International

LatentAudit: Real-Time White-Box Faithfulness Monitoring for Retrieval-Augmented Generation with Verifiable Deployment

arXiv:2604.05358v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) mitigates hallucination but does not eliminate it: a deployed system must still decide, at inference time, whether its answer is actually supported by the retrieved evidence. We introduce LatentAudit, a white-box auditor...

1 min 1 week, 3 days ago

ead

LOW Academic International

Instruction-Tuned LLMs for Parsing and Mining Unstructured Logs on Leadership HPC Systems

arXiv:2604.05168v1 Announce Type: new Abstract: Leadership-class HPC systems generate massive volumes of heterogeneous, largely unstructured system logs. Because these logs originate from diverse software, hardware, and runtime layers, they exhibit inconsistent formats, making structure extraction and pattern discovery extremely challenging....

1 min 1 week, 3 days ago

ead

LOW Academic European Union

Learning to Focus: CSI-Free Hierarchical MARL for Reconfigurable Reflectors

arXiv:2604.05165v1 Announce Type: new Abstract: Reconfigurable Intelligent Surfaces (RIS) has a potential to engineer smart radio environments for next-generation millimeter-wave (mmWave) networks. However, the prohibitive computational overhead of Channel State Information (CSI) estimation and the dimensionality explosion inherent in centralized...

1 min 1 week, 3 days ago

ead

LOW Academic International

Top-K Retrieval with Fixed-Size Linear-Attention Completion: Backbone- and KV-Format-Preserving Attention for KV-Cache Read Reduction

arXiv:2604.05438v1 Announce Type: new Abstract: Long-context generation is increasingly limited by decode-time key-value (KV) cache traffic, particularly when KV is offloaded beyond GPU memory. Query-aware retrieval (e.g., Top-K selection) reduces this traffic by loading only a subset of KV pairs,...

1 min 1 week, 3 days ago

ead

LOW Academic United States

Faster Superword Tokenization

arXiv:2604.05192v1 Announce Type: new Abstract: Byte Pair Encoding (BPE) is a widely used tokenization algorithm, whose tokens cannot extend across pre-tokenization boundaries, functionally limiting it to representing at most full words. The BoundlessBPE and SuperBPE algorithms extend and improve BPE...

1 min 1 week, 3 days ago

ead

LOW Academic International

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

arXiv:2604.05426v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) is now the dominant method for parameter-efficient fine-tuning of large language models, but achieving a high-quality adapter often requires systematic hyperparameter tuning because LoRA performance is highly sensitive to configuration choices. In...

1 min 1 week, 3 days ago

ead

LOW Academic European Union

The UNDO Flip-Flop: A Controlled Probe for Reversible Semantic State Management in State Space Model

arXiv:2604.05923v1 Announce Type: new Abstract: State space models (SSMs) have been shown to possess the theoretical capacity to model both star-free sequential tasks and bounded hierarchical structures Sarrof et al. (2024). However, formal expressivity results do not guarantee that gradient-based...

1 min 1 week, 3 days ago

ead

LOW Academic International

From Uniform to Learned Knots: A Study of Spline-Based Numerical Encodings for Tabular Deep Learning

arXiv:2604.05635v1 Announce Type: new Abstract: Numerical preprocessing remains an important component of tabular deep learning, where the representation of continuous features can strongly affect downstream performance. Although its importance is well established for classical statistical and machine learning models, the...

1 min 1 week, 3 days ago

ead

ODE-free Neural Flow Matching for One-Step Generative Modeling

A Comparative Study of Demonstration Selection for Practical Large Language Models-based Next POI Prediction

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

EpiBench: Benchmarking Multi-turn Research Workflows for Multimodal Agents

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

Expectation Maximization (EM) Converges for General Agnostic Mixtures

Jeffreys Flow: Robust Boltzmann Generators for Rare Event Sampling via Parallel Tempering Distillation

Towards Scaling Law Analysis For Spatiotemporal Weather Data

Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue

MMORF: A Multi-agent Framework for Designing Multi-objective Retrosynthesis Planning Systems

Attribution Bias in Large Language Models

PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing

A Theory-guided Weighted $L^2$ Loss for solving the BGK model via Physics-informed neural networks

Neural Assistive Impulses: Synthesizing Exaggerated Motions for Physics-based Characters

Reproducing AlphaZero on Tablut: Self-Play RL for an Asymmetric Board Game

Prune-Quantize-Distill: An Ordered Pipeline for Efficient Neural Network Compression

Stop Fixating on Prompts: Reasoning Hijacking and Constraint Tightening for Red-Teaming LLM Agents

Graph-Based Chain-of-Thought Pruning for Reducing Redundant Reflections in Reasoning LLMs

Learning to Edit Knowledge via Instruction-based Chain-of-Thought Prompting

Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects

Phase-Associative Memory: Sequence Modeling in Complex Hilbert Space

TFRBench: A Reasoning Benchmark for Evaluating Forecasting Systems

LatentAudit: Real-Time White-Box Faithfulness Monitoring for Retrieval-Augmented Generation with Verifiable Deployment

Instruction-Tuned LLMs for Parsing and Mining Unstructured Logs on Leadership HPC Systems

Learning to Focus: CSI-Free Hierarchical MARL for Reconfigurable Reflectors

Top-K Retrieval with Fixed-Size Linear-Attention Completion: Backbone- and KV-Format-Preserving Attention for KV-Cache Read Reduction

Faster Superword Tokenization

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

The UNDO Flip-Flop: A Controlled Probe for Reversible Semantic State Management in State Space Model

From Uniform to Learned Knots: A Study of Spline-Based Numerical Encodings for Tabular Deep Learning

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.