International Law

LOW Academic International

Multi-objective Evolutionary Merging Enables Efficient Reasoning Models

arXiv:2604.06465v1 Announce Type: new Abstract: Reasoning models have demonstrated remarkable capabilities in solving complex problems by leveraging long chains of thought. However, this more deliberate reasoning comes with substantial computational overhead at inference time. The Long-to-Short (L2S) reasoning problem seeks...

1 min 1 week, 1 day ago

ear

LOW Academic International

ValueGround: Evaluating Culture-Conditioned Visual Value Grounding in MLLMs

arXiv:2604.06484v1 Announce Type: new Abstract: Cultural values are expressed not only through language but also through visual scenes and everyday social practices. Yet existing evaluations of cultural values in language models are almost entirely text-only, making it unclear whether models...

1 min 1 week, 1 day ago

ear

LOW Academic International

AgentOpt v0.1 Technical Report: Client-Side Optimization for LLM-Based Agent

arXiv:2604.06296v1 Announce Type: new Abstract: AI agents are increasingly deployed in real-world applications, including systems such as Manus, OpenClaw, and coding agents. Existing research has primarily focused on \emph{server-side} efficiency, proposing methods such as caching, speculative execution, traffic scheduling, and...

1 min 1 week, 1 day ago

ear

LOW Academic International

Learning to Interrupt in Language-based Multi-agent Communication

arXiv:2604.06452v1 Announce Type: new Abstract: Multi-agent systems using large language models (LLMs) have demonstrated impressive capabilities across various domains. However, current agent communication suffers from verbose output that overload context and increase computational costs. Although existing approaches focus on compressing...

1 min 1 week, 1 day ago

ear

LOW Academic International

A Severity-Based Curriculum Learning Strategy for Arabic Medical Text Generation

arXiv:2604.06365v1 Announce Type: new Abstract: Arabic medical text generation is increasingly needed to help users interpret symptoms and access general health guidance in their native language. Nevertheless, many existing methods assume uniform importance across training samples, overlooking differences in clinical...

1 min 1 week, 1 day ago

ear

LOW Academic International

Transformer See, Transformer Do: Copying as an Intermediate Step in Learning Analogical Reasoning

arXiv:2604.06501v1 Announce Type: new Abstract: Analogical reasoning is a hallmark of human intelligence, enabling us to solve new problems by transferring knowledge from one situation to another. Yet, developing artificial intelligence systems capable of robust human-like analogical reasoning has proven...

1 min 1 week, 1 day ago

ear

LOW Academic International

AE-ViT: Stable Long-Horizon Parametric Partial Differential Equations Modeling

arXiv:2604.06475v1 Announce Type: new Abstract: Deep Learning Reduced Order Models (ROMs) are becoming increasingly popular as surrogate models for parametric partial differential equations (PDEs) due to their ability to handle high-dimensional data, approximate highly nonlinear mappings, and utilize GPUs. Existing...

1 min 1 week, 1 day ago

ear

LOW Academic International

MICA: Multivariate Infini Compressive Attention for Time Series Forecasting

arXiv:2604.06473v1 Announce Type: new Abstract: Multivariate forecasting with Transformers faces a core scalability challenge: modeling cross-channel dependencies via attention compounds attention's quadratic sequence complexity with quadratic channel scaling, making full cross-channel attention impractical for high-dimensional time series. We propose Multivariate...

1 min 1 week, 1 day ago

ear

LOW Academic International

From Load Tests to Live Streams: Graph Embedding-Based Anomaly Detection in Microservice Architectures

arXiv:2604.06448v1 Announce Type: new Abstract: Prime Video regularly conducts load tests to simulate the viewer traffic spikes seen during live events such as Thursday Night Football as well as video-on-demand (VOD) events such as Rings of Power. While these stress...

1 min 1 week, 1 day ago

ear

LOW Academic International

The Depth Ceiling: On the Limits of Large Language Models in Discovering Latent Planning

arXiv:2604.06427v1 Announce Type: new Abstract: The viability of chain-of-thought (CoT) monitoring hinges on models being unable to reason effectively in their latent representations. Yet little is known about the limits of such latent reasoning in LLMs. We test these limits...

1 min 1 week, 1 day ago

ear

LOW Academic International

SMT-AD: a scalable quantum-inspired anomaly detection approach

arXiv:2604.06265v1 Announce Type: new Abstract: Quantum-inspired tensor networks algorithms have shown to be effective and efficient models for machine learning tasks, including anomaly detection. Here, we propose a highly parallelizable quantum-inspired approach which we call SMT-AD from Superposition of Multiresolution...

1 min 1 week, 1 day ago

ear

LOW Academic International

$S^3$: Stratified Scaling Search for Test-Time in Diffusion Language Models

arXiv:2604.06260v1 Announce Type: new Abstract: Test-time scaling investigates whether a fixed diffusion language model (DLM) can generate better outputs when given more inference compute, without additional training. However, naive best-of-$K$ sampling is fundamentally limited because it repeatedly draws from the...

1 min 1 week, 1 day ago

ear

LOW Academic International

Don't Act Blindly: Robust GUI Automation via Action-Effect Verification and Self-Correction

arXiv:2604.05477v1 Announce Type: new Abstract: Autonomous GUI agents based on vision-language models (VLMs) often assume deterministic environment responses, generating actions without verifying whether previous operations succeeded. In real-world settings with network latency, rendering delays, and system interruptions, this assumption leads...

1 min 1 week, 2 days ago

ear

LOW Academic International

Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue

arXiv:2604.05552v1 Announce Type: new Abstract: Large Language Models demonstrate outstanding performance in many language tasks but still face fundamental challenges in managing the non-linear flow of human conversation. The prevalent approach of treating dialogue history as a flat, linear sequence...

1 min 1 week, 2 days ago

ear

LOW Academic International

Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling

arXiv:2604.05445v1 Announce Type: new Abstract: Vision-language reward modeling faces a dilemma: generative approaches are interpretable but slow, while discriminative ones are efficient but act as opaque "black boxes." To bridge this gap, we propose VL-MDR (Vision-Language Multi-Dimensional Reward), a framework...

1 min 1 week, 2 days ago

ear

LOW Academic International

Gradient-Controlled Decoding: A Safety Guardrail for LLMs with Dual-Anchor Steering

arXiv:2604.05179v1 Announce Type: new Abstract: Large language models (LLMs) remain susceptible to jailbreak and direct prompt-injection attacks, yet the strongest defensive filters frequently over-refuse benign queries and degrade user experience. Previous work on jailbreak and prompt injection detection such as...

1 min 1 week, 2 days ago

icc

LOW Academic International

YoNER: A New Yor\`ub\'a Multi-domain Named Entity Recognition Dataset

arXiv:2604.05624v1 Announce Type: new Abstract: Named Entity Recognition (NER) is a foundational NLP task, yet research in Yor\`ub\'a has been constrained by limited and domain-specific resources. Existing resources, such as MasakhaNER (a manually annotated news-domain corpus) and WikiAnn (automatically created...

1 min 1 week, 2 days ago

ear

LOW Academic International

What Makes a Good Response? An Empirical Analysis of Quality in Qualitative Interviews

arXiv:2604.05163v1 Announce Type: new Abstract: Qualitative interviews provide essential insights into human experiences when they elicit high-quality responses. While qualitative and NLP researchers have proposed various measures of interview quality, these measures lack validation that high-scoring responses actually contribute to...

1 min 1 week, 2 days ago

ear

LOW Academic International

SenseAI: A Human-in-the-Loop Dataset for RLHF-Aligned Financial Sentiment Reasoning

arXiv:2604.05135v1 Announce Type: new Abstract: We introduce SenseAI, a human-in-the-loop (HITL) validated financial sentiment dataset designed to capture not only model outputs but the full reasoning process behind them. Unlike existing resources, SenseAI incorporates reasoning chains, confidence scores, human correction...

1 min 1 week, 2 days ago

ear

LOW Academic International

Learning-Based Multi-Criteria Decision Making Model for Sawmill Location Problems

arXiv:2604.04996v1 Announce Type: new Abstract: Strategically locating a sawmill is vital for enhancing the efficiency, profitability, and sustainability of timber supply chains. Our study proposes a Learning-Based Multi-Criteria Decision-Making (LB-MCDM) framework that integrates machine learning (ML) with GIS-based spatial location...

1 min 1 week, 2 days ago

ear

LOW Academic International

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

arXiv:2604.05134v1 Announce Type: new Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolves in a language model -- from supervised fine-tuning (SFT) to reinforcement learning (RL) --...

1 min 1 week, 2 days ago

ear

LOW Academic International

Cross-fitted Proximal Learning for Model-Based Reinforcement Learning

arXiv:2604.05185v1 Announce Type: new Abstract: Model-based reinforcement learning is attractive for sequential decision-making because it explicitly estimates reward and transition models and then supports planning through simulated rollouts. In offline settings with hidden confounding, however, models learned directly from observational...

1 min 1 week, 2 days ago

ear

LOW Academic International

Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series

arXiv:2604.05064v1 Announce Type: new Abstract: Synthetic data is essential for training foundation models for time series (FMTS), but most generators assume static correlations, and are typically missing realistic inter-channel dependencies. We introduce DynLMC, a Dynamic Linear Model of Coregionalization, that...

1 min 1 week, 2 days ago

ear

LOW Academic International

Territory Paint Wars: Diagnosing and Mitigating Failure Modes in Competitive Multi-Agent PPO

arXiv:2604.04983v1 Announce Type: new Abstract: We present Territory Paint Wars, a minimal competitive multi-agent reinforcement learning environment implemented in Unity, and use it to systematically investigate failure modes of Proximal Policy Optimisation (PPO) under self-play. A first agent trained for...

1 min 1 week, 2 days ago

ear

LOW Academic International

Learning Stable Predictors from Weak Supervision under Distribution Shift

arXiv:2604.05002v1 Announce Type: new Abstract: Learning from weak or proxy supervision is common when ground-truth labels are unavailable, yet robustness under distribution shift remains poorly understood, especially when the supervision mechanism itself changes. We formalize this as supervision drift, defined...

1 min 1 week, 2 days ago

ear

LOW Academic International

PRIME: Prototype-Driven Multimodal Pretraining for Cancer Prognosis with Missing Modalities

arXiv:2604.04999v1 Announce Type: new Abstract: Multimodal self-supervised pretraining offers a promising route to cancer prognosis by integrating histopathology whole-slide images, gene expression, and pathology reports, yet most existing approaches require fully paired and complete inputs. In practice, clinical cohorts are...

1 min 1 week, 2 days ago

ear

LOW Academic International

From Retinal Evidence to Safe Decisions: RETINA-SAFE and ECRT for Hallucination Risk Triage in Medical LLMs

arXiv:2604.05348v1 Announce Type: new Abstract: Hallucinations in medical large language models (LLMs) remain a safety-critical issue, particularly when available evidence is insufficient or conflicting. We study this problem in diabetic retinopathy (DR) decision settings and introduce RETINA-SAFE, an evidence-grounded benchmark...

1 min 1 week, 2 days ago

ear

LOW Academic International

Learning to Edit Knowledge via Instruction-based Chain-of-Thought Prompting

arXiv:2604.05540v1 Announce Type: new Abstract: Large language models (LLMs) can effectively handle outdated information through knowledge editing. However, current approaches face two key limitations: (I) Poor generalization: Most approaches rigidly inject new knowledge without ensuring that the model can use...

1 min 1 week, 2 days ago

ear

LOW Academic International

EpiBench: Benchmarking Multi-turn Research Workflows for Multimodal Agents

arXiv:2604.05557v1 Announce Type: new Abstract: Scientific research follows multi-turn, multi-step workflows that require proactively searching the literature, consulting figures and tables, and integrating evidence across papers to align experimental settings and support reproducible conclusions. This joint capability is not systematically...

1 min 1 week, 2 days ago

ear

LOW Academic International

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

arXiv:2604.05426v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) is now the dominant method for parameter-efficient fine-tuning of large language models, but achieving a high-quality adapter often requires systematic hyperparameter tuning because LoRA performance is highly sensitive to configuration choices. In...

1 min 1 week, 2 days ago

ear

Multi-objective Evolutionary Merging Enables Efficient Reasoning Models

ValueGround: Evaluating Culture-Conditioned Visual Value Grounding in MLLMs

AgentOpt v0.1 Technical Report: Client-Side Optimization for LLM-Based Agent

Learning to Interrupt in Language-based Multi-agent Communication

A Severity-Based Curriculum Learning Strategy for Arabic Medical Text Generation

Transformer See, Transformer Do: Copying as an Intermediate Step in Learning Analogical Reasoning

AE-ViT: Stable Long-Horizon Parametric Partial Differential Equations Modeling

MICA: Multivariate Infini Compressive Attention for Time Series Forecasting

From Load Tests to Live Streams: Graph Embedding-Based Anomaly Detection in Microservice Architectures

The Depth Ceiling: On the Limits of Large Language Models in Discovering Latent Planning

SMT-AD: a scalable quantum-inspired anomaly detection approach

$S^3$: Stratified Scaling Search for Test-Time in Diffusion Language Models

Don't Act Blindly: Robust GUI Automation via Action-Effect Verification and Self-Correction

Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue

Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling

Gradient-Controlled Decoding: A Safety Guardrail for LLMs with Dual-Anchor Steering

YoNER: A New Yor\`ub\'a Multi-domain Named Entity Recognition Dataset

What Makes a Good Response? An Empirical Analysis of Quality in Qualitative Interviews

SenseAI: A Human-in-the-Loop Dataset for RLHF-Aligned Financial Sentiment Reasoning

Learning-Based Multi-Criteria Decision Making Model for Sawmill Location Problems

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

Cross-fitted Proximal Learning for Model-Based Reinforcement Learning

Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series

Territory Paint Wars: Diagnosing and Mitigating Failure Modes in Competitive Multi-Agent PPO

Learning Stable Predictors from Weak Supervision under Distribution Shift

PRIME: Prototype-Driven Multimodal Pretraining for Cancer Prognosis with Missing Modalities

From Retinal Evidence to Safe Decisions: RETINA-SAFE and ECRT for Hallucination Risk Triage in Medical LLMs

Learning to Edit Knowledge via Instruction-based Chain-of-Thought Prompting

EpiBench: Benchmarking Multi-turn Research Workflows for Multimodal Agents

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.