International Law

LOW Academic International

WebClipper: Efficient Evolution of Web Agents with Graph-based Trajectory Pruning

arXiv:2602.12852v1 Announce Type: new Abstract: Deep Research systems based on web agents have shown strong potential in solving complex information-seeking tasks, yet their search efficiency remains underexplored. We observe that many state-of-the-art open-source web agents rely on long tool-call trajectories...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Language-Guided Invariance Probing of Vision-Language Models

arXiv:2511.13494v1 Announce Type: cross Abstract: Recent vision-language models (VLMs) such as CLIP, OpenCLIP, EVA02-CLIP and SigLIP achieve strong zero-shot performance, but it is unclear how reliably they respond to controlled linguistic perturbations. We introduce Language-Guided Invariance Probing (LGIP), a benchmark...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

A Lightweight LLM Framework for Disaster Humanitarian Information Classification

arXiv:2602.12284v1 Announce Type: cross Abstract: Timely classification of humanitarian information from social media is critical for effective disaster response. However, deploying large language models (LLMs) for this task faces challenges in resource-constrained emergency settings. This paper develops a lightweight, cost-effective...

1 min 1 month, 2 weeks ago

itar

LOW Academic International

From Biased Chatbots to Biased Agents: Examining Role Assignment Effects on LLM Agent Robustness

arXiv:2602.12285v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly deployed as autonomous agents capable of actions with real-world impacts beyond text generation. While persona-induced biases in text generation are well documented, their effects on agent task performance remain...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Energy-Aware Reinforcement Learning for Robotic Manipulation of Articulated Components in Infrastructure Operation and Maintenance

arXiv:2602.12288v1 Announce Type: cross Abstract: With the growth of intelligent civil infrastructure and smart cities, operation and maintenance (O&M) increasingly requires safe, efficient, and energy-conscious robotic manipulation of articulated components, including access doors, service drawers, and pipeline valves. However, existing...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Adaptive traffic signal control optimization using a novel road partition and multi-channel state representation method

arXiv:2602.12296v1 Announce Type: cross Abstract: This study proposes a novel adaptive traffic signal control method leveraging a Deep Q-Network (DQN) and Proximal Policy Optimization (PPO) to optimize signal timing by integrating variable cell length and multi-channel state representation. A road...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

OptiML: An End-to-End Framework for Program Synthesis and CUDA Kernel Optimization

arXiv:2602.12305v1 Announce Type: cross Abstract: Generating high-performance CUDA kernels remains challenging due to the need to navigate a combinatorial space of low-level transformations under noisy and expensive hardware feedback. Although large language models can synthesize functionally correct CUDA code, achieving...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Quantum walk inspired JPEG compression of images

arXiv:2602.12306v1 Announce Type: cross Abstract: This work proposes a quantum inspired adaptive quantization framework that enhances the classical JPEG compression by introducing a learned, optimized Qtable derived using a Quantum Walk Inspired Optimization (QWIO) search strategy. The optimizer searches a...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Visible and Hyperspectral Imaging for Quality Assessment of Milk: Property Characterisation and Identification

arXiv:2602.12313v1 Announce Type: cross Abstract: Rapid and non-destructive assessment of milk quality is crucial to ensuring both nutritional value and food safety. In this study, we investigated the potential of visible and hyperspectral imaging as cost-effective and quick-response alternatives to...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping

arXiv:2602.12315v1 Announce Type: cross Abstract: The proliferation of e-commerce has made web shopping platforms key gateways for customers navigating the vast digital marketplace. Yet this rapid expansion has led to a noisy and fragmented information environment, increasing cognitive burden as...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Free Lunch in Medical Image Foundation Model Pre-training via Randomized Synthesis and Disentanglement

arXiv:2602.12317v1 Announce Type: cross Abstract: Medical image foundation models (MIFMs) have demonstrated remarkable potential for a wide range of clinical tasks, yet their development is constrained by the scarcity, heterogeneity, and high cost of large-scale annotated datasets. Here, we propose...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

ForeAct: Steering Your VLA with Efficient Visual Foresight Planning

arXiv:2602.12322v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models convert high-level language instructions into concrete, executable actions, a task that is especially challenging in open-world environments. We present Visual Foresight Planning (ForeAct), a general and efficient planner that guides a VLA...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Intrinsic Credit Assignment for Long Horizon Interaction

arXiv:2602.12342v1 Announce Type: cross Abstract: How can we train agents to navigate uncertainty over long horizons? In this work, we propose {\Delta}Belief-RL, which leverages a language model's own intrinsic beliefs to reward intermediate progress. Our method utilizes the change in...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning

arXiv:2602.12375v1 Announce Type: cross Abstract: Optimistic value estimates provide one mechanism for directed exploration in reinforcement learning (RL). The agent acts greedily with respect to an estimate of the value plus what can be seen as a value bonus. The...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis

arXiv:2602.12395v1 Announce Type: cross Abstract: Reinforcement learning (RL) with verifiable rewards has become a standard post-training stage for boosting visual reasoning in vision-language models, yet it remains unclear what capabilities RL actually improves compared with supervised fine-tuning as cold-start initialization...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

AstRL: Analog and Mixed-Signal Circuit Synthesis with Deep Reinforcement Learning

arXiv:2602.12402v1 Announce Type: cross Abstract: Analog and mixed-signal (AMS) integrated circuits (ICs) lie at the core of modern computing and communications systems. However, despite the continued rise in design complexity, advances in AMS automation remain limited. This reflects the central...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Soft Contamination Means Benchmarks Test Shallow Generalization

arXiv:2602.12413v1 Announce Type: cross Abstract: If LLM training data is polluted with benchmark test data, then benchmark performance gives biased estimates of out-of-distribution (OOD) generalization. Typical decontamination filters use n-gram matching which fail to detect semantic duplicates: sentences with equivalent...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty

arXiv:2602.12424v1 Announce Type: cross Abstract: Benchmarks establish a standardized evaluation framework to systematically assess the performance of large language models (LLMs), facilitating objective comparisons and driving advancements in the field. However, existing benchmarks fail to differentiate question difficulty, limiting their...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

arXiv:2602.12444v1 Announce Type: cross Abstract: Reinforcement learning (RL) is a powerful framework for optimal decision-making and control but often lacks provable guarantees for safety-critical applications. In this paper, we introduce a novel recovery-based shielding framework that enables safe RL with...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

CLASE: A Hybrid Method for Chinese Legalese Stylistic Evaluation

arXiv:2602.12639v1 Announce Type: new Abstract: Legal text generated by large language models (LLMs) can usually achieve reasonable factual accuracy, but it frequently fails to adhere to the specialised stylistic norms and linguistic conventions of legal writing. In order to improve...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Learning Ordinal Probabilistic Reward from Preferences

arXiv:2602.12660v1 Announce Type: new Abstract: Reward models are crucial for aligning large language models (LLMs) with human values and intentions. Existing approaches follow either Generative (GRMs) or Discriminative (DRMs) paradigms, yet both suffer from limitations: GRMs typically demand costly point-wise...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

$\mathcal{X}$-KD: General Experiential Knowledge Distillation for Large Language Models

arXiv:2602.12674v1 Announce Type: new Abstract: Knowledge Distillation (KD) for Large Language Models (LLMs) has become increasingly important as models grow in size and complexity. While existing distillation approaches focus on imitating teacher behavior, they often overlook the original learning environment...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

arXiv:2602.12746v1 Announce Type: new Abstract: Despite their impressive performance, self-supervised speech models often struggle to generalize to new languages and tend to forget previously acquired knowledge during continual training. To address this, we propose Lamer-SSL, a parameter-efficient framework that integrates...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Aspect-Based Sentiment Analysis for Future Tourism Experiences: A BERT-MoE Framework for Persian User Reviews

arXiv:2602.12778v1 Announce Type: new Abstract: This study advances aspect-based sentiment analysis (ABSA) for Persian-language user reviews in the tourism domain, addressing challenges of low-resource languages. We propose a hybrid BERT-based model with Top-K routing and auxiliary losses to mitigate routing...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

BaziQA-Benchmark: Evaluating Symbolic and Temporally Compositional Reasoning in Large Language Models

arXiv:2602.12889v1 Announce Type: new Abstract: We present BaziQA-Benchmark, a standardized benchmark for evaluating symbolic and temporally compositional reasoning in large language models. The benchmark is derived from 200 professionally curated, multiple-choice problems from the Global Fortune-teller Competition (2021--2025), where each...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

ProbeLLM: Automating Principled Diagnosis of LLM Failures

arXiv:2602.12966v1 Announce Type: new Abstract: Understanding how and why large language models (LLMs) fail is becoming a central challenge as models rapidly evolve and static evaluations fall behind. While automated probing has been enabled by dynamic test generation, existing approaches...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Evaluating the Homogeneity of Keyphrase Prediction Models

arXiv:2602.12989v1 Announce Type: new Abstract: Keyphrases which are useful in several NLP and IR applications are either extracted from text or predicted by generative models. Contrarily to keyphrase extraction approaches, keyphrase generation models can predict keyphrases that do not appear...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models

arXiv:2602.12996v1 Announce Type: new Abstract: Knowledge augmentation has significantly enhanced the performance of Large Language Models (LLMs) in knowledge-intensive tasks. However, existing methods typically operate on the simplistic premise that model performance equates with internal knowledge, overlooking the knowledge-confidence gaps...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Exploring a New Competency Modeling Process with Large Language Models

arXiv:2602.13084v1 Announce Type: new Abstract: Competency modeling is widely used in human resource management to select, develop, and evaluate talent. However, traditional expert-driven approaches rely heavily on manual analysis of large volumes of interview transcripts, making them costly and prone...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Semantic Chunking and the Entropy of Natural Language

arXiv:2602.13194v1 Announce Type: new Abstract: The entropy rate of printed English is famously estimated to be about one bit per character, a benchmark that modern large language models (LLMs) have only recently approached. This entropy rate implies that English contains...

1 min 1 month, 2 weeks ago

ear

WebClipper: Efficient Evolution of Web Agents with Graph-based Trajectory Pruning

Language-Guided Invariance Probing of Vision-Language Models

A Lightweight LLM Framework for Disaster Humanitarian Information Classification

From Biased Chatbots to Biased Agents: Examining Role Assignment Effects on LLM Agent Robustness

Energy-Aware Reinforcement Learning for Robotic Manipulation of Articulated Components in Infrastructure Operation and Maintenance

Adaptive traffic signal control optimization using a novel road partition and multi-channel state representation method

OptiML: An End-to-End Framework for Program Synthesis and CUDA Kernel Optimization

Quantum walk inspired JPEG compression of images

Visible and Hyperspectral Imaging for Quality Assessment of Milk: Property Characterisation and Identification

AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping

Free Lunch in Medical Image Foundation Model Pre-training via Randomized Synthesis and Disentanglement

ForeAct: Steering Your VLA with Efficient Visual Foresight Planning

Intrinsic Credit Assignment for Long Horizon Interaction

Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning

What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis

AstRL: Analog and Mixed-Signal Circuit Synthesis with Deep Reinforcement Learning

Soft Contamination Means Benchmarks Test Shallow Generalization

RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty

Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

CLASE: A Hybrid Method for Chinese Legalese Stylistic Evaluation

Learning Ordinal Probabilistic Reward from Preferences

$\mathcal{X}$-KD: General Experiential Knowledge Distillation for Large Language Models

Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

Aspect-Based Sentiment Analysis for Future Tourism Experiences: A BERT-MoE Framework for Persian User Reviews

BaziQA-Benchmark: Evaluating Symbolic and Temporally Compositional Reasoning in Large Language Models

ProbeLLM: Automating Principled Diagnosis of LLM Failures

Evaluating the Homogeneity of Keyphrase Prediction Models

Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models

Exploring a New Competency Modeling Process with Large Language Models

Semantic Chunking and the Entropy of Natural Language

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.