International Law

LOW Academic International

A Lightweight LLM Framework for Disaster Humanitarian Information Classification

arXiv:2602.12284v1 Announce Type: cross Abstract: Timely classification of humanitarian information from social media is critical for effective disaster response. However, deploying large language models (LLMs) for this task faces challenges in resource-constrained emergency settings. This paper develops a lightweight, cost-effective...

1 min 1 month, 2 weeks ago

itar

LOW Academic International

From Biased Chatbots to Biased Agents: Examining Role Assignment Effects on LLM Agent Robustness

arXiv:2602.12285v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly deployed as autonomous agents capable of actions with real-world impacts beyond text generation. While persona-induced biases in text generation are well documented, their effects on agent task performance remain...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Energy-Aware Reinforcement Learning for Robotic Manipulation of Articulated Components in Infrastructure Operation and Maintenance

arXiv:2602.12288v1 Announce Type: cross Abstract: With the growth of intelligent civil infrastructure and smart cities, operation and maintenance (O&M) increasingly requires safe, efficient, and energy-conscious robotic manipulation of articulated components, including access doors, service drawers, and pipeline valves. However, existing...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Adaptive traffic signal control optimization using a novel road partition and multi-channel state representation method

arXiv:2602.12296v1 Announce Type: cross Abstract: This study proposes a novel adaptive traffic signal control method leveraging a Deep Q-Network (DQN) and Proximal Policy Optimization (PPO) to optimize signal timing by integrating variable cell length and multi-channel state representation. A road...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model

arXiv:2602.12304v1 Announce Type: cross Abstract: Existing mainstream video customization methods focus on generating identity-consistent videos based on given reference images and textual prompts. Benefiting from the rapid advancement of joint audio-video generation, this paper proposes a more compelling new task:...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

OptiML: An End-to-End Framework for Program Synthesis and CUDA Kernel Optimization

arXiv:2602.12305v1 Announce Type: cross Abstract: Generating high-performance CUDA kernels remains challenging due to the need to navigate a combinatorial space of low-level transformations under noisy and expensive hardware feedback. Although large language models can synthesize functionally correct CUDA code, achieving...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Quantum walk inspired JPEG compression of images

arXiv:2602.12306v1 Announce Type: cross Abstract: This work proposes a quantum inspired adaptive quantization framework that enhances the classical JPEG compression by introducing a learned, optimized Qtable derived using a Quantum Walk Inspired Optimization (QWIO) search strategy. The optimizer searches a...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Visible and Hyperspectral Imaging for Quality Assessment of Milk: Property Characterisation and Identification

arXiv:2602.12313v1 Announce Type: cross Abstract: Rapid and non-destructive assessment of milk quality is crucial to ensuring both nutritional value and food safety. In this study, we investigated the potential of visible and hyperspectral imaging as cost-effective and quick-response alternatives to...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping

arXiv:2602.12315v1 Announce Type: cross Abstract: The proliferation of e-commerce has made web shopping platforms key gateways for customers navigating the vast digital marketplace. Yet this rapid expansion has led to a noisy and fragmented information environment, increasing cognitive burden as...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Free Lunch in Medical Image Foundation Model Pre-training via Randomized Synthesis and Disentanglement

arXiv:2602.12317v1 Announce Type: cross Abstract: Medical image foundation models (MIFMs) have demonstrated remarkable potential for a wide range of clinical tasks, yet their development is constrained by the scarcity, heterogeneity, and high cost of large-scale annotated datasets. Here, we propose...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

ForeAct: Steering Your VLA with Efficient Visual Foresight Planning

arXiv:2602.12322v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models convert high-level language instructions into concrete, executable actions, a task that is especially challenging in open-world environments. We present Visual Foresight Planning (ForeAct), a general and efficient planner that guides a VLA...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Intrinsic Credit Assignment for Long Horizon Interaction

arXiv:2602.12342v1 Announce Type: cross Abstract: How can we train agents to navigate uncertainty over long horizons? In this work, we propose {\Delta}Belief-RL, which leverages a language model's own intrinsic beliefs to reward intermediate progress. Our method utilizes the change in...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Policy4OOD: A Knowledge-Guided World Model for Policy Intervention Simulation against the Opioid Overdose Crisis

arXiv:2602.12373v1 Announce Type: cross Abstract: The opioid epidemic remains one of the most severe public health crises in the United States, yet evaluating policy interventions before implementation is difficult: multiple policies interact within a dynamic system where targeting one risk...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning

arXiv:2602.12375v1 Announce Type: cross Abstract: Optimistic value estimates provide one mechanism for directed exploration in reinforcement learning (RL). The agent acts greedily with respect to an estimate of the value plus what can be seen as a value bonus. The...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Why Deep Jacobian Spectra Separate: Depth-Induced Scaling and Singular-Vector Alignment

arXiv:2602.12384v2 Announce Type: cross Abstract: Understanding why gradient-based training in deep networks exhibits strong implicit bias remains challenging, in part because tractable singular-value dynamics are typically available only for balanced deep linear models. We propose an alternative route based on...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Rational Neural Networks have Expressivity Advantages

arXiv:2602.12390v1 Announce Type: cross Abstract: We study neural networks with trainable low-degree rational activation functions and show that they are more expressive and parameter-efficient than modern piecewise-linear and smooth activations such as ELU, LeakyReLU, LogSigmoid, PReLU, ReLU, SELU, CELU, Sigmoid,...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis

arXiv:2602.12395v1 Announce Type: cross Abstract: Reinforcement learning (RL) with verifiable rewards has become a standard post-training stage for boosting visual reasoning in vision-language models, yet it remains unclear what capabilities RL actually improves compared with supervised fine-tuning as cold-start initialization...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

AstRL: Analog and Mixed-Signal Circuit Synthesis with Deep Reinforcement Learning

arXiv:2602.12402v1 Announce Type: cross Abstract: Analog and mixed-signal (AMS) integrated circuits (ICs) lie at the core of modern computing and communications systems. However, despite the continued rise in design complexity, advances in AMS automation remain limited. This reflects the central...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Soft Contamination Means Benchmarks Test Shallow Generalization

arXiv:2602.12413v1 Announce Type: cross Abstract: If LLM training data is polluted with benchmark test data, then benchmark performance gives biased estimates of out-of-distribution (OOD) generalization. Typical decontamination filters use n-gram matching which fail to detect semantic duplicates: sentences with equivalent...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty

arXiv:2602.12424v1 Announce Type: cross Abstract: Benchmarks establish a standardized evaluation framework to systematically assess the performance of large language models (LLMs), facilitating objective comparisons and driving advancements in the field. However, existing benchmarks fail to differentiate question difficulty, limiting their...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward

arXiv:2602.12430v2 Announce Type: cross Abstract: The transition from monolithic language models to modular, skill-equipped agents marks a defining shift in how large language models (LLMs) are deployed in practice. Rather than encoding all procedural knowledge within model weights, agent skills...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

arXiv:2602.12444v1 Announce Type: cross Abstract: Reinforcement learning (RL) is a powerful framework for optimal decision-making and control but often lacks provable guarantees for safety-critical applications. In this paper, we introduce a novel recovery-based shielding framework that enables safe RL with...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Designing RNAs with Language Models

arXiv:2602.12470v1 Announce Type: cross Abstract: RNA design, the task of finding a sequence that folds into a target secondary structure, has broad biological and biomedical impact but remains computationally challenging due to the exponentially large sequence space and exponentially many...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

CLASE: A Hybrid Method for Chinese Legalese Stylistic Evaluation

arXiv:2602.12639v1 Announce Type: new Abstract: Legal text generated by large language models (LLMs) can usually achieve reasonable factual accuracy, but it frequently fails to adhere to the specialised stylistic norms and linguistic conventions of legal writing. In order to improve...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Beyond Normalization: Rethinking the Partition Function as a Difficulty Scheduler for RLVR

arXiv:2602.12642v1 Announce Type: new Abstract: Reward-maximizing RL methods enhance the reasoning performance of LLMs, but often reduce the diversity among outputs. Recent works address this issue by adopting GFlowNets, training LLMs to match a target distribution while jointly learning its...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Learning Ordinal Probabilistic Reward from Preferences

arXiv:2602.12660v1 Announce Type: new Abstract: Reward models are crucial for aligning large language models (LLMs) with human values and intentions. Existing approaches follow either Generative (GRMs) or Discriminative (DRMs) paradigms, yet both suffer from limitations: GRMs typically demand costly point-wise...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

$\mathcal{X}$-KD: General Experiential Knowledge Distillation for Large Language Models

arXiv:2602.12674v1 Announce Type: new Abstract: Knowledge Distillation (KD) for Large Language Models (LLMs) has become increasingly important as models grow in size and complexity. While existing distillation approaches focus on imitating teacher behavior, they often overlook the original learning environment...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

arXiv:2602.12746v1 Announce Type: new Abstract: Despite their impressive performance, self-supervised speech models often struggle to generalize to new languages and tend to forget previously acquired knowledge during continual training. To address this, we propose Lamer-SSL, a parameter-efficient framework that integrates...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Aspect-Based Sentiment Analysis for Future Tourism Experiences: A BERT-MoE Framework for Persian User Reviews

arXiv:2602.12778v1 Announce Type: new Abstract: This study advances aspect-based sentiment analysis (ABSA) for Persian-language user reviews in the tourism domain, addressing challenges of low-resource languages. We propose a hybrid BERT-based model with Top-K routing and auxiliary losses to mitigate routing...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

RAT-Bench: A Comprehensive Benchmark for Text Anonymization

arXiv:2602.12806v1 Announce Type: new Abstract: Data containing personal information is increasingly used to train, fine-tune, or query Large Language Models (LLMs). Text is typically scrubbed of identifying information prior to use, often with tools such as Microsoft's Presidio or Anthropic's...

1 min 1 month, 2 weeks ago

ear

A Lightweight LLM Framework for Disaster Humanitarian Information Classification

From Biased Chatbots to Biased Agents: Examining Role Assignment Effects on LLM Agent Robustness

Energy-Aware Reinforcement Learning for Robotic Manipulation of Articulated Components in Infrastructure Operation and Maintenance

Adaptive traffic signal control optimization using a novel road partition and multi-channel state representation method

OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model

OptiML: An End-to-End Framework for Program Synthesis and CUDA Kernel Optimization

Quantum walk inspired JPEG compression of images

Visible and Hyperspectral Imaging for Quality Assessment of Milk: Property Characterisation and Identification

AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping

Free Lunch in Medical Image Foundation Model Pre-training via Randomized Synthesis and Disentanglement

ForeAct: Steering Your VLA with Efficient Visual Foresight Planning

Intrinsic Credit Assignment for Long Horizon Interaction

Policy4OOD: A Knowledge-Guided World Model for Policy Intervention Simulation against the Opioid Overdose Crisis

Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning

Why Deep Jacobian Spectra Separate: Depth-Induced Scaling and Singular-Vector Alignment

Rational Neural Networks have Expressivity Advantages

What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis

AstRL: Analog and Mixed-Signal Circuit Synthesis with Deep Reinforcement Learning

Soft Contamination Means Benchmarks Test Shallow Generalization

RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty

Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward

Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

Designing RNAs with Language Models

CLASE: A Hybrid Method for Chinese Legalese Stylistic Evaluation

Beyond Normalization: Rethinking the Partition Function as a Difficulty Scheduler for RLVR

Learning Ordinal Probabilistic Reward from Preferences

$\mathcal{X}$-KD: General Experiential Knowledge Distillation for Large Language Models

Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

Aspect-Based Sentiment Analysis for Future Tourism Experiences: A BERT-MoE Framework for Persian User Reviews

RAT-Bench: A Comprehensive Benchmark for Text Anonymization

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.