AI & Technology Law

LOW Academic International

What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis

arXiv:2602.12395v1 Announce Type: cross Abstract: Reinforcement learning (RL) with verifiable rewards has become a standard post-training stage for boosting visual reasoning in vision-language models, yet it remains unclear what capabilities RL actually improves compared with supervised fine-tuning as cold-start initialization...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

arXiv:2602.12444v1 Announce Type: cross Abstract: Reinforcement learning (RL) is a powerful framework for optimal decision-making and control but often lacks provable guarantees for safety-critical applications. In this paper, we introduce a novel recovery-based shielding framework that enables safe RL with...

1 min 1 month, 2 weeks ago

ai

LOW Academic United States

Designing RNAs with Language Models

arXiv:2602.12470v1 Announce Type: cross Abstract: RNA design, the task of finding a sequence that folds into a target secondary structure, has broad biological and biomedical impact but remains computationally challenging due to the exponentially large sequence space and exponentially many...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

Discovering Semantic Latent Structures in Psychological Scales: A Response-Free Pathway to Efficient Simplification

arXiv:2602.12575v1 Announce Type: new Abstract: Psychological scale refinement traditionally relies on response-based methods such as factor analysis, item response theory, and network psychometrics to optimize item composition. Although rigorous, these approaches require large samples and may be constrained by data...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

$\mathcal{X}$-KD: General Experiential Knowledge Distillation for Large Language Models

arXiv:2602.12674v1 Announce Type: new Abstract: Knowledge Distillation (KD) for Large Language Models (LLMs) has become increasingly important as models grow in size and complexity. While existing distillation approaches focus on imitating teacher behavior, they often overlook the original learning environment...

1 min 1 month, 2 weeks ago

llm

LOW Academic International

Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

arXiv:2602.12746v1 Announce Type: new Abstract: Despite their impressive performance, self-supervised speech models often struggle to generalize to new languages and tend to forget previously acquired knowledge during continual training. To address this, we propose Lamer-SSL, a parameter-efficient framework that integrates...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

Aspect-Based Sentiment Analysis for Future Tourism Experiences: A BERT-MoE Framework for Persian User Reviews

arXiv:2602.12778v1 Announce Type: new Abstract: This study advances aspect-based sentiment analysis (ABSA) for Persian-language user reviews in the tourism domain, addressing challenges of low-resource languages. We propose a hybrid BERT-based model with Top-K routing and auxiliary losses to mitigate routing...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

MentalBench: A Benchmark for Evaluating Psychiatric Diagnostic Capability of Large Language Models

arXiv:2602.12871v1 Announce Type: new Abstract: We introduce MentalBench, a benchmark for evaluating psychiatric diagnostic decision-making in large language models (LLMs). Existing mental health benchmarks largely rely on social media data, limiting their ability to assess DSM-grounded diagnostic judgments. At the...

1 min 1 month, 2 weeks ago

llm

LOW Academic International

BaziQA-Benchmark: Evaluating Symbolic and Temporally Compositional Reasoning in Large Language Models

arXiv:2602.12889v1 Announce Type: new Abstract: We present BaziQA-Benchmark, a standardized benchmark for evaluating symbolic and temporally compositional reasoning in large language models. The benchmark is derived from 200 professionally curated, multiple-choice problems from the Global Fortune-teller Competition (2021--2025), where each...

1 min 1 month, 2 weeks ago

ai

LOW Academic United Kingdom

ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark

arXiv:2602.12911v1 Announce Type: new Abstract: Code-switching (CS), which is when Vietnamese speech uses English words like drug names or procedures, is a common phenomenon in Vietnamese medical communication. This creates challenges for Automatic Speech Recognition (ASR) systems, especially in low-resource...

1 min 1 month, 2 weeks ago

ai

LOW Academic European Union

Curriculum Learning and Pseudo-Labeling Improve the Generalization of Multi-Label Arabic Dialect Identification Models

arXiv:2602.12937v1 Announce Type: new Abstract: Being modeled as a single-label classification task for a long time, recent work has argued that Arabic Dialect Identification (ADI) should be framed as a multi-label classification task. However, ADI remains constrained by the availability...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

Evaluating the Homogeneity of Keyphrase Prediction Models

arXiv:2602.12989v1 Announce Type: new Abstract: Keyphrases which are useful in several NLP and IR applications are either extracted from text or predicted by generative models. Contrarily to keyphrase extraction approaches, keyphrase generation models can predict keyphrases that do not appear...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

TraceBack: Multi-Agent Decomposition for Fine-Grained Table Attribution

arXiv:2602.13059v1 Announce Type: new Abstract: Question answering (QA) over structured tables requires not only accurate answers but also transparency about which cells support them. Existing table QA systems rarely provide fine-grained attribution, so even correct answers often lack verifiable grounding,...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

Exploring a New Competency Modeling Process with Large Language Models

arXiv:2602.13084v1 Announce Type: new Abstract: Competency modeling is widely used in human resource management to select, develop, and evaluate talent. However, traditional expert-driven approaches rely heavily on manual analysis of large volumes of interview transcripts, making them costly and prone...

1 min 1 month, 2 weeks ago

llm

LOW Academic International

From sunblock to softblock: Analyzing the correlates of neology in published writing and on social media

arXiv:2602.13123v1 Announce Type: new Abstract: Living languages are shaped by a host of conflicting internal and external evolutionary pressures. While some of these pressures are universal across languages and cultures, others differ depending on the social and conversational context: language...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

OpenLID-v3: Improving the Precision of Closely Related Language Identification -- An Experience Report

arXiv:2602.13139v1 Announce Type: new Abstract: Language identification (LID) is an essential step in building high-quality multilingual datasets from web data. Existing LID tools (such as OpenLID or GlotLID) often struggle to identify closely related languages and to distinguish valid natural...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

HyperMLP: An Integrated Perspective for Sequence Modeling

arXiv:2602.12601v1 Announce Type: cross Abstract: Self-attention is often viewed as probabilistic query-key lookup, motivating designs that preserve normalized attention scores and fixed positional semantics. We advocate a simpler and more unified perspective: an autoregressive attention head can be viewed as...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

VimRAG: Navigating Massive Visual Context in Retrieval-Augmented Generation via Multimodal Memory Graph

arXiv:2602.12735v1 Announce Type: cross Abstract: Effectively retrieving, reasoning, and understanding multimodal information remains a critical challenge for agentic systems. Traditional Retrieval-augmented Generation (RAG) methods rely on linear interaction histories, which struggle to handle long-context tasks, especially those involving information-sparse yet...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

The Appeal and Reality of Recycling LoRAs with Adaptive Merging

arXiv:2602.12323v1 Announce Type: new Abstract: The widespread availability of fine-tuned LoRA modules for open pre-trained models has led to an interest in methods that can adaptively merge LoRAs to improve performance. These methods typically include some way of selecting LoRAs...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

Wireless TokenCom: RL-Based Tokenizer Agreement for Multi-User Wireless Token Communications

arXiv:2602.12338v1 Announce Type: new Abstract: Token Communications (TokenCom) has recently emerged as an effective new paradigm, where tokens are the unified units of multimodal communications and computations, enabling efficient digital semantic- and goal-oriented communications in future wireless networks. To establish...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

Continuous Diffusion Models Can Obey Formal Syntax

arXiv:2602.12468v1 Announce Type: new Abstract: Diffusion language models offer a promising alternative to autoregressive models due to their global, non-causal generation process, but their continuous latent dynamics make discrete constraints -- e.g., the output should be a JSON file that...

1 min 1 month, 2 weeks ago

ai

LOW Academic United Kingdom

Regularized Meta-Learning for Improved Generalization

arXiv:2602.12469v1 Announce Type: new Abstract: Deep ensemble methods often improve predictive performance, yet they suffer from three practical limitations: redundancy among base models that inflates computational cost and degrades conditioning, unstable weighting under multicollinearity, and overfitting in meta-learning pipelines. We...

1 min 1 month, 2 weeks ago

ai

LOW Academic United States

Tight Bounds for Logistic Regression with Large Stepsize Gradient Descent in Low Dimension

arXiv:2602.12471v1 Announce Type: new Abstract: We consider the optimization problem of minimizing the logistic loss with gradient descent to train a linear model for binary classification with separable data. With a budget of $T$ iterations, it was recently shown that...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

A Theoretical Analysis of Mamba's Training Dynamics: Filtering Relevant Features for Generalization in State Space Models

arXiv:2602.12499v1 Announce Type: new Abstract: The recent empirical success of Mamba and other selective state space models (SSMs) has renewed interest in non-attention architectures for sequence modeling, yet their theoretical foundations remain underexplored. We present a first-step analysis of generalization...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

Analytical Results for Two Exponential Family Distributions in Hierarchical Dirichlet Processes

arXiv:2602.12527v1 Announce Type: new Abstract: The Hierarchical Dirichlet Process (HDP) provides a flexible Bayesian nonparametric framework for modeling grouped data with a shared yet unbounded collection of mixture components. While existing applications of the HDP predominantly focus on the Dirichlet-multinomial...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

Fractional Order Federated Learning for Battery Electric Vehicle Energy Consumption Modeling

arXiv:2602.12567v1 Announce Type: new Abstract: Federated learning on connected electric vehicles (BEVs) faces severe instability due to intermittent connectivity, time-varying client participation, and pronounced client-to-client variation induced by diverse operating conditions. Conventional FedAvg and many advanced methods can suffer from...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

Block-Sample MAC-Bayes Generalization Bounds

arXiv:2602.12605v1 Announce Type: new Abstract: We present a family of novel block-sample MAC-Bayes bounds (mean approximately correct). While PAC-Bayes bounds (probably approximately correct) typically give bounds for the generalization error that hold with high probability, MAC-Bayes bounds have a similar...

1 min 1 month, 2 weeks ago

ai

LOW Academic European Union

Coden: Efficient Temporal Graph Neural Networks for Continuous Prediction

arXiv:2602.12613v1 Announce Type: new Abstract: Temporal Graph Neural Networks (TGNNs) are pivotal in processing dynamic graphs. However, existing TGNNs primarily target one-time predictions for a given temporal span, whereas many practical applications require continuous predictions, that predictions are issued frequently...

1 min 1 month, 2 weeks ago

neural network

LOW Academic European Union

Formalizing the Sampling Design Space of Diffusion-Based Generative Models via Adaptive Solvers and Wasserstein-Bounded Timesteps

arXiv:2602.12624v1 Announce Type: new Abstract: Diffusion-based generative models have achieved remarkable performance across various domains, yet their practical deployment is often limited by high sampling costs. While prior work focuses on training objectives or individual solvers, the holistic design of...

1 min 1 month, 2 weeks ago

ai

LOW Academic International

Dual-Granularity Contrastive Reward via Generated Episodic Guidance for Efficient Embodied RL

arXiv:2602.12636v1 Announce Type: new Abstract: Designing suitable rewards poses a significant challenge in reinforcement learning (RL), especially for embodied manipulation. Trajectory success rewards are suitable for human judges or model fitting, but the sparsity severely limits RL sample efficiency. While...

1 min 1 month, 2 weeks ago

ai

What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis

Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

Designing RNAs with Language Models

Discovering Semantic Latent Structures in Psychological Scales: A Response-Free Pathway to Efficient Simplification

$\mathcal{X}$-KD: General Experiential Knowledge Distillation for Large Language Models

Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

Aspect-Based Sentiment Analysis for Future Tourism Experiences: A BERT-MoE Framework for Persian User Reviews

MentalBench: A Benchmark for Evaluating Psychiatric Diagnostic Capability of Large Language Models

BaziQA-Benchmark: Evaluating Symbolic and Temporally Compositional Reasoning in Large Language Models

ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark

Curriculum Learning and Pseudo-Labeling Improve the Generalization of Multi-Label Arabic Dialect Identification Models

Evaluating the Homogeneity of Keyphrase Prediction Models

TraceBack: Multi-Agent Decomposition for Fine-Grained Table Attribution

Exploring a New Competency Modeling Process with Large Language Models

From sunblock to softblock: Analyzing the correlates of neology in published writing and on social media

OpenLID-v3: Improving the Precision of Closely Related Language Identification -- An Experience Report

HyperMLP: An Integrated Perspective for Sequence Modeling

VimRAG: Navigating Massive Visual Context in Retrieval-Augmented Generation via Multimodal Memory Graph

The Appeal and Reality of Recycling LoRAs with Adaptive Merging

Wireless TokenCom: RL-Based Tokenizer Agreement for Multi-User Wireless Token Communications

Continuous Diffusion Models Can Obey Formal Syntax

Regularized Meta-Learning for Improved Generalization

Tight Bounds for Logistic Regression with Large Stepsize Gradient Descent in Low Dimension

A Theoretical Analysis of Mamba's Training Dynamics: Filtering Relevant Features for Generalization in State Space Models

Analytical Results for Two Exponential Family Distributions in Hierarchical Dirichlet Processes

Fractional Order Federated Learning for Battery Electric Vehicle Energy Consumption Modeling

Block-Sample MAC-Bayes Generalization Bounds

Coden: Efficient Temporal Graph Neural Networks for Continuous Prediction

Formalizing the Sampling Design Space of Diffusion-Based Generative Models via Adaptive Solvers and Wasserstein-Bounded Timesteps

Dual-Granularity Contrastive Reward via Generated Episodic Guidance for Efficient Embodied RL

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.