International Law

LOW Academic International

Hit-RAG: Learning to Reason with Long Contexts via Preference Alignment

arXiv:2603.07023v1 Announce Type: new Abstract: Despite the promise of Retrieval-Augmented Generation in grounding Multimodal Large Language Models with external knowledge, the transition to extensive contexts often leads to significant attention dilution and reasoning hallucinations. The surge in information density causes...

1 min 1 month, 1 week ago

ear

LOW Academic International

Enhancing Consistency of Werewolf AI through Dialogue Summarization and Persona Information

arXiv:2603.07111v1 Announce Type: new Abstract: The Werewolf Game is a communication game where players' reasoning and discussion skills are essential. In this study, we present a Werewolf AI agent developed for the AIWolfDial 2024 shared task, co-hosted with the 17th...

1 min 1 month, 1 week ago

ear

LOW Academic International

Emotion Transcription in Conversation: A Benchmark for Capturing Subtle and Complex Emotional States through Natural Language

arXiv:2603.07138v1 Announce Type: new Abstract: Emotion Recognition in Conversation (ERC) is critical for enabling natural human-machine interactions. However, existing methods predominantly employ categorical or dimensional emotion annotations, which often fail to adequately represent complex, subtle, or culturally specific emotional nuances....

1 min 1 month, 1 week ago

ear

LOW Academic International

Scaling Self-Supervised Speech Models Uncovers Deep Linguistic Relationships: Evidence from the Pacific Cluster

arXiv:2603.07238v1 Announce Type: new Abstract: Similarities between language representations derived from Self-Supervised Speech Models (S3Ms) have been observed to primarily reflect geographic proximity or surface typological similarities driven by recent expansion or contact, potentially missing deeper genealogical signals. We investigate...

1 min 1 month, 1 week ago

ear

LOW Academic International

Taiwan Safety Benchmark and Breeze Guard: Toward Trustworthy AI for Taiwanese Mandarin

arXiv:2603.07286v1 Announce Type: new Abstract: Global safety models exhibit strong performance across widely used benchmarks, yet their training data rarely captures the cultural and linguistic nuances of Taiwanese Mandarin. This limitation results in systematic blind spots when interpreting region-specific risks...

1 min 1 month, 1 week ago

icc

LOW Academic International

Domain-Specific Quality Estimation for Machine Translation in Low-Resource Scenarios

arXiv:2603.07372v1 Announce Type: new Abstract: Quality Estimation (QE) is essential for assessing machine translation quality in reference-less settings, particularly for domain-specific and low-resource language scenarios. In this paper, we investigate sentence-level QE for English to Indic machine translation across four...

1 min 1 month, 1 week ago

ear

LOW Academic International

The Dual-Stream Transformer: Channelized Architecture for Interpretable Language Modeling

arXiv:2603.07461v1 Announce Type: new Abstract: Standard transformers entangle all computation in a single residual stream, obscuring which components perform which functions. We introduce the Dual-Stream Transformer, which decomposes the residual stream into two functionally distinct components: a token stream updated...

1 min 1 month, 1 week ago

ear

LOW Academic International

Cross-Modal Taxonomic Generalization in (Vision-) Language Models

arXiv:2603.07474v1 Announce Type: new Abstract: What is the interplay between semantic representations learned by language models (LM) from surface form alone to those learned from more grounded evidence? We study this question for a scenario where part of the input...

1 min 1 month, 1 week ago

ear

LOW Academic International

Skip to the Good Part: Representation Structure & Inference-Time Layer Skipping in Diffusion vs. Autoregressive LLMs

arXiv:2603.07475v1 Announce Type: new Abstract: Autoregressive (AR) language models form representations incrementally through left-to-right prediction, whereas diffusion language models (dLLMs) are trained via full-sequence denoising. Although recent dLLMs match AR performance, it remains unclear whether diffusion objectives fundamentally reshape internal...

1 min 1 month, 1 week ago

ear

LOW Academic International

TableMind++: An Uncertainty-Aware Programmatic Agent for Tool-Augmented Table Reasoning

arXiv:2603.07528v1 Announce Type: new Abstract: Table reasoning requires models to jointly perform semantic understanding and precise numerical operations. Most existing methods rely on a single-turn reasoning paradigm over tables which suffers from context overflow and weak numerical sensitivity. To address...

1 min 1 month, 1 week ago

ear

LOW Academic International

Learning-free L2-Accented Speech Generation using Phonological Rules

arXiv:2603.07550v1 Announce Type: new Abstract: Accent plays a crucial role in speaker identity and inclusivity in speech technologies. Existing accented text-to-speech (TTS) systems either require large-scale accented datasets or lack fine-grained phoneme-level controllability. We propose a accented TTS framework that...

1 min 1 month, 1 week ago

ear

LOW Academic International

QuadAI at SemEval-2026 Task 3: Ensemble Learning of Hybrid RoBERTa and LLMs for Dimensional Aspect-Based Sentiment Analysis

arXiv:2603.07766v1 Announce Type: new Abstract: We present our system for SemEval-2026 Task 3 on dimensional aspect-based sentiment regression. Our approach combines a hybrid RoBERTa encoder, which jointly predicts sentiment using regression and discretized classification heads, with large language models (LLMs)...

1 min 1 month, 1 week ago

ear

LOW Academic International

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems

arXiv:2603.07779v1 Announce Type: new Abstract: Training next-generation code generation models requires high-quality datasets, yet existing datasets face difficulty imbalance, format inconsistency, and data quality problems. We address these challenges through systematic data processing and difficulty scaling. We introduce a four-stage...

1 min 1 month, 1 week ago

ear

LOW Academic International

An Efficient and Effective Evaluator for Text2SQL Models on Unseen and Unlabeled Data

arXiv:2603.07841v1 Announce Type: new Abstract: Recent advances in large language models has strengthened Text2SQL systems that translate natural language questions into database queries. A persistent deployment challenge is to assess a newly trained Text2SQL system on an unseen and unlabeled...

1 min 1 month, 1 week ago

ear

LOW Academic International

How Attention Sinks Emerge in Large Language Models: An Interpretability Perspective

arXiv:2603.06591v1 Announce Type: new Abstract: Large Language Models (LLMs) often allocate disproportionate attention to specific tokens, a phenomenon commonly referred to as the attention sink. While such sinks are generally considered detrimental, prior studies have identified a notable exception: the...

1 min 1 month, 1 week ago

ear

LOW Academic International

Khatri-Rao Clustering for Data Summarization

arXiv:2603.06602v1 Announce Type: new Abstract: As datasets continue to grow in size and complexity, finding succinct yet accurate data summaries poses a key challenge. Centroid-based clustering, a widely adopted approach to address this challenge, finds informative summaries of datasets in...

1 min 1 month, 1 week ago

ear

LOW Academic International

Structure-Aware Set Transformers: Temporal and Variable-Type Attention Biases for Asynchronous Clinical Time Series

arXiv:2603.06605v1 Announce Type: new Abstract: Electronic health records (EHR) are irregular, asynchronous multivariate time series. As time-series foundation models increasingly tokenize events rather than discretizing time, the input layout becomes a key design choice. Grids expose time$\times$variable structure but require...

1 min 1 month, 1 week ago

ear

LOW Academic International

Valid Feature-Level Inference for Tabular Foundation Models via the Conditional Randomization Test

arXiv:2603.06609v1 Announce Type: new Abstract: Modern machine learning models are highly expressive but notoriously difficult to analyze statistically. In particular, while black-box predictors can achieve strong empirical performance, they rarely provide valid hypothesis tests or p-values for assessing whether individual...

1 min 1 month, 1 week ago

ear

LOW Academic International

OptiRoulette Optimizer: A New Stochastic Meta-Optimizer for up to 5.3x Faster Convergence

arXiv:2603.06613v1 Announce Type: new Abstract: This paper presents OptiRoulette, a stochastic meta-optimizer that selects update rules during training instead of fixing a single optimizer. The method combines warmup optimizer locking, random sampling from an active optimizer pool, compatibility-aware learning-rate scaling...

1 min 1 month, 1 week ago

ear

LOW Academic International

Annealed Co-Generation: Disentangling Variables via Progressive Pairwise Modeling

arXiv:2603.06615v1 Announce Type: new Abstract: For multivariate co-generation in scientific applications, we advocate pairwise block rather than joint modeling of all variables. This design mitigates the computational burden and data imbalance. To this end, we propose an Annealed Co-Generation (ACG)...

1 min 1 month, 1 week ago

ear

LOW Academic International

Not all tokens are needed(NAT): token efficient reinforcement learning

arXiv:2603.06619v1 Announce Type: new Abstract: Reinforcement learning (RL) has become a key driver of progress in large language models, but scaling RL to long chain-of-thought (CoT) trajectories is increasingly constrained by backpropagation over every generated token. Even with optimized rollout...

1 min 1 month, 1 week ago

ear

LOW Academic International

Reward Under Attack: Analyzing the Robustness and Hackability of Process Reward Models

arXiv:2603.06621v1 Announce Type: new Abstract: Process Reward Models (PRMs) are rapidly becoming the backbone of LLM reasoning pipelines, yet we demonstrate that state-of-the-art PRMs are systematically exploitable under adversarial optimization pressure. To address this, we introduce a three-tiered diagnostic framework...

1 min 1 month, 1 week ago

ear

LOW Academic International

From ARIMA to Attention: Power Load Forecasting Using Temporal Deep Learning

arXiv:2603.06622v1 Announce Type: new Abstract: Accurate short-term power load forecasting is important to effectively manage, optimize, and ensure the robustness of modern power systems. This paper performs an empirical evaluation of a traditional statistical model and deep learning approaches for...

1 min 1 month, 1 week ago

ear

LOW Academic International

Grouter: Decoupling Routing from Representation for Accelerated MoE Training

arXiv:2603.06626v1 Announce Type: new Abstract: Traditional Mixture-of-Experts (MoE) training typically proceeds without any structural priors, effectively requiring the model to simultaneously train expert weights while searching for an optimal routing policy within a vast combinatorial space. This entanglement often leads...

1 min 1 month, 1 week ago

ear

LOW Academic International

A new Uncertainty Principle in Machine Learning

arXiv:2603.06634v1 Announce Type: new Abstract: Many scientific problems in the context of machine learning can be reduced to the search of polynomial answers in appropriate variables. The Hevisidization of arbitrary polynomial is actually provided by one-and-the same two-layer expression. What...

1 min 1 month, 1 week ago

ear

LOW Academic International

Graph Property Inference in Small Language Models: Effects of Representation and Inference Strategy

arXiv:2603.06635v1 Announce Type: new Abstract: Recent progress in language modeling has expanded the range of tasks that can be approached through natural language interfaces, including problems that require structured reasoning. However, it remains unclear how effectively limited-capacity language models can...

1 min 1 month, 1 week ago

ear

LOW Academic International

SmartBench: Evaluating LLMs in Smart Homes with Anomalous Device States and Behavioral Contexts

arXiv:2603.06636v1 Announce Type: new Abstract: Due to the strong context-awareness capabilities demonstrated by large language models (LLMs), recent research has begun exploring their integration into smart home assistants to help users manage and adjust their living environments. While LLMs have...

1 min 1 month, 1 week ago

ear

LOW Academic International

SR-TTT: Surprisal-Aware Residual Test-Time Training

arXiv:2603.06642v1 Announce Type: new Abstract: Test-Time Training (TTT) language models achieve theoretically infinite context windows with an O(1) memory footprint by replacing the standard exact-attention KV-cache with hidden state ``fast weights'' W_fast updated via self-supervised learning during inference. However, pure...

1 min 1 month, 1 week ago

ear

LOW Academic International

From Statistical Fidelity to Clinical Consistency: Scalable Generation and Auditing of Synthetic Patient Trajectories

arXiv:2603.06720v1 Announce Type: new Abstract: Access to electronic health records (EHRs) for digital health research is often limited by privacy regulations and institutional barriers. Synthetic EHRs have been proposed as a way to enable safe and sovereign data sharing; however,...

1 min 1 month, 1 week ago

ear

LOW Academic International

Bi Directional Feedback Fusion for Activity Aware Forecasting of Indoor CO2 and PM2.5

arXiv:2603.06724v1 Announce Type: new Abstract: Indoor air quality (IAQ) forecasting plays a critical role in safeguarding occupant health, ensuring thermal comfort, and supporting intelligent building control. However, predicting future concentrations of key pollutants such as carbon dioxide (CO2) and fine...

1 min 1 month, 1 week ago

ear

Hit-RAG: Learning to Reason with Long Contexts via Preference Alignment

Enhancing Consistency of Werewolf AI through Dialogue Summarization and Persona Information

Emotion Transcription in Conversation: A Benchmark for Capturing Subtle and Complex Emotional States through Natural Language

Scaling Self-Supervised Speech Models Uncovers Deep Linguistic Relationships: Evidence from the Pacific Cluster

Taiwan Safety Benchmark and Breeze Guard: Toward Trustworthy AI for Taiwanese Mandarin

Domain-Specific Quality Estimation for Machine Translation in Low-Resource Scenarios

The Dual-Stream Transformer: Channelized Architecture for Interpretable Language Modeling

Cross-Modal Taxonomic Generalization in (Vision-) Language Models

Skip to the Good Part: Representation Structure & Inference-Time Layer Skipping in Diffusion vs. Autoregressive LLMs

TableMind++: An Uncertainty-Aware Programmatic Agent for Tool-Augmented Table Reasoning

Learning-free L2-Accented Speech Generation using Phonological Rules

QuadAI at SemEval-2026 Task 3: Ensemble Learning of Hybrid RoBERTa and LLMs for Dimensional Aspect-Based Sentiment Analysis

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems

An Efficient and Effective Evaluator for Text2SQL Models on Unseen and Unlabeled Data

How Attention Sinks Emerge in Large Language Models: An Interpretability Perspective

Khatri-Rao Clustering for Data Summarization

Structure-Aware Set Transformers: Temporal and Variable-Type Attention Biases for Asynchronous Clinical Time Series

Valid Feature-Level Inference for Tabular Foundation Models via the Conditional Randomization Test

OptiRoulette Optimizer: A New Stochastic Meta-Optimizer for up to 5.3x Faster Convergence

Annealed Co-Generation: Disentangling Variables via Progressive Pairwise Modeling

Not all tokens are needed(NAT): token efficient reinforcement learning

Reward Under Attack: Analyzing the Robustness and Hackability of Process Reward Models

From ARIMA to Attention: Power Load Forecasting Using Temporal Deep Learning

Grouter: Decoupling Routing from Representation for Accelerated MoE Training

A new Uncertainty Principle in Machine Learning

Graph Property Inference in Small Language Models: Effects of Representation and Inference Strategy

SmartBench: Evaluating LLMs in Smart Homes with Anomalous Device States and Behavioral Contexts

SR-TTT: Surprisal-Aware Residual Test-Time Training

From Statistical Fidelity to Clinical Consistency: Scalable Generation and Auditing of Synthetic Patient Trajectories

Bi Directional Feedback Fusion for Activity Aware Forecasting of Indoor CO2 and PM2.5

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.