International Law

LOW Academic International

Diverse Word Choices, Same Reference: Annotating Lexically-Rich Cross-Document Coreference

arXiv:2602.17424v1 Announce Type: new Abstract: Cross-document coreference resolution (CDCR) identifies and links mentions of the same entities and events across related documents, enabling content analysis that aggregates information at the level of discourse participants. However, existing datasets primarily focus on...

1 min 2 months, 1 week ago

ear

LOW Academic International

AIDG: Evaluating Asymmetry Between Information Extraction and Containment in Multi-Turn Dialogue

arXiv:2602.17443v1 Announce Type: new Abstract: Evaluating the strategic reasoning capabilities of Large Language Models (LLMs) requires moving beyond static benchmarks to dynamic, multi-turn interactions. We introduce AIDG (Adversarial Information Deduction Game), a game-theoretic framework that probes the asymmetry between information...

1 min 2 months, 1 week ago

ear

LOW Academic International

Auditing Reciprocal Sentiment Alignment: Inversion Risk, Dialect Representation and Intent Misalignment in Transformers

arXiv:2602.17469v1 Announce Type: new Abstract: The core theme of bidirectional alignment is ensuring that AI systems accurately understand human intent and that humans can trust AI behavior. However, this loop fractures significantly across language barriers. Our research addresses Cross-Lingual Sentiment...

1 min 2 months, 1 week ago

ear

LOW Academic International

Using LLMs for Knowledge Component-level Correctness Labeling in Open-ended Coding Problems

arXiv:2602.17542v1 Announce Type: new Abstract: Fine-grained skill representations, commonly referred to as knowledge components (KCs), are fundamental to many approaches in student modeling and learning analytics. However, KC-level correctness labels are rarely available in real-world datasets, especially for open-ended programming...

1 min 2 months, 1 week ago

ear

LOW Academic International

Learning to Stay Safe: Adaptive Regularization Against Safety Degradation during Fine-Tuning

arXiv:2602.17546v1 Announce Type: new Abstract: Instruction-following language models are trained to be helpful and safe, yet their safety behavior can deteriorate under benign fine-tuning and worsen under adversarial updates. Existing defenses often offer limited protection or force a trade-off between...

1 min 2 months, 1 week ago

ear

LOW Academic International

The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR$\rightarrow$LLM Pipelines?

arXiv:2602.17598v1 Announce Type: new Abstract: Current speech LLMs largely perform implicit ASR: on tasks solvable from a transcript, they are behaviorally and mechanistically equivalent to simple Whisper$\to$LLM cascades. We show this through matched-backbone testing across four speech LLMs and six...

1 min 2 months, 1 week ago

ear

LOW Academic International

Unmasking the Factual-Conceptual Gap in Persian Language Models

arXiv:2602.17623v1 Announce Type: new Abstract: While emerging Persian NLP benchmarks have expanded into pragmatics and politeness, they rarely distinguish between memorized cultural facts and the ability to reason about implicit social norms. We introduce DivanBench, a diagnostic benchmark focused on...

1 min 2 months, 1 week ago

ear

LOW Academic International

Differences in Typological Alignment in Language Models' Treatment of Differential Argument Marking

arXiv:2602.17653v1 Announce Type: new Abstract: Recent work has shown that language models (LMs) trained on synthetic corpora can exhibit typological preferences that resemble cross-linguistic regularities in human languages, particularly for syntactic phenomena such as word order. In this paper, we...

1 min 2 months, 1 week ago

ear

LOW Academic International

What Language is This? Ask Your Tokenizer

arXiv:2602.17655v1 Announce Type: new Abstract: Language Identification (LID) is an important component of many multilingual natural language processing pipelines, where it facilitates corpus curation, training data analysis, and cross-lingual evaluation of large language models. Despite near-perfect performance on high-resource languages,...

1 min 2 months, 1 week ago

ear

LOW Academic International

Better Think Thrice: Learning to Reason Causally with Double Counterfactual Consistency

arXiv:2602.16787v1 Announce Type: cross Abstract: Despite their strong performance on reasoning benchmarks, large language models (LLMs) have proven brittle when presented with counterfactual questions, suggesting weaknesses in their causal reasoning ability. While recent work has demonstrated that labeled counterfactual tasks...

1 min 2 months, 1 week ago

ear

LOW Academic International

Hybrid-Gym: Training Coding Agents to Generalize Across Tasks

arXiv:2602.16819v1 Announce Type: cross Abstract: When assessing the quality of coding agents, predominant benchmarks focus on solving single issues on GitHub, such as SWE-Bench. In contrast, in real use, these agents solve more various and complex tasks that involve other...

1 min 2 months, 1 week ago

ear

LOW Academic United States

MMCAformer: Macro-Micro Cross-Attention Transformer for Traffic Speed Prediction with Microscopic Connected Vehicle Driving Behavior

arXiv:2602.16730v1 Announce Type: new Abstract: Accurate speed prediction is crucial for proactive traffic management to enhance traffic efficiency and safety. Existing studies have primarily relied on aggregated, macroscopic traffic flow data to predict future traffic trends, whereas road traffic dynamics...

1 min 2 months, 1 week ago

ear

LOW Academic United States

A Few-Shot LLM Framework for Extreme Day Classification in Electricity Markets

arXiv:2602.16735v1 Announce Type: new Abstract: This paper proposes a few-shot classification framework based on Large Language Models (LLMs) to predict whether the next day will have spikes in real-time electricity prices. The approach aggregates system state information, including electricity demand,...

1 min 2 months, 1 week ago

ear

LOW Academic United States

Real-time Secondary Crash Likelihood Prediction Excluding Post Primary Crash Features

arXiv:2602.16739v1 Announce Type: new Abstract: Secondary crash likelihood prediction is a critical component of an active traffic management system to mitigate congestion and adverse impacts caused by secondary crashes. However, existing approaches mainly rely on post-crash features (e.g., crash type...

1 min 2 months, 1 week ago

ear

LOW Academic International

Quantifying LLM Attention-Head Stability: Implications for Circuit Universality

arXiv:2602.16740v1 Announce Type: new Abstract: In mechanistic interpretability, recent work scrutinizes transformer "circuits" - sparse, mono or multi layer sub computations, that may reflect human understandable functions. Yet, these network circuits are rarely acid-tested for their stability across different instances...

1 min 2 months, 1 week ago

ear

LOW Academic International

DeepVision-103K: A Visually Diverse, Broad-Coverage, and Verifiable Mathematical Dataset for Multimodal Reasoning

arXiv:2602.16742v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has been shown effective in enhancing the visual reflection and reasoning capabilities of Large Multimodal Models (LMMs). However, existing datasets are predominantly derived from either small-scale manual construction or...

1 min 2 months, 1 week ago

ear

LOW Academic International

Low-Dimensional and Transversely Curved Optimization Dynamics in Grokking

arXiv:2602.16746v1 Announce Type: new Abstract: Grokking -- the delayed transition from memorization to generalization in small algorithmic tasks -- remains poorly understood. We present a geometric analysis of optimization dynamics in transformers trained on modular arithmetic. PCA of attention weight...

1 min 2 months, 1 week ago

ear

LOW Academic International

Attending to Routers Aids Indoor Wireless Localization

arXiv:2602.16762v1 Announce Type: new Abstract: Modern machine learning-based wireless localization using Wi-Fi signals continues to face significant challenges in achieving groundbreaking performance across diverse environments. A major limitation is that most existing algorithms do not appropriately weight the information from...

1 min 2 months, 1 week ago

ear

LOW Academic European Union

Machine Learning Argument of Latitude Error Model for LEO Satellite Orbit and Covariance Correction

arXiv:2602.16764v1 Announce Type: new Abstract: Low Earth orbit (LEO) satellites are leveraged to support new position, navigation, and timing (PNT) service alternatives to GNSS. These alternatives require accurate propagation of satellite position and velocity with a realistic quantification of uncertainty....

1 min 2 months, 1 week ago

ear

LOW Academic International

Escaping the Cognitive Well: Efficient Competition Math with Off-the-Shelf Models

arXiv:2602.16793v1 Announce Type: new Abstract: In the past year, custom and unreleased math reasoning models reached gold medal performance on the International Mathematical Olympiad (IMO). Similar performance was then reported using large-scale inference on publicly available models but at prohibitive...

1 min 2 months, 1 week ago

ear

LOW Academic European Union

Efficient Tail-Aware Generative Optimization via Flow Model Fine-Tuning

arXiv:2602.16796v1 Announce Type: new Abstract: Fine-tuning pre-trained diffusion and flow models to optimize downstream utilities is central to real-world deployment. Existing entropy-regularized methods primarily maximize expected reward, providing no mechanism to shape tail behavior. However, tail control is often essential:...

1 min 2 months, 1 week ago

ear

LOW Academic European Union

TopoFlow: Physics-guided Neural Networks for high-resolution air quality prediction

arXiv:2602.16821v1 Announce Type: new Abstract: We propose TopoFlow (Topography-aware pollutant Flow learning), a physics-guided neural network for efficient, high-resolution air quality prediction. To explicitly embed physical processes into the learning framework, we identify two critical factors governing pollutant dynamics: topography...

1 min 2 months, 1 week ago

ear

LOW Academic International

HiVAE: Hierarchical Latent Variables for Scalable Theory of Mind

arXiv:2602.16826v1 Announce Type: new Abstract: Theory of mind (ToM) enables AI systems to infer agents' hidden goals and mental states, but existing approaches focus mainly on small human understandable gridworld spaces. We introduce HiVAE, a hierarchical variational architecture that scales...

1 min 2 months, 1 week ago

ear

LOW Academic European Union

Learning under noisy supervision is governed by a feedback-truth gap

arXiv:2602.16829v1 Announce Type: new Abstract: When feedback is absorbed faster than task structure can be evaluated, the learner will favor feedback over truth. A two-timescale model shows this feedback-truth gap is inevitable whenever the two rates differ and vanishes only...

1 min 2 months, 1 week ago

ear

LOW Academic International

VAM: Verbalized Action Masking for Controllable Exploration in RL Post-Training -- A Chess Case Study

arXiv:2602.16833v1 Announce Type: new Abstract: Exploration remains a key bottleneck for reinforcement learning (RL) post-training of large language models (LLMs), where sparse feedback and large action spaces can lead to premature collapse into repetitive behaviors. We propose Verbalized Action Masking...

1 min 2 months, 1 week ago

ear

LOW Academic United States

A Residual-Aware Theory of Position Bias in Transformers

arXiv:2602.16837v1 Announce Type: new Abstract: Transformer models systematically favor certain token positions, yet the architectural origins of this position bias remain poorly understood. Under causal masking at infinite depth, prior theoretical analyses of attention rollout predict an inevitable collapse of...

1 min 2 months, 1 week ago

ear

LOW Academic International

Training Large Reasoning Models Efficiently via Progressive Thought Encoding

arXiv:2602.16839v1 Announce Type: new Abstract: Large reasoning models (LRMs) excel on complex problems but face a critical barrier to efficiency: reinforcement learning (RL) training requires long rollouts for outcome-based rewards, where autoregressive decoding dominates time and memory usage. While sliding-window...

1 min 2 months, 1 week ago

ear

LOW Academic European Union

What is the Value of Censored Data? An Exact Analysis for the Data-driven Newsvendor

arXiv:2602.16842v1 Announce Type: new Abstract: We study the offline data-driven newsvendor problem with censored demand data. In contrast to prior works where demand is fully observed, we consider the setting where demand is censored at the inventory level and only...

1 min 2 months, 1 week ago

ear

LOW Academic European Union

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

arXiv:2602.16849v1 Announce Type: new Abstract: We present a comprehensive analysis of how two-layer neural networks learn features to solve the modular addition task. Our work provides a full mechanistic interpretation of the learned model and a theoretical explanation of its...

1 min 2 months, 1 week ago

ear

LOW Academic International

Position: Why a Dynamical Systems Perspective is Needed to Advance Time Series Modeling

arXiv:2602.16864v1 Announce Type: new Abstract: Time series (TS) modeling has come a long way from early statistical, mainly linear, approaches to the current trend in TS foundation models. With a lot of hype and industrial demand in this field, it...

1 min 2 months, 1 week ago

ear

Diverse Word Choices, Same Reference: Annotating Lexically-Rich Cross-Document Coreference

AIDG: Evaluating Asymmetry Between Information Extraction and Containment in Multi-Turn Dialogue

Auditing Reciprocal Sentiment Alignment: Inversion Risk, Dialect Representation and Intent Misalignment in Transformers

Using LLMs for Knowledge Component-level Correctness Labeling in Open-ended Coding Problems

Learning to Stay Safe: Adaptive Regularization Against Safety Degradation during Fine-Tuning

The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR$\rightarrow$LLM Pipelines?

Unmasking the Factual-Conceptual Gap in Persian Language Models

Differences in Typological Alignment in Language Models' Treatment of Differential Argument Marking

What Language is This? Ask Your Tokenizer

Better Think Thrice: Learning to Reason Causally with Double Counterfactual Consistency

Hybrid-Gym: Training Coding Agents to Generalize Across Tasks

MMCAformer: Macro-Micro Cross-Attention Transformer for Traffic Speed Prediction with Microscopic Connected Vehicle Driving Behavior

A Few-Shot LLM Framework for Extreme Day Classification in Electricity Markets

Real-time Secondary Crash Likelihood Prediction Excluding Post Primary Crash Features

Quantifying LLM Attention-Head Stability: Implications for Circuit Universality

DeepVision-103K: A Visually Diverse, Broad-Coverage, and Verifiable Mathematical Dataset for Multimodal Reasoning

Low-Dimensional and Transversely Curved Optimization Dynamics in Grokking

Attending to Routers Aids Indoor Wireless Localization

Machine Learning Argument of Latitude Error Model for LEO Satellite Orbit and Covariance Correction

Escaping the Cognitive Well: Efficient Competition Math with Off-the-Shelf Models

Efficient Tail-Aware Generative Optimization via Flow Model Fine-Tuning

TopoFlow: Physics-guided Neural Networks for high-resolution air quality prediction

HiVAE: Hierarchical Latent Variables for Scalable Theory of Mind

Learning under noisy supervision is governed by a feedback-truth gap

VAM: Verbalized Action Masking for Controllable Exploration in RL Post-Training -- A Chess Case Study

A Residual-Aware Theory of Position Bias in Transformers

Training Large Reasoning Models Efficiently via Progressive Thought Encoding

What is the Value of Censored Data? An Exact Analysis for the Data-driven Newsvendor

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

Position: Why a Dynamical Systems Perspective is Needed to Advance Time Series Modeling

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.