International Law

LOW Academic International

ToolMATH: A Math Tool Benchmark for Realistic Long-Horizon Multi-Tool Reasoning

arXiv:2602.21265v1 Announce Type: new Abstract: We introduce \ToolMATH, a math-grounded benchmark that evaluates tool-augmented language models in realistic multi-tool environments where the output depends on calling schema-specified tools and sustaining multi-step execution. It turns math problems into a controlled, correctness-checkable...

1 min 2 months ago

ear

LOW Academic International

VecGlypher: Unified Vector Glyph Generation with Language Models

arXiv:2602.21461v1 Announce Type: new Abstract: Vector glyphs are the atomic units of digital typography, yet most learning-based pipelines still depend on carefully curated exemplar sheets and raster-to-vector postprocessing, which limits accessibility and editability. We introduce VecGlypher, a single multimodal language...

1 min 2 months ago

ear

LOW Academic International

Enhancing Multilingual Embeddings via Multi-Way Parallel Text Alignment

arXiv:2602.21543v1 Announce Type: new Abstract: Multilingual pretraining typically lacks explicit alignment signals, leading to suboptimal cross-lingual alignment in the representation space. In this work, we show that training standard pretrained models for cross-lingual alignment with a multi-way parallel corpus in...

1 min 2 months ago

ear

LOW Academic International

When More Is Less: A Systematic Analysis of Spatial and Commonsense Information for Visual Spatial Reasoning

arXiv:2602.21619v1 Announce Type: new Abstract: Visual spatial reasoning (VSR) remains challenging for modern vision-language models (VLMs), despite advances in multimodal architectures. A common strategy is to inject additional information at inference time, such as explicit spatial cues, external commonsense knowledge,...

1 min 2 months ago

ear

LOW Academic International

RuCL: Stratified Rubric-Based Curriculum Learning for Multimodal Large Language Model Reasoning

arXiv:2602.21628v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a prevailing paradigm for enhancing reasoning in Multimodal Large Language Models (MLLMs). However, relying solely on outcome supervision risks reward hacking, where models learn spurious reasoning...

1 min 2 months ago

ear

LOW Academic European Union

Multi-dimensional Assessment and Explainable Feedback for Counselor Responses to Client Resistance in Text-based Counseling with LLMs

arXiv:2602.21638v1 Announce Type: new Abstract: Effectively addressing client resistance is a sophisticated clinical skill in psychological counseling, yet practitioners often lack timely and scalable supervisory feedback to refine their approaches. Although current NLP research has examined overall counseling quality and...

1 min 2 months ago

ear

LOW Academic International

Scalable Multilingual Multimodal Machine Translation with Speech-Text Fusion

arXiv:2602.21646v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) have achieved notable success in enhancing translation performance by integrating multimodal information. However, existing research primarily focuses on image-guided methods, whose applicability is constrained by the scarcity of multilingual image-text...

1 min 2 months ago

ear

LOW Academic International

DWA-KD: Dual-Space Weighting and Time-Warped Alignment for Cross-Tokenizer Knowledge Distillation

arXiv:2602.21669v1 Announce Type: new Abstract: Knowledge Distillation (KD) has emerged as a crucial technique for compressing Large Language Models (LLMs). Although existing cross-tokenizer KD methods have made notable progress, their effectiveness remains constrained by suboptimal alignment across sequence and vocabulary...

1 min 2 months ago

ear

LOW Academic International

Evaluating the relationship between regularity and learnability in recursive numeral systems using Reinforcement Learning

arXiv:2602.21720v1 Announce Type: new Abstract: Human recursive numeral systems (i.e., counting systems such as English base-10 numerals), like many other grammatical systems, are highly regular. Following prior work that relates cross-linguistic tendencies to biases in learning, we ask whether regular...

1 min 2 months ago

ear

LOW Academic International

Explore-on-Graph: Incentivizing Autonomous Exploration of Large Language Models on Knowledge Graphs with Path-refined Reward Modeling

arXiv:2602.21728v1 Announce Type: new Abstract: The reasoning process of Large Language Models (LLMs) is often plagued by hallucinations and missing facts in question-answering tasks. A promising solution is to ground LLMs' answers in verifiable knowledge sources, such as Knowledge Graphs...

1 min 2 months ago

ear

LOW Academic International

D-COT: Disciplined Chain-of-Thought Learning for Efficient Reasoning in Small Language Models

arXiv:2602.21786v1 Announce Type: new Abstract: Chain-of-Thought (CoT) distillation from Large Language Models (LLMs) often induces "overthinking" in Small Language Models (SLMs), leading to performance degradation and excessive token consumption. In this study, we propose Disciplined Chain-of-Thought (D-CoT), a novel framework...

1 min 2 months ago

ear

LOW Academic International

FewMMBench: A Benchmark for Multimodal Few-Shot Learning

arXiv:2602.21854v1 Announce Type: new Abstract: As multimodal large language models (MLLMs) advance in handling interleaved image-text data, assessing their few-shot learning capabilities remains an open challenge. In this paper, we introduce FewMMBench, a comprehensive benchmark designed to evaluate MLLMs under...

1 min 2 months ago

ear

LOW Academic International

Personalized Graph-Empowered Large Language Model for Proactive Information Access

arXiv:2602.21862v1 Announce Type: new Abstract: Since individuals may struggle to recall all life details and often confuse events, establishing a system to assist users in recalling forgotten experiences is essential. While numerous studies have proposed memory recall systems, these primarily...

1 min 2 months ago

ear

LOW Academic International

ExpLang: Improved Exploration and Exploitation in LLM Reasoning with On-Policy Thinking Language Selection

arXiv:2602.21887v1 Announce Type: new Abstract: Current large reasoning models (LRMs) have shown strong ability on challenging tasks after reinforcement learning (RL) based post-training. However, previous work mainly focuses on English reasoning in expectation of the strongest performance, despite the demonstrated...

1 min 2 months ago

ear

LOW Academic International

Large Language Models are Algorithmically Blind

arXiv:2602.21947v1 Announce Type: new Abstract: Large language models (LLMs) demonstrate remarkable breadth of knowledge, yet their ability to reason about computational processes remains poorly understood. Closing this gap matters for practitioners who rely on LLMs to guide algorithm selection and...

1 min 2 months ago

ear

LOW Academic United States

RADAR: Reasoning as Discrimination with Aligned Representations for LLM-based Knowledge Graph Reasoning

arXiv:2602.21951v1 Announce Type: new Abstract: Knowledge graph reasoning (KGR) infers missing facts, with recent advances increasingly harnessing the semantic priors and reasoning abilities of Large Language Models (LLMs). However, prevailing generative paradigms are prone to memorizing surface-level co-occurrences rather than...

1 min 2 months ago

ear

LOW Academic International

CxMP: A Linguistic Minimal-Pair Benchmark for Evaluating Constructional Understanding in Language Models

arXiv:2602.21978v1 Announce Type: new Abstract: Recent work has examined language models from a linguistic perspective to better understand how they acquire language. Most existing benchmarks focus on judging grammatical acceptability, whereas the ability to interpret meanings conveyed by grammatical forms...

1 min 2 months ago

ear

LOW Academic International

A Diversity Diet for a Healthier Model: A Case Study of French ModernBERT

arXiv:2602.22014v1 Announce Type: new Abstract: Diversity has been gaining interest in the NLP community in recent years. At the same time, state-of-the-art transformer models such as ModernBERT use very large pre-training datasets, which are driven by size rather than by...

1 min 2 months ago

ear

LOW Academic United States

DLT-Corpus: A Large-Scale Text Collection for the Distributed Ledger Technology Domain

arXiv:2602.22045v1 Announce Type: new Abstract: We introduce DLT-Corpus, the largest domain-specific text collection for Distributed Ledger Technology (DLT) research to date: 2.98 billion tokens from 22.12 million documents spanning scientific literature (37,440 publications), United States Patent and Trademark Office (USPTO)...

1 min 2 months ago

ear

LOW Academic European Union

SymTorch: A Framework for Symbolic Distillation of Deep Neural Networks

arXiv:2602.21307v1 Announce Type: new Abstract: Symbolic distillation replaces neural networks, or components thereof, with interpretable, closed-form mathematical expressions. This approach has shown promise in discovering physical laws and mathematical relationships directly from trained deep learning models, yet adoption remains limited...

1 min 2 months ago

ear

LOW Academic International

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

arXiv:2602.21320v1 Announce Type: new Abstract: Large language models (LLMs) are becoming the foundation for autonomous agents that can use tools to solve complex tasks. Reinforcement learning (RL) has emerged as a common approach for injecting such agentic capabilities, but typically...

1 min 2 months ago

ear

LOW Academic International

Efficient Opportunistic Approachability

arXiv:2602.21328v1 Announce Type: new Abstract: We study the problem of opportunistic approachability: a generalization of Blackwell approachability where the learner would like to obtain stronger guarantees (i.e., approach a smaller set) when their adversary limits themselves to a subset of...

1 min 2 months ago

ear

LOW Academic International

HiPPO Zoo: Explicit Memory Mechanisms for Interpretable State Space Models

arXiv:2602.21340v1 Announce Type: new Abstract: Representing the past in a compressed, efficient, and informative manner is a central problem for systems trained on sequential data. The HiPPO framework, originally proposed by Gu & Dao et al., provides a principled approach...

1 min 2 months ago

ear

LOW Academic International

Archetypal Graph Generative Models: Explainable and Identifiable Communities via Anchor-Dominant Convex Hulls

arXiv:2602.21342v1 Announce Type: new Abstract: Representation learning has been essential for graph machine learning tasks such as link prediction, community detection, and network visualization. Despite recent advances in achieving high performance on these downstream tasks, little progress has been made...

1 min 2 months ago

ear

LOW Academic European Union

Interleaved Head Attention

arXiv:2602.21371v1 Announce Type: new Abstract: Multi-Head Attention (MHA) is the core computational primitive underlying modern Large Language Models (LLMs). However, MHA suffers from a fundamental linear scaling limitation: $H$ attention heads produce exactly $H$ independent attention matrices, with no communication...

1 min 2 months ago

ear

LOW Academic International

Defensive Generation

arXiv:2602.21390v1 Announce Type: new Abstract: We study the problem of efficiently producing, in an online fashion, generative models of scalar, multiclass, and vector-valued outcomes that cannot be falsified on the basis of the observed data and a pre-specified collection of...

1 min 2 months ago

ear

LOW Academic International

Generative Bayesian Computation as a Scalable Alternative to Gaussian Process Surrogates

arXiv:2602.21408v1 Announce Type: new Abstract: Gaussian process (GP) surrogates are the default tool for emulating expensive computer experiments, but cubic cost, stationarity assumptions, and Gaussian predictive distributions limit their reach. We propose Generative Bayesian Computation (GBC) via Implicit Quantile Networks...

1 min 2 months ago

ear

LOW Academic European Union

Benchmarking State Space Models, Transformers, and Recurrent Networks for US Grid Forecasting

arXiv:2602.21415v1 Announce Type: new Abstract: Selecting the right deep learning model for power grid forecasting is challenging, as performance heavily depends on the data available to the operator. This paper presents a comprehensive benchmark of five modern neural architectures: two...

1 min 2 months ago

ear

LOW Academic United States

On the Structural Non-Preservation of Epistemic Behaviour under Policy Transformation

arXiv:2602.21424v1 Announce Type: new Abstract: Reinforcement learning (RL) agents under partial observability often condition actions on internally accumulated information such as memory or inferred latent context. We formalise such information-conditioned interaction patterns as behavioural dependency: variation in action selection with...

1 min 2 months ago

ear

LOW Academic International

Proximal-IMH: Proximal Posterior Proposals for Independent Metropolis-Hastings with Approximate Operators

arXiv:2602.21426v1 Announce Type: new Abstract: We consider the problem of sampling from a posterior distribution arising in Bayesian inverse problems in science, engineering, and imaging. Our method belongs to the family of independence Metropolis-Hastings (IMH) sampling algorithms, which are common...

1 min 2 months ago

ear

ToolMATH: A Math Tool Benchmark for Realistic Long-Horizon Multi-Tool Reasoning

VecGlypher: Unified Vector Glyph Generation with Language Models

Enhancing Multilingual Embeddings via Multi-Way Parallel Text Alignment

When More Is Less: A Systematic Analysis of Spatial and Commonsense Information for Visual Spatial Reasoning

RuCL: Stratified Rubric-Based Curriculum Learning for Multimodal Large Language Model Reasoning

Multi-dimensional Assessment and Explainable Feedback for Counselor Responses to Client Resistance in Text-based Counseling with LLMs

Scalable Multilingual Multimodal Machine Translation with Speech-Text Fusion

DWA-KD: Dual-Space Weighting and Time-Warped Alignment for Cross-Tokenizer Knowledge Distillation

Evaluating the relationship between regularity and learnability in recursive numeral systems using Reinforcement Learning

Explore-on-Graph: Incentivizing Autonomous Exploration of Large Language Models on Knowledge Graphs with Path-refined Reward Modeling

D-COT: Disciplined Chain-of-Thought Learning for Efficient Reasoning in Small Language Models

FewMMBench: A Benchmark for Multimodal Few-Shot Learning

Personalized Graph-Empowered Large Language Model for Proactive Information Access

ExpLang: Improved Exploration and Exploitation in LLM Reasoning with On-Policy Thinking Language Selection

Large Language Models are Algorithmically Blind

RADAR: Reasoning as Discrimination with Aligned Representations for LLM-based Knowledge Graph Reasoning

CxMP: A Linguistic Minimal-Pair Benchmark for Evaluating Constructional Understanding in Language Models

A Diversity Diet for a Healthier Model: A Case Study of French ModernBERT

DLT-Corpus: A Large-Scale Text Collection for the Distributed Ledger Technology Domain

SymTorch: A Framework for Symbolic Distillation of Deep Neural Networks

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

Efficient Opportunistic Approachability

HiPPO Zoo: Explicit Memory Mechanisms for Interpretable State Space Models

Archetypal Graph Generative Models: Explainable and Identifiable Communities via Anchor-Dominant Convex Hulls

Interleaved Head Attention

Defensive Generation

Generative Bayesian Computation as a Scalable Alternative to Gaussian Process Surrogates

Benchmarking State Space Models, Transformers, and Recurrent Networks for US Grid Forecasting

On the Structural Non-Preservation of Epistemic Behaviour under Policy Transformation

Proximal-IMH: Proximal Posterior Proposals for Independent Metropolis-Hastings with Approximate Operators

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.