International Law

LOW Academic International

Explicit Grammar Semantic Feature Fusion for Robust Text Classification

arXiv:2602.20749v1 Announce Type: new Abstract: Natural Language Processing enables computers to understand human language by analysing and classifying text efficiently with deep-level grammatical and semantic features. Existing models capture features by learning from large corpora with transformer models, which are...

1 min 2 months ago

ear

LOW Academic United States

Overton Pluralistic Reinforcement Learning for Large Language Models

arXiv:2602.20759v1 Announce Type: new Abstract: Existing alignment paradigms remain limited in capturing the pluralistic nature of human values. Overton Pluralism addresses this gap by generating responses with diverse perspectives from a single query. This paper introduces OP-GRPO (Overton Pluralistic Group...

1 min 2 months ago

ear

LOW Academic International

Don't Ignore the Tail: Decoupling top-K Probabilities for Efficient Language Model Distillation

arXiv:2602.20816v1 Announce Type: new Abstract: The core learning signal used in language model distillation is the standard Kullback-Leibler (KL) divergence between the student and teacher distributions. Traditional KL divergence tends to be dominated by the next tokens with the highest...

1 min 2 months ago

ear

LOW Academic International

FinAnchor: Aligned Multi-Model Representations for Financial Prediction

arXiv:2602.20859v1 Announce Type: new Abstract: Financial prediction from long documents involves significant challenges, as actionable signals are often sparse and obscured by noise, and the optimal LLM for generating embeddings varies across tasks and time periods. In this paper, we...

1 min 2 months ago

ear

LOW Academic International

The Art of Efficient Reasoning: Data, Reward, and Optimization

arXiv:2602.20945v1 Announce Type: new Abstract: Large Language Models (LLMs) consistently benefit from scaled Chain-of-Thought (CoT) reasoning, but also suffer from heavy computational overhead. To address this issue, efficient reasoning aims to incentivize short yet accurate thinking trajectories, typically through reward...

1 min 2 months ago

ear

LOW Academic International

Blackbird Language Matrices: A Framework to Investigate the Linguistic Competence of Language Models

arXiv:2602.20966v1 Announce Type: new Abstract: This article describes a novel language task, the Blackbird Language Matrices (BLM) task, inspired by intelligence tests, and illustrates the BLM datasets, their construction and benchmarking, and targeted experiments on chunking and systematicity. BLMs are...

1 min 2 months ago

ear

LOW Academic International

Linear Reasoning vs. Proof by Cases: Obstacles for Large Language Models in FOL Problem Solving

arXiv:2602.20973v1 Announce Type: new Abstract: To comprehensively evaluate the mathematical reasoning capabilities of Large Language Models (LLMs), researchers have introduced abundant mathematical reasoning datasets. However, most existing datasets primarily focus on linear reasoning, neglecting other parts such as proof by...

1 min 2 months ago

ear

LOW Academic United States

Beyond the Star Rating: A Scalable Framework for Aspect-Based Sentiment Analysis Using LLMs and Text Classification

arXiv:2602.21082v1 Announce Type: new Abstract: Customer-provided reviews have become an important source of information for business owners and other customers alike. However, effectively analyzing millions of unstructured reviews remains challenging. While large language models (LLMs) show promise for natural language...

1 min 2 months ago

ear

LOW Academic United States

PVminer: A Domain-Specific Tool to Detect the Patient Voice in Patient Generated Data

arXiv:2602.21165v1 Announce Type: new Abstract: Patient-generated text such as secure messages, surveys, and interviews contains rich expressions of the patient voice (PV), reflecting communicative behaviors and social determinants of health (SDoH). Traditional qualitative coding frameworks are labor intensive and do...

1 min 2 months ago

ear

LOW Academic International

On Data Engineering for Scaling LLM Terminal Capabilities

arXiv:2602.21193v1 Announce Type: new Abstract: Despite rapid recent progress in the terminal capabilities of large language models, the training data strategies behind state-of-the-art terminal agents remain largely undisclosed. We address this gap through a systematic study of data engineering practices...

1 min 2 months ago

ear

LOW Academic European Union

Graph Modelling Analysis of Speech-Gesture Interaction for Aphasia Severity Estimation

arXiv:2602.20163v1 Announce Type: cross Abstract: Aphasia is an acquired language disorder caused by injury to the regions of the brain that are responsible for language. Aphasia may impair the use and comprehension of written and spoken language. The Western Aphasia...

1 min 2 months ago

ear

LOW Academic International

MedCLIPSeg: Probabilistic Vision-Language Adaptation for Data-Efficient and Generalizable Medical Image Segmentation

arXiv:2602.20423v1 Announce Type: cross Abstract: Medical image segmentation remains challenging due to limited annotations for training, ambiguous anatomical features, and domain shifts. While vision-language models such as CLIP offer strong cross-modal representations, their potential for dense, text-guided medical image segmentation...

1 min 2 months ago

ear

LOW Academic International

Protein Language Models Diverge from Natural Language: Comparative Analysis and Improved Inference

arXiv:2602.20449v1 Announce Type: cross Abstract: Modern Protein Language Models (PLMs) apply transformer-based model architectures from natural language processing to biological sequences, predicting a variety of protein functions and properties. However, protein language has key differences from natural language, such as...

1 min 2 months ago

ear

LOW Academic European Union

Actor-Curator: Co-adaptive Curriculum Learning via Policy-Improvement Bandits for RL Post-Training

arXiv:2602.20532v1 Announce Type: cross Abstract: Post-training large foundation models with reinforcement learning typically relies on massive and heterogeneous datasets, making effective curriculum learning both critical and challenging. In this work, we propose ACTOR-CURATOR, a scalable and fully automated curriculum learning...

1 min 2 months ago

ear

LOW Academic International

GATES: Self-Distillation under Privileged Context with Consensus Gating

arXiv:2602.20574v1 Announce Type: cross Abstract: We study self-distillation in settings where supervision is unreliable: there are no ground truth labels, verifiable rewards, or external graders to evaluate answers. We focus on document-grounded question answering with asymmetric context, where a single...

1 min 2 months ago

ear

LOW Academic European Union

RMIT-ADM+S at the MMU-RAG NeurIPS 2025 Competition

arXiv:2602.20735v1 Announce Type: cross Abstract: This paper presents the award-winning RMIT-ADM+S system for the Text-to-Text track of the NeurIPS~2025 MMU-RAG Competition. We introduce Routing-to-RAG (R2RAG), a research-focused retrieval-augmented generation (RAG) architecture composed of lightweight components that dynamically adapt the retrieval...

1 min 2 months ago

ear

LOW Academic International

Multimodal MRI Report Findings Supervised Brain Lesion Segmentation with Substructures

arXiv:2602.20994v1 Announce Type: cross Abstract: Report-supervised (RSuper) learning seeks to alleviate the need for dense tumor voxel labels with constraints derived from radiology reports (e.g., volumes, counts, sizes, locations). In MRI studies of brain tumors, however, we often involve multi-parametric...

1 min 2 months ago

ear

LOW Academic United States

Tensor Network Generator-Enhanced Optimization for Traveling Salesman Problem

arXiv:2602.20175v1 Announce Type: new Abstract: We present an application of the tensor network generator-enhanced optimization (TN-GEO) framework to address the traveling salesman problem (TSP), a fundamental combinatorial optimization challenge. Our approach employs a tensor network Born machine based on automatically...

1 min 2 months ago

ear

LOW Academic International

Controllable Exploration in Hybrid-Policy RLVR for Multi-Modal Reasoning

arXiv:2602.20197v1 Announce Type: new Abstract: Reinforcement Learning with verifiable rewards (RLVR) has emerged as a primary learning paradigm for enhancing the reasoning capabilities of multi-modal large language models (MLLMs). However, during RL training, the enormous state space of MLLM and...

1 min 2 months ago

ear

LOW Academic United States

IMOVNO+: A Regional Partitioning and Meta-Heuristic Ensemble Framework for Imbalanced Multi-Class Learning

arXiv:2602.20199v1 Announce Type: new Abstract: Class imbalance, overlap, and noise degrade data quality, reduce model reliability, and limit generalization. Although widely studied in binary classification, these issues remain underexplored in multi-class settings, where complex inter-class relationships make minority-majority structures unclear...

1 min 2 months ago

ear

LOW Academic International

Golden Layers and Where to Find Them: Improved Knowledge Editing for Large Language Models Via Layer Gradient Analysis

arXiv:2602.20207v1 Announce Type: new Abstract: Knowledge editing in Large Language Models (LLMs) aims to update the model's prediction for a specific query to a desired target while preserving its behavior on all other inputs. This process typically involves two stages:...

1 min 2 months ago

ear

LOW Academic International

Model Merging in the Essential Subspace

arXiv:2602.20208v1 Announce Type: new Abstract: Model merging aims to integrate multiple task-specific fine-tuned models derived from a shared pre-trained checkpoint into a single multi-task model without additional training. Despite extensive research, task interference remains a major obstacle that often undermines...

1 min 2 months ago

ear

LOW Academic International

MultiModalPFN: Extending Prior-Data Fitted Networks for Multimodal Tabular Learning

arXiv:2602.20223v1 Announce Type: new Abstract: Recently, TabPFN has gained attention as a foundation model for tabular data. However, it struggles to integrate heterogeneous modalities such as images and text, which are common in domains like healthcare and marketing, thereby limiting...

1 min 2 months ago

ear

LOW Academic International

Uncertainty-Aware Delivery Delay Duration Prediction via Multi-Task Deep Learning

arXiv:2602.20271v1 Announce Type: new Abstract: Accurate delivery delay prediction is critical for maintaining operational efficiency and customer satisfaction across modern supply chains. Yet the increasing complexity of logistics networks, spanning multimodal transportation, cross-country routing, and pronounced regional variability, makes this...

1 min 2 months ago

ear

LOW Academic International

The Truthfulness Spectrum Hypothesis

arXiv:2602.20273v1 Announce Type: new Abstract: Large language models (LLMs) have been reported to linearly encode truthfulness, yet recent work questions this finding's generality. We reconcile these views with the truthfulness spectrum hypothesis: the representational space contains directions ranging from broadly...

1 min 2 months ago

ear

LOW Academic International

Learning to Solve Complex Problems via Dataset Decomposition

arXiv:2602.20296v1 Announce Type: new Abstract: Curriculum learning is a class of training strategies that organizes the data being exposed to a model by difficulty, gradually from simpler to more complex examples. This research explores a reverse curriculum generation approach that...

1 min 2 months ago

ear

LOW Academic United States

Shape-informed cardiac mechanics surrogates in data-scarce regimes via geometric encoding and generative augmentation

arXiv:2602.20306v1 Announce Type: new Abstract: High-fidelity computational models of cardiac mechanics provide mechanistic insight into the heart function but are computationally prohibitive for routine clinical use. Surrogate models can accelerate simulations, but generalization across diverse anatomies is challenging, particularly in...

1 min 2 months ago

ear

LOW Academic International

In-context Pre-trained Time-Series Foundation Models adapt to Unseen Tasks

arXiv:2602.20307v1 Announce Type: new Abstract: Time-series foundation models (TSFMs) have demonstrated strong generalization capabilities across diverse datasets and tasks. However, existing foundation models are typically pre-trained to enhance performance on specific tasks and often struggle to generalize to unseen tasks...

1 min 2 months ago

ear

LOW Academic International

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

arXiv:2602.20309v1 Announce Type: new Abstract: Vision-language-action (VLA) models unify perception, language, and control for embodied agents but face significant challenges in practical deployment due to rapidly increasing compute and memory demands, especially as models scale to longer horizons and larger...

1 min 2 months ago

ear

LOW Academic United States

Emergent Manifold Separability during Reasoning in Large Language Models

arXiv:2602.20338v1 Announce Type: new Abstract: Chain-of-Thought (CoT) prompting significantly improves reasoning in Large Language Models, yet the temporal dynamics of the underlying representation geometry remain poorly understood. We investigate these dynamics by applying Manifold Capacity Theory (MCT) to a compositional...

1 min 2 months ago

ear

Explicit Grammar Semantic Feature Fusion for Robust Text Classification

Overton Pluralistic Reinforcement Learning for Large Language Models

Don't Ignore the Tail: Decoupling top-K Probabilities for Efficient Language Model Distillation

FinAnchor: Aligned Multi-Model Representations for Financial Prediction

The Art of Efficient Reasoning: Data, Reward, and Optimization

Blackbird Language Matrices: A Framework to Investigate the Linguistic Competence of Language Models

Linear Reasoning vs. Proof by Cases: Obstacles for Large Language Models in FOL Problem Solving

Beyond the Star Rating: A Scalable Framework for Aspect-Based Sentiment Analysis Using LLMs and Text Classification

PVminer: A Domain-Specific Tool to Detect the Patient Voice in Patient Generated Data

On Data Engineering for Scaling LLM Terminal Capabilities

Graph Modelling Analysis of Speech-Gesture Interaction for Aphasia Severity Estimation

MedCLIPSeg: Probabilistic Vision-Language Adaptation for Data-Efficient and Generalizable Medical Image Segmentation

Protein Language Models Diverge from Natural Language: Comparative Analysis and Improved Inference

Actor-Curator: Co-adaptive Curriculum Learning via Policy-Improvement Bandits for RL Post-Training

GATES: Self-Distillation under Privileged Context with Consensus Gating

RMIT-ADM+S at the MMU-RAG NeurIPS 2025 Competition

Multimodal MRI Report Findings Supervised Brain Lesion Segmentation with Substructures

Tensor Network Generator-Enhanced Optimization for Traveling Salesman Problem

Controllable Exploration in Hybrid-Policy RLVR for Multi-Modal Reasoning

IMOVNO+: A Regional Partitioning and Meta-Heuristic Ensemble Framework for Imbalanced Multi-Class Learning

Golden Layers and Where to Find Them: Improved Knowledge Editing for Large Language Models Via Layer Gradient Analysis

Model Merging in the Essential Subspace

MultiModalPFN: Extending Prior-Data Fitted Networks for Multimodal Tabular Learning

Uncertainty-Aware Delivery Delay Duration Prediction via Multi-Task Deep Learning

The Truthfulness Spectrum Hypothesis

Learning to Solve Complex Problems via Dataset Decomposition

Shape-informed cardiac mechanics surrogates in data-scarce regimes via geometric encoding and generative augmentation

In-context Pre-trained Time-Series Foundation Models adapt to Unseen Tasks

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

Emergent Manifold Separability during Reasoning in Large Language Models

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.