Arbitration

LOW Academic International

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

arXiv:2603.13985v1 Announce Type: new Abstract: Pre-trained Large Language Model (LLM) exhibits broad capabilities, yet, for specific tasks or domains their attainment of higher accuracy and more reliable reasoning generally depends on post-training through Supervised Fine-Tuning (SFT) or Reinforcement Learning (RL)....

1 min 1 month ago

bit

LOW Academic International

LLM-MINE: Large Language Model based Alzheimer's Disease and Related Dementias Phenotypes Mining from Clinical Notes

arXiv:2603.13673v1 Announce Type: new Abstract: Accurate extraction of Alzheimer's Disease and Related Dementias (ADRD) phenotypes from electronic health records (EHR) is critical for early-stage detection and disease staging. However, this information is usually embedded in unstructured textual data rather than...

1 min 1 month ago

adr

LOW Academic International

DOVA: Deliberation-First Multi-Agent Orchestration for Autonomous Research Automation

arXiv:2603.13327v1 Announce Type: new Abstract: Large language model (LLM) agents have demonstrated remarkable capabilities in tool use, reasoning, and code generation, yet single-agent systems exhibit fundamental limitations when confronted with complex research tasks demanding multi-source synthesis, adversarial verification, and personalized...

1 min 1 month ago

bit

LOW Academic International

Knowledge Distillation for Large Language Models

arXiv:2603.13765v1 Announce Type: new Abstract: We propose a resource-efficient framework for compressing large language models through knowledge distillation, combined with guided chain-of-thought reinforcement learning. Using Qwen 3B as the teacher and Qwen 0.5B as the student, we apply knowledge distillation...

1 min 1 month ago

bit

LOW Academic International

PA-Net: Precipitation-Adaptive Mixture-of-Experts for Long-Tail Rainfall Nowcasting

arXiv:2603.13818v1 Announce Type: new Abstract: Precipitation nowcasting is vital for flood warning, agricultural management, and emergency response, yet two bottlenecks persist: the prohibitive cost of modeling million-scale spatiotemporal tokens from multi-variate atmospheric fields, and the extreme long-tailed rainfall distribution where...

1 min 1 month ago

bit

LOW Academic International

Motivation in Large Language Models

arXiv:2603.14347v1 Announce Type: new Abstract: Motivation is a central driver of human behavior, shaping decisions, goals, and task performance. As large language models (LLMs) become increasingly aligned with human preferences, we ask whether they exhibit something akin to motivation. We...

1 min 1 month ago

bit

LOW Academic International

Exposing Long-Tail Safety Failures in Large Language Models through Efficient Diverse Response Sampling

arXiv:2603.14355v1 Announce Type: new Abstract: Safety tuning through supervised fine-tuning and reinforcement learning from human feedback has substantially improved the robustness of large language models (LLMs). However, it often suppresses rather than eliminates unsafe behaviors, leaving rare but critical failures...

1 min 1 month ago

bit

LOW Academic International

BiT-MCTS: A Theme-based Bidirectional MCTS Approach to Chinese Fiction Generation

arXiv:2603.14410v1 Announce Type: new Abstract: Generating long-form linear fiction from open-ended themes remains a major challenge for large language models, which frequently fail to guarantee global structure and narrative diversity when using premise-based or linear outlining approaches. We present BiT-MCTS,...

1 min 1 month ago

bit

LOW Academic International

FedUAF: Uncertainty-Aware Fusion with Reliability-Guided Aggregation for Multimodal Federated Sentiment Analysis

arXiv:2603.13291v1 Announce Type: new Abstract: Multimodal sentiment analysis in federated learning environments faces significant challenges due to missing modalities, heterogeneous data distributions, and unreliable client updates. Existing federated approaches often struggle to maintain robust performance under these practical conditions. In...

1 min 1 month ago

bit

LOW Academic International

Preventing Curriculum Collapse in Self-Evolving Reasoning Systems

arXiv:2603.13309v1 Announce Type: new Abstract: Self-evolving reasoning frameworks let LLMs improve their reasoning capabilities by iteratively generating and solving problems without external supervision, using verifiable rewards. Ideally, such systems are expected to explore a diverse problem space and propose new...

1 min 1 month ago

bit

LOW Academic International

Linear Predictability of Attention Heads in Large Language Models

arXiv:2603.13314v1 Announce Type: new Abstract: Large language model (LLM) inference is increasingly bottlenecked by the Key-Value (KV) cache, yet the fine-grained structure of attention-head activations remains poorly understood. We show that pretrained Transformers exhibit a pervasive inter-head linear structure: for...

1 min 1 month ago

bit

LOW Academic International

Residual Stream Analysis of Overfitting And Structural Disruptions

arXiv:2603.13318v1 Announce Type: new Abstract: Ensuring that large language models (LLMs) remain both helpful and harmless poses a significant challenge: fine-tuning on repetitive safety datasets, where unsafe prompts are paired with standard refusal templates, often leads to false refusals, in...

1 min 1 month ago

bit

LOW Academic International

AdaBox: Adaptive Density-Based Box Clustering with Parameter Generalization

arXiv:2603.13339v1 Announce Type: new Abstract: Density-based clustering algorithms like DBSCAN and HDBSCAN are foundational tools for discovering arbitrarily shaped clusters, yet their practical utility is undermined by acute hyperparameter sensitivity -- parameters tuned on one dataset frequently fail to transfer...

1 min 1 month ago

bit

LOW Academic International

Thermal Robustness of Retrieval in Dense Associative Memories: LSE vs LSR Kernels

arXiv:2603.13350v1 Announce Type: new Abstract: Understanding whether retrieval in dense associative memories survives thermal noise is essential for bridging zero-temperature capacity proofs with the finite-temperature conditions of practical inference and biological computation. We use Monte Carlo simulations to map the...

1 min 1 month ago

bit

LOW News International

Nvidia’s DLSS 5 uses generative AI to boost photorealism in video games, with ambitions beyond gaming

Nvidia’s new DLSS 5 uses generative AI and structured graphics data to make video games more realistic. CEO Jensen Huang says the approach could eventually spread to other industries.

1 min 1 month ago

bit

LOW Academic International

ODRL Policy Comparison Through Normalisation

arXiv:2603.12926v1 Announce Type: new Abstract: The ODRL language has become the standard for representing policies and regulations for digital rights. However its complexity is a barrier to its usage, which has caused many related theoretical and practical works to focus...

1 min 1 month ago

bit

LOW Academic International

Semantic Invariance in Agentic AI

arXiv:2603.13173v1 Announce Type: new Abstract: Large Language Models (LLMs) increasingly serve as autonomous reasoning agents in decision support, scientific problem-solving, and multi-agent coordination systems. However, deploying LLM agents in consequential applications requires assurance that their reasoning remains stable under semantically...

1 min 1 month ago

bit

LOW Academic International

AI Model Modulation with Logits Redistribution

arXiv:2603.12755v1 Announce Type: new Abstract: Large-scale models are typically adapted to meet the diverse requirements of model owners and users. However, maintaining multiple specialized versions of the model is inefficient. In response, we propose AIM, a novel model modulation paradigm...

1 min 1 month ago

bit

LOW Academic International

TRACE: Temporal Rule-Anchored Chain-of-Evidence on Knowledge Graphs for Interpretable Stock Movement Prediction

arXiv:2603.12500v1 Announce Type: cross Abstract: We present a Temporal Rule-Anchored Chain-of-Evidence (TRACE) on knowledge graphs for interpretable stock movement prediction that unifies symbolic relational priors, dynamic graph exploration, and LLM-guided decision making in a single end-to-end pipeline. The approach performs...

1 min 1 month ago

bit

LOW Academic International

LMEB: Long-horizon Memory Embedding Benchmark

arXiv:2603.12572v1 Announce Type: new Abstract: Memory embeddings are crucial for memory-augmented systems, such as OpenClaw, but their evaluation is underexplored in current text embedding benchmarks, which narrowly focus on traditional passage retrieval and fail to assess models' ability to handle...

1 min 1 month ago

bit

LOW Academic International

SteerRM: Debiasing Reward Models via Sparse Autoencoders

arXiv:2603.12795v1 Announce Type: new Abstract: Reward models (RMs) are critical components of alignment pipelines, yet they exhibit biases toward superficial stylistic cues, preferring better-presented responses over semantically superior ones. Existing debiasing methods typically require retraining or architectural modifications, while direct...

1 min 1 month ago

bit

LOW Academic International

Long-form RewardBench: Evaluating Reward Models for Long-form Generation

arXiv:2603.12963v1 Announce Type: new Abstract: The widespread adoption of reinforcement learning-based alignment highlights the growing importance of reward models. Various benchmarks have been built to evaluate reward models in various domains and scenarios. However, a significant gap remains in assessing...

1 min 1 month ago

bit

LOW Academic International

Explicit Logic Channel for Validation and Enhancement of MLLMs on Zero-Shot Tasks

arXiv:2603.11689v1 Announce Type: new Abstract: Frontier Multimodal Large Language Models (MLLMs) exhibit remarkable capabilities in Visual-Language Comprehension (VLC) tasks. However, they are often deployed as zero-shot solution to new tasks in a black-box manner. Validating and understanding the behavior of...

1 min 1 month ago

bit

LOW Academic International

Stop Listening to Me! How Multi-turn Conversations Can Degrade Diagnostic Reasoning

arXiv:2603.11394v1 Announce Type: new Abstract: Patients and clinicians are increasingly using chatbots powered by large language models (LLMs) for healthcare inquiries. While state-of-the-art LLMs exhibit high performance on static diagnostic reasoning benchmarks, their efficacy across multi-turn conversations, which better reflect...

1 min 1 month ago

bit

LOW Academic International

ThReadMed-QA: A Multi-Turn Medical Dialogue Benchmark from Real Patient Questions

arXiv:2603.11281v1 Announce Type: new Abstract: Medical question-answering benchmarks predominantly evaluate single-turn exchanges, failing to capture the iterative, clarification-seeking nature of real patient consultations. We introduce ThReadMed-QA, a benchmark of 2,437 fully-answered patient-physician conversation threads extracted from r/AskDocs, comprising 8,204 question-answer...

1 min 1 month ago

bit

LOW Academic International

Social, Legal, Ethical, Empathetic and Cultural Norm Operationalisation for AI Agents

arXiv:2603.11864v1 Announce Type: new Abstract: As AI agents are increasingly used in high-stakes domains like healthcare and law enforcement, aligning their behaviour with social, legal, ethical, empathetic, and cultural (SLEEC) norms has become a critical engineering challenge. While international frameworks...

1 min 1 month ago

enforcement

LOW Academic International

Examining Users' Behavioural Intention to Use OpenClaw Through the Cognition--Affect--Conation Framework

arXiv:2603.11455v1 Announce Type: new Abstract: This study examines users' behavioural intention to use OpenClaw through the Cognition--Affect--Conation (CAC) framework. The research investigates how cognitive perceptions of the system influence affective responses and subsequently shape behavioural intention. Enabling factors include perceived...

1 min 1 month ago

bit

LOW Academic International

CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges

arXiv:2603.11863v1 Announce Type: new Abstract: The saturation of high-quality pre-training data has shifted research focus toward evolutionary systems capable of continuously generating novel artifacts, leading to the success of AlphaEvolve. However, the progress of such systems is hindered by the...

1 min 1 month ago

bit

LOW Academic International

QChunker: Learning Question-Aware Text Chunking for Domain RAG via Multi-Agent Debate

arXiv:2603.11650v1 Announce Type: new Abstract: The effectiveness upper bound of retrieval-augmented generation (RAG) is fundamentally constrained by the semantic integrity and information granularity of text chunks in its knowledge base. To address these challenges, this paper proposes QChunker, which restructures...

1 min 1 month ago

bit

LOW Academic International

Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge

arXiv:2603.11665v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) have been widely adopted as MLLM-as-a-Judges due to their strong alignment with human judgment across various visual tasks. However, most existing judge models are optimized for single-task scenarios and struggle...

1 min 1 month ago

bit

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

LLM-MINE: Large Language Model based Alzheimer's Disease and Related Dementias Phenotypes Mining from Clinical Notes

DOVA: Deliberation-First Multi-Agent Orchestration for Autonomous Research Automation

Knowledge Distillation for Large Language Models

PA-Net: Precipitation-Adaptive Mixture-of-Experts for Long-Tail Rainfall Nowcasting

Motivation in Large Language Models

Exposing Long-Tail Safety Failures in Large Language Models through Efficient Diverse Response Sampling

BiT-MCTS: A Theme-based Bidirectional MCTS Approach to Chinese Fiction Generation

FedUAF: Uncertainty-Aware Fusion with Reliability-Guided Aggregation for Multimodal Federated Sentiment Analysis

Preventing Curriculum Collapse in Self-Evolving Reasoning Systems

Linear Predictability of Attention Heads in Large Language Models

Residual Stream Analysis of Overfitting And Structural Disruptions

AdaBox: Adaptive Density-Based Box Clustering with Parameter Generalization

Thermal Robustness of Retrieval in Dense Associative Memories: LSE vs LSR Kernels

Nvidia’s DLSS 5 uses generative AI to boost photorealism in video games, with ambitions beyond gaming

ODRL Policy Comparison Through Normalisation

Semantic Invariance in Agentic AI

AI Model Modulation with Logits Redistribution

TRACE: Temporal Rule-Anchored Chain-of-Evidence on Knowledge Graphs for Interpretable Stock Movement Prediction

LMEB: Long-horizon Memory Embedding Benchmark

SteerRM: Debiasing Reward Models via Sparse Autoencoders

Long-form RewardBench: Evaluating Reward Models for Long-form Generation

Explicit Logic Channel for Validation and Enhancement of MLLMs on Zero-Shot Tasks

Stop Listening to Me! How Multi-turn Conversations Can Degrade Diagnostic Reasoning

ThReadMed-QA: A Multi-Turn Medical Dialogue Benchmark from Real Patient Questions

Social, Legal, Ethical, Empathetic and Cultural Norm Operationalisation for AI Agents

Examining Users' Behavioural Intention to Use OpenClaw Through the Cognition--Affect--Conation Framework

CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges

QChunker: Learning Question-Aware Text Chunking for Domain RAG via Multi-Agent Debate

Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.