Intellectual Property

LOW Academic International

Structure-Aware Distributed Backdoor Attacks in Federated Learning

arXiv:2603.03865v1 Announce Type: new Abstract: While federated learning protects data privacy, it also makes the model update process vulnerable to long-term stealthy perturbations. Existing studies on backdoor attacks in federated learning mainly focus on trigger design or poisoning strategies, typically...

1 min 1 month, 2 weeks ago

ip

LOW Academic United States

Believe Your Model: Distribution-Guided Confidence Calibration

arXiv:2603.03872v1 Announce Type: new Abstract: Large Reasoning Models have demonstrated remarkable performance with the advancement of test-time scaling techniques, which enhances prediction accuracy by generating multiple candidate responses and selecting the most reliable answer. While prior work has analyzed that...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

HateMirage: An Explainable Multi-Dimensional Dataset for Decoding Faux Hate and Subtle Online Abuse

arXiv:2603.02684v1 Announce Type: new Abstract: Subtle and indirect hate speech remains an underexplored challenge in online safety research, particularly when harmful intent is embedded within misleading or manipulative narratives. Existing hate speech datasets primarily capture overt toxicity, underrepresenting the nuanced...

1 min 1 month, 3 weeks ago

ip

LOW Academic International

Graph-GRPO: Stabilizing Multi-Agent Topology Learning via Group Relative Policy Optimization

arXiv:2603.02701v1 Announce Type: new Abstract: Optimizing communication topology is fundamental to the efficiency and effectiveness of Large Language Model (LLM)-based Multi-Agent Systems (MAS). While recent approaches utilize reinforcement learning to dynamically construct task-specific graphs, they typically rely on single-sample policy...

1 min 1 month, 3 weeks ago

nda

LOW Academic International

OCR or Not? Rethinking Document Information Extraction in the MLLMs Era with Real-World Large-Scale Datasets

arXiv:2603.02789v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) enhance the potential of natural language processing. However, their actual impact on document information extraction remains unclear. In particular, it is unclear whether an MLLM-only pipeline--while simpler--can truly match the...

1 min 1 month, 3 weeks ago

ip

LOW Academic International

Faster, Cheaper, More Accurate: Specialised Knowledge Tracing Models Outperform LLMs

arXiv:2603.02830v1 Announce Type: new Abstract: Predicting future student responses to questions is particularly valuable for educational learning platforms where it enables effective interventions. One of the key approaches to do this has been through the use of knowledge tracing (KT)...

1 min 1 month, 3 weeks ago

ip

LOW Academic United States

Nodes Are Early, Edges Are Late: Probing Diagram Representations in Large Vision-Language Models

arXiv:2603.02865v1 Announce Type: new Abstract: Large vision-language models (LVLMs) demonstrate strong performance on diagram understanding benchmarks, yet they still struggle with understanding relationships between elements, particularly those represented by nodes and directed edges (e.g., arrows and lines). To investigate the...

1 min 1 month, 3 weeks ago

ip

LOW Academic International

LaTeX Compilation: Challenges in the Era of LLMs

arXiv:2603.02873v1 Announce Type: new Abstract: As large language models (LLMs) increasingly assist scientific writing, limitations and the significant token cost of TeX become more and more visible. This paper analyzes TeX's fundamental defects in compilation and user experience design to...

1 min 1 month, 3 weeks ago

nda

LOW Academic International

Eval4Sim: An Evaluation Framework for Persona Simulation

arXiv:2603.02876v1 Announce Type: new Abstract: Large Language Model (LLM) personas with explicit specifications of attributes, background, and behavioural tendencies are increasingly used to simulate human conversations for tasks such as user modeling, social reasoning, and behavioural analysis. Ensuring that persona-grounded...

1 min 1 month, 3 weeks ago

ip

LOW Academic International

Learning to Generate and Extract: A Multi-Agent Collaboration Framework For Zero-shot Document-level Event Arguments Extraction

arXiv:2603.02909v1 Announce Type: new Abstract: Document-level event argument extraction (DEAE) is essential for knowledge acquisition, aiming to extract participants of events from documents.In the zero-shot setting, existing methods employ LLMs to generate synthetic data to address the challenge posed by...

1 min 1 month, 3 weeks ago

ip

LOW Academic International

PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue Systems

arXiv:2603.03054v1 Announce Type: new Abstract: Large language models are increasingly used for patient-facing medical assistance and clinical decision support, but adapting them to clinical dialogue often requires supervision derived from doctor-patient conversations that may contain sensitive information. Conventional supervised fine-tuning...

1 min 1 month, 3 weeks ago

ip

LOW Academic United States

TAO-Attack: Toward Advanced Optimization-Based Jailbreak Attacks for Large Language Models

arXiv:2603.03081v1 Announce Type: new Abstract: Large language models (LLMs) have achieved remarkable success across diverse applications but remain vulnerable to jailbreak attacks, where attackers craft prompts that bypass safety alignment and elicit unsafe responses. Among existing approaches, optimization-based attacks have...

1 min 1 month, 3 weeks ago

ip

LOW Academic European Union

UniSkill: A Dataset for Matching University Curricula to Professional Competencies

arXiv:2603.03134v1 Announce Type: new Abstract: Skill extraction and recommendation systems have been studied from recruiter, applicant, and education perspectives. While AI applications in job advertisements have received broad attention, deficiencies in the instructed skills side remain a challenge. In this...

1 min 1 month, 3 weeks ago

nda

LOW Academic International

APRES: An Agentic Paper Revision and Evaluation System

arXiv:2603.03142v1 Announce Type: new Abstract: Scientific discoveries must be communicated clearly to realize their full potential. Without effective communication, even the most groundbreaking findings risk being overlooked or misunderstood. The primary way scientists communicate their work and receive feedback from...

1 min 1 month, 3 weeks ago

ip

LOW Academic International

Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use

arXiv:2603.03205v1 Announce Type: new Abstract: Agentic language models operate in a fundamentally different safety regime than chat models: they must plan, call tools, and execute long-horizon actions where a single misstep, such as accessing files or entering credentials, can cause...

1 min 1 month, 3 weeks ago

nda

LOW Academic International

Using Learning Progressions to Guide AI Feedback for Science Learning

arXiv:2603.03249v1 Announce Type: new Abstract: Generative artificial intelligence (AI) offers scalable support for formative feedback, yet most AI-generated feedback relies on task-specific rubrics authored by domain experts. While effective, rubric authoring is time-consuming and limits scalability across instructional contexts. Learning...

1 min 1 month, 3 weeks ago

ip

LOW Academic International

Self-Play Only Evolves When Self-Synthetic Pipeline Ensures Learnable Information Gain

arXiv:2603.02218v1 Announce Type: cross Abstract: Large language models (LLMs) make it plausible to build systems that improve through self-evolving loops, but many existing proposals are better understood as self-play and often plateau quickly. A central failure mode is that the...

1 min 1 month, 3 weeks ago

ip

LOW Academic International

Routing Absorption in Sparse Attention: Why Random Gates Are Hard to Beat

arXiv:2603.02227v1 Announce Type: cross Abstract: Can a transformer learn which attention entries matter during training? In principle, yes: attention distributions are highly concentrated, and a small gate network can identify the important entries post-hoc with near-perfect accuracy. In practice, barely....

1 min 1 month, 3 weeks ago

ip

LOW Academic International

HELIOS: Harmonizing Early Fusion, Late Fusion, and LLM Reasoning for Multi-Granular Table-Text Retrieval

arXiv:2603.02248v1 Announce Type: cross Abstract: Table-text retrieval aims to retrieve relevant tables and text to support open-domain question answering. Existing studies use either early or late fusion, but face limitations. Early fusion pre-aligns a table row with its associated passages,...

1 min 1 month, 3 weeks ago

ip

LOW Academic International

MUSE: A Run-Centric Platform for Multimodal Unified Safety Evaluation of Large Language Models

arXiv:2603.02482v1 Announce Type: cross Abstract: Safety evaluation and red-teaming of large language models remain predominantly text-centric, and existing frameworks lack the infrastructure to systematically test whether alignment generalizes to audio, image, and video inputs. We present MUSE (Multimodal Unified Safety...

1 min 1 month, 3 weeks ago

nda

LOW Academic International

FlashEvaluator: Expanding Search Space with Parallel Evaluation

arXiv:2603.02565v1 Announce Type: cross Abstract: The Generator-Evaluator (G-E) framework, i.e., evaluating K sequences from a generator and selecting the top-ranked one according to evaluator scores, is a foundational paradigm in tasks such as Recommender Systems (RecSys) and Natural Language Processing...

1 min 1 month, 3 weeks ago

nda

LOW Academic International

StitchCUDA: An Automated Multi-Agents End-to-End GPU Programing Framework with Rubric-based Agentic Reinforcement Learning

arXiv:2603.02637v1 Announce Type: cross Abstract: Modern machine learning (ML) workloads increasingly rely on GPUs, yet achieving high end-to-end performance remains challenging due to dependencies on both GPU kernel efficiency and host-side settings. Although LLM-based methods show promise on automated GPU...

1 min 1 month, 3 weeks ago

nda

LOW Academic International

RxnNano:Training Compact LLMs for Chemical Reaction and Retrosynthesis Prediction via Hierarchical Curriculum Learning

arXiv:2603.02215v1 Announce Type: new Abstract: Chemical reaction prediction is pivotal for accelerating drug discovery and synthesis planning. Despite advances in data-driven models, current approaches are hindered by an overemphasis on parameter and dataset scaling. Some methods coupled with evaluation techniques...

1 min 1 month, 3 weeks ago

nda

LOW Academic International

Is Retraining-Free Enough? The Necessity of Router Calibration for Efficient MoE Compression

arXiv:2603.02217v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) models scale capacity efficiently, but their massive parameter footprint creates a deployment-time memory bottleneck. We organize retraining-free MoE compression into three paradigms - Expert Pruning, Expert Editing, and Expert Merging - and show...

1 min 1 month, 3 weeks ago

nda

LOW Academic United States

MedCalc-Bench Doesn't Measure What You Think: A Benchmark Audit and the Case for Open-Book Evaluation

arXiv:2603.02222v1 Announce Type: new Abstract: MedCalc-Bench is a widely used benchmark for evaluating LLM performance on clinical calculator tasks, with state-of-the-art direct prompting scores plateauing around 35% on the Verified split (HELM MedHELM leaderboard) and the best published approach-RL with...

1 min 1 month, 3 weeks ago

ip

LOW Academic United States

Characterizing and Predicting Wildfire Evacuation Behavior: A Dual-Stage ML Approach

arXiv:2603.02223v1 Announce Type: new Abstract: Wildfire evacuation behavior is highly variable and influenced by complex interactions among household resources, preparedness, and situational cues. Using a large-scale MTurk survey of residents in California, Colorado, and Oregon, this study integrates unsupervised and...

1 min 1 month, 3 weeks ago

ip

LOW Academic International

Subspace Geometry Governs Catastrophic Forgetting in Low-Rank Adaptation

arXiv:2603.02224v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) has emerged as a parameter-efficient approach for adapting large pre-trained models, yet its behavior under continual learning remains poorly understood. We present a geometric theory characterizing catastrophic forgetting in LoRA through the...

1 min 1 month, 3 weeks ago

ip

LOW Academic European Union

Neural Paging: Learning Context Management Policies for Turing-Complete Agents

arXiv:2603.02228v1 Announce Type: new Abstract: The proof that Large Language Models (LLMs) augmented with external read-write memory constitute a computationally universal system has established the theoretical foundation for general-purpose agents. However, existing implementations face a critical bottleneck: the finite and...

1 min 1 month, 3 weeks ago

nda

LOW Academic European Union

Beyond Binary Preferences: A Principled Framework for Reward Modeling with Ordinal Feedback

arXiv:2603.02232v1 Announce Type: new Abstract: Reward modeling is crucial for aligning large language models with human preferences, yet current approaches lack a principled mathematical framework for leveraging ordinal preference data. When human annotators provide graded preferences on a Likert scale...

1 min 1 month, 3 weeks ago

ip

LOW Academic International

Adaptive Personalized Federated Learning via Multi-task Averaging of Kernel Mean Embeddings

arXiv:2603.02233v1 Announce Type: new Abstract: Personalized Federated Learning (PFL) enables a collection of agents to collaboratively learn individual models without sharing raw data. We propose a new PFL approach in which each agent optimizes a weighted combination of all agents'...

1 min 1 month, 3 weeks ago

ip

Structure-Aware Distributed Backdoor Attacks in Federated Learning

Believe Your Model: Distribution-Guided Confidence Calibration

HateMirage: An Explainable Multi-Dimensional Dataset for Decoding Faux Hate and Subtle Online Abuse

Graph-GRPO: Stabilizing Multi-Agent Topology Learning via Group Relative Policy Optimization

OCR or Not? Rethinking Document Information Extraction in the MLLMs Era with Real-World Large-Scale Datasets

Faster, Cheaper, More Accurate: Specialised Knowledge Tracing Models Outperform LLMs

Nodes Are Early, Edges Are Late: Probing Diagram Representations in Large Vision-Language Models

LaTeX Compilation: Challenges in the Era of LLMs

Eval4Sim: An Evaluation Framework for Persona Simulation

Learning to Generate and Extract: A Multi-Agent Collaboration Framework For Zero-shot Document-level Event Arguments Extraction

PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue Systems

TAO-Attack: Toward Advanced Optimization-Based Jailbreak Attacks for Large Language Models

UniSkill: A Dataset for Matching University Curricula to Professional Competencies

APRES: An Agentic Paper Revision and Evaluation System

Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use

Using Learning Progressions to Guide AI Feedback for Science Learning

Self-Play Only Evolves When Self-Synthetic Pipeline Ensures Learnable Information Gain

Routing Absorption in Sparse Attention: Why Random Gates Are Hard to Beat

HELIOS: Harmonizing Early Fusion, Late Fusion, and LLM Reasoning for Multi-Granular Table-Text Retrieval

MUSE: A Run-Centric Platform for Multimodal Unified Safety Evaluation of Large Language Models

FlashEvaluator: Expanding Search Space with Parallel Evaluation

StitchCUDA: An Automated Multi-Agents End-to-End GPU Programing Framework with Rubric-based Agentic Reinforcement Learning

RxnNano:Training Compact LLMs for Chemical Reaction and Retrosynthesis Prediction via Hierarchical Curriculum Learning

Is Retraining-Free Enough? The Necessity of Router Calibration for Efficient MoE Compression

MedCalc-Bench Doesn't Measure What You Think: A Benchmark Audit and the Case for Open-Book Evaluation

Characterizing and Predicting Wildfire Evacuation Behavior: A Dual-Stage ML Approach

Subspace Geometry Governs Catastrophic Forgetting in Low-Rank Adaptation

Neural Paging: Learning Context Management Policies for Turing-Complete Agents

Beyond Binary Preferences: A Principled Framework for Reward Modeling with Ordinal Feedback

Adaptive Personalized Federated Learning via Multi-task Averaging of Kernel Mean Embeddings

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.