Tax Law

LOW Academic International

Using Learning Progressions to Guide AI Feedback for Science Learning

arXiv:2603.03249v1 Announce Type: new Abstract: Generative artificial intelligence (AI) offers scalable support for formative feedback, yet most AI-generated feedback relies on task-specific rubrics authored by domain experts. While effective, rubric authoring is time-consuming and limits scalability across instructional contexts. Learning...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

MUSE: A Run-Centric Platform for Multimodal Unified Safety Evaluation of Large Language Models

arXiv:2603.02482v1 Announce Type: cross Abstract: Safety evaluation and red-teaming of large language models remain predominantly text-centric, and existing frameworks lack the infrastructure to systematically test whether alignment generalizes to audio, image, and video inputs. We present MUSE (Multimodal Unified Safety...

1 min 1 month, 2 weeks ago

tax

LOW Academic International

Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs

arXiv:2603.02556v1 Announce Type: cross Abstract: Reasoning has emerged as a key capability of large language models. In linguistic tasks, this capability can be enhanced by self-improving techniques that refine reasoning paths for subsequent finetuning. However, extending these language-based self-improving approaches...

1 min 1 month, 2 weeks ago

vat

LOW Academic United States

MedCalc-Bench Doesn't Measure What You Think: A Benchmark Audit and the Case for Open-Book Evaluation

arXiv:2603.02222v1 Announce Type: new Abstract: MedCalc-Bench is a widely used benchmark for evaluating LLM performance on clinical calculator tasks, with state-of-the-art direct prompting scores plateauing around 35% on the Verified split (HELM MedHELM leaderboard) and the best published approach-RL with...

1 min 1 month, 2 weeks ago

audit

LOW Academic European Union

Neural Paging: Learning Context Management Policies for Turing-Complete Agents

arXiv:2603.02228v1 Announce Type: new Abstract: The proof that Large Language Models (LLMs) augmented with external read-write memory constitute a computationally universal system has established the theoretical foundation for general-purpose agents. However, existing implementations face a critical bottleneck: the finite and...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

Concept Heterogeneity-aware Representation Steering

arXiv:2603.02237v1 Announce Type: new Abstract: Representation steering offers a lightweight mechanism for controlling the behavior of large language models (LLMs) by intervening on internal activations at inference time. Most existing methods rely on a single global steering direction, typically obtained...

1 min 1 month, 2 weeks ago

vat

LOW Academic United States

Quantum-Inspired Fine-Tuning for Few-Shot AIGC Detection via Phase-Structured Reparameterization

arXiv:2603.02281v1 Announce Type: new Abstract: Recent studies show that quantum neural networks (QNNs) generalize well in few-shot regimes. To extend this advantage to large-scale tasks, we propose Q-LoRA, a quantum-enhanced fine-tuning scheme that integrates lightweight QNNs into the low-rank adaptation...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

EdgeFLow: Serverless Federated Learning via Sequential Model Migration in Edge Networks

arXiv:2603.02562v1 Announce Type: new Abstract: Federated Learning (FL) has emerged as a transformative distributed learning paradigm in the era of Internet of Things (IoT), reconceptualizing data processing methodologies. However, FL systems face significant communication bottlenecks due to inevitable client-server data...

1 min 1 month, 2 weeks ago

vat

LOW Academic United States

LLM-Bootstrapped Targeted Finding Guidance for Factual MLLM-based Medical Report Generation

arXiv:2603.00426v1 Announce Type: new Abstract: The automatic generation of medical reports utilizing Multimodal Large Language Models (MLLMs) frequently encounters challenges related to factual instability, which may manifest as the omission of findings or the incorporation of inaccurate information, thereby constraining...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

CoMoL: Efficient Mixture of LoRA Experts via Dynamic Core Space Merging

arXiv:2603.00573v1 Announce Type: new Abstract: Large language models (LLMs) achieve remarkable performance on diverse downstream and domain-specific tasks via parameter-efficient fine-tuning (PEFT). However, existing PEFT methods, particularly MoE-LoRA architectures, suffer from limited parameter efficiency and coarse-grained adaptation due to the...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

Super Research: Answering Highly Complex Questions with Large Language Models through Super Deep and Super Wide Research

arXiv:2603.00582v1 Announce Type: new Abstract: While Large Language Models (LLMs) have demonstrated proficiency in Deep Research or Wide Search, their capacity to solve highly complex questions-those requiring long-horizon planning, massive evidence gathering, and synthesis across heterogeneous sources-remains largely unexplored. We...

1 min 1 month, 2 weeks ago

audit

LOW Academic International

SSKG Hub: An Expert-Guided Platform for LLM-Empowered Sustainability Standards Knowledge Graphs

arXiv:2603.00669v1 Announce Type: new Abstract: Sustainability disclosure standards (e.g., GRI, SASB, TCFD, IFRS S2) are comprehensive yet lengthy, terminology-dense, and highly cross-referential, hindering structured analysis and downstream use. We present SSKG Hub (Sustainability Standards Knowledge Graph Hub), a research prototype...

1 min 1 month, 2 weeks ago

audit

LOW Academic International

CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning

arXiv:2603.00889v1 Announce Type: new Abstract: Large Language Models (LLMs) have recently exhibited remarkable reasoning capabilities, largely enabled by supervised fine-tuning (SFT)- and reinforcement learning (RL)-based post-training on high-quality reasoning data. However, reproducing and extending these capabilities in open and scalable...

1 min 1 month, 2 weeks ago

tax

LOW Academic International

KVSlimmer: Theoretical Insights and Practical Optimizations for Asymmetric KV Merging

arXiv:2603.00907v1 Announce Type: new Abstract: The growing computational and memory demands of the Key-Value (KV) cache significantly limit the ability of Large Language Models (LLMs). While KV merging has emerged as a promising solution, existing methods that rely on empirical...

1 min 1 month, 2 weeks ago

vat

LOW Academic United States

Conformal Prediction for Risk-Controlled Medical Entity Extraction Across Clinical Domains

arXiv:2603.00924v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly used for medical entity extraction, yet their confidence scores are often miscalibrated, limiting safe deployment in clinical settings. We present a conformal prediction framework that provides finite-sample coverage guarantees...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

How RL Unlocks the Aha Moment in Geometric Interleaved Reasoning

arXiv:2603.01070v1 Announce Type: new Abstract: Solving complex geometric problems inherently requires interleaved reasoning: a tight alternation between constructing diagrams and performing logical deductions. Although recent Multimodal Large Language Models (MLLMs) have demonstrated strong capabilities in visual generation and plotting, we...

1 min 1 month, 2 weeks ago

deduction

LOW Academic International

StaTS: Spectral Trajectory Schedule Learning for Adaptive Time Series Forecasting with Frequency Guided Denoiser

arXiv:2603.00037v1 Announce Type: new Abstract: Diffusion models have been used for probabilistic time series forecasting and show strong potential. However, fixed noise schedules often produce intermediate states that are hard to invert and a terminal state that deviates from the...

1 min 1 month, 2 weeks ago

vat

LOW Academic European Union

Attn-QAT: 4-Bit Attention With Quantization-Aware Training

arXiv:2603.00040v1 Announce Type: new Abstract: Achieving reliable 4-bit attention is a prerequisite for end-to-end FP4 computation on emerging FP4-capable GPUs, yet attention remains the main obstacle due to FP4's tiny dynamic range and attention's heavy-tailed activations. This paper presents the...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

REMIND: Rethinking Medical High-Modality Learning under Missingness--A Long-Tailed Distribution Perspective

arXiv:2603.00046v1 Announce Type: new Abstract: Medical multi-modal learning is critical for integrating information from a large set of diverse modalities. However, when leveraging a high number of modalities in real clinical applications, it is often impractical to obtain full-modality observations...

1 min 1 month, 2 weeks ago

vat

LOW Academic United Kingdom

Knowledge-guided generative surrogate modeling for high-dimensional design optimization under scarce data

arXiv:2603.00052v1 Announce Type: new Abstract: Surrogate models are widely used in mechanical design and manufacturing process optimization, where high-fidelity computational models may be unavailable or prohibitively expensive. Their effectiveness, however, is often limited by data scarcity, as purely data-driven surrogates...

1 min 1 month, 2 weeks ago

vat

LOW Academic United States

Engineering FAIR Privacy-preserving Applications that Learn Histories of Disease

arXiv:2603.00181v1 Announce Type: new Abstract: A recent report on "Learning the natural history of human disease with generative transformers" created an opportunity to assess the engineering challenge of delivering user-facing Generative AI applications in privacy-sensitive domains. The application of these...

1 min 1 month, 2 weeks ago

vat

LOW Academic European Union

CoPeP: Benchmarking Continual Pretraining for Protein Language Models

arXiv:2603.00253v1 Announce Type: new Abstract: Protein language models (pLMs) have recently gained significant attention for their ability to uncover relationships between sequence, structure, and function from evolutionary statistics, thereby accelerating therapeutic drug discovery. These models learn from large protein databases...

1 min 1 month, 2 weeks ago

vat

LOW Academic United States

Exact and Asymptotically Complete Robust Verifications of Neural Networks via Quantum Optimization

arXiv:2603.00408v1 Announce Type: new Abstract: Deep neural networks (DNNs) enable high performance across domains but remain vulnerable to adversarial perturbations, limiting their use in safety-critical settings. Here, we introduce two quantum-optimization-based models for robust verification that reduce the combinatorial burden...

1 min 1 month, 2 weeks ago

vat

LOW Academic European Union

Weight Updates as Activation Shifts: A Principled Framework for Steering

arXiv:2603.00425v1 Announce Type: new Abstract: Activation steering promises to be an extremely parameter-efficient form of adaptation, but its effectiveness depends on critical design choices -- such as intervention location and parameterization -- that currently rely on empirical heuristics rather than...

1 min 1 month, 2 weeks ago

vat

LOW Academic United States

ROKA: Robust Knowledge Unlearning against Adversaries

arXiv:2603.00436v1 Announce Type: new Abstract: The need for machine unlearning is critical for data privacy, yet existing methods often cause Knowledge Contamination by unintentionally damaging related knowledge. Such a degraded model performance after unlearning has been recently leveraged for new...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

TRIZ-RAGNER: A Retrieval-Augmented Large Language Model for TRIZ-Aware Named Entity Recognition in Patent-Based Contradiction Mining

arXiv:2602.23656v1 Announce Type: new Abstract: TRIZ-based contradiction mining is a fundamental task in patent analysis and systematic innovation, as it enables the identification of improving and worsening technical parameters that drive inventive problem solving. However, existing approaches largely rely on...

1 min 1 month, 2 weeks ago

vat

LOW Academic European Union

GLUScope: A Tool for Analyzing GLU Neurons in Transformer Language Models

arXiv:2602.23826v1 Announce Type: new Abstract: We present GLUScope, an open-source tool for analyzing neurons in Transformer-based language models, intended for interpretability researchers. We focus on more recent models than previous tools do; specifically we consider gated activation functions such as...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

The Astonishing Ability of Large Language Models to Parse Jabberwockified Language

arXiv:2602.23928v1 Announce Type: new Abstract: We show that large language models (LLMs) have an astonishing ability to recover meaning from severely degraded English texts. Texts in which content words have been randomly substituted by nonsense strings, e.g., "At the ghybe...

1 min 1 month, 2 weeks ago

tax

LOW Academic International

CoME: Empowering Channel-of-Mobile-Experts with Informative Hybrid-Capabilities Reasoning

arXiv:2602.24142v1 Announce Type: new Abstract: Mobile Agents can autonomously execute user instructions, which requires hybrid-capabilities reasoning, including screen summary, subtask planning, action decision and action function. However, existing agents struggle to achieve both decoupled enhancement and balanced integration of these...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

MT-PingEval: Evaluating Multi-Turn Collaboration with Private Information Games

arXiv:2602.24188v1 Announce Type: new Abstract: We present a scalable methodology for evaluating language models in multi-turn interactions, using a suite of collaborative games that require effective communication about private information. This enables an interactive scaling analysis, in which a fixed...

1 min 1 month, 2 weeks ago

vat

Using Learning Progressions to Guide AI Feedback for Science Learning

MUSE: A Run-Centric Platform for Multimodal Unified Safety Evaluation of Large Language Models

Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs

MedCalc-Bench Doesn't Measure What You Think: A Benchmark Audit and the Case for Open-Book Evaluation

Neural Paging: Learning Context Management Policies for Turing-Complete Agents

Concept Heterogeneity-aware Representation Steering

Quantum-Inspired Fine-Tuning for Few-Shot AIGC Detection via Phase-Structured Reparameterization

EdgeFLow: Serverless Federated Learning via Sequential Model Migration in Edge Networks

LLM-Bootstrapped Targeted Finding Guidance for Factual MLLM-based Medical Report Generation

CoMoL: Efficient Mixture of LoRA Experts via Dynamic Core Space Merging

Super Research: Answering Highly Complex Questions with Large Language Models through Super Deep and Super Wide Research

SSKG Hub: An Expert-Guided Platform for LLM-Empowered Sustainability Standards Knowledge Graphs

CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning

KVSlimmer: Theoretical Insights and Practical Optimizations for Asymmetric KV Merging

Conformal Prediction for Risk-Controlled Medical Entity Extraction Across Clinical Domains

How RL Unlocks the Aha Moment in Geometric Interleaved Reasoning

StaTS: Spectral Trajectory Schedule Learning for Adaptive Time Series Forecasting with Frequency Guided Denoiser

Attn-QAT: 4-Bit Attention With Quantization-Aware Training

REMIND: Rethinking Medical High-Modality Learning under Missingness--A Long-Tailed Distribution Perspective

Knowledge-guided generative surrogate modeling for high-dimensional design optimization under scarce data

Engineering FAIR Privacy-preserving Applications that Learn Histories of Disease

CoPeP: Benchmarking Continual Pretraining for Protein Language Models

Exact and Asymptotically Complete Robust Verifications of Neural Networks via Quantum Optimization

Weight Updates as Activation Shifts: A Principled Framework for Steering

ROKA: Robust Knowledge Unlearning against Adversaries

TRIZ-RAGNER: A Retrieval-Augmented Large Language Model for TRIZ-Aware Named Entity Recognition in Patent-Based Contradiction Mining

GLUScope: A Tool for Analyzing GLU Neurons in Transformer Language Models

The Astonishing Ability of Large Language Models to Parse Jabberwockified Language

CoME: Empowering Channel-of-Mobile-Experts with Informative Hybrid-Capabilities Reasoning

MT-PingEval: Evaluating Multi-Turn Collaboration with Private Information Games

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.