Tax Law

LOW Academic United States

PlugMem: A Task-Agnostic Plugin Memory Module for LLM Agents

arXiv:2603.03296v1 Announce Type: cross Abstract: Long-term memory is essential for large language model (LLM) agents operating in complex environments, yet existing memory designs are either task-specific and non-transferable, or task-agnostic but less effective due to low task-relevance and context explosion...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

TATRA: Training-Free Instance-Adaptive Prompting Through Rephrasing and Aggregation

arXiv:2603.03298v1 Announce Type: cross Abstract: Large Language Models (LLMs) have improved substantially alignment, yet their behavior remains highly sensitive to prompt phrasing. This brittleness has motivated automated prompt engineering, but most existing methods (i) require a task-specific training set, (ii)...

1 min 1 month, 2 weeks ago

vat

LOW Academic European Union

From Exact Hits to Close Enough: Semantic Caching for LLM Embeddings

arXiv:2603.03301v1 Announce Type: cross Abstract: The rapid adoption of large language models (LLMs) has created demand for faster responses and lower costs. Semantic caching, reusing semantically similar requests via their embeddings, addresses this need but breaks classic cache assumptions and...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

Draft-Conditioned Constrained Decoding for Structured Generation in LLMs

arXiv:2603.03305v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly used to generate executable outputs, JSON objects, and API calls, where a single syntax error can make the output unusable. Constrained decoding enforces validity token-by-token via masking and renormalization,...

1 min 1 month, 2 weeks ago

tax

LOW Academic International

Token-Oriented Object Notation vs JSON: A Benchmark of Plain and Constrained Decoding Generation

arXiv:2603.03306v1 Announce Type: cross Abstract: Recently presented Token-Oriented Object Notation (TOON) aims to replace JSON as a serialization format for passing structured data to LLMs with significantly reduced token usage. While showing solid accuracy in LLM comprehension, there is a...

1 min 1 month, 2 weeks ago

tax

LOW Academic International

Automated Concept Discovery for LLM-as-a-Judge Preference Analysis

arXiv:2603.03319v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly used as scalable evaluators of model outputs, but their preference judgments exhibit systematic biases and can diverge from human evaluations. Prior work on LLM-as-a-judge has largely focused on a...

1 min 1 month, 2 weeks ago

tax

LOW Academic International

Discern Truth from Falsehood: Reducing Over-Refusal via Contrastive Refinement

arXiv:2603.03323v1 Announce Type: cross Abstract: Large language models (LLMs) aligned for safety often suffer from over-refusal, the tendency to reject seemingly toxic or benign prompts by misclassifying them as toxic. This behavior undermines models' helpfulness and restricts usability in sensitive...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

Controlling Chat Style in Language Models via Single-Direction Editing

arXiv:2603.03324v1 Announce Type: cross Abstract: Controlling stylistic attributes in large language models (LLMs) remains challenging, with existing approaches relying on either prompt engineering or post-training alignment. This paper investigates this challenge through the lens of representation engineering, testing the hypothesis...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

IntPro: A Proxy Agent for Context-Aware Intent Understanding via Retrieval-conditioned Inference

arXiv:2603.03325v1 Announce Type: cross Abstract: Large language models (LLMs) have become integral to modern Human-AI collaboration workflows, where accurately understanding user intent serves as a crucial step for generating satisfactory responses. Context-aware intent understanding, which involves inferring user intentions from...

1 min 1 month, 2 weeks ago

vat

LOW Academic European Union

Controllable and explainable personality sliders for LLMs at inference time

arXiv:2603.03326v1 Announce Type: cross Abstract: Aligning Large Language Models (LLMs) with specific personas typically relies on expensive and monolithic Supervised Fine-Tuning (SFT) or RLHF. While effective, these methods require training distinct models for every target personality profile. Inference-time activation steering...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

How LLMs Cite and Why It Matters: A Cross-Model Audit of Reference Fabrication in AI-Assisted Academic Writing and Methods to Detect Phantom Citations

arXiv:2603.03299v1 Announce Type: new Abstract: Large language models (LLMs) have been noted to fabricate scholarly citations, yet the scope of this behavior across providers, domains, and prompting conditions remains poorly quantified. We present one of the largest citation hallucination audits...

1 min 1 month, 2 weeks ago

audit

LOW Academic International

Fragile Thoughts: How Large Language Models Handle Chain-of-Thought Perturbations

arXiv:2603.03332v1 Announce Type: new Abstract: Chain-of-Thought (CoT) prompting has emerged as a foundational technique for eliciting reasoning from Large Language Models (LLMs), yet the robustness of this approach to corruptions in intermediate reasoning steps remains poorly understood. This paper presents...

1 min 1 month, 2 weeks ago

tax

LOW Academic International

Tracing Pharmacological Knowledge In Large Language Models

arXiv:2603.03407v1 Announce Type: new Abstract: Large language models (LLMs) have shown strong empirical performance across pharmacology and drug discovery tasks, yet the internal mechanisms by which they encode pharmacological knowledge remain poorly understood. In this work, we investigate how drug-group...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

A theoretical model of dynamical grammatical gender shifting based on set-valued set function

arXiv:2603.03510v1 Announce Type: new Abstract: This study investigates the diverse characteristics of nouns, focusing on both semantic (e.g., countable/uncountable) and morphosyntactic (e.g., masculine/feminine) distinctions. We explore inter-word variations for gender markers in noun morphology. Grammatical gender shift is a widespread...

1 min 1 month, 2 weeks ago

vat

LOW Academic United States

AOI: Turning Failed Trajectories into Training Signals for Autonomous Cloud Diagnosis

arXiv:2603.03378v1 Announce Type: new Abstract: Large language model (LLM) agents offer a promising data-driven approach to automating Site Reliability Engineering (SRE), yet their enterprise deployment is constrained by three challenges: restricted access to proprietary data, unsafe action execution under permission-governed...

1 min 1 month, 2 weeks ago

vat

LOW Academic European Union

Heterogeneous Time Constants Improve Stability in Equilibrium Propagation

arXiv:2603.03402v1 Announce Type: new Abstract: Equilibrium propagation (EP) is a biologically plausible alternative to backpropagation for training neural networks. However, existing EP models use a uniform scalar time step dt, which corresponds biologically to a membrane time constant that is...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

Minimax Optimal Strategy for Delayed Observations in Online Reinforcement Learning

arXiv:2603.03480v1 Announce Type: new Abstract: We study reinforcement learning with delayed state observation, where the agent observes the current state after some random number of time steps. We propose an algorithm that combines the augmentation method and the upper confidence...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

Orbital Transformers for Predicting Wavefunctions in Time-Dependent Density Functional Theory

arXiv:2603.03511v1 Announce Type: new Abstract: We aim to learn wavefunctions simulated by time-dependent density functional theory (TDDFT), which can be efficiently represented as linear combination coefficients of atomic orbitals. In real-time TDDFT, the electronic wavefunctions of a molecule evolve over...

1 min 1 month, 2 weeks ago

vat

LOW Academic United States

Role-Aware Conditional Inference for Spatiotemporal Ecosystem Carbon Flux Prediction

arXiv:2603.03531v1 Announce Type: new Abstract: Accurate prediction of terrestrial ecosystem carbon fluxes (e.g., CO$_2$, GPP, and CH$_4$) is essential for understanding the global carbon cycle and managing its impacts. However, prediction remains challenging due to strong spatiotemporal heterogeneity: ecosystem flux...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

Online Learnability of Chain-of-Thought Verifiers: Soundness and Completeness Trade-offs

arXiv:2603.03538v1 Announce Type: new Abstract: Large language models with chain-of-thought generation have demonstrated great potential for producing complex mathematical proofs. However, their reasoning can often go astray, leading to increasing interest in formal and learned verifiers. A major challenge in...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training

arXiv:2603.03597v1 Announce Type: new Abstract: The rapid progress of large language models (LLMs) is increasingly constrained by memory and deployment costs, motivating compression methods for practical deployment. Many state-of-the-art compression pipelines leverage the low-rank structure of trained weight matrices, a...

1 min 1 month, 2 weeks ago

vat

LOW Academic United States

Riemannian Optimization in Modular Systems

arXiv:2603.03610v1 Announce Type: new Abstract: Understanding how systems built out of modular components can be jointly optimized is an important problem in biology, engineering, and machine learning. The backpropagation algorithm is one such solution and has been instrumental in the...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

Why Are Linear RNNs More Parallelizable?

arXiv:2603.03612v1 Announce Type: new Abstract: The community is increasingly exploring linear RNNs (LRNNs) as language models, motivated by their expressive power and parallelizability. While prior work establishes the expressivity benefits of LRNNs over transformers, it is unclear what makes LRNNs...

1 min 1 month, 2 weeks ago

vat

LOW Academic European Union

Extending Neural Operators: Robust Handling of Functions Beyond the Training Set

arXiv:2603.03621v1 Announce Type: new Abstract: We develop a rigorous framework for extending neural operators to handle out-of-distribution input functions. We leverage kernel approximation techniques and provide theory for characterizing the input-output function spaces in terms of Reproducing Kernel Hilbert Spaces...

1 min 1 month, 2 weeks ago

vat

LOW Academic European Union

Adaptive Sensing of Continuous Physical Systems for Machine Learning

arXiv:2603.03650v1 Announce Type: new Abstract: Physical dynamical systems can be viewed as natural information processors: their systems preserve, transform, and disperse input information. This perspective motivates learning not only from data generated by such systems, but also how to measure...

1 min 1 month, 2 weeks ago

vat

LOW Academic European Union

Graph Negative Feedback Bias Correction Framework for Adaptive Heterophily Modeling

arXiv:2603.03662v1 Announce Type: new Abstract: Graph Neural Networks (GNNs) have emerged as a powerful framework for processing graph-structured data. However, conventional GNNs and their variants are inherently limited by the homophily assumption, leading to degradation in performance on heterophilic graphs....

1 min 1 month, 2 weeks ago

vat

LOW Academic International

JANUS: Structured Bidirectional Generation for Guaranteed Constraints and Analytical Uncertainty

arXiv:2603.03748v1 Announce Type: new Abstract: High-stakes synthetic data generation faces a fundamental Quadrilemma: achieving Fidelity to the original distribution, Control over complex logical constraints, Reliability in uncertainty estimation, and Efficiency in computational cost -- simultaneously. State-of-the-art Deep Generative Models (CTGAN,...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

arXiv:2603.03756v1 Announce Type: new Abstract: While large language models (LLMs) show promise in scientific discovery, existing research focuses on inference or feedback-driven training, leaving the direct modeling of the generative reasoning process, $P(\text{hypothesis}|\text{background})$ ($P(h|b)$), unexplored. We demonstrate that directly training...

1 min 1 month, 2 weeks ago

vat

LOW Academic International

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

arXiv:2603.03805v1 Announce Type: new Abstract: Relational Databases (RDBs) are the backbone of modern business, yet they lack foundation models comparable to those in text or vision. A key obstacle is that high-quality RDBs are private, scarce and structurally heterogeneous, making...

1 min 1 month, 2 weeks ago

vat

LOW Academic United States

Believe Your Model: Distribution-Guided Confidence Calibration

arXiv:2603.03872v1 Announce Type: new Abstract: Large Reasoning Models have demonstrated remarkable performance with the advancement of test-time scaling techniques, which enhances prediction accuracy by generating multiple candidate responses and selecting the most reliable answer. While prior work has analyzed that...

1 min 1 month, 2 weeks ago

vat

PlugMem: A Task-Agnostic Plugin Memory Module for LLM Agents

TATRA: Training-Free Instance-Adaptive Prompting Through Rephrasing and Aggregation

From Exact Hits to Close Enough: Semantic Caching for LLM Embeddings

Draft-Conditioned Constrained Decoding for Structured Generation in LLMs

Token-Oriented Object Notation vs JSON: A Benchmark of Plain and Constrained Decoding Generation

Automated Concept Discovery for LLM-as-a-Judge Preference Analysis

Discern Truth from Falsehood: Reducing Over-Refusal via Contrastive Refinement

Controlling Chat Style in Language Models via Single-Direction Editing

IntPro: A Proxy Agent for Context-Aware Intent Understanding via Retrieval-conditioned Inference

Controllable and explainable personality sliders for LLMs at inference time

How LLMs Cite and Why It Matters: A Cross-Model Audit of Reference Fabrication in AI-Assisted Academic Writing and Methods to Detect Phantom Citations

Fragile Thoughts: How Large Language Models Handle Chain-of-Thought Perturbations

Tracing Pharmacological Knowledge In Large Language Models

A theoretical model of dynamical grammatical gender shifting based on set-valued set function

AOI: Turning Failed Trajectories into Training Signals for Autonomous Cloud Diagnosis

Heterogeneous Time Constants Improve Stability in Equilibrium Propagation

Minimax Optimal Strategy for Delayed Observations in Online Reinforcement Learning

Orbital Transformers for Predicting Wavefunctions in Time-Dependent Density Functional Theory

Role-Aware Conditional Inference for Spatiotemporal Ecosystem Carbon Flux Prediction

Online Learnability of Chain-of-Thought Verifiers: Soundness and Completeness Trade-offs

NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training

Riemannian Optimization in Modular Systems

Why Are Linear RNNs More Parallelizable?

Extending Neural Operators: Robust Handling of Functions Beyond the Training Set

Adaptive Sensing of Continuous Physical Systems for Machine Learning

Graph Negative Feedback Bias Correction Framework for Adaptive Heterophily Modeling

JANUS: Structured Bidirectional Generation for Guaranteed Constraints and Analytical Uncertainty

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

Believe Your Model: Distribution-Guided Confidence Calibration

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.