Tax Law

LOW Academic International

Me, Myself, and $\pi$ : Evaluating and Explaining LLM Introspection

arXiv:2603.20276v1 Announce Type: new Abstract: A hallmark of human intelligence is Introspection-the ability to assess and reason about one's own cognitive processes. Introspection has emerged as a promising but contested capability in large language models (LLMs). However, current evaluations often...

1 min 3 weeks, 4 days ago

tax

LOW Conference European Union

NeurIPS 2026 Evaluations & Datasets Track Call for Papers

6 min 3 weeks, 4 days ago

audit

LOW Academic European Union

Graph of States: Solving Abductive Tasks with Large Language Models

arXiv:2603.21250v1 Announce Type: new Abstract: Logical reasoning encompasses deduction, induction, and abduction. However, while Large Language Models (LLMs) have effectively mastered the former two, abductive reasoning remains significantly underexplored. Existing frameworks, predominantly designed for static deductive tasks, fail to generalize...

1 min 3 weeks, 4 days ago

deduction

LOW Academic United States

ReLaMix: Residual Latency-Aware Mixing for Delay-Robust Financial Time-Series Forecasting

arXiv:2603.20869v1 Announce Type: new Abstract: Financial time-series forecasting in real-world high-frequency markets is often hindered by delayed or partially stale observations caused by asynchronous data acquisition and transmission latency. To better reflect such practical conditions, we investigate a simulated delay...

1 min 3 weeks, 4 days ago

vat

LOW Academic United States

Enhancing Safety of Large Language Models via Embedding Space Separation

arXiv:2603.20206v1 Announce Type: new Abstract: Large language models (LLMs) have achieved impressive capabilities, yet ensuring their safety against harmful prompts remains a critical challenge. Recent work has revealed that the latent representations (embeddings) of harmful and safe queries in LLMs...

1 min 3 weeks, 4 days ago

vat

LOW Conference European Union

NeurIPS Main Track Handbook

11 min 3 weeks, 4 days ago

vat

LOW Academic United States

GMPilot: An Expert AI Agent For FDA cGMP Compliance

arXiv:2603.20815v1 Announce Type: new Abstract: The pharmaceutical industry is facing challenges with quality management such as high costs of compliance, slow responses and disjointed knowledge. This paper presents GMPilot, a domain-specific AI agent that is designed to support FDA cGMP...

1 min 3 weeks, 4 days ago

vat

LOW Academic International

Compression is all you need: Modeling Mathematics

arXiv:2603.20396v1 Announce Type: new Abstract: Human mathematics (HM), the mathematics humans discover and value, is a vanishingly small subset of formal mathematics (FM), the totality of all valid deductions. We argue that HM is distinguished by its compressibility through hierarchically...

1 min 3 weeks, 4 days ago

deduction

LOW Academic International

Reasoning Traces Shape Outputs but Models Won't Say So

arXiv:2603.20620v1 Announce Type: new Abstract: Can we trust the reasoning traces that large reasoning models (LRMs) produce? We investigate whether these traces faithfully reflect what drives model outputs, and whether models will honestly report their influence. We introduce Thought Injection,...

1 min 3 weeks, 4 days ago

vat

LOW Academic International

A Modular LLM Framework for Explainable Price Outlier Detection

arXiv:2603.20636v1 Announce Type: new Abstract: Detecting product price outliers is important for retail and e-commerce stores as erroneous or unexpectedly high prices adversely affect competitiveness, revenue, and consumer trust. Classical techniques offer simple thresholds while ignoring the rich semantic relationships...

1 min 3 weeks, 4 days ago

audit

LOW Academic International

PAVE: Premise-Aware Validation and Editing for Retrieval-Augmented LLMs

arXiv:2603.20673v1 Announce Type: new Abstract: Retrieval-augmented language models can retrieve relevant evidence yet still commit to answers before explicitly checking whether the retrieved context supports the conclusion. We present PAVE (Premise-Grounded Answer Validation and Editing), an inference-time validation layer for...

1 min 3 weeks, 4 days ago

audit

LOW Academic European Union

The Anatomy of an Edit: Mechanism-Guided Activation Steering for Knowledge Editing

arXiv:2603.20795v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used as knowledge bases, but keeping them up to date requires targeted knowledge editing (KE). However, it remains unclear how edits are implemented inside the model once applied. In...

1 min 3 weeks, 4 days ago

vat

LOW Academic United States

RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution

arXiv:2603.20799v1 Announce Type: new Abstract: Reinforcement learning from verifiable rewards (RLVR) stimulates the thinking processes of large language models (LLMs), substantially enhancing their reasoning abilities on verifiable tasks. It is often assumed that similar gains should transfer to general question...

1 min 3 weeks, 4 days ago

vat

LOW Academic International

BenchBench: Benchmarking Automated Benchmark Generation

arXiv:2603.20807v1 Announce Type: new Abstract: Benchmarks are the de facto standard for tracking progress in large language models (LLMs), yet static test sets can rapidly saturate, become vulnerable to contamination, and are costly to refresh. Scalable evaluation of open-ended items...

1 min 3 weeks, 4 days ago

audit

LOW Academic United States

LLM Router: Prefill is All You Need

arXiv:2603.20895v1 Announce Type: new Abstract: LLMs often share comparable benchmark accuracies, but their complementary performance across task subsets suggests that an Oracle router--a theoretical selector with perfect foresight--can significantly surpass standalone model accuracy by navigating model-specific strengths. While current routers...

1 min 3 weeks, 4 days ago

vat

LOW Academic International

The Hidden Puppet Master: A Theoretical and Real-World Account of Emotional Manipulation in LLMs

arXiv:2603.20907v1 Announce Type: new Abstract: As users increasingly turn to LLMs for practical and personal advice, they become vulnerable to being subtly steered toward hidden incentives misaligned with their own interests. Prior works have benchmarked persuasion and manipulation detection, but...

1 min 3 weeks, 4 days ago

tax

LOW Academic United States

Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models

arXiv:2603.20957v1 Announce Type: new Abstract: Frontier LLM companies have repeatedly assured courts and regulators that their models do not store copies of training data. They further rely on safety alignment strategies via RLHF, system prompts, and output filters to block...

1 min 3 weeks, 4 days ago

vat

LOW Academic International

Reading Between the Lines: How Electronic Nonverbal Cues shape Emotion Decoding

arXiv:2603.21038v1 Announce Type: new Abstract: As text-based computer-mediated communication (CMC) increasingly structures everyday interaction, a central question re-emerges with new urgency: How do users reconstruct nonverbal expression in environments where embodied cues are absent? This paper provides a systematic, theory-driven...

1 min 3 weeks, 4 days ago

tax

LOW Academic International

MARLIN: Multi-Agent Reinforcement Learning for Incremental DAG Discovery

arXiv:2603.20295v1 Announce Type: new Abstract: Uncovering causal structures from observational data is crucial for understanding complex systems and making informed decisions. While reinforcement learning (RL) has shown promise in identifying these structures in the form of a directed acyclic graph...

1 min 3 weeks, 4 days ago

vat

LOW Academic International

Transformer-Based Predictive Maintenance for Risk-Aware Instrument Calibration

arXiv:2603.20297v1 Announce Type: new Abstract: Accurate calibration is essential for instruments whose measurements must remain traceable, reliable, and compliant over long operating periods. Fixed-interval programs are easy to administer, but they ignore that instruments drift at different rates under different...

1 min 3 weeks, 4 days ago

vat

LOW Academic European Union

Rolling-Origin Validation Reverses Model Rankings in Multi-Step PM10 Forecasting: XGBoost, SARIMA, and Persistence

arXiv:2603.20315v1 Announce Type: new Abstract: (a) Many air quality forecasting studies report gains from machine learning, but evaluations often use static chronological splits and omit persistence baselines, so the operational added value under routine updating is unclear. (b) Using 2,350...

1 min 3 weeks, 4 days ago

vat

LOW Academic United States

Interpretable Multiple Myeloma Prognosis with Observational Medical Outcomes Partnership Data

arXiv:2603.20341v1 Announce Type: new Abstract: Machine learning (ML) promises better clinical decision-making, yet opaque model behavior limits the adoption in healthcare. We propose two novel regularization techniques for ensuring the interpretability of ML models trained on real-world data. In particular,...

1 min 3 weeks, 4 days ago

vat

LOW Academic International

CAMA: Exploring Collusive Adversarial Attacks in c-MARL

arXiv:2603.20390v1 Announce Type: new Abstract: Cooperative multi-agent reinforcement learning (c-MARL) has been widely deployed in real-world applications, such as social robots, embodied intelligence, UAV swarms, etc. Nevertheless, many adversarial attacks still exist to threaten various c-MARL systems. At present, the...

1 min 3 weeks, 4 days ago

vat

LOW Academic International

Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation

arXiv:2603.20406v1 Announce Type: new Abstract: We investigate whether independently trained language models converge to geometrically compatible latent representations, and whether this compatibility can be exploited to correct model behavior at inference time without any weight updates. We learn a linear...

1 min 3 weeks, 4 days ago

vat

LOW Academic European Union

SDE-Driven Spatio-Temporal Hypergraph Neural Networks for Irregular Longitudinal fMRI Connectome Modeling in Alzheimer's Disease

arXiv:2603.20452v1 Announce Type: new Abstract: Longitudinal neuroimaging is essential for modeling disease progression in Alzheimer's disease (AD), yet irregular sampling and missing visits pose substantial challenges for learning reliable temporal representations. To address this challenge, we propose SDE-HGNN, a stochastic...

1 min 3 weeks, 4 days ago

vat

LOW Academic European Union

From Data to Laws: Neural Discovery of Conservation Laws Without False Positives

arXiv:2603.20474v1 Announce Type: new Abstract: Conservation laws are fundamental to understanding dynamical systems, but discovering them from data remains challenging due to parameter variation, non-polynomial invariants, local minima, and false positives on chaotic systems. We introduce NGCG, a neural-symbolic pipeline...

1 min 3 weeks, 4 days ago

vat

LOW Academic European Union

RMNP: Row-Momentum Normalized Preconditioning for Scalable Matrix-Based Optimization

arXiv:2603.20527v1 Announce Type: new Abstract: Preconditioned adaptive methods have gained significant attention for training deep neural networks, as they capture rich curvature information of the loss landscape . The central challenge in this field lies in balancing preconditioning effectiveness with...

1 min 3 weeks, 4 days ago

vat

LOW Academic United States

LJ-Bench: Ontology-Based Benchmark for U.S. Crime

arXiv:2603.20572v1 Announce Type: new Abstract: The potential of Large Language Models (LLMs) to provide harmful information remains a significant concern due to the vast breadth of illegal queries they may encounter. Unfortunately, existing benchmarks only focus on a handful types...

1 min 3 weeks, 4 days ago

tax

LOW Academic International

Optimal low-rank stochastic gradient estimation for LLM training

arXiv:2603.20632v1 Announce Type: new Abstract: Large language model (LLM) training is often bottlenecked by memory constraints and stochastic gradient noise in extremely high-dimensional parameter spaces. Motivated by empirical evidence that many LLM gradient matrices are effectively low-rank during training, we...

1 min 3 weeks, 4 days ago

vat

LOW Academic European Union

CFNN: Continued Fraction Neural Network

arXiv:2603.20634v1 Announce Type: new Abstract: Accurately characterizing non-linear functional manifolds with singularities is a fundamental challenge in scientific computing. While Multi-Layer Perceptrons (MLPs) dominate, their spectral bias hinders resolving high-curvature features without excessive parameters. We introduce Continued Fraction Neural Networks...

1 min 3 weeks, 4 days ago

vat

Me, Myself, and $\pi$ : Evaluating and Explaining LLM Introspection

NeurIPS 2026 Evaluations & Datasets Track Call for Papers

Graph of States: Solving Abductive Tasks with Large Language Models

ReLaMix: Residual Latency-Aware Mixing for Delay-Robust Financial Time-Series Forecasting

Enhancing Safety of Large Language Models via Embedding Space Separation

NeurIPS Main Track Handbook

GMPilot: An Expert AI Agent For FDA cGMP Compliance

Compression is all you need: Modeling Mathematics

Reasoning Traces Shape Outputs but Models Won't Say So

A Modular LLM Framework for Explainable Price Outlier Detection

PAVE: Premise-Aware Validation and Editing for Retrieval-Augmented LLMs

The Anatomy of an Edit: Mechanism-Guided Activation Steering for Knowledge Editing

RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution

BenchBench: Benchmarking Automated Benchmark Generation

LLM Router: Prefill is All You Need

The Hidden Puppet Master: A Theoretical and Real-World Account of Emotional Manipulation in LLMs

Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models

Reading Between the Lines: How Electronic Nonverbal Cues shape Emotion Decoding

MARLIN: Multi-Agent Reinforcement Learning for Incremental DAG Discovery

Transformer-Based Predictive Maintenance for Risk-Aware Instrument Calibration

Rolling-Origin Validation Reverses Model Rankings in Multi-Step PM10 Forecasting: XGBoost, SARIMA, and Persistence

Interpretable Multiple Myeloma Prognosis with Observational Medical Outcomes Partnership Data

CAMA: Exploring Collusive Adversarial Attacks in c-MARL

Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation

SDE-Driven Spatio-Temporal Hypergraph Neural Networks for Irregular Longitudinal fMRI Connectome Modeling in Alzheimer's Disease

From Data to Laws: Neural Discovery of Conservation Laws Without False Positives

RMNP: Row-Momentum Normalized Preconditioning for Scalable Matrix-Based Optimization

LJ-Bench: Ontology-Based Benchmark for U.S. Crime

Optimal low-rank stochastic gradient estimation for LLM training

CFNN: Continued Fraction Neural Network

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.