International Law

LOW Academic International

User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction

arXiv:2603.20939v1 Announce Type: new Abstract: Large language models are increasingly used as personal assistants, yet most lack a persistent user model, forcing users to repeatedly restate preferences across sessions. We propose Vector-Adapted Retrieval Scoring (VARS), a pipeline-agnostic, frozen-backbone framework that...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Reading Between the Lines: How Electronic Nonverbal Cues shape Emotion Decoding

arXiv:2603.21038v1 Announce Type: new Abstract: As text-based computer-mediated communication (CMC) increasingly structures everyday interaction, a central question re-emerges with new urgency: How do users reconstruct nonverbal expression in environments where embodied cues are absent? This paper provides a systematic, theory-driven...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

MARLIN: Multi-Agent Reinforcement Learning for Incremental DAG Discovery

arXiv:2603.20295v1 Announce Type: new Abstract: Uncovering causal structures from observational data is crucial for understanding complex systems and making informed decisions. While reinforcement learning (RL) has shown promise in identifying these structures in the form of a directed acyclic graph...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Collaborative Adaptive Curriculum for Progressive Knowledge Distillation

arXiv:2603.20296v1 Announce Type: new Abstract: Recent advances in collaborative knowledge distillation have demonstrated cutting-edge performance for resource-constrained distributed multimedia learning scenarios. However, achieving such competitiveness requires addressing a fundamental mismatch: high-dimensional teacher knowledge complexity versus heterogeneous client learning capacities, which...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Bounded Coupled AI Learning Dynamics in Tri-Hierarchical Drone Swarms

arXiv:2603.20333v1 Announce Type: new Abstract: Modern autonomous multi-agent systems combine heterogeneous learning mechanisms operating at different timescales. An open question remains: can one formally guarantee that coupled dynamics of such mechanisms stay within the admissible operational regime? This paper studies...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Hybrid Autoencoder-Isolation Forest approach for time series anomaly detection in C70XP cyclotron operation data at ARRONAX

arXiv:2603.20335v1 Announce Type: new Abstract: The Interest Public Group ARRONAX's C70XP cyclotron, used for radioisotope production for medical and research applications, relies on complex and costly systems that are prone to failures, leading to operational disruptions. In this context, this...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Graph-Aware Text-Only Backdoor Poisoning for Text-Attributed Graphs

arXiv:2603.20339v1 Announce Type: new Abstract: Many learning systems now use graph data in which each node also contains text, such as papers with abstracts or users with posts. Because these texts often come from open platforms, an attacker may be...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

The Multiverse of Time Series Machine Learning: an Archive for Multivariate Time Series Classification

arXiv:2603.20352v1 Announce Type: new Abstract: Time series machine learning (TSML) is a growing research field that spans a wide range of tasks. The popularity of established tasks such as classification, clustering, and extrinsic regression has, in part, been driven by...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

CAMA: Exploring Collusive Adversarial Attacks in c-MARL

arXiv:2603.20390v1 Announce Type: new Abstract: Cooperative multi-agent reinforcement learning (c-MARL) has been widely deployed in real-world applications, such as social robots, embodied intelligence, UAV swarms, etc. Nevertheless, many adversarial attacks still exist to threaten various c-MARL systems. At present, the...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

SymCircuit: Bayesian Structure Inference for Tractable Probabilistic Circuits via Entropy-Regularized Reinforcement Learning

arXiv:2603.20392v1 Announce Type: new Abstract: Probabilistic circuit (PC) structure learning is hampered by greedy algorithms that make irreversible, locally optimal decisions. We propose SymCircuit, which replaces greedy search with a learned generative policy trained via entropy-regularized reinforcement learning. Instantiating the...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

KV Cache Optimization Strategies for Scalable and Efficient LLM Inference

arXiv:2603.20397v1 Announce Type: new Abstract: The key-value (KV) cache is a foundational optimization in Transformer-based large language models (LLMs), eliminating redundant recomputation of past token representations during autoregressive generation. However, its memory footprint scales linearly with context length, imposing critical...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation

arXiv:2603.20406v1 Announce Type: new Abstract: We investigate whether independently trained language models converge to geometrically compatible latent representations, and whether this compatibility can be exploited to correct model behavior at inference time without any weight updates. We learn a linear...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Data-driven discovery of roughness descriptors for surface characterization and intimate contact modeling of unidirectional composite tapes

arXiv:2603.20418v1 Announce Type: new Abstract: Unidirectional tapes surface roughness determines the evolution of the degree of intimate contact required for ensuring the thermoplastic molecular diffusion and the associated inter-tapes consolidation during manufacturing of composite structures. However, usual characterization of rough...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

AE-LLM: Adaptive Efficiency Optimization for Large Language Models

arXiv:2603.20492v1 Announce Type: new Abstract: Large Language Models (LLMs) have achieved remarkable success across diverse applications, yet their deployment remains challenging due to substantial computational costs, memory requirements, and energy consumption. Recent empirical studies have demonstrated that no single efficiency...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Does This Gradient Spark Joy?

arXiv:2603.20526v1 Announce Type: new Abstract: Policy gradient computes a backward pass for every sample, even though the backward pass is expensive and most samples carry little learning value. The Delightful Policy Gradient (DG) provides a forward-pass signal of learning value:...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Towards Practical Multimodal Hospital Outbreak Detection

arXiv:2603.20536v1 Announce Type: new Abstract: Rapid identification of outbreaks in hospitals is essential for controlling pathogens with epidemic potential. Although whole genome sequencing (WGS) remains the gold standard in outbreak investigations, its substantial costs and turnaround times limit its feasibility...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Understanding Behavior Cloning with Action Quantization

arXiv:2603.20538v1 Announce Type: new Abstract: Behavior cloning is a fundamental paradigm in machine learning, enabling policy learning from expert demonstrations across robotics, autonomous driving, and generative models. Autoregressive models like transformer have proven remarkably effective, from large language models (LLMs)...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

RECLAIM: Cyclic Causal Discovery Amid Measurement Noise

arXiv:2603.20585v1 Announce Type: new Abstract: Uncovering causal relationships is a fundamental problem across science and engineering. However, most existing causal discovery methods assume acyclicity and direct access to the system variables -- assumptions that fail to hold in many real-world...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning

arXiv:2603.20586v1 Announce Type: new Abstract: As long-context language modeling becomes increasingly important, the cost of maintaining and attending to large Key/Value (KV) caches grows rapidly, becoming a major bottleneck in both training and inference. While prior works such as Multi-Query...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Bayesian Learning in Episodic Zero-Sum Games

arXiv:2603.20604v1 Announce Type: new Abstract: We study Bayesian learning in episodic, finite-horizon zero-sum Markov games with unknown transition and reward models. We investigate a posterior algorithm in which each player maintains a Bayesian posterior over the game model, independently samples...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Beyond Token Eviction: Mixed-Dimension Budget Allocation for Efficient KV Cache Compression

arXiv:2603.20616v1 Announce Type: new Abstract: Key-value (KV) caching is widely used to accelerate transformer inference, but its memory cost grows linearly with input length, limiting long-context deployment. Existing token eviction methods reduce memory by discarding less important tokens, which can...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Breaking the $O(\sqrt{T})$ Cumulative Constraint Violation Barrier while Achieving $O(\sqrt{T})$ Static Regret in Constrained Online Convex Optimization

arXiv:2603.20671v1 Announce Type: new Abstract: The problem of constrained online convex optimization is considered, where at each round, once a learner commits to an action $x_t \in \mathcal{X} \subset \mathbb{R}^d$, a convex loss function $f_t$ and a convex constraint function...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Centrality-Based Pruning for Efficient Echo State Networks

arXiv:2603.20684v1 Announce Type: new Abstract: Echo State Networks (ESNs) are a reservoir computing framework widely used for nonlinear time-series prediction. However, despite their effectiveness, the randomly initialized reservoir often contains redundant nodes, leading to unnecessary computational overhead and reduced efficiency....

1 min 3 weeks, 4 days ago

ear

LOW Academic International

OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Segmentation

arXiv:2603.20777v1 Announce Type: new Abstract: Robust semantic segmentation is crucial for safe autonomous driving, yet deployed models remain vulnerable to black-box adversarial attacks when target weights are unknown. Most existing approaches either craft image-wide perturbations or optimize patches for a...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

Achieving $\widetilde{O}(1/\epsilon)$ Sample Complexity for Bilinear Systems Identification under Bounded Noises

arXiv:2603.20819v1 Announce Type: new Abstract: This paper studies finite-sample set-membership identification for discrete-time bilinear systems under bounded symmetric log-concave disturbances. Compared with existing finite-sample results for linear systems and related analyses under stronger noise assumptions, we consider the more challenging...

1 min 3 weeks, 4 days ago

ear

LOW Academic International

GeoChallenge: A Multi-Answer Multiple-Choice Benchmark for Geometric Reasoning with Diagrams

arXiv:2603.19252v1 Announce Type: cross Abstract: Evaluating the symbolic reasoning of large language models (LLMs) calls for geometry benchmarks that require multi-step proofs grounded in both text and diagrams. However, existing benchmarks are often limited in scale and rarely provide visually...

1 min 3 weeks, 5 days ago

ear

LOW Academic International

PA2D-MORL: Pareto Ascent Directional Decomposition based Multi-Objective Reinforcement Learning

arXiv:2603.19579v1 Announce Type: new Abstract: Multi-objective reinforcement learning (MORL) provides an effective solution for decision-making problems involving conflicting objectives. However, achieving high-quality approximations to the Pareto policy set remains challenging, especially in complex tasks with continuous or high-dimensional state-action space....

1 min 3 weeks, 5 days ago

ear

LOW Academic International

Pitfalls in Evaluating Interpretability Agents

arXiv:2603.20101v1 Announce Type: new Abstract: Automated interpretability systems aim to reduce the need for human labor and scale analysis to increasingly large models and diverse tasks. Recent efforts toward this goal leverage large language models (LLMs) at increasing levels of...

1 min 3 weeks, 5 days ago

ear

LOW Academic International

Teaching an Agent to Sketch One Part at a Time

arXiv:2603.19500v1 Announce Type: new Abstract: We develop a method for producing vector sketches one part at a time. To do this, we train a multi-modal language model-based agent using a novel multi-turn process-reward reinforcement learning following supervised fine-tuning. Our approach...

1 min 3 weeks, 5 days ago

ear

LOW Academic International

Hyperagents

arXiv:2603.19461v1 Announce Type: new Abstract: Self-improving AI systems aim to reduce reliance on human engineering by learning to improve their own learning and problem-solving processes. Existing approaches to self-improvement rely on fixed, handcrafted meta-level mechanisms, fundamentally limiting how fast such...

1 min 3 weeks, 5 days ago

ear

User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction

Reading Between the Lines: How Electronic Nonverbal Cues shape Emotion Decoding

MARLIN: Multi-Agent Reinforcement Learning for Incremental DAG Discovery

Collaborative Adaptive Curriculum for Progressive Knowledge Distillation

Bounded Coupled AI Learning Dynamics in Tri-Hierarchical Drone Swarms

Hybrid Autoencoder-Isolation Forest approach for time series anomaly detection in C70XP cyclotron operation data at ARRONAX

Graph-Aware Text-Only Backdoor Poisoning for Text-Attributed Graphs

The Multiverse of Time Series Machine Learning: an Archive for Multivariate Time Series Classification

CAMA: Exploring Collusive Adversarial Attacks in c-MARL

SymCircuit: Bayesian Structure Inference for Tractable Probabilistic Circuits via Entropy-Regularized Reinforcement Learning

KV Cache Optimization Strategies for Scalable and Efficient LLM Inference

Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation

Data-driven discovery of roughness descriptors for surface characterization and intimate contact modeling of unidirectional composite tapes

AE-LLM: Adaptive Efficiency Optimization for Large Language Models

Does This Gradient Spark Joy?

Towards Practical Multimodal Hospital Outbreak Detection

Understanding Behavior Cloning with Action Quantization

RECLAIM: Cyclic Causal Discovery Amid Measurement Noise

MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning

Bayesian Learning in Episodic Zero-Sum Games

Beyond Token Eviction: Mixed-Dimension Budget Allocation for Efficient KV Cache Compression

Breaking the $O(\sqrt{T})$ Cumulative Constraint Violation Barrier while Achieving $O(\sqrt{T})$ Static Regret in Constrained Online Convex Optimization

Centrality-Based Pruning for Efficient Echo State Networks

OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Segmentation

Achieving $\widetilde{O}(1/\epsilon)$ Sample Complexity for Bilinear Systems Identification under Bounded Noises

GeoChallenge: A Multi-Answer Multiple-Choice Benchmark for Geometric Reasoning with Diagrams

PA2D-MORL: Pareto Ascent Directional Decomposition based Multi-Objective Reinforcement Learning

Pitfalls in Evaluating Interpretability Agents

Teaching an Agent to Sketch One Part at a Time

Hyperagents

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.