Labor & Employment

LOW Academic European Union

Hidden in the Multiplicative Interaction: Uncovering Fragility in Multimodal Contrastive Learning

arXiv:2604.05834v1 Announce Type: new Abstract: Multimodal contrastive learning is increasingly enriched by going beyond image-text pairs. Among recent contrastive methods, Symile is a strong approach for this challenge because its multiplicative interaction objective captures higher-order cross-modal dependence. Yet, we find...

1 min 1 week, 2 days ago

ada

LOW Academic European Union

Enhancing sample efficiency in reinforcement-learning-based flow control: replacing the critic with an adaptive reduced-order model

arXiv:2604.04986v1 Announce Type: new Abstract: Model-free deep reinforcement learning (DRL) methods suffer from poor sample efficiency. To overcome this limitation, this work introduces an adaptive reduced-order-model (ROM)-based reinforcement learning framework for active flow control. In contrast to conventional actor--critic architectures,...

1 min 1 week, 2 days ago

ada

LOW Law Review United States

Shadow Derivatives: The Quiet Propertization of AI Learning

Introduction Artificial intelligence (AI) systems learn. In today’s AI markets, durable advantage comes less from any single output than from the learning that accumulates through training, fine-tuning, and downstream feedback loops.[1] Each interaction, correction, and deployment contributes incrementally to improved...

1 min 1 week, 2 days ago

ada

LOW Academic International

Simulating the Evolution of Alignment and Values in Machine Intelligence

arXiv:2604.05274v1 Announce Type: new Abstract: Model alignment is currently applied in a vacuum, evaluated primarily through standardised benchmark performance. The purpose of this study is to examine the effects of alignment on populations of models through time. We focus on...

1 min 1 week, 2 days ago

ada

LOW Academic International

Territory Paint Wars: Diagnosing and Mitigating Failure Modes in Competitive Multi-Agent PPO

arXiv:2604.04983v1 Announce Type: new Abstract: We present Territory Paint Wars, a minimal competitive multi-agent reinforcement learning environment implemented in Unity, and use it to systematically investigate failure modes of Proximal Policy Optimisation (PPO) under self-play. A first agent trained for...

1 min 1 week, 2 days ago

ada

LOW News United States

What oral arguments and opinion authorships can actually tell us

Empirical SCOTUS is a recurring series by Adam Feldman that looks at Supreme Court data, primarily in the form of opinions and oral arguments, to provide insights into the justices’ decision making and […]The postWhat oral arguments and opinion authorships...

1 min 1 week, 3 days ago

ada

LOW Law Review International

First Ideas

2 min 1 week, 3 days ago

termination

LOW Academic International

CAWN: Continuous Acoustic Wave Networks for Autoregressive Language Modeling

arXiv:2604.04250v1 Announce Type: new Abstract: Modern Large Language Models (LLMs) rely on Transformer self-attention, which scales quadratically with sequence length. Recent linear-time alternatives, like State Space Models (SSMs), often suffer from signal degradation over extended contexts. We introduce the Continuous...

1 min 1 week, 3 days ago

ada

LOW Academic European Union

DARE: Diffusion Large Language Models Alignment and Reinforcement Executor

arXiv:2604.04215v1 Announce Type: new Abstract: Diffusion large language models (dLLMs) are emerging as a compelling alternative to dominant autoregressive models, replacing strictly sequential token generation with iterative denoising and parallel generation dynamics. However, their open-source ecosystem remains fragmented across model...

1 min 1 week, 3 days ago

ada

LOW Academic International

Scaling DPPs for RAG: Density Meets Diversity

arXiv:2604.03240v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) enhances Large Language Models (LLMs) by grounding generation in external knowledge, yielding relevance responses that are aligned with factual evidence and evolving corpora. Standard RAG pipelines construct context through relevance ranking, performing...

1 min 1 week, 3 days ago

ada

LOW Academic International

Improving Model Performance by Adapting the KGE Metric to Account for System Non-Stationarity

arXiv:2604.03906v1 Announce Type: new Abstract: Geoscientific systems tend to be characterized by pronounced temporal non-stationarity, arising from seasonal and climatic variability in hydrometeorological drivers, and from natural and anthropogenic changes to land use and cover. As has been pointed out,...

1 min 1 week, 3 days ago

ada

LOW Academic International

Apparent Age Estimation: Challenges and Outcomes

arXiv:2604.03335v1 Announce Type: new Abstract: Apparent age estimation is a valuable tool for business personalization, yet current models frequently exhibit demographic biases. We review prior works on the DEX method by applying distribution learning techniques such as Mean-Variance Loss (MVL)...

1 min 1 week, 3 days ago

ada

LOW Academic International

Researchers waste 80% of LLM annotation costs by classifying one text at a time

arXiv:2604.03684v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly being used for text classification across the social sciences, yet researchers overwhelmingly classify one text per variable per prompt. Coding 100,000 texts on four variables requires 400,000 API calls....

1 min 1 week, 3 days ago

ada

LOW Academic International

Shorter, but Still Trustworthy? An Empirical Study of Chain-of-Thought Compression

arXiv:2604.04120v1 Announce Type: new Abstract: Long chain-of-thought (Long-CoT) reasoning models have motivated a growing body of work on compressing reasoning traces to reduce inference cost, yet existing evaluations focus almost exclusively on task accuracy and token savings. Trustworthiness properties, whether...

1 min 1 week, 3 days ago

ada

LOW Academic United States

RUQuant: Towards Refining Uniform Quantization for Large Language Models

arXiv:2604.04013v1 Announce Type: new Abstract: The increasing size and complexity of large language models (LLMs) have raised significant challenges in deployment efficiency, particularly under resource constraints. Post-training quantization (PTQ) has emerged as a practical solution by compressing models without requiring...

1 min 1 week, 3 days ago

ada

LOW Academic International

Scalable Variational Bayesian Fine-Tuning of LLMs via Orthogonalized Low-Rank Adapters

arXiv:2604.03388v1 Announce Type: new Abstract: When deploying large language models (LLMs) to safety-critical applications, uncertainty quantification (UQ) is of utmost importance to self-assess the reliability of the LLM-based decisions. However, such decisions typically suffer from overconfidence, particularly after parameter-efficient fine-tuning...

1 min 1 week, 3 days ago

ada

LOW Academic United States

Extracting and Steering Emotion Representations in Small Language Models: A Methodological Comparison

arXiv:2604.04064v1 Announce Type: new Abstract: Small language models (SLMs) in the 100M-10B parameter range increasingly power production systems, yet whether they possess the internal emotion representations recently discovered in frontier models remains unknown. We present the first comparative analysis of...

1 min 1 week, 3 days ago

ada

LOW Academic International

Don't Blink: Evidence Collapse during Multimodal Reasoning

arXiv:2604.04207v1 Announce Type: new Abstract: Reasoning VLMs can become more accurate while progressively losing visual grounding as they think. This creates task-conditional danger zones where low-entropy predictions are confident but ungrounded, a failure mode text-only monitoring cannot detect. Evaluating three...

1 min 1 week, 3 days ago

ada

LOW Academic International

When Adaptive Rewards Hurt: Causal Probing and the Switching-Stability Dilemma in LLM-Guided LEO Satellite Scheduling

arXiv:2604.03562v1 Announce Type: new Abstract: Adaptive reward design for deep reinforcement learning (DRL) in multi-beam LEO satellite scheduling is motivated by the intuition that regime-aware reward weights should outperform static ones. We systematically test this intuition and uncover a switching-stability...

1 min 1 week, 3 days ago

ada

LOW Academic International

AdaptFuse: Training-Free Sequential Preference Learning via Externalized Bayesian Inference

arXiv:2604.03925v1 Announce Type: new Abstract: Large language models struggle to accumulate evidence across multiple rounds of user interaction, failing to update their beliefs in a manner consistent with Bayesian inference. Existing solutions require fine-tuning on sensitive user interaction data, limiting...

1 min 1 week, 3 days ago

ada

LOW Academic United States

Embedding Enhancement via Fine-Tuned Language Models for Learner-Item Cognitive Modeling

arXiv:2604.04088v1 Announce Type: new Abstract: Learner-item cognitive modeling plays a central role in the web-based online intelligent education system by enabling cognitive diagnosis (CD) across diverse online educational scenarios. Although ID embedding remains the mainstream approach in cognitive modeling due...

1 min 1 week, 3 days ago

ada

LOW Academic International

MultiPress: A Multi-Agent Framework for Interpretable Multimodal News Classification

arXiv:2604.03586v1 Announce Type: new Abstract: With the growing prevalence of multimodal news content, effective news topic classification demands models capable of jointly understanding and reasoning over heterogeneous data such as text and images. Existing methods often process modalities independently or...

1 min 1 week, 3 days ago

labor

LOW Academic United States

Profile-Then-Reason: Bounded Semantic Complexity for Tool-Augmented Language Agents

arXiv:2604.04131v1 Announce Type: new Abstract: Large language model agents that use external tools are often implemented through reactive execution, in which reasoning is repeatedly recomputed after each observation, increasing latency and sensitivity to error propagation. This work introduces Profile--Then--Reason (PTR),...

1 min 1 week, 3 days ago

ada

LOW Academic United States

Solar-VLM: Multimodal Vision-Language Models for Augmented Solar Power Forecasting

arXiv:2604.04145v1 Announce Type: new Abstract: Photovoltaic (PV) power forecasting plays a critical role in power system dispatch and market participation. Because PV generation is highly sensitive to weather conditions and cloud motion, accurate forecasting requires effective modeling of complex spatiotemporal...

1 min 1 week, 3 days ago

ada

LOW Academic International

LightThinker++: From Reasoning Compression to Memory Management

arXiv:2604.03679v1 Announce Type: new Abstract: Large language models (LLMs) excel at complex reasoning, yet their efficiency is limited by the surging cognitive overhead of long thought traces. In this paper, we propose LightThinker, a method that enables LLMs to dynamically...

1 min 1 week, 3 days ago

ada

LOW Academic European Union

Neural Operators for Multi-Task Control and Adaptation

arXiv:2604.03449v1 Announce Type: new Abstract: Neural operator methods have emerged as powerful tools for learning mappings between infinite-dimensional function spaces, yet their potential in optimal control remains largely unexplored. We focus on multi-task control problems, whose solution is a mapping...

1 min 1 week, 3 days ago

ada

LOW Academic International

Unveiling Language Routing Isolation in Multilingual MoE Models for Interpretable Subnetwork Adaptation

arXiv:2604.03592v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) models exhibit striking performance disparities across languages, yet the internal mechanisms driving these gaps remain poorly understood. In this work, we conduct a systematic analysis of expert routing patterns in MoE models, revealing...

1 min 1 week, 3 days ago

ada

LOW Academic United States

Readable Minds: Emergent Theory-of-Mind-Like Behavior in LLM Poker Agents

arXiv:2604.04157v1 Announce Type: new Abstract: Theory of Mind (ToM) -- the ability to model others' mental states -- is fundamental to human social cognition. Whether large language models (LLMs) can develop ToM has been tested exclusively through static vignettes, leaving...

1 min 1 week, 3 days ago

ada

LOW Academic International

BWTA: Accurate and Efficient Binarized Transformer by Algorithm-Hardware Co-design

arXiv:2604.03957v1 Announce Type: new Abstract: Ultra low-bit quantization brings substantial efficiency for Transformer-based models, but the accuracy degradation and limited GPU support hinder its wide usage. In this paper, we analyze zero-point distortion in binarization and propose a Binary Weights...

1 min 1 week, 3 days ago

ada

LOW Academic International

Why Attend to Everything? Focus is the Key

arXiv:2604.03260v1 Announce Type: new Abstract: We introduce Focus, a method that learns which token pairs matter rather than approximating all of them. Learnable centroids assign tokens to groups; distant attention is restricted to same-group pairs while local attention operates at...

1 min 1 week, 3 days ago

ada

Hidden in the Multiplicative Interaction: Uncovering Fragility in Multimodal Contrastive Learning

Enhancing sample efficiency in reinforcement-learning-based flow control: replacing the critic with an adaptive reduced-order model

Shadow Derivatives: The Quiet Propertization of AI Learning

Simulating the Evolution of Alignment and Values in Machine Intelligence

Territory Paint Wars: Diagnosing and Mitigating Failure Modes in Competitive Multi-Agent PPO

What oral arguments and opinion authorships can actually tell us

First Ideas

CAWN: Continuous Acoustic Wave Networks for Autoregressive Language Modeling

DARE: Diffusion Large Language Models Alignment and Reinforcement Executor

Scaling DPPs for RAG: Density Meets Diversity

Improving Model Performance by Adapting the KGE Metric to Account for System Non-Stationarity

Apparent Age Estimation: Challenges and Outcomes

Researchers waste 80% of LLM annotation costs by classifying one text at a time

Shorter, but Still Trustworthy? An Empirical Study of Chain-of-Thought Compression

RUQuant: Towards Refining Uniform Quantization for Large Language Models

Scalable Variational Bayesian Fine-Tuning of LLMs via Orthogonalized Low-Rank Adapters

Extracting and Steering Emotion Representations in Small Language Models: A Methodological Comparison

Don't Blink: Evidence Collapse during Multimodal Reasoning

When Adaptive Rewards Hurt: Causal Probing and the Switching-Stability Dilemma in LLM-Guided LEO Satellite Scheduling

AdaptFuse: Training-Free Sequential Preference Learning via Externalized Bayesian Inference

Embedding Enhancement via Fine-Tuned Language Models for Learner-Item Cognitive Modeling

MultiPress: A Multi-Agent Framework for Interpretable Multimodal News Classification

Profile-Then-Reason: Bounded Semantic Complexity for Tool-Augmented Language Agents

Solar-VLM: Multimodal Vision-Language Models for Augmented Solar Power Forecasting

LightThinker++: From Reasoning Compression to Memory Management

Neural Operators for Multi-Task Control and Adaptation

Unveiling Language Routing Isolation in Multilingual MoE Models for Interpretable Subnetwork Adaptation

Readable Minds: Emergent Theory-of-Mind-Like Behavior in LLM Poker Agents

BWTA: Accurate and Efficient Binarized Transformer by Algorithm-Hardware Co-design

Why Attend to Everything? Focus is the Key

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.