Labor & Employment

LOW Academic European Union

Self-Tuning Sparse Attention: Multi-Fidelity Hyperparameter Optimization for Transformer Acceleration

arXiv:2603.18417v1 Announce Type: new Abstract: Sparse attention mechanisms promise to break the quadratic bottleneck of long-context transformers, yet production adoption remains limited by a critical usability gap: optimal hyperparameters vary substantially across layers and models, and current methods (e.g., SpargeAttn)...

1 min 4 weeks ago

ada

LOW News European Union

Musk’s tactic of blaming users for Grok sex images may be foiled by EU law

Planned EU ban on nudify apps would likely force Musk to make Grok less "spicy."

1 min 4 weeks, 1 day ago

union

LOW Academic European Union

Theoretical Foundations of Latent Posterior Factors: Formal Guarantees for Multi-Evidence Reasoning

arXiv:2603.15674v1 Announce Type: new Abstract: We present a complete theoretical characterization of Latent Posterior Factors (LPF), a principled framework for aggregating multiple heterogeneous evidence items in probabilistic prediction tasks. Multi-evidence reasoning arises pervasively in high-stakes domains including healthcare diagnosis, financial...

1 min 4 weeks, 2 days ago

ada

LOW Academic European Union

NeuronSpark: A Spiking Neural Network Language Model with Selective State Space Dynamics

arXiv:2603.16148v1 Announce Type: new Abstract: We ask whether a pure spiking backbone can learn large-scale language modeling from random initialization, without Transformer distillation. We introduce NeuronSpark, a 0.9B-parameter SNN language model trained with next-token prediction and surrogate gradients. The model...

1 min 4 weeks, 2 days ago

ada

LOW Academic European Union

GSI Agent: Domain Knowledge Enhancement for Large Language Models in Green Stormwater Infrastructure

arXiv:2603.15643v1 Announce Type: new Abstract: Green Stormwater Infrastructure (GSI) systems, such as permeable pavement, rain gardens, and bioretention facilities, require continuous inspection and maintenance to ensure long-term performance. However, domain knowledge about GSI is often scattered across municipal manuals, regulatory...

1 min 4 weeks, 2 days ago

ada

LOW Academic European Union

Tackling Over-smoothing on Hypergraphs: A Ricci Flow-guided Neural Diffusion Approach

arXiv:2603.15696v1 Announce Type: new Abstract: Hypergraph neural networks (HGNNs) have demonstrated strong capabilities in modeling complex higher-order relationships. However, existing HGNNs often suffer from over-smoothing as the number of layers increases and lack effective control over message passing among nodes....

1 min 4 weeks, 2 days ago

ada

LOW Academic European Union

OMNIFLOW: A Physics-Grounded Multimodal Agent for Generalized Scientific Reasoning

arXiv:2603.15797v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated exceptional logical reasoning capabilities but frequently struggle with the continuous spatiotemporal dynamics governed by Partial Differential Equations (PDEs), often resulting in non-physical hallucinations. Existing approaches typically resort to costly,...

1 min 4 weeks, 2 days ago

ada

LOW Academic European Union

Generative Inverse Design with Abstention via Diagonal Flow Matching

arXiv:2603.15925v1 Announce Type: new Abstract: Inverse design aims to find design parameters $x$ achieving target performance $y^*$. Generative approaches learn bidirectional mappings between designs and labels, enabling diverse solution sampling. However, standard conditional flow matching (CFM), when adapted to inverse...

1 min 4 weeks, 2 days ago

ada

LOW Academic European Union

DyACE: Dynamic Algorithm Co-evolution for Online Automated Heuristic Design with Large Language Model

arXiv:2603.13344v1 Announce Type: new Abstract: The prevailing paradigm in Automated Heuristic Design (AHD) typically relies on the assumption that a single, fixed algorithm can effectively navigate the shifting dynamics of a combinatorial search. This static approach often proves inadequate for...

1 min 1 month ago

ada

LOW Academic European Union

The ARC of Progress towards AGI: A Living Survey of Abstraction and Reasoning

arXiv:2603.13372v1 Announce Type: new Abstract: The Abstraction and Reasoning Corpus (ARC-AGI) has become a key benchmark for fluid intelligence in AI. This survey presents the first cross-generation analysis of 82 approaches across three benchmark versions and the ARC Prize 2024-2025...

1 min 1 month ago

ada

LOW Academic European Union

ICaRus: Identical Cache Reuse for Efficient Multi Model Inference

arXiv:2603.13281v1 Announce Type: new Abstract: Multi model inference has recently emerged as a prominent paradigm, particularly in the development of agentic AI systems. However, in such scenarios, each model must maintain its own Key-Value (KV) cache for the identical prompt,...

1 min 1 month ago

ada

LOW Academic European Union

Detecting Miscitation on the Scholarly Web through LLM-Augmented Text-Rich Graph Learning

arXiv:2603.12290v1 Announce Type: cross Abstract: Scholarly web is a vast network of knowledge connected by citations. However, this system is increasingly compromised by miscitation, where references do not support or even contradict the claims they are cited for. Current miscitation...

1 min 1 month ago

labor

LOW Academic European Union

A Geometrically-Grounded Drive for MDL-Based Optimization in Deep Learning

arXiv:2603.12304v1 Announce Type: cross Abstract: This paper introduces a novel optimization framework that fundamentally integrates the Minimum Description Length (MDL) principle into the training dynamics of deep neural networks. Moving beyond its conventional role as a model selection criterion, we...

1 min 1 month ago

ada

LOW Academic European Union

DART: Input-Difficulty-AwaRe Adaptive Threshold for Early-Exit DNNs

arXiv:2603.12269v1 Announce Type: cross Abstract: Early-exit deep neural networks enable adaptive inference by terminating computation when sufficient confidence is achieved, reducing cost for edge AI accelerators in resource-constrained settings. Existing methods, however, rely on suboptimal exit policies, ignore input difficulty,...

1 min 1 month ago

ada

LOW Academic European Union

ActTail: Global Activation Sparsity in Large Language Models

arXiv:2603.12272v1 Announce Type: new Abstract: Activation sparsity is a promising approach for accelerating large language model (LLM) inference by reducing computation and memory movement. However, existing activation sparsity methods typically apply uniform sparsity across projections, ignoring the heterogeneous statistical properties...

1 min 1 month ago

ada

LOW Academic European Union

Marked Pedagogies: Examining Linguistic Biases in Personalized Automated Writing Feedback

arXiv:2603.12471v1 Announce Type: new Abstract: Effective personalized feedback is critical to students' literacy development. Though LLM-powered tools now promise to automate such feedback at scale, LLMs are not language-neutral: they privilege standard academic English and reproduce social stereotypes, raising concerns...

1 min 1 month ago

ada

LOW Academic European Union

98$\times$ Faster LLM Routing Without a Dedicated GPU: Flash Attention, Prompt Compression, and Near-Streaming for the vLLM Semantic Router

arXiv:2603.12646v1 Announce Type: new Abstract: System-level routers that intercept LLM requests for safety classification, domain routing, and PII detection must be both fast and operationally lightweight: they should add minimal latency to every request, yet not require a dedicated GPU...

1 min 1 month ago

ada

LOW Academic European Union

Is Human Annotation Necessary? Iterative MBR Distillation for Error Span Detection in Machine Translation

arXiv:2603.12983v1 Announce Type: new Abstract: Error Span Detection (ESD) is a crucial subtask in Machine Translation (MT) evaluation, aiming to identify the location and severity of translation errors. While fine-tuning models on human-annotated data improves ESD performance, acquiring such data...

1 min 1 month ago

ada

LOW Academic European Union

Interpretable Semantic Gradients in SSD: A PCA Sweep Approach and a Case Study on AI Discourse

arXiv:2603.13038v1 Announce Type: new Abstract: Supervised Semantic Differential (SSD) is a mixed quantitative-interpretive method that models how text meaning varies with continuous individual-difference variables by estimating a semantic gradient in an embedding space and interpreting its poles through clustering and...

1 min 1 month ago

labor

LOW Academic European Union

Lyapunov Stable Graph Neural Flow

arXiv:2603.12557v1 Announce Type: new Abstract: Graph Neural Networks (GNNs) are highly vulnerable to adversarial perturbations in both topology and features, making the learning of robust representations a critical challenge. In this work, we bridge GNNs with control theory to introduce...

1 min 1 month ago

ada

LOW Academic European Union

When Drafts Evolve: Speculative Decoding Meets Online Learning

arXiv:2603.12617v1 Announce Type: new Abstract: Speculative decoding has emerged as a widely adopted paradigm for accelerating large language model inference, where a lightweight draft model rapidly generates candidate tokens that are then verified in parallel by a larger target model....

1 min 1 month ago

ada

LOW Academic European Union

Adaptive Diffusion Posterior Sampling for Data and Model Fusion of Complex Nonlinear Dynamical Systems

arXiv:2603.12635v1 Announce Type: new Abstract: High-fidelity numerical simulations of chaotic, high dimensional nonlinear dynamical systems are computationally expensive, necessitating the development of efficient surrogate models. Most surrogate models for such systems are deterministic, for example when neural operators are involved....

1 min 1 month ago

ada

LOW Academic European Union

Semi-Synthetic Parallel Data for Translation Quality Estimation: A Case Study of Dataset Building for an Under-Resourced Language Pair

arXiv:2603.11743v1 Announce Type: new Abstract: Quality estimation (QE) plays a crucial role in machine translation (MT) workflows, as it serves to evaluate generated outputs that have no reference translations and to determine whether human post-editing or full retranslation is necessary....

1 min 1 month ago

ada

LOW Academic European Union

High-resolution weather-guided surrogate modeling for data-efficient cross-location building energy prediction

arXiv:2603.11121v1 Announce Type: new Abstract: Building design optimization often depends on physics-based simulation tools such as EnergyPlus, which, although accurate, are computationally expensive and slow. Surrogate models provide a faster alternative, yet most are location-specific, and even weather-informed variants require...

1 min 1 month ago

ada

LOW Academic European Union

Algorithmic Capture, Computational Complexity, and Inductive Bias of Infinite Transformers

arXiv:2603.11161v1 Announce Type: new Abstract: We formally define Algorithmic Capture (i.e., ``grokking'' an algorithm) as the ability of a neural network to generalize to arbitrary problem sizes ($T$) with controllable error and minimal sample adaptation, distinguishing true algorithmic learning from...

1 min 1 month ago

ada

LOW Academic European Union

Reference-Guided Machine Unlearning

arXiv:2603.11210v1 Announce Type: new Abstract: Machine unlearning aims to remove the influence of specific data from trained models while preserving general utility. Existing approximate unlearning methods often rely on performance-degradation heuristics, such as loss maximization or random labeling. However, these...

1 min 1 month ago

ada

LOW Academic European Union

UniHetCO: A Unified Heterogeneous Representation for Multi-Problem Learning in Unsupervised Neural Combinatorial Optimization

arXiv:2603.11456v1 Announce Type: new Abstract: Unsupervised neural combinatorial optimization (NCO) offers an appealing alternative to supervised approaches by training learning-based solvers without ground-truth solutions, directly minimizing instance objectives and constraint violations. Yet for graph node subset-selection problems (e.g., Maximum Clique...

1 min 1 month ago

ada

LOW Academic European Union

PoultryLeX-Net: Domain-Adaptive Dual-Stream Transformer Architecture for Large-Scale Poultry Stakeholder Modeling

arXiv:2603.09991v1 Announce Type: cross Abstract: The rapid growth of the global poultry industry, driven by rising demand for affordable animal protein, has intensified public discourse surrounding production practices, housing, management, animal welfare, and supply-chain transparency. Social media platforms such as...

1 min 1 month ago

ada

LOW Academic European Union

Fine-Tune, Don't Prompt, Your Language Model to Identify Biased Language in Clinical Notes

arXiv:2603.10004v1 Announce Type: new Abstract: Clinical documentation can contain emotionally charged language with stigmatizing or privileging valences. We present a framework for detecting and classifying such language as stigmatizing, privileging, or neutral. We constructed a curated lexicon of biased terms...

1 min 1 month ago

ada

LOW Academic European Union

A Principle-Driven Adaptive Policy for Group Cognitive Stimulation Dialogue for Elderly with Cognitive Impairment

arXiv:2603.10034v1 Announce Type: new Abstract: Cognitive impairment is becoming a major public health challenge. Cognitive Stimulation Therapy (CST) is an effective intervention for cognitive impairment, but traditional methods are difficult to scale, and existing digital systems struggle with group dialogues...

1 min 1 month ago

ada

Self-Tuning Sparse Attention: Multi-Fidelity Hyperparameter Optimization for Transformer Acceleration

Musk’s tactic of blaming users for Grok sex images may be foiled by EU law

Theoretical Foundations of Latent Posterior Factors: Formal Guarantees for Multi-Evidence Reasoning

NeuronSpark: A Spiking Neural Network Language Model with Selective State Space Dynamics

GSI Agent: Domain Knowledge Enhancement for Large Language Models in Green Stormwater Infrastructure

Tackling Over-smoothing on Hypergraphs: A Ricci Flow-guided Neural Diffusion Approach

OMNIFLOW: A Physics-Grounded Multimodal Agent for Generalized Scientific Reasoning

Generative Inverse Design with Abstention via Diagonal Flow Matching

DyACE: Dynamic Algorithm Co-evolution for Online Automated Heuristic Design with Large Language Model

The ARC of Progress towards AGI: A Living Survey of Abstraction and Reasoning

ICaRus: Identical Cache Reuse for Efficient Multi Model Inference

Detecting Miscitation on the Scholarly Web through LLM-Augmented Text-Rich Graph Learning

A Geometrically-Grounded Drive for MDL-Based Optimization in Deep Learning

DART: Input-Difficulty-AwaRe Adaptive Threshold for Early-Exit DNNs

ActTail: Global Activation Sparsity in Large Language Models

Marked Pedagogies: Examining Linguistic Biases in Personalized Automated Writing Feedback

98$\times$ Faster LLM Routing Without a Dedicated GPU: Flash Attention, Prompt Compression, and Near-Streaming for the vLLM Semantic Router

Is Human Annotation Necessary? Iterative MBR Distillation for Error Span Detection in Machine Translation

Interpretable Semantic Gradients in SSD: A PCA Sweep Approach and a Case Study on AI Discourse

Lyapunov Stable Graph Neural Flow

When Drafts Evolve: Speculative Decoding Meets Online Learning

Adaptive Diffusion Posterior Sampling for Data and Model Fusion of Complex Nonlinear Dynamical Systems

Semi-Synthetic Parallel Data for Translation Quality Estimation: A Case Study of Dataset Building for an Under-Resourced Language Pair

High-resolution weather-guided surrogate modeling for data-efficient cross-location building energy prediction

Algorithmic Capture, Computational Complexity, and Inductive Bias of Infinite Transformers

Reference-Guided Machine Unlearning

UniHetCO: A Unified Heterogeneous Representation for Multi-Problem Learning in Unsupervised Neural Combinatorial Optimization

PoultryLeX-Net: Domain-Adaptive Dual-Stream Transformer Architecture for Large-Scale Poultry Stakeholder Modeling

Fine-Tune, Don't Prompt, Your Language Model to Identify Biased Language in Clinical Notes

A Principle-Driven Adaptive Policy for Group Cognitive Stimulation Dialogue for Elderly with Cognitive Impairment

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.