AI & Technology Law

LOW Academic International

Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects

arXiv:2604.05546v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) enable sophisticated reasoning over images and videos, yet their inference is hindered by a systemic efficiency barrier known as visual token dominance. This overhead is driven by a multi-regime interplay between...

1 min 1 week, 2 days ago

ai algorithm

LOW Academic International

CODESTRUCT: Code Agents over Structured Action Spaces

arXiv:2604.05407v1 Announce Type: new Abstract: LLM-based code agents treat repositories as unstructured text, applying edits through brittle string matching that frequently fails due to formatting drift or ambiguous patterns. We propose reframing the codebase as a structured action space where...

1 min 1 week, 2 days ago

ai llm

LOW Academic United States

ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces

arXiv:2604.05172v1 Announce Type: new Abstract: Large language model (LLM) agents are increasingly deployed to automate productivity tasks (e.g., email, scheduling, document management), but evaluating them on live services is risky due to potentially irreversible changes. Existing benchmarks rely on simplified...

1 min 1 week, 2 days ago

ai llm

LOW Academic International

Multi-Drafter Speculative Decoding with Alignment Feedback

arXiv:2604.05417v1 Announce Type: new Abstract: Speculative decoding (SD) accelerates large language model (LLM) inference by using a smaller model to draft future tokens, which are then verified by the target LLM. This preserves generation quality by accepting only aligned tokens....

1 min 1 week, 2 days ago

ai llm

LOW News United States

I can’t help rooting for tiny open source AI model maker Arcee

Arcee is a tiny 26-person U.S. startup that built a high-performing, massive, open source LLM. And it's gaining popularity with OpenClaw users.

1 min 1 week, 3 days ago

ai llm

LOW Law Review United States

The Higher Education Accommodation Mistake

1 min 1 week, 3 days ago

ai llm

LOW Academic International

AI Appeals Processor: A Deep Learning Approach to Automated Classification of Citizen Appeals in Government Services

arXiv:2604.03672v1 Announce Type: new Abstract: Government agencies worldwide face growing volumes of citizen appeals, with electronic submissions increasing significantly over recent years. Traditional manual processing averages 20 minutes per appeal with only 67% classification accuracy, creating significant bottlenecks in public...

1 min 1 week, 3 days ago

ai deep learning

LOW Academic United States

The Format Tax

arXiv:2604.03616v1 Announce Type: new Abstract: Asking a large language model to respond in JSON should be a formatting choice, not a capability tax. Yet we find that structured output requirements -- JSON, XML, LaTeX, Markdown -- substantially degrade reasoning and...

1 min 1 week, 3 days ago

ai bias

LOW Academic International

Cultural Authenticity: Comparing LLM Cultural Representations to Native Human Expectations

arXiv:2604.03493v1 Announce Type: new Abstract: Cultural representation in Large Language Model (LLM) outputs has primarily been evaluated through the proxies of cultural diversity and factual accuracy. However, a crucial gap remains in assessing cultural alignment: the degree to which generated...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

LightThinker++: From Reasoning Compression to Memory Management

arXiv:2604.03679v1 Announce Type: new Abstract: Large language models (LLMs) excel at complex reasoning, yet their efficiency is limited by the surging cognitive overhead of long thought traces. In this paper, we propose LightThinker, a method that enables LLMs to dynamically...

1 min 1 week, 3 days ago

ai llm

LOW Academic European Union

Evolutionary Search for Automated Design of Uncertainty Quantification Methods

arXiv:2604.03473v1 Announce Type: new Abstract: Uncertainty quantification (UQ) methods for large language models are predominantly designed by hand based on domain knowledge and heuristics, limiting their scalability and generality. We apply LLM-powered evolutionary search to automatically discover unsupervised UQ methods...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Automated Analysis of Global AI Safety Initiatives: A Taxonomy-Driven LLM Approach

arXiv:2604.03533v1 Announce Type: new Abstract: We present an automated crosswalk framework that compares an AI safety policy document pair under a shared taxonomy of activities. Using the activity categories defined in Activity Map on AI Safety as fixed aspects, the...

1 min 1 week, 3 days ago

ai llm

LOW Academic United States

Position: Science of AI Evaluation Requires Item-level Benchmark Data

arXiv:2604.03244v1 Announce Type: new Abstract: AI evaluations have become the primary evidence for deploying generative AI systems across high-stakes domains. However, current evaluation paradigms often exhibit systemic validity failures. These issues, ranging from unjustified design choices to misaligned metrics, remain...

1 min 1 week, 3 days ago

ai generative ai

LOW Academic International

I-CALM: Incentivizing Confidence-Aware Abstention for LLM Hallucination Mitigation

arXiv:2604.03904v1 Announce Type: new Abstract: Large language models (LLMs) frequently produce confident but incorrect answers, partly because common binary scoring conventions reward answering over honestly expressing uncertainty. We study whether prompt-only interventions -- explicitly announcing reward schemes for answer-versus-abstain decisions...

1 min 1 week, 3 days ago

ai llm

LOW Academic United States

ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation

arXiv:2604.03922v1 Announce Type: new Abstract: Selecting LLM-generated code candidates using LLM-generated tests is challenging because the tests themselves may be incorrect. Existing methods either treat all tests equally or rely on ad-hoc heuristics to filter unreliable tests. Yet determining test...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

FeynmanBench: Benchmarking Multimodal LLMs on Diagrammatic Physics Reasoning

arXiv:2604.03893v1 Announce Type: new Abstract: Breakthroughs in frontier theory often depend on the combination of concrete diagrammatic notations with rigorous logic. While multimodal large language models (MLLMs) show promise in general scientific tasks, current benchmarks often focus on local information...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Diagonal-Tiled Mixed-Precision Attention for Efficient Low-Bit MXFP Inference

arXiv:2604.03950v1 Announce Type: new Abstract: Transformer-based large language models (LLMs) have demonstrated remarkable performance across a wide range of real-world tasks, but their inference cost remains prohibitively high due to the quadratic complexity of attention and the memory bandwidth limitations...

1 min 1 week, 3 days ago

ai llm

LOW Academic United States

PRAISE: Prefix-Based Rollout Reuse in Agentic Search Training

arXiv:2604.03675v1 Announce Type: new Abstract: In agentic search, large language models (LLMs) are trained to perform multi-turn retrieval and reasoning for complex tasks such as multi-hop question answering (QA). However, current search-based Reinforcement Learning (RL) methods suffer from two core...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Single-agent vs. Multi-agents for Automated Video Analysis of On-Screen Collaborative Learning Behaviors

arXiv:2604.03631v1 Announce Type: new Abstract: On-screen learning behavior provides valuable insights into how students seek, use, and create information during learning. Analyzing on-screen behavioral engagement is essential for capturing students' cognitive and collaborative processes. The recent development of Vision Language...

1 min 1 week, 3 days ago

ai autonomous

LOW Academic International

Selective Forgetting for Large Reasoning Models

arXiv:2604.03571v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) generate structured chains of thought (CoTs) before producing final answers, making them especially vulnerable to knowledge leakage through intermediate reasoning steps. Yet, the memorization of sensitive information in the training data...

1 min 1 week, 3 days ago

ai llm

LOW Academic European Union

Structural Rigidity and the 57-Token Predictive Window: A Physical Framework for Inference-Layer Governability in Large Language Models

arXiv:2604.03524v1 Announce Type: new Abstract: Current AI safety relies on behavioral monitoring and post-training alignment, yet empirical measurement shows these approaches produce no detectable pre-commitment signal in a majority of instruction-tuned models tested. We present an energy-based governance framework connecting...

1 min 1 week, 3 days ago

ai autonomous

LOW Academic International

Comparative reversal learning reveals rigid adaptation in LLMs under non-stationary uncertainty

arXiv:2604.04182v1 Announce Type: new Abstract: Non-stationary environments require agents to revise previously learned action values when contingencies change. We treat large language models (LLMs) as sequential decision policies in a two-option probabilistic reversal-learning task with three latent states and switch...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Explainable Model Routing for Agentic Workflows

arXiv:2604.03527v1 Announce Type: new Abstract: Modern agentic workflows decompose complex tasks into specialized subtasks and route them to diverse models to minimize cost without sacrificing quality. However, current routing architectures focus exclusively on performance optimization, leaving underlying trade-offs between model...

1 min 1 week, 3 days ago

ai algorithm

LOW Academic European Union

When Do Hallucinations Arise? A Graph Perspective on the Evolution of Path Reuse and Path Compression

arXiv:2604.03557v1 Announce Type: new Abstract: Reasoning hallucinations in large language models (LLMs) often appear as fluent yet unsupported conclusions that violate either the given context or underlying factual knowledge. Although such failures are widely observed, the mechanisms by which decoder-only...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Beyond Retrieval: Modeling Confidence Decay and Deterministic Agentic Platforms in Generative Engine Optimization

arXiv:2604.03656v1 Announce Type: new Abstract: Generative Engine Optimization (GEO) is rapidly reshaping digital marketing paradigms in the era of Large Language Models (LLMs). However, current GEO strategies predominantly rely on Retrieval-Augmented Generation (RAG), which inherently suffers from probabilistic hallucinations and...

1 min 1 week, 3 days ago

ai llm

LOW Academic European Union

Neural Global Optimization via Iterative Refinement from Noisy Samples

arXiv:2604.03614v1 Announce Type: new Abstract: Global optimization of black-box functions from noisy samples is a fundamental challenge in machine learning and scientific computing. Traditional methods such as Bayesian Optimization often converge to local minima on multi-modal functions, while gradient-free methods...

1 min 1 week, 3 days ago

ai machine learning

LOW Academic United States

Algebraic Diversity: Group-Theoretic Spectral Estimation from Single Observations

arXiv:2604.03634v1 Announce Type: new Abstract: We prove that temporal averaging over multiple observations can be replaced by algebraic group action on a single observation for second-order statistical estimation. A General Replacement Theorem establishes conditions under which a group-averaged estimator from...

1 min 1 week, 3 days ago

ai llm

LOW Academic European Union

Improving Feasibility via Fast Autoencoder-Based Projections

arXiv:2604.03489v1 Announce Type: new Abstract: Enforcing complex (e.g., nonconvex) operational constraints is a critical challenge in real-world learning and control systems. However, existing methods struggle to efficiently enforce general classes of constraints. To address this, we propose a novel data-driven...

1 min 1 week, 3 days ago

ai neural network

LOW Academic International

Are Arabic Benchmarks Reliable? QIMMA's Quality-First Approach to LLM Evaluation

arXiv:2604.03395v1 Announce Type: new Abstract: We present QIMMA, a quality-assured Arabic LLM leaderboard that places systematic benchmark validation at its core. Rather than aggregating existing resources as-is, QIMMA applies a multi-model assessment pipeline combining automated LLM judgment with human review...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Representational Collapse in Multi-Agent LLM Committees: Measurement and Diversity-Aware Consensus

arXiv:2604.03809v1 Announce Type: new Abstract: Multi-agent LLM committees replicate the same model under different role prompts and aggregate outputs by majority vote, implicitly assuming that agents contribute complementary evidence. We embed each agent's chain-of-thought rationale and measure pairwise similarity: across...

1 min 1 week, 3 days ago

ai llm

Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects

CODESTRUCT: Code Agents over Structured Action Spaces

ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces

Multi-Drafter Speculative Decoding with Alignment Feedback

I can’t help rooting for tiny open source AI model maker Arcee

The Higher Education Accommodation Mistake

AI Appeals Processor: A Deep Learning Approach to Automated Classification of Citizen Appeals in Government Services

The Format Tax

Cultural Authenticity: Comparing LLM Cultural Representations to Native Human Expectations

LightThinker++: From Reasoning Compression to Memory Management

Evolutionary Search for Automated Design of Uncertainty Quantification Methods

Automated Analysis of Global AI Safety Initiatives: A Taxonomy-Driven LLM Approach

Position: Science of AI Evaluation Requires Item-level Benchmark Data

I-CALM: Incentivizing Confidence-Aware Abstention for LLM Hallucination Mitigation

ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation

FeynmanBench: Benchmarking Multimodal LLMs on Diagrammatic Physics Reasoning

Diagonal-Tiled Mixed-Precision Attention for Efficient Low-Bit MXFP Inference

PRAISE: Prefix-Based Rollout Reuse in Agentic Search Training

Single-agent vs. Multi-agents for Automated Video Analysis of On-Screen Collaborative Learning Behaviors

Selective Forgetting for Large Reasoning Models

Structural Rigidity and the 57-Token Predictive Window: A Physical Framework for Inference-Layer Governability in Large Language Models

Comparative reversal learning reveals rigid adaptation in LLMs under non-stationary uncertainty

Explainable Model Routing for Agentic Workflows

When Do Hallucinations Arise? A Graph Perspective on the Evolution of Path Reuse and Path Compression

Beyond Retrieval: Modeling Confidence Decay and Deterministic Agentic Platforms in Generative Engine Optimization

Neural Global Optimization via Iterative Refinement from Noisy Samples

Algebraic Diversity: Group-Theoretic Spectral Estimation from Single Observations

Improving Feasibility via Fast Autoencoder-Based Projections

Are Arabic Benchmarks Reliable? QIMMA's Quality-First Approach to LLM Evaluation

Representational Collapse in Multi-Agent LLM Committees: Measurement and Diversity-Aware Consensus

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.