AI & Technology Law

LOW Academic International

VERT: Reliable LLM Judges for Radiology Report Evaluation

arXiv:2604.03376v1 Announce Type: new Abstract: Current literature on radiology report evaluation has focused primarily on designing LLM-based metrics and fine-tuning small models for chest X-rays. However, it remains unclear whether these approaches are robust when applied to reports from other...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic European Union

k-Maximum Inner Product Attention for Graph Transformers and the Expressive Power of GraphGPS The Expressive Power of GraphGPS

arXiv:2604.03815v1 Announce Type: new Abstract: Graph transformers have shown promise in overcoming limitations of traditional graph neural networks, such as oversquashing and difficulties in modelling long-range dependencies. However, their application to large-scale graphs is hindered by the quadratic memory and...

1 min 2 weeks, 1 day ago

ai neural network

LOW Academic International

The limits of bio-molecular modeling with large language models : a cross-scale evaluation

arXiv:2604.03361v1 Announce Type: new Abstract: The modeling of bio-molecular system across molecular scales remains a central challenge in scientific research. Large language models (LLMs) are increasingly applied to bio-molecular discovery, yet systematic evaluation across multi-scale biological problems and rigorous assessment...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

SoLA: Leveraging Soft Activation Sparsity and Low-Rank Decomposition for Large Language Model Compression

arXiv:2604.03258v1 Announce Type: new Abstract: Large language models (LLMs) have demonstrated impressive capabilities across various tasks, but the billion-scale parameters pose deployment challenges. Although existing methods attempt to reduce the scale of LLMs, they require either special hardware support or...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

CresOWLve: Benchmarking Creative Problem-Solving Over Real-World Knowledge

arXiv:2604.03374v1 Announce Type: new Abstract: Creative problem-solving requires combining multiple cognitive abilities, including logical reasoning, lateral thinking, analogy-making, and commonsense knowledge, to discover insights that connect seemingly unrelated pieces of information. However, most existing benchmarks for large language models (LLMs)...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

CoALFake: Collaborative Active Learning with Human-LLM Co-Annotation for Cross-Domain Fake News Detection

arXiv:2604.04174v1 Announce Type: new Abstract: The proliferation of fake news across diverse domains highlights critical limitations in current detection systems, which often exhibit narrow domain specificity and poor generalization. Existing cross-domain approaches face two key challenges: (1) reliance on labelled...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

PolySwarm: A Multi-Agent Large Language Model Framework for Prediction Market Trading and Latency Arbitrage

arXiv:2604.03888v1 Announce Type: new Abstract: This paper presents PolySwarm, a novel multi-agent large language model (LLM) framework designed for real-time prediction market trading and latency arbitrage on decentralized platforms such as Polymarket. PolySwarm deploys a swarm of 50 diverse LLM...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

Automated Conjecture Resolution with Formal Verification

arXiv:2604.03789v1 Announce Type: new Abstract: Recent advances in large language models have significantly improved their ability to perform mathematical reasoning, extending from elementary problem solving to increasingly capable performance on research-level problems. However, reliably solving and verifying such problems remains...

1 min 2 weeks, 1 day ago

ai autonomous

LOW Academic International

Predict, Don't React: Value-Based Safety Forecasting for LLM Streaming

arXiv:2604.03962v1 Announce Type: new Abstract: In many practical LLM deployments, a single guardrail is used for both prompt and response moderation. Prompt moderation operates on fully observed text, whereas streaming response moderation requires safety decisions to be made over partial...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

Representational Collapse in Multi-Agent LLM Committees: Measurement and Diversity-Aware Consensus

arXiv:2604.03809v1 Announce Type: new Abstract: Multi-agent LLM committees replicate the same model under different role prompts and aggregate outputs by majority vote, implicitly assuming that agents contribute complementary evidence. We embed each agent's chain-of-thought rationale and measure pairwise similarity: across...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic United States

NativeTernary: A Self-Delimiting Binary Encoding with Unary Run-Length Hierarchy Markers for Ternary Neural Network Weights, Structured Data, and General Computing Infrastructure

arXiv:2604.03336v1 Announce Type: new Abstract: BitNet b1.58 (Ma et al., 2024) demonstrates that large language models can operate entirely on ternary weights {-1, 0, +1}, yet no native binary wire format exists for such models. NativeTernary closes this gap. We...

1 min 2 weeks, 1 day ago

ai neural network

LOW Academic United States

DRAFT: Task Decoupled Latent Reasoning for Agent Safety

arXiv:2604.03242v1 Announce Type: new Abstract: The advent of tool-using LLM agents shifts safety monitoring from output moderation to auditing long, noisy interaction trajectories, where risk-critical evidence is sparse-making standard binary supervision poorly suited for credit assignment. To address this, we...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic United States

RUQuant: Towards Refining Uniform Quantization for Large Language Models

arXiv:2604.04013v1 Announce Type: new Abstract: The increasing size and complexity of large language models (LLMs) have raised significant challenges in deployment efficiency, particularly under resource constraints. Post-training quantization (PTQ) has emerged as a practical solution by compressing models without requiring...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

Researchers waste 80% of LLM annotation costs by classifying one text at a time

arXiv:2604.03684v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly being used for text classification across the social sciences, yet researchers overwhelmingly classify one text per variable per prompt. Coding 100,000 texts on four variables requires 400,000 API calls....

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

Adaptive Threshold-Driven Continuous Greedy Method for Scalable Submodular Optimization

arXiv:2604.03419v1 Announce Type: new Abstract: Submodular maximization under matroid constraints is a fundamental problem in combinatorial optimization with applications in sensing, data summarization, active learning, and resource allocation. While the Sequential Greedy (SG) algorithm achieves only a $\frac{1}{2}$-approximation due to...

1 min 2 weeks, 1 day ago

ai algorithm

LOW Academic United States

ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation

arXiv:2604.03922v1 Announce Type: new Abstract: Selecting LLM-generated code candidates using LLM-generated tests is challenging because the tests themselves may be incorrect. Existing methods either treat all tests equally or rely on ad-hoc heuristics to filter unreliable tests. Yet determining test...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

LLM-Agent-based Social Simulation for Attitude Diffusion

arXiv:2604.03898v1 Announce Type: new Abstract: This paper introduces discourse_simulator, an open-source framework that combines LLMs with agent-based modelling. It offers a new way to simulate how public attitudes toward immigration change over time in response to salient events like protests,...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic European Union

Evolutionary Search for Automated Design of Uncertainty Quantification Methods

arXiv:2604.03473v1 Announce Type: new Abstract: Uncertainty quantification (UQ) methods for large language models are predominantly designed by hand based on domain knowledge and heuristics, limiting their scalability and generality. We apply LLM-powered evolutionary search to automatically discover unsupervised UQ methods...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

Self-Execution Simulation Improves Coding Models

arXiv:2604.03253v1 Announce Type: new Abstract: A promising research direction in enabling LLMs to generate consistently correct code involves addressing their inability to properly estimate program execution, particularly for code they generate. In this work, we demonstrate that Code LLMs can...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

Single-agent vs. Multi-agents for Automated Video Analysis of On-Screen Collaborative Learning Behaviors

arXiv:2604.03631v1 Announce Type: new Abstract: On-screen learning behavior provides valuable insights into how students seek, use, and create information during learning. Analyzing on-screen behavioral engagement is essential for capturing students' cognitive and collaborative processes. The recent development of Vision Language...

1 min 2 weeks, 1 day ago

ai autonomous

LOW Academic International

LightThinker++: From Reasoning Compression to Memory Management

arXiv:2604.03679v1 Announce Type: new Abstract: Large language models (LLMs) excel at complex reasoning, yet their efficiency is limited by the surging cognitive overhead of long thought traces. In this paper, we propose LightThinker, a method that enables LLMs to dynamically...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

I-CALM: Incentivizing Confidence-Aware Abstention for LLM Hallucination Mitigation

arXiv:2604.03904v1 Announce Type: new Abstract: Large language models (LLMs) frequently produce confident but incorrect answers, partly because common binary scoring conventions reward answering over honestly expressing uncertainty. We study whether prompt-only interventions -- explicitly announcing reward schemes for answer-versus-abstain decisions...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic United States

Algebraic Diversity: Group-Theoretic Spectral Estimation from Single Observations

arXiv:2604.03634v1 Announce Type: new Abstract: We prove that temporal averaging over multiple observations can be replaced by algebraic group action on a single observation for second-order statistical estimation. A General Replacement Theorem establishes conditions under which a group-averaged estimator from...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

AI Appeals Processor: A Deep Learning Approach to Automated Classification of Citizen Appeals in Government Services

arXiv:2604.03672v1 Announce Type: new Abstract: Government agencies worldwide face growing volumes of citizen appeals, with electronic submissions increasing significantly over recent years. Traditional manual processing averages 20 minutes per appeal with only 67% classification accuracy, creating significant bottlenecks in public...

1 min 2 weeks, 1 day ago

ai deep learning

LOW Academic United States

From Model-Based Screening to Data-Driven Surrogates: A Multi-Stage Workflow for Exploring Stochastic Agent-Based Models

arXiv:2604.03350v1 Announce Type: new Abstract: Systematic exploration of Agent-Based Models (ABMs) is challenged by the curse of dimensionality and their inherent stochasticity. We present a multi-stage pipeline integrating the systematic design of experiments with machine learning surrogates. Using a predator-prey...

1 min 2 weeks, 1 day ago

ai machine learning

LOW Academic International

Delayed Homomorphic Reinforcement Learning for Environments with Delayed Feedback

arXiv:2604.03641v1 Announce Type: new Abstract: Reinforcement learning in real-world systems is often accompanied by delayed feedback, which breaks the Markov assumption and impedes both learning and control. Canonical state augmentation approaches cause the state-space explosion, which introduces a severe sample-complexity...

1 min 2 weeks, 1 day ago

ai algorithm

LOW Academic European Union

Beauty in the Eye of AI: Aligning LLMs and Vision Models with Human Aesthetics in Network Visualization

arXiv:2604.03417v1 Announce Type: new Abstract: Network visualization has traditionally relied on heuristic metrics, such as stress, under the assumption that optimizing them leads to aesthetic and informative layouts. However, no single metric consistently produces the most effective results. A data-driven...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

When Models Know More Than They Say: Probing Analogical Reasoning in LLMs

arXiv:2604.03877v1 Announce Type: new Abstract: Analogical reasoning is a core cognitive faculty essential for narrative understanding. While LLMs perform well when surface and structural cues align, they struggle in cases where an analogy is not apparent on the surface but...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic European Union

When Do Hallucinations Arise? A Graph Perspective on the Evolution of Path Reuse and Path Compression

arXiv:2604.03557v1 Announce Type: new Abstract: Reasoning hallucinations in large language models (LLMs) often appear as fluent yet unsupported conclusions that violate either the given context or underlying factual knowledge. Although such failures are widely observed, the mechanisms by which decoder-only...

1 min 2 weeks, 1 day ago

ai llm

LOW Academic International

From Plausible to Causal: Counterfactual Semantics for Policy Evaluation in Simulated Online Communities

arXiv:2604.03920v1 Announce Type: new Abstract: LLM-based social simulations can generate believable community interactions, enabling ``policy wind tunnels'' where governance interventions are tested before deployment. But believability is not causality. Claims like ``intervention $A$ reduces escalation'' require causal semantics that current...

1 min 2 weeks, 1 day ago

ai llm

VERT: Reliable LLM Judges for Radiology Report Evaluation

k-Maximum Inner Product Attention for Graph Transformers and the Expressive Power of GraphGPS The Expressive Power of GraphGPS

The limits of bio-molecular modeling with large language models : a cross-scale evaluation

SoLA: Leveraging Soft Activation Sparsity and Low-Rank Decomposition for Large Language Model Compression

CresOWLve: Benchmarking Creative Problem-Solving Over Real-World Knowledge

CoALFake: Collaborative Active Learning with Human-LLM Co-Annotation for Cross-Domain Fake News Detection

PolySwarm: A Multi-Agent Large Language Model Framework for Prediction Market Trading and Latency Arbitrage

Automated Conjecture Resolution with Formal Verification

Predict, Don't React: Value-Based Safety Forecasting for LLM Streaming

Representational Collapse in Multi-Agent LLM Committees: Measurement and Diversity-Aware Consensus

NativeTernary: A Self-Delimiting Binary Encoding with Unary Run-Length Hierarchy Markers for Ternary Neural Network Weights, Structured Data, and General Computing Infrastructure

DRAFT: Task Decoupled Latent Reasoning for Agent Safety

RUQuant: Towards Refining Uniform Quantization for Large Language Models

Researchers waste 80% of LLM annotation costs by classifying one text at a time

Adaptive Threshold-Driven Continuous Greedy Method for Scalable Submodular Optimization

ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation

LLM-Agent-based Social Simulation for Attitude Diffusion

Evolutionary Search for Automated Design of Uncertainty Quantification Methods

Self-Execution Simulation Improves Coding Models

Single-agent vs. Multi-agents for Automated Video Analysis of On-Screen Collaborative Learning Behaviors

LightThinker++: From Reasoning Compression to Memory Management

I-CALM: Incentivizing Confidence-Aware Abstention for LLM Hallucination Mitigation

Algebraic Diversity: Group-Theoretic Spectral Estimation from Single Observations

AI Appeals Processor: A Deep Learning Approach to Automated Classification of Citizen Appeals in Government Services

From Model-Based Screening to Data-Driven Surrogates: A Multi-Stage Workflow for Exploring Stochastic Agent-Based Models

Delayed Homomorphic Reinforcement Learning for Environments with Delayed Feedback

Beauty in the Eye of AI: Aligning LLMs and Vision Models with Human Aesthetics in Network Visualization

When Models Know More Than They Say: Probing Analogical Reasoning in LLMs

When Do Hallucinations Arise? A Graph Perspective on the Evolution of Path Reuse and Path Compression

From Plausible to Causal: Counterfactual Semantics for Policy Evaluation in Simulated Online Communities

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.