Litigation

LOW Academic International

BenchBrowser -- Collecting Evidence for Evaluating Benchmark Validity

arXiv:2603.18019v1 Announce Type: new Abstract: Do language model benchmarks actually measure what practitioners intend them to ? High-level metadata is too coarse to convey the granular reality of benchmarks: a "poetry" benchmark may never test for haikus, while "instruction-following" benchmarks...

1 min 4 weeks, 1 day ago

evidence

LOW Academic International

Prune-then-Quantize or Quantize-then-Prune? Understanding the Impact of Compression Order in Joint Model Compression

arXiv:2603.18426v1 Announce Type: new Abstract: What happens when multiple compression methods are combined-does the order in which they are applied matter? Joint model compression has emerged as a powerful strategy to achieve higher efficiency by combining multiple methods such as...

1 min 4 weeks, 1 day ago

standing

LOW Academic International

MedForge: Interpretable Medical Deepfake Detection via Forgery-aware Reasoning

arXiv:2603.18577v1 Announce Type: new Abstract: Text-guided image editors can now manipulate authentic medical scans with high fidelity, enabling lesion implantation/removal that threatens clinical trust and safety. Existing defenses are inadequate for healthcare. Medical detectors are largely black-box, while MLLM-based explainers...

1 min 4 weeks, 1 day ago

evidence

LOW Academic United States

The Validity Gap in Health AI Evaluation: A Cross-Sectional Analysis of Benchmark Composition

arXiv:2603.18294v1 Announce Type: new Abstract: Background: Clinical trials rely on transparent inclusion criteria to ensure generalizability. In contrast, benchmarks validating health-related large language models (LLMs) rarely characterize the "patient" or "query" populations they contain. Without defined composition, aggregate performance metrics...

1 min 4 weeks, 1 day ago

trial

LOW Academic International

Large-Scale Analysis of Political Propaganda on Moltbook

arXiv:2603.18349v1 Announce Type: new Abstract: We present an NLP-based study of political propaganda on Moltbook, a Reddit-style platform for AI agents. To enable large-scale analysis, we develop LLM-based classifiers to detect political propaganda, validated against expert annotation (Cohen's $\kappa$= 0.64-0.74)....

1 min 4 weeks, 1 day ago

evidence

LOW Academic International

Learned but Not Expressed: Capability-Expression Dissociation in Large Language Models

arXiv:2603.18013v1 Announce Type: new Abstract: Large language models (LLMs) demonstrate the capacity to reconstruct and trace learned content from their training data under specific elicitation conditions, yet this capability does not manifest in standard generation contexts. This empirical observational study...

1 min 4 weeks, 1 day ago

standing

LOW Academic United States

How Psychological Learning Paradigms Shaped and Constrained Artificial Intelligence

arXiv:2603.18203v1 Announce Type: new Abstract: The dominant paradigms of artificial intelligence were shaped by learning theories from psychology: behaviorism inspired reinforcement learning, cognitivism gave rise to deep learning and memory-augmented architectures, and constructivism influenced curriculum learning and compositional approaches. This...

1 min 4 weeks, 1 day ago

standing

LOW Academic International

UT-ACA: Uncertainty-Triggered Adaptive Context Allocation for Long-Context Inference

arXiv:2603.18446v1 Announce Type: new Abstract: Long-context inference remains challenging for large language models due to attention dilution and out-of-distribution degradation. Context selection mitigates this limitation by attending to a subset of key-value cache entries, yet most methods allocate a fixed...

1 min 4 weeks, 1 day ago

evidence

LOW Academic International

DiscoPhon: Benchmarking the Unsupervised Discovery of Phoneme Inventories With Discrete Speech Units

arXiv:2603.18612v1 Announce Type: new Abstract: We introduce DiscoPhon, a multilingual benchmark for evaluating unsupervised phoneme discovery from discrete speech units. DiscoPhon covers 6 dev and 6 test languages, chosen to span a wide range of phonemic contrasts. Given only 10...

1 min 4 weeks, 1 day ago

discovery

LOW Academic South Korea

Mi:dm K 2.5 Pro

arXiv:2603.18788v1 Announce Type: new Abstract: The evolving LLM landscape requires capabilities beyond simple text generation, prioritizing multi-step reasoning, long-context understanding, and agentic workflows. This shift challenges existing models in enterprise environments, especially in Korean-language and domain-specific scenarios where scaling is...

1 min 4 weeks, 1 day ago

standing

LOW Academic International

Evaluating LLM-Generated Lessons from the Language Learning Students' Perspective: A Short Case Study on Duolingo

arXiv:2603.18873v1 Announce Type: new Abstract: Popular language learning applications such as Duolingo use large language models (LLMs) to generate lessons for its users. Most lessons focus on general real-world scenarios such as greetings, ordering food, or asking directions, with limited...

1 min 4 weeks, 1 day ago

standing

LOW Academic European Union

A Human-in/on-the-Loop Framework for Accessible Text Generation

arXiv:2603.18879v1 Announce Type: new Abstract: Plain Language and Easy-to-Read formats in text simplification are essential for cognitive accessibility. Yet current automatic simplification and evaluation pipelines remain largely automated, metric-driven, and fail to reflect user comprehension or normative standards. This paper...

1 min 4 weeks, 1 day ago

evidence

LOW Academic International

Hypothesis-Conditioned Query Rewriting for Decision-Useful Retrieval

arXiv:2603.19008v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) improves Large Language Models (LLMs) by grounding generation in external, non-parametric knowledge. However, when a task requires choosing among competing options, simply grounding generation in broadly relevant context is often insufficient to...

1 min 4 weeks, 1 day ago

evidence

LOW Academic United States

Frayed RoPE and Long Inputs: A Geometric Perspective

arXiv:2603.18017v1 Announce Type: new Abstract: Rotary Positional Embedding (RoPE) is a widely adopted technique for encoding position in language models, which, while effective, causes performance breakdown when input length exceeds training length. Prior analyses assert (rightly) that long inputs cause...

1 min 4 weeks, 1 day ago

standing

LOW Academic International

Towards Differentiating Between Failures and Domain Shifts in Industrial Data Streams

arXiv:2603.18032v1 Announce Type: new Abstract: Anomaly and failure detection methods are crucial in identifying deviations from normal system operational conditions, which allows for actions to be taken in advance, usually preventing more serious damages. Long-lasting deviations indicate failures, while sudden,...

1 min 4 weeks, 1 day ago

trial

LOW Academic International

Quotient Geometry and Persistence-Stable Metrics for Swarm Configurations

arXiv:2603.18041v1 Announce Type: new Abstract: Swarm and constellation reconfiguration can be viewed as motion of an unordered point configuration in an ambient space. Here, we provide persistence-stable, symmetry-invariant geometric representations for comparing and monitoring multi-agent configuration data. We introduce a...

1 min 4 weeks, 1 day ago

motion

LOW Academic International

AGRI-Fidelity: Evaluating the Reliability of Listenable Explanations for Poultry Disease Detection

arXiv:2603.18247v1 Announce Type: new Abstract: Existing XAI metrics measure faithfulness for a single model, ignoring model multiplicity where near-optimal classifiers rely on different or spurious acoustic cues. In noisy farm environments, stationary artifacts such as ventilation noise can produce explanations...

1 min 4 weeks, 1 day ago

discovery

LOW Academic United States

MolRGen: A Training and Evaluation Setting for De Novo Molecular Generation with Reasonning Models

arXiv:2603.18256v1 Announce Type: new Abstract: Recent advances in reasoning-based large language models (LLMs) have demonstrated substantial improvements in complex problem-solving tasks. Motivated by these advances, several works have explored the application of reasoning LLMs to drug discovery and molecular design....

1 min 4 weeks, 1 day ago

discovery

LOW Academic International

Discovering What You Can Control: Interventional Boundary Discovery for Reinforcement Learning

arXiv:2603.18257v1 Announce Type: new Abstract: Selecting relevant state dimensions in the presence of confounded distractors is a causal identification problem: observational statistics alone cannot reliably distinguish dimensions that correlate with actions from those that actions cause. We formalize this as...

1 min 4 weeks, 1 day ago

discovery

LOW Academic United States

Enactor: From Traffic Simulators to Surrogate World Models

arXiv:2603.18266v1 Announce Type: new Abstract: Traffic microsimulators are widely used to evaluate road network performance under various ``what-if" conditions. However, the behavior models controlling the actions of the actors are overly simplistic and fails to capture realistic actor-actor interactions. Deep...

1 min 4 weeks, 1 day ago

standing

LOW Academic United States

Path-Constrained Mixture-of-Experts

arXiv:2603.18297v1 Announce Type: new Abstract: Sparse Mixture-of-Experts (MoE) architectures enable efficient scaling by activating only a subset of parameters for each input. However, conventional MoE routing selects each layer's experts independently, creating N^L possible expert paths -- for N experts...

1 min 4 weeks, 1 day ago

standing

LOW Academic International

Escaping Offline Pessimism: Vector-Field Reward Shaping for Safe Frontier Exploration

arXiv:2603.18326v1 Announce Type: new Abstract: While offline reinforcement learning provides reliable policies for real-world deployment, its inherent pessimism severely restricts an agent's ability to explore and collect novel data online. Drawing inspiration from safe reinforcement learning, exploring near the boundary...

1 min 4 weeks, 1 day ago

motion

LOW Academic International

Epistemic Generative Adversarial Networks

arXiv:2603.18348v1 Announce Type: new Abstract: Generative models, particularly Generative Adversarial Networks (GANs), often suffer from a lack of output diversity, frequently generating similar samples rather than a wide range of variations. This paper introduces a novel generalization of the GAN...

1 min 4 weeks, 1 day ago

evidence

LOW Academic International

FlowMS: Flow Matching for De Novo Structure Elucidation from Mass Spectra

arXiv:2603.18397v1 Announce Type: new Abstract: Mass spectrometry (MS) stands as a cornerstone analytical technique for molecular identification, yet de novo structure elucidation from spectra remains challenging due to the combinatorial complexity of chemical space and the inherent ambiguity of spectral...

1 min 4 weeks, 1 day ago

discovery

LOW Academic European Union

Self-Tuning Sparse Attention: Multi-Fidelity Hyperparameter Optimization for Transformer Acceleration

arXiv:2603.18417v1 Announce Type: new Abstract: Sparse attention mechanisms promise to break the quadratic bottleneck of long-context transformers, yet production adoption remains limited by a critical usability gap: optimal hyperparameters vary substantially across layers and models, and current methods (e.g., SpargeAttn)...

1 min 4 weeks, 1 day ago

discovery

LOW Academic United States

Seeking Universal Shot Language Understanding Solutions

arXiv:2603.18448v1 Announce Type: new Abstract: Shot language understanding (SLU) is crucial for cinematic analysis but remains challenging due to its diverse cinematographic dimensions and subjective expert judgment. While vision-language models (VLMs) have shown strong ability in general visual understanding, recent...

1 min 4 weeks, 1 day ago

standing

LOW News European Union

Jeff Bezos reportedly wants $100 billion to buy and transform old manufacturing firms with AI

The Amazon magnate has a new project centered around acquiring industrial firms and revamping them with AI technology.

1 min 4 weeks, 1 day ago

trial

LOW Law Review United States

Whatever Did Happen to the Antitrust Movement?

ARTICLE Whatever Did Happen to the Antitrust Movement? Herbert Hovenkamp* Antitrust in the United States today is caught between its pursuit of technical rules designed to define and implement defensible economic goals, and increasingly political calls for a new antitrust...

1 min 4 weeks, 1 day ago

trial

LOW Academic United States

Federated Multi Agent Deep Learning and Neural Networks for Advanced Distributed Sensing in Wireless Networks

arXiv:2603.16881v1 Announce Type: new Abstract: Multi-agent deep learning (MADL), including multi-agent deep reinforcement learning (MADRL), distributed/federated training, and graph-structured neural networks, is becoming a unifying framework for decision-making and inference in wireless systems where sensing, communication, and computing are tightly...

1 min 4 weeks, 2 days ago

trial

LOW Academic International

Do Understanding and Generation Fight? A Diagnostic Study of DPO for Unified Multimodal Models

arXiv:2603.17044v1 Announce Type: new Abstract: Unified multimodal models share a language model backbone for both understanding and generating images. Can DPO align both capabilities simultaneously? We present the first systematic study of this question, applying DPO to Janus-Pro at 1B...

1 min 4 weeks, 2 days ago

standing

BenchBrowser -- Collecting Evidence for Evaluating Benchmark Validity

Prune-then-Quantize or Quantize-then-Prune? Understanding the Impact of Compression Order in Joint Model Compression

MedForge: Interpretable Medical Deepfake Detection via Forgery-aware Reasoning

The Validity Gap in Health AI Evaluation: A Cross-Sectional Analysis of Benchmark Composition

Large-Scale Analysis of Political Propaganda on Moltbook

Learned but Not Expressed: Capability-Expression Dissociation in Large Language Models

How Psychological Learning Paradigms Shaped and Constrained Artificial Intelligence

UT-ACA: Uncertainty-Triggered Adaptive Context Allocation for Long-Context Inference

DiscoPhon: Benchmarking the Unsupervised Discovery of Phoneme Inventories With Discrete Speech Units

Mi:dm K 2.5 Pro

Evaluating LLM-Generated Lessons from the Language Learning Students' Perspective: A Short Case Study on Duolingo

A Human-in/on-the-Loop Framework for Accessible Text Generation

Hypothesis-Conditioned Query Rewriting for Decision-Useful Retrieval

Frayed RoPE and Long Inputs: A Geometric Perspective

Towards Differentiating Between Failures and Domain Shifts in Industrial Data Streams

Quotient Geometry and Persistence-Stable Metrics for Swarm Configurations

AGRI-Fidelity: Evaluating the Reliability of Listenable Explanations for Poultry Disease Detection

MolRGen: A Training and Evaluation Setting for De Novo Molecular Generation with Reasonning Models

Discovering What You Can Control: Interventional Boundary Discovery for Reinforcement Learning

Enactor: From Traffic Simulators to Surrogate World Models

Path-Constrained Mixture-of-Experts

Escaping Offline Pessimism: Vector-Field Reward Shaping for Safe Frontier Exploration

Epistemic Generative Adversarial Networks

FlowMS: Flow Matching for De Novo Structure Elucidation from Mass Spectra

Self-Tuning Sparse Attention: Multi-Fidelity Hyperparameter Optimization for Transformer Acceleration

Seeking Universal Shot Language Understanding Solutions

Jeff Bezos reportedly wants $100 billion to buy and transform old manufacturing firms with AI

Whatever Did Happen to the Antitrust Movement?

Federated Multi Agent Deep Learning and Neural Networks for Advanced Distributed Sensing in Wireless Networks

Do Understanding and Generation Fight? A Diagnostic Study of DPO for Unified Multimodal Models

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.