International Law

LOW Academic International

DrugPlayGround: Benchmarking Large Language Models and Embeddings for Drug Discovery

arXiv:2604.02346v1 Announce Type: cross Abstract: Large language models (LLMs) are in the ascendancy for research in drug discovery, offering unprecedented opportunities to reshape drug research by accelerating hypothesis generation, optimizing candidate prioritization, and enabling more scalable and cost-effective drug discovery...

1 min 1 week, 4 days ago

ear

LOW Academic International

Using LLM-as-a-Judge/Jury to Advance Scalable, Clinically-Validated Safety Evaluations of Model Responses to Users Demonstrating Psychosis

arXiv:2604.02359v1 Announce Type: cross Abstract: General-purpose Large Language Models (LLMs) are becoming widely adopted by people for mental health support. Yet emerging evidence suggests there are significant risks associated with high-frequency use, particularly for individuals suffering from psychosis, as LLMs...

1 min 1 week, 4 days ago

ear

LOW Academic International

Causal-Audit: A Framework for Risk Assessment of Assumption Violations in Time-Series Causal Discovery

arXiv:2604.02488v1 Announce Type: new Abstract: Time-series causal discovery methods rely on assumptions such as stationarity, regular sampling, and bounded temporal dependence. When these assumptions are violated, structure learning can produce confident but misleading causal graphs without warning. We introduce Causal-Audit,...

1 min 1 week, 4 days ago

ear

LOW Academic International

Beyond the Parameters: A Technical Survey of Contextual Enrichment in Large Language Models: From In-Context Prompting to Causal Retrieval-Augmented Generation

arXiv:2604.03174v1 Announce Type: new Abstract: Large language models (LLMs) encode vast world knowledge in their parameters, yet they remain fundamentally limited by static knowledge, finite context windows, and weakly structured causal reasoning. This survey provides a unified account of augmentation...

1 min 1 week, 4 days ago

ear

LOW Academic International

Audio Spatially-Guided Fusion for Audio-Visual Navigation

arXiv:2604.02389v1 Announce Type: cross Abstract: Audio-visual Navigation refers to an agent utilizing visual and auditory information in complex 3D environments to accomplish target localization and path planning, thereby achieving autonomous navigation. The core challenge of this task lies in the...

1 min 1 week, 4 days ago

ear

LOW Academic International

Verbalizing LLMs' assumptions to explain and control sycophancy

arXiv:2604.03058v1 Announce Type: new Abstract: LLMs can be socially sycophantic, affirming users when they ask questions like "am I in the wrong?" rather than providing genuine assessment. We hypothesize that this behavior arises from incorrect assumptions about the user, like...

1 min 1 week, 4 days ago

ear

LOW Academic International

Dynamic Mask Enhanced Intelligent Multi-UAV Deployment for Urban Vehicular Networks

arXiv:2604.02358v1 Announce Type: cross Abstract: Vehicular Ad Hoc Networks (VANETs) play a crucial role in realizing vehicle-road collaboration and intelligent transportation. However, urban VANETs often face challenges such as frequent link disconnections and subnet fragmentation, which hinder reliable connectivity. To...

1 min 1 week, 4 days ago

ear

LOW Academic International

From Broad Exploration to Stable Synthesis: Entropy-Guided Optimization for Autoregressive Image Generation

arXiv:2604.02355v1 Announce Type: new Abstract: Combining Chain-of-Thought (CoT) with Reinforcement Learning (RL) improves text-to-image (T2I) generation, yet the underlying interaction between CoT's exploration and RL's optimization remains unclear. We present a systematic entropy-based analysis that yields three key insights: (1)...

1 min 1 week, 4 days ago

ear

LOW Academic International

Mitigating Data Scarcity in Spaceflight Applications for Offline Reinforcement Learning Using Physics-Informed Deep Generative Models

arXiv:2604.02438v1 Announce Type: new Abstract: The deployment of reinforcement learning (RL)-based controllers on physical systems is often limited by poor generalization to real-world scenarios, known as the simulation-to-reality (sim-to-real) gap. This gap is particularly challenging in spaceflight, where real-world training...

1 min 1 week, 4 days ago

ear

LOW Academic International

YC Bench: a Live Benchmark for Forecasting Startup Outperformance in Y Combinator Batches

arXiv:2604.02378v1 Announce Type: new Abstract: Forecasting startup success is notoriously difficult, partly because meaningful outcomes, such as exits, large funding rounds, and sustained revenue growth, are rare and can take years to materialize. As a result, signals are sparse and...

1 min 1 week, 4 days ago

ear

LOW Academic International

DeltaLogic: Minimal Premise Edits Reveal Belief-Revision Failures in Logical Reasoning Models

arXiv:2604.02733v1 Announce Type: new Abstract: Reasoning benchmarks typically evaluate whether a model derives the correct answer from a fixed premise set, but they under-measure a closely related capability that matters in dynamic environments: belief revision under minimal evidence change. We...

1 min 1 week, 4 days ago

ear

LOW Academic International

Interpretable Deep Reinforcement Learning for Element-level Bridge Life-cycle Optimization

arXiv:2604.02528v1 Announce Type: new Abstract: The new Specifications for the National Bridge Inventory (SNBI), in effect from 2022, emphasize the use of element-level condition states (CS) for risk-based bridge management. Instead of a general component rating, element-level condition data use...

1 min 1 week, 4 days ago

ear

LOW Academic International

An Empirical Study of Many-Shot In-Context Learning for Machine Translation of Low-Resource Languages

arXiv:2604.02596v1 Announce Type: new Abstract: In-context learning (ICL) allows large language models (LLMs) to adapt to new tasks from a few examples, making it promising for languages underrepresented in pre-training. Recent work on many-shot ICL suggests that modern LLMs can...

1 min 1 week, 4 days ago

ear

LOW Academic International

Matrix Profile for Time-Series Anomaly Detection: A Reproducible Open-Source Benchmark on TSB-AD

arXiv:2604.02445v1 Announce Type: new Abstract: Matrix Profile (MP) methods are an interpretable and scalable family of distance-based methods for time-series anomaly detection, but strong benchmark performance still depends on design choices beyond a vanilla nearest-neighbor profile. This technical report documents...

1 min 1 week, 4 days ago

ear

LOW Academic International

AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents

arXiv:2604.02947v1 Announce Type: new Abstract: Computer-use agents extend language models from text generation to persistent action over tools, files, and execution environments. Unlike chat systems, they maintain state across interactions and translate intermediate outputs into concrete actions. This creates a...

1 min 1 week, 4 days ago

ear

LOW Academic International

CharTool: Tool-Integrated Visual Reasoning for Chart Understanding

arXiv:2604.02794v1 Announce Type: new Abstract: Charts are ubiquitous in scientific and financial literature for presenting structured data. However, chart reasoning remains challenging for multimodal large language models (MLLMs) due to the lack of high-quality training data, as well as the...

1 min 1 week, 4 days ago

ear

LOW Academic International

I must delete the evidence: AI Agents Explicitly Cover up Fraud and Violent Crime

arXiv:2604.02500v1 Announce Type: new Abstract: As ongoing research explores the ability of AI agents to be insider threats and act against company interests, we showcase the abilities of such agents to act against human well being in service of corporate...

1 min 1 week, 4 days ago

ear

LOW Academic International

Do Audio-Visual Large Language Models Really See and Hear?

arXiv:2604.02605v1 Announce Type: new Abstract: Audio-Visual Large Language Models (AVLLMs) are emerging as unified interfaces to multimodal perception. We present the first mechanistic interpretability study of AVLLMs, analyzing how audio and visual features evolve and fuse through different layers of...

1 min 1 week, 4 days ago

ear

LOW Academic International

Modeling and Controlling Deployment Reliability under Temporal Distribution Shift

arXiv:2604.02351v1 Announce Type: new Abstract: Machine learning models deployed in non-stationary environments are exposed to temporal distribution shift, which can erode predictive reliability over time. While common mitigation strategies such as periodic retraining and recalibration aim to preserve performance, they...

1 min 1 week, 4 days ago

ear

LOW Academic International

Jump Start or False Start? A Theoretical and Empirical Evaluation of LLM-initialized Bandits

arXiv:2604.02527v1 Announce Type: new Abstract: The recent advancement of Large Language Models (LLMs) offers new opportunities to generate user preference data to warm-start bandits. Recent studies on contextual bandits with LLM initialization (CBLI) have shown that these synthetic priors can...

1 min 1 week, 4 days ago

ear

LOW Academic International

Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models

arXiv:2604.02340v1 Announce Type: new Abstract: Recent advances in masked diffusion language models (MDLMs) narrow the quality gap to autoregressive LMs, but their sampling remains expensive because generation requires many full-sequence denoising passes with a large Transformer and, unlike autoregressive decoding,...

1 min 1 week, 4 days ago

ear

LOW Academic International

Valence-Arousal Subspace in LLMs: Circular Emotion Geometry and Multi-Behavioral Control

arXiv:2604.03147v1 Announce Type: new Abstract: We present a method to identify a valence-arousal (VA) subspace within large language model representations. From 211k emotion-labeled texts, we derive emotion steering vectors, then learn VA axes as linear combinations of their top PCA...

1 min 1 week, 4 days ago

ear

LOW Academic International

Detecting and Correcting Reference Hallucinations in Commercial LLMs and Deep Research Agents

arXiv:2604.03173v1 Announce Type: new Abstract: Large language models and deep research agents supply citation URLs to support their claims, yet the reliability of these citations has not been systematically measured. We address six research questions about citation URL validity using...

1 min 1 week, 4 days ago

ear

LOW Academic International

BioUNER: A Benchmark Dataset for Clinical Urdu Named Entity Recognition

arXiv:2604.02904v1 Announce Type: new Abstract: In this article, we present a gold-standard benchmark dataset for Biomedical Urdu Named Entity Recognition (BioUNER), developed by crawling health-related articles from online Urdu news portals, medical prescriptions, and hospital health blogs and websites. After...

1 min 1 week, 4 days ago

ear

LOW Academic International

Do We Need Frontier Models to Verify Mathematical Proofs?

arXiv:2604.02450v1 Announce Type: new Abstract: Advances in training, post-training, and inference-time methods have enabled frontier reasoning models to win gold medals in math competitions and settle challenging open problems. Gaining trust in the responses of these models requires that natural...

1 min 1 week, 4 days ago

ear

LOW News International

Trump ignores biggest reasons his AI data center buildout is failing

Nearly 50% of data center projects delayed as China holds key to power infrastructure.

1 min 2 weeks ago

ear

LOW Law Review International

The Enumerated-Rights Reading of the Privileges or Immunities Clause: A Response to Barnett and Bernick

ARTICLE The Enumerated-Rights Reading of the Privileges or Immunities Clause: A Response to Barnett and Bernick Kurt T. Lash* In 1871, John Bingham explained the meaning of the Fourteenth Amendment’s Privileges or Immunities Clause—a clause Bingham himself drafted and had...

1 min 2 weeks ago

ear

LOW Academic International

HippoCamp: Benchmarking Contextual Agents on Personal Computers

arXiv:2604.01221v1 Announce Type: new Abstract: We present HippoCamp, a new benchmark designed to evaluate agents' capabilities on multimodal file management. Unlike existing agent benchmarks that focus on tasks like web interaction, tool use, or software automation in generic settings, HippoCamp...

1 min 2 weeks, 1 day ago

ear

LOW Academic International

A Safety-Aware Role-Orchestrated Multi-Agent LLM Framework for Behavioral Health Communication Simulation

arXiv:2604.00249v1 Announce Type: new Abstract: Single-agent large language model (LLM) systems struggle to simultaneously support diverse conversational functions and maintain safety in behavioral health communication. We propose a safety-aware, role-orchestrated multi-agent LLM framework designed to simulate supportive behavioral health dialogue...

1 min 2 weeks, 1 day ago

ear

LOW Academic International

RefineRL: Advancing Competitive Programming with Self-Refinement Reinforcement Learning

arXiv:2604.00790v1 Announce Type: new Abstract: While large language models (LLMs) have demonstrated strong performance on complex reasoning tasks such as competitive programming (CP), existing methods predominantly focus on single-attempt settings, overlooking their capacity for iterative refinement. In this paper, we...

1 min 2 weeks, 1 day ago

ear

DrugPlayGround: Benchmarking Large Language Models and Embeddings for Drug Discovery

Using LLM-as-a-Judge/Jury to Advance Scalable, Clinically-Validated Safety Evaluations of Model Responses to Users Demonstrating Psychosis

Causal-Audit: A Framework for Risk Assessment of Assumption Violations in Time-Series Causal Discovery

Beyond the Parameters: A Technical Survey of Contextual Enrichment in Large Language Models: From In-Context Prompting to Causal Retrieval-Augmented Generation

Audio Spatially-Guided Fusion for Audio-Visual Navigation

Verbalizing LLMs' assumptions to explain and control sycophancy

Dynamic Mask Enhanced Intelligent Multi-UAV Deployment for Urban Vehicular Networks

From Broad Exploration to Stable Synthesis: Entropy-Guided Optimization for Autoregressive Image Generation

Mitigating Data Scarcity in Spaceflight Applications for Offline Reinforcement Learning Using Physics-Informed Deep Generative Models

YC Bench: a Live Benchmark for Forecasting Startup Outperformance in Y Combinator Batches

DeltaLogic: Minimal Premise Edits Reveal Belief-Revision Failures in Logical Reasoning Models

Interpretable Deep Reinforcement Learning for Element-level Bridge Life-cycle Optimization

An Empirical Study of Many-Shot In-Context Learning for Machine Translation of Low-Resource Languages

Matrix Profile for Time-Series Anomaly Detection: A Reproducible Open-Source Benchmark on TSB-AD

AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents

CharTool: Tool-Integrated Visual Reasoning for Chart Understanding

I must delete the evidence: AI Agents Explicitly Cover up Fraud and Violent Crime

Do Audio-Visual Large Language Models Really See and Hear?

Modeling and Controlling Deployment Reliability under Temporal Distribution Shift

Jump Start or False Start? A Theoretical and Empirical Evaluation of LLM-initialized Bandits

Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models

Valence-Arousal Subspace in LLMs: Circular Emotion Geometry and Multi-Behavioral Control

Detecting and Correcting Reference Hallucinations in Commercial LLMs and Deep Research Agents

BioUNER: A Benchmark Dataset for Clinical Urdu Named Entity Recognition

Do We Need Frontier Models to Verify Mathematical Proofs?

Trump ignores biggest reasons his AI data center buildout is failing

The Enumerated-Rights Reading of the Privileges or Immunities Clause: A Response to Barnett and Bernick

HippoCamp: Benchmarking Contextual Agents on Personal Computers

A Safety-Aware Role-Orchestrated Multi-Agent LLM Framework for Behavioral Health Communication Simulation

RefineRL: Advancing Competitive Programming with Self-Refinement Reinforcement Learning

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.