International Law

LOW Academic International

Understanding the Challenges in Iterative Generative Optimization with LLMs

arXiv:2603.23994v1 Announce Type: new Abstract: Generative optimization uses large language models (LLMs) to iteratively improve artifacts (such as code, workflows or prompts) using execution feedback. It is a promising approach to building self-improving agents, yet in practice remains brittle: despite...

1 min 3 weeks, 2 days ago

ear

LOW Academic International

Lagrangian Relaxation Score-based Generation for Mixed Integer linear Programming

arXiv:2603.24033v1 Announce Type: new Abstract: Predict-and-search (PaS) methods have shown promise for accelerating mixed-integer linear programming (MILP) solving. However, existing approaches typically assume variable independence and rely on deterministic single-point predictions, which limits solution diversityand often necessitates extensive downstream search...

1 min 3 weeks, 2 days ago

ear

LOW News International

The AI skills gap is here, says AI company, and power users are pulling ahead

Anthropic finds AI isn’t replacing jobs yet, but early data shows growing inequality as experienced users gain an edge, raising concerns about future displacement and workforce divides.

1 min 3 weeks, 2 days ago

ear

LOW News International

Lucid Bots raises $20M to keep up with demand for its window-washing drones

Lucid Bots has seen demand accelerate over the last year for its window-cleaning drones and power-washing robots.

1 min 3 weeks, 2 days ago

ear

LOW Academic International

RelayS2S: A Dual-Path Speculative Generation for Real-Time Dialogue

arXiv:2603.23346v1 Announce Type: new Abstract: Real-time spoken dialogue systems face a fundamental tension between latency and response quality. End-to-end speech-to-speech (S2S) models respond immediately and naturally handle turn-taking, backchanneling, and interruption, but produce semantically weaker outputs. Cascaded pipelines (ASR ->...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

CoMaTrack: Competitive Multi-Agent Game-Theoretic Tracking with Vision-Language-Action Models

arXiv:2603.22846v1 Announce Type: new Abstract: Embodied Visual Tracking (EVT), a core dynamic task in embodied intelligence, requires an agent to precisely follow a language-specified target. Yet most existing methods rely on single-agent imitation learning, suffering from costly expert data and...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning

arXiv:2603.22942v1 Announce Type: new Abstract: Translating Natural Language to SQL (NL2SQL) remains a critical bottleneck for democratization of data in enterprises. Although Large Language Models (LLMs) like Gemini 2.5 and other LLMs have demonstrated impressive zero-shot capabilities, their high inference...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models

arXiv:2603.23085v1 Announce Type: new Abstract: Vision-Language Models (VLMs) have enabled interpretable medical diagnosis by integrating visual perception with linguistic reasoning. Yet, existing medical chain-of-thought (CoT) models lack explicit mechanisms to represent and enforce causal reasoning, leaving them vulnerable to spurious...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

MuQ-Eval: An Open-Source Per-Sample Quality Metric for AI Music Generation Evaluation

arXiv:2603.22677v1 Announce Type: new Abstract: Distributional metrics such as Fr\'echet Audio Distance cannot score individual music clips and correlate poorly with human judgments, while the only per-sample learned metric achieving high human correlation is closed-source. We introduce MUQ-EVAL, an open-source...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment

arXiv:2603.23114v1 Announce Type: new Abstract: A human's moral decision depends heavily on the context. Yet research on LLM morality has largely studied fixed scenarios. We address this gap by introducing Contextual MoralChoice, a dataset of moral dilemmas with systematic contextual...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

The Efficiency Attenuation Phenomenon: A Computational Challenge to the Language of Thought Hypothesis

arXiv:2603.22312v1 Announce Type: new Abstract: This paper computationally investigates whether thought requires a language-like format, as posited by the Language of Thought (LoT) hypothesis. We introduce the ``AI Private Language'' thought experiment: if two artificial agents develop an efficient, inscrutable...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

Understanding LLM Performance Degradation in Multi-Instance Processing: The Roles of Instance Count and Context Length

arXiv:2603.22608v1 Announce Type: new Abstract: Users often rely on Large Language Models (LLMs) for processing multiple documents or performing analysis over a number of instances. For example, analysing the overall sentiment of a number of movie reviews requires an LLM...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

arXiv:2603.22293v1 Announce Type: new Abstract: Search-augmented large language models (LLMs) trained with reinforcement learning (RL) have achieved strong results on open-domain question answering (QA), but training still remains a significant challenge. The optimization is often unstable due to sparse rewards...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing

arXiv:2603.22289v1 Announce Type: new Abstract: Knowledge Tracing (KT) models students' evolving knowledge states to predict future performance, serving as a foundation for personalized education. While traditional deep learning models achieve high accuracy, they often lack interpretability. Large Language Models (LLMs)...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

How Utilitarian Are OpenAI's Models Really? Replicating and Reinterpreting Pfeffer, Kr\"ugel, and Uhl (2025)

arXiv:2603.22730v1 Announce Type: new Abstract: Pfeffer, Kr\"ugel, and Uhl (2025) report that OpenAI's reasoning model o1-mini produces more utilitarian responses to the trolley problem and footbridge dilemma than the non-reasoning model GPT-4o. I replicate their study with four current OpenAI...

1 min 3 weeks, 3 days ago

itar

LOW Academic International

Explanation Generation for Contradiction Reconciliation with LLMs

arXiv:2603.22735v1 Announce Type: new Abstract: Existing NLP work commonly treats contradictions as errors to be resolved by choosing which statements to accept or discard. Yet a key aspect of human reasoning in social interactions and professional domains is the ability...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

LLM Olympiad: Why Model Evaluation Needs a Sealed Exam

arXiv:2603.23292v1 Announce Type: new Abstract: Benchmarks and leaderboards are how NLP most often communicates progress, but in the LLM era they are increasingly easy to misread. Scores can reflect benchmark-chasing, hidden evaluation choices, or accidental exposure to test content --...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

Towards Automated Community Notes Generation with Large Vision Language Models for Combating Contextual Deception

arXiv:2603.22453v1 Announce Type: new Abstract: Community Notes have emerged as an effective crowd-sourced mechanism for combating online deception on social media platforms. However, its reliance on human contributors limits both the timeliness and scalability. In this work, we study the...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures

arXiv:2603.22473v1 Announce Type: new Abstract: Hybrid language models combining attention with state space models (SSMs) or linear attention offer improved efficiency, but whether both components are genuinely utilized remains unclear. We present a functional component ablation framework applied to two...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

Who Spoke What When? Evaluating Spoken Language Models for Conversational ASR with Semantic and Overlap-Aware Metrics

arXiv:2603.22709v1 Announce Type: new Abstract: Conversational automatic speech recognition remains challenging due to overlapping speech, far-field noise, and varying speaker counts. While recent LLM-based systems perform well on single-speaker benchmarks, their robustness in multi-speaker settings is unclear. We systematically compare...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

CAPITU: A Benchmark for Evaluating Instruction-Following in Brazilian Portuguese with Literary Context

arXiv:2603.22576v1 Announce Type: new Abstract: We introduce CAPITU, a benchmark for evaluating instruction-following capabilities of Large Language Models (LLMs) in Brazilian Portuguese. Unlike existing benchmarks that focus on English or use generic prompts, CAPITU contextualizes all tasks within eight canonical...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

PhySe-RPO: Physics and Semantics Guided Relative Policy Optimization for Diffusion-Based Surgical Smoke Removal

arXiv:2603.22844v1 Announce Type: new Abstract: Surgical smoke severely degrades intraoperative video quality, obscuring anatomical structures and limiting surgical perception. Existing learning-based desmoking approaches rely on scarce paired supervision and deterministic restoration pipelines, making it difficult to perform exploration or reinforcement-driven...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

Less is More: Adapting Text Embeddings for Low-Resource Languages with Small Scale Noisy Synthetic Data

arXiv:2603.22290v1 Announce Type: new Abstract: Low-resource languages (LRLs) often lack high-quality, large-scale datasets for training effective text embedding models, hindering their application in tasks like retrieval-augmented generation (RAG) and semantic search. In this work, we challenge the prevailing assumption that...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

Rashid: A Cipher-Based Framework for Exploring In-Context Language Learning

arXiv:2603.22497v1 Announce Type: new Abstract: Where there is growing interest in in-context language learning (ICLL) for unseen languages with large language models, such languages usually suffer from the lack of NLP tools, data resources, and researcher expertise. This means that...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

Memory Bear AI Memory Science Engine for Multimodal Affective Intelligence: A Technical Report

arXiv:2603.22306v1 Announce Type: new Abstract: Affective judgment in real interaction is rarely a purely local prediction problem. Emotional meaning often depends on prior trajectory, accumulated context, and multimodal evidence that may be weak, noisy, or incomplete at the current moment....

1 min 3 weeks, 3 days ago

ear

LOW Academic International

Continuous Optimization for Satisfiability Modulo Theories on Linear Real Arithmetic

arXiv:2603.22877v1 Announce Type: new Abstract: Efficient solutions for satisfiability modulo theories (SMT) are integral in industrial applications such as hardware verification and design automation. Existing approaches are predominantly based on conflict-driven clause learning, which is structurally difficult to parallelize and...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

Improving LLM Predictions via Inter-Layer Structural Encoders

arXiv:2603.22665v1 Announce Type: new Abstract: The standard practice in Large Language Models (LLMs) is to base predictions on the final-layer token representations. Recent studies, however, show that intermediate layers encode substantial information, which may contain more task-relevant features than the...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

Improving Safety Alignment via Balanced Direct Preference Optimization

arXiv:2603.22829v1 Announce Type: new Abstract: With the rapid development and widespread application of Large Language Models (LLMs), their potential safety risks have attracted widespread attention. Reinforcement Learning from Human Feedback (RLHF) has been adopted to enhance the safety performance of...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

arXiv:2603.22446v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) has significantly improved reasoning in large language models (LLMs), yet the token-level mechanisms underlying these improvements remain unclear. We present a systematic empirical study of RLVR's distributional effects organized...

1 min 3 weeks, 3 days ago

ear

LOW Academic International

KALAVAI: Predicting When Independent Specialist Fusion Works -- A Quantitative Model for Post-Hoc Cooperative LLM Training

arXiv:2603.22755v1 Announce Type: new Abstract: Independently trained domain specialists can be fused post-hoc into a single model that outperforms any individual specialist, and the gain is predictable: gain = 0.82 x divergence - 2.72 (R^2 = 0.856, n=6, 3-26% divergence)....

1 min 3 weeks, 3 days ago

ear

Understanding the Challenges in Iterative Generative Optimization with LLMs

Lagrangian Relaxation Score-based Generation for Mixed Integer linear Programming

The AI skills gap is here, says AI company, and power users are pulling ahead

Lucid Bots raises $20M to keep up with demand for its window-washing drones

RelayS2S: A Dual-Path Speculative Generation for Real-Time Dialogue

CoMaTrack: Competitive Multi-Agent Game-Theoretic Tracking with Vision-Language-Action Models

Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning

MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models

MuQ-Eval: An Open-Source Per-Sample Quality Metric for AI Music Generation Evaluation

Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment

The Efficiency Attenuation Phenomenon: A Computational Challenge to the Language of Thought Hypothesis

Understanding LLM Performance Degradation in Multi-Instance Processing: The Roles of Instance Count and Context Length

TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing

How Utilitarian Are OpenAI's Models Really? Replicating and Reinterpreting Pfeffer, Kr\"ugel, and Uhl (2025)

Explanation Generation for Contradiction Reconciliation with LLMs

LLM Olympiad: Why Model Evaluation Needs a Sealed Exam

Towards Automated Community Notes Generation with Large Vision Language Models for Combating Contextual Deception

Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures

Who Spoke What When? Evaluating Spoken Language Models for Conversational ASR with Semantic and Overlap-Aware Metrics

CAPITU: A Benchmark for Evaluating Instruction-Following in Brazilian Portuguese with Literary Context

PhySe-RPO: Physics and Semantics Guided Relative Policy Optimization for Diffusion-Based Surgical Smoke Removal

Less is More: Adapting Text Embeddings for Low-Resource Languages with Small Scale Noisy Synthetic Data

Rashid: A Cipher-Based Framework for Exploring In-Context Language Learning

Memory Bear AI Memory Science Engine for Multimodal Affective Intelligence: A Technical Report

Continuous Optimization for Satisfiability Modulo Theories on Linear Real Arithmetic

Improving LLM Predictions via Inter-Layer Structural Encoders

Improving Safety Alignment via Balanced Direct Preference Optimization

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

KALAVAI: Predicting When Independent Specialist Fusion Works -- A Quantitative Model for Post-Hoc Cooperative LLM Training

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.