AI & Technology Law

LOW Academic International

Cross-Modal Coreference Alignment: Enabling Reliable Information Transfer in Omni-LLMs

arXiv:2604.05522v1 Announce Type: new Abstract: Omni Large Language Models (Omni-LLMs) have demonstrated impressive capabilities in holistic multi-modal perception, yet they consistently falter in complex scenarios requiring synergistic omni-modal reasoning. Beyond understanding global multimodal context, effective reasoning also hinges on fine-grained...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Uncertainty-Guided Latent Diagnostic Trajectory Learning for Sequential Clinical Diagnosis

arXiv:2604.05116v1 Announce Type: new Abstract: Clinical diagnosis requires sequential evidence acquisition under uncertainty. However, most Large Language Model (LLM) based diagnostic systems assume fully observed patient information and therefore do not explicitly model how clinical evidence should be sequentially acquired...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation

arXiv:2604.05083v1 Announce Type: new Abstract: While Large Language Models (LLMs) are increasingly adopted as automated judges for evaluating generated text, their outputs are often costly, and highly sensitive to prompt design, language, and aggregation strategies, severely, which limits reproducibility. To...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

PCA-Driven Adaptive Sensor Triage for Edge AI Inference

arXiv:2604.05045v1 Announce Type: new Abstract: Multi-channel sensor networks in industrial IoT often exceed available bandwidth. We propose PCA-Triage, a streaming algorithm that converts incremental PCA loadings into proportional per-channel sampling rates under a bandwidth budget. PCA-Triage runs in O(wdk) time...

1 min 1 week, 3 days ago

ai algorithm

LOW Academic International

Do Domain-specific Experts exist in MoE-based LLMs?

arXiv:2604.05267v1 Announce Type: new Abstract: In the era of Large Language Models (LLMs), the Mixture of Experts (MoE) architecture has emerged as an effective approach for training extremely large models with improved computational efficiency. This success builds upon extensive prior...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

This Treatment Works, Right? Evaluating LLM Sensitivity to Patient Question Framing in Medical QA

arXiv:2604.05051v1 Announce Type: new Abstract: Patients are increasingly turning to large language models (LLMs) with medical questions that are complex and difficult to articulate clearly. However, LLMs are sensitive to prompt phrasings and can be influenced by the way questions...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference of Video LLMs

arXiv:2604.05650v1 Announce Type: new Abstract: Video Large Language Models (Video-LLMs) excel in video understanding but suffer from high inference latency during autoregressive generation. Speculative Decoding (SD) mitigates this by applying a draft-and-verify paradigm, yet existing methods are constrained by rigid...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

ICR-Drive: Instruction Counterfactual Robustness for End-to-End Language-Driven Autonomous Driving

arXiv:2604.05378v1 Announce Type: new Abstract: Recent progress in vision-language-action (VLA) models has enabled language-conditioned driving agents to execute natural-language navigation commands in closed-loop simulation, yet standard evaluations largely assume instructions are precise and well-formed. In deployment, instructions vary in phrasing...

1 min 1 week, 3 days ago

ai autonomous

LOW Academic International

PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing

arXiv:2604.05018v1 Announce Type: new Abstract: Synthesizing unstructured research materials into manuscripts is an essential yet under-explored challenge in AI-driven scientific discovery. Existing autonomous writers are rigidly coupled to specific experimental pipelines, and produce superficial literature reviews. We introduce PaperOrchestra, a...

1 min 1 week, 3 days ago

ai autonomous

LOW Academic International

Graph-Based Chain-of-Thought Pruning for Reducing Redundant Reflections in Reasoning LLMs

arXiv:2604.05643v1 Announce Type: new Abstract: Extending CoT through RL has been widely used to enhance the reasoning capabilities of LLMs. However, due to the sparsity of reward signals, it can also induce undesirable thinking patterns such as overthinking, i.e., generating...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

CODESTRUCT: Code Agents over Structured Action Spaces

arXiv:2604.05407v1 Announce Type: new Abstract: LLM-based code agents treat repositories as unstructured text, applying edits through brittle string matching that frequently fails due to formatting drift or ambiguous patterns. We propose reframing the codebase as a structured action space where...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Cross-fitted Proximal Learning for Model-Based Reinforcement Learning

arXiv:2604.05185v1 Announce Type: new Abstract: Model-based reinforcement learning is attractive for sequential decision-making because it explicitly estimates reward and transition models and then supports planning through simulated rollouts. In offline settings with hidden confounding, however, models learned directly from observational...

1 min 1 week, 3 days ago

ai bias

LOW Academic International

Bypassing the CSI Bottleneck: MARL-Driven Spatial Control for Reflector Arrays

arXiv:2604.05162v1 Announce Type: new Abstract: Reconfigurable Intelligent Surfaces (RIS) are pivotal for next-generation smart radio environments, yet their practical deployment is severely bottlenecked by the intractable computational overhead of Channel State Information (CSI) estimation. To bypass this fundamental physical-layer barrier,...

1 min 1 week, 3 days ago

ai autonomous

LOW Academic International

Improving Sparse Memory Finetuning

arXiv:2604.05248v1 Announce Type: new Abstract: Large Language Models (LLMs) are typically static after training, yet real-world applications require continual adaptation to new knowledge without degrading existing capabilities. Standard approaches to updating models, like full finetuning or parameter-efficient methods (e.g., LoRA),...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Confidence Should Be Calibrated More Than One Turn Deep

arXiv:2604.05397v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly applied in high-stakes domains such as finance, healthcare, and education, where reliable multi-turn interactions with users are essential. However, existing work on confidence estimation and calibration, a major approach...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts

arXiv:2604.05242v1 Announce Type: new Abstract: Multi-bit watermarking has emerged as a promising solution for embedding imperceptible binary messages into Large Language Model (LLM)-generated text, enabling reliable attribution and tracing of malicious usage of LLMs. Despite recent progress, existing methods still...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Multi-Agent Pathfinding with Non-Unit Integer Edge Costs via Enhanced Conflict-Based Search and Graph Discretization

arXiv:2604.05416v1 Announce Type: new Abstract: Multi-Agent Pathfinding (MAPF) plays a critical role in various domains. Traditional MAPF methods typically assume unit edge costs and single-timestep actions, which limit their applicability to real-world scenarios. MAPFR extends MAPF to handle non-unit costs...

1 min 1 week, 3 days ago

ai algorithm

LOW Academic International

Gradient-Controlled Decoding: A Safety Guardrail for LLMs with Dual-Anchor Steering

arXiv:2604.05179v1 Announce Type: new Abstract: Large language models (LLMs) remain susceptible to jailbreak and direct prompt-injection attacks, yet the strongest defensive filters frequently over-refuse benign queries and degrade user experience. Previous work on jailbreak and prompt injection detection such as...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

LLMs Should Express Uncertainty Explicitly

arXiv:2604.05306v1 Announce Type: new Abstract: Large language models are increasingly used in settings where uncertainty must drive decisions such as abstention, retrieval, and verification. Most existing methods treat uncertainty as a latent quantity to estimate after generation rather than a...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Top-K Retrieval with Fixed-Size Linear-Attention Completion: Backbone- and KV-Format-Preserving Attention for KV-Cache Read Reduction

arXiv:2604.05438v1 Announce Type: new Abstract: Long-context generation is increasingly limited by decode-time key-value (KV) cache traffic, particularly when KV is offloaded beyond GPU memory. Query-aware retrieval (e.g., Top-K selection) reduces this traffic by loading only a subset of KV pairs,...

1 min 1 week, 3 days ago

ai bias

LOW Academic International

Training Without Orthogonalization, Inference With SVD: A Gradient Analysis of Rotation Representations

arXiv:2604.05414v1 Announce Type: new Abstract: Recent work has shown that removing orthogonalization during training and applying it only at inference improves rotation estimation in deep learning, with empirical evidence favoring 9D representations with SVD projection. However, the theoretical understanding of...

1 min 1 week, 3 days ago

ai deep learning

LOW Academic International

$\pi^2$: Structure-Originated Reasoning Data Improves Long-Context Reasoning Ability of Large Language Models

arXiv:2604.05114v1 Announce Type: new Abstract: We study a pipeline that curates reasoning data from initial structured data for improving long-context reasoning in large language models (LLMs). Our approach, $\pi^2$, constructs high-quality reasoning data through rigorous QA curation: 1) extracting and...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Instruction-Tuned LLMs for Parsing and Mining Unstructured Logs on Leadership HPC Systems

arXiv:2604.05168v1 Announce Type: new Abstract: Leadership-class HPC systems generate massive volumes of heterogeneous, largely unstructured system logs. Because these logs originate from diverse software, hardware, and runtime layers, they exhibit inconsistent formats, making structure extraction and pattern discovery extremely challenging....

1 min 1 week, 3 days ago

ai llm

LOW Academic International

ActivityEditor: Learning to Synthesize Physically Valid Human Mobility

arXiv:2604.05529v1 Announce Type: new Abstract: Human mobility modeling is indispensable for diverse urban applications. However, existing data-driven methods often suffer from data scarcity, limiting their applicability in regions where historical trajectories are unavailable or restricted. To bridge this gap, we...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

DualDiffusion: A Speculative Decoding Strategy for Masked Diffusion Models

arXiv:2604.05250v1 Announce Type: new Abstract: Masked Diffusion Models (MDMs) offer a promising alternative to autoregressive language models by enabling parallel token generation and bidirectional context modeling. However, their inference speed is significantly limited by the inability to cache key-value pairs...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Content Fuzzing for Escaping Information Cocoons on Digital Social Media

arXiv:2604.05461v1 Announce Type: new Abstract: Information cocoons on social media limit users' exposure to posts with diverse viewpoints. Modern platforms use stance detection as an important signal in recommendation and ranking pipelines, which can route posts primarily to like-minded audiences...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion

arXiv:2604.05688v1 Announce Type: new Abstract: Key-Value (KV) cache memory and bandwidth increasingly dominate large language model inference cost in long-context and long-generation regimes. Architectures such as multi-head latent attention (MLA) and hybrid sliding-window attention (SWA) can alleviate this bound, but...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Learning to Edit Knowledge via Instruction-based Chain-of-Thought Prompting

arXiv:2604.05540v1 Announce Type: new Abstract: Large language models (LLMs) can effectively handle outdated information through knowledge editing. However, current approaches face two key limitations: (I) Poor generalization: Most approaches rigidly inject new knowledge without ensuring that the model can use...

1 min 1 week, 3 days ago

ai llm

LOW Academic International

Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects

arXiv:2604.05546v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) enable sophisticated reasoning over images and videos, yet their inference is hindered by a systemic efficiency barrier known as visual token dominance. This overhead is driven by a multi-regime interplay between...

1 min 1 week, 3 days ago

ai algorithm

LOW Academic International

HYVE: Hybrid Views for LLM Context Engineering over Machine Data

arXiv:2604.05400v1 Announce Type: new Abstract: Machine data is central to observability and diagnosis in modern computing systems, appearing in logs, metrics, telemetry traces, and configuration snapshots. When provided to large language models (LLMs), this data typically arrives as a mixture...

1 min 1 week, 3 days ago

ai llm

Cross-Modal Coreference Alignment: Enabling Reliable Information Transfer in Omni-LLMs

Uncertainty-Guided Latent Diagnostic Trajectory Learning for Sequential Clinical Diagnosis

Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation

PCA-Driven Adaptive Sensor Triage for Edge AI Inference

Do Domain-specific Experts exist in MoE-based LLMs?

This Treatment Works, Right? Evaluating LLM Sensitivity to Patient Question Framing in Medical QA

See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference of Video LLMs

ICR-Drive: Instruction Counterfactual Robustness for End-to-End Language-Driven Autonomous Driving

PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing

Graph-Based Chain-of-Thought Pruning for Reducing Redundant Reflections in Reasoning LLMs

CODESTRUCT: Code Agents over Structured Action Spaces

Cross-fitted Proximal Learning for Model-Based Reinforcement Learning

Bypassing the CSI Bottleneck: MARL-Driven Spatial Control for Reflector Arrays

Improving Sparse Memory Finetuning

Confidence Should Be Calibrated More Than One Turn Deep

XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts

Multi-Agent Pathfinding with Non-Unit Integer Edge Costs via Enhanced Conflict-Based Search and Graph Discretization

Gradient-Controlled Decoding: A Safety Guardrail for LLMs with Dual-Anchor Steering

LLMs Should Express Uncertainty Explicitly

Top-K Retrieval with Fixed-Size Linear-Attention Completion: Backbone- and KV-Format-Preserving Attention for KV-Cache Read Reduction

Training Without Orthogonalization, Inference With SVD: A Gradient Analysis of Rotation Representations

$\pi^2$: Structure-Originated Reasoning Data Improves Long-Context Reasoning Ability of Large Language Models

Instruction-Tuned LLMs for Parsing and Mining Unstructured Logs on Leadership HPC Systems

ActivityEditor: Learning to Synthesize Physically Valid Human Mobility

DualDiffusion: A Speculative Decoding Strategy for Masked Diffusion Models

Content Fuzzing for Escaping Information Cocoons on Digital Social Media

Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion

Learning to Edit Knowledge via Instruction-based Chain-of-Thought Prompting

Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects

HYVE: Hybrid Views for LLM Context Engineering over Machine Data

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.