Immigration Law

LOW Academic International

A Parameter-Efficient Transfer Learning Approach through Multitask Prompt Distillation and Decomposition for Clinical NLP

arXiv:2604.06650v1 Announce Type: new Abstract: Existing prompt-based fine-tuning methods typically learn task-specific prompts independently, imposing significant computing and storage overhead at scale when deploying multiple clinical natural language processing (NLP) systems. We present a multitask prompt distillation and decomposition framework...

1 min 1 week, 2 days ago

ead

LOW News United States

State election dispute on political speech comes to Supreme Court on interim docket

Lawyers for Ohio Secretary of State Frank LaRose, as well as county election officials, urged the Supreme Court on Wednesday to let them go ahead with a ballot that does […]The postState election dispute on political speech comes to Supreme...

1 min 1 week, 2 days ago

ead

LOW Academic European Union

A Comparative Study of Demonstration Selection for Practical Large Language Models-based Next POI Prediction

arXiv:2604.06207v1 Announce Type: new Abstract: This paper investigates demonstration selection strategies for predicting a user's next point-of-interest (POI) using large language models (LLMs), aiming to accurately forecast a user's subsequent location based on historical check-in data. While in-context learning (ICL)...

1 min 1 week, 2 days ago

tps

LOW Academic International

Distributed Interpretability and Control for Large Language Models

arXiv:2604.06483v1 Announce Type: new Abstract: Large language models that require multiple GPU cards to host are usually the most capable models. It is necessary to understand and steer these models, but the current technologies do not support the interpretability and...

1 min 1 week, 2 days ago

tps

LOW Academic International

In-Context Learning in Speech Language Models: Analyzing the Role of Acoustic Features, Linguistic Structure, and Induction Heads

arXiv:2604.06356v1 Announce Type: new Abstract: In-Context Learning (ICL) has been extensively studied in text-only Language Models, but remains largely unexplored in the speech domain. Here, we investigate how linguistic and acoustic features affect ICL in Speech Language Models. We focus...

1 min 1 week, 2 days ago

ead

LOW Academic European Union

MO-RiskVAE: A Multi-Omics Variational Autoencoder for Survival Risk Modeling in Multiple MyelomaMO-RiskVAE

arXiv:2604.06267v1 Announce Type: new Abstract: Multimodal variational autoencoders (VAEs) have emerged as a powerful framework for survival risk modeling in multiple myeloma by integrating heterogeneous omics and clinical data. However, when trained under survival supervision, standard latent regularization strategies often...

1 min 1 week, 2 days ago

ead

LOW Academic European Union

Optimal Rates for Pure {\varepsilon}-Differentially Private Stochastic Convex Optimization with Heavy Tails

arXiv:2604.06492v1 Announce Type: new Abstract: We study stochastic convex optimization (SCO) with heavy-tailed gradients under pure epsilon-differential privacy (DP). Instead of assuming a bound on the worst-case Lipschitz parameter of the loss, we assume only a bounded k-th moment. This...

1 min 1 week, 2 days ago

ead

LOW Academic United States

A Benchmark of Classical and Deep Learning Models for Agricultural Commodity Price Forecasting on A Novel Bangladeshi Market Price Dataset

arXiv:2604.06227v1 Announce Type: new Abstract: Accurate short-term forecasting of agricultural commodity prices is critical for food security planning and smallholder income stabilisation in developing economies, yet machine-learning-ready datasets for this purpose remain scarce in South Asia. This paper makes two...

1 min 1 week, 2 days ago

ead

LOW Academic United States

Beyond Facts: Benchmarking Distributional Reading Comprehension in Large Language Models

arXiv:2604.06201v1 Announce Type: new Abstract: While most reading comprehension benchmarks for LLMs focus on factual information that can be answered by localizing specific textual evidence, many real-world tasks require understanding distributional information, such as population-level trends and preferences expressed across...

1 min 1 week, 2 days ago

ead

LOW Academic International

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

arXiv:2604.05134v1 Announce Type: new Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolves in a language model -- from supervised fine-tuning (SFT) to reinforcement learning (RL) --...

1 min 1 week, 3 days ago

ead

LOW Academic United States

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

arXiv:2604.05195v1 Announce Type: new Abstract: Unlike traditional homogeneous routing problems, the Heterogeneous Fleet Vehicle Routing Problem (HFVRP) involves heterogeneous fixed costs, variable travel costs, and capacity constraints, rendering solution quality highly sensitive to vehicle selection. Furthermore, real-world logistics applications often...

1 min 1 week, 3 days ago

ead

LOW Academic European Union

Towards Scaling Law Analysis For Spatiotemporal Weather Data

arXiv:2604.05068v1 Announce Type: new Abstract: Compute-optimal scaling laws are relatively well studied for NLP and CV, where objectives are typically single-step and targets are comparatively homogeneous. Weather forecasting is harder to characterize in the same framework: autoregressive rollouts compound errors...

1 min 1 week, 3 days ago

ead

LOW Academic European Union

Towards Effective In-context Cross-domain Knowledge Transfer via Domain-invariant-neurons-based Retrieval

arXiv:2604.05383v1 Announce Type: new Abstract: Large language models (LLMs) have made notable progress in logical reasoning, yet still fall short of human-level performance. Current boosting strategies rely on expert-crafted in-domain demonstrations, limiting their applicability in expertise-scarce domains, such as specialized...

1 min 1 week, 3 days ago

tps

LOW Academic International

Do Domain-specific Experts exist in MoE-based LLMs?

arXiv:2604.05267v1 Announce Type: new Abstract: In the era of Large Language Models (LLMs), the Mixture of Experts (MoE) architecture has emerged as an effective approach for training extremely large models with improved computational efficiency. This success builds upon extensive prior...

1 min 1 week, 3 days ago

tps

LOW Academic International

Jeffreys Flow: Robust Boltzmann Generators for Rare Event Sampling via Parallel Tempering Distillation

arXiv:2604.05303v1 Announce Type: new Abstract: Sampling physical systems with rough energy landscapes is hindered by rare events and metastable trapping. While Boltzmann generators already offer a solution, their reliance on the reverse Kullback--Leibler divergence frequently induces catastrophic mode collapse, missing...

1 min 1 week, 3 days ago

ead

LOW Academic European Union

Prune-Quantize-Distill: An Ordered Pipeline for Efficient Neural Network Compression

arXiv:2604.04988v1 Announce Type: new Abstract: Modern deployment often requires trading accuracy for efficiency under tight CPU and memory constraints, yet common compression proxies such as parameter count or FLOPs do not reliably predict wall-clock inference time. In particular, unstructured sparsity...

1 min 1 week, 3 days ago

ead

LOW Academic International

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

arXiv:2604.05426v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) is now the dominant method for parameter-efficient fine-tuning of large language models, but achieving a high-quality adapter often requires systematic hyperparameter tuning because LoRA performance is highly sensitive to configuration choices. In...

1 min 1 week, 3 days ago

ead

LOW Academic European Union

The UNDO Flip-Flop: A Controlled Probe for Reversible Semantic State Management in State Space Model

arXiv:2604.05923v1 Announce Type: new Abstract: State space models (SSMs) have been shown to possess the theoretical capacity to model both star-free sequential tasks and bounded hierarchical structures Sarrof et al. (2024). However, formal expressivity results do not guarantee that gradient-based...

1 min 1 week, 3 days ago

ead

LOW Academic International

LatentAudit: Real-Time White-Box Faithfulness Monitoring for Retrieval-Augmented Generation with Verifiable Deployment

arXiv:2604.05358v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) mitigates hallucination but does not eliminate it: a deployed system must still decide, at inference time, whether its answer is actually supported by the retrieved evidence. We introduce LatentAudit, a white-box auditor...

1 min 1 week, 3 days ago

ead

LOW Academic United States

Attribution Bias in Large Language Models

arXiv:2604.05224v1 Announce Type: new Abstract: As Large Language Models (LLMs) are increasingly used to support search and information retrieval, it is critical that they accurately attribute content to its original authors. In this work, we introduce AttriBench, the first fame-...

1 min 1 week, 3 days ago

ead

LOW Academic United States

Thinking Diffusion: Penalize and Guide Visual-Grounded Reasoning in Diffusion Multimodal Language Models

arXiv:2604.05497v1 Announce Type: new Abstract: Diffusion large language models (dLLMs) are emerging as promising alternatives to autoregressive (AR) LLMs. Recently, this paradigm has been extended to multimodal tasks, leading to the development of diffusion multimodal large language models (dMLLMs). These...

1 min 1 week, 3 days ago

ead

LOW Academic United States

DIA-HARM: Dialectal Disparities in Harmful Content Detection Across 50 English Dialects

arXiv:2604.05318v1 Announce Type: new Abstract: Harmful content detectors-particularly disinformation classifiers-are predominantly developed and evaluated on Standard American English (SAE), leaving their robustness to dialectal variation unexplored. We present DIA-HARM, the first benchmark for evaluating disinformation detection robustness across 50 English...

1 min 1 week, 3 days ago

tps

LOW Academic International

Learning to Edit Knowledge via Instruction-based Chain-of-Thought Prompting

arXiv:2604.05540v1 Announce Type: new Abstract: Large language models (LLMs) can effectively handle outdated information through knowledge editing. However, current approaches face two key limitations: (I) Poor generalization: Most approaches rigidly inject new knowledge without ensuring that the model can use...

1 min 1 week, 3 days ago

tps

LOW Academic International

CODESTRUCT: Code Agents over Structured Action Spaces

arXiv:2604.05407v1 Announce Type: new Abstract: LLM-based code agents treat repositories as unstructured text, applying edits through brittle string matching that frequently fails due to formatting drift or ambiguous patterns. We propose reframing the codebase as a structured action space where...

1 min 1 week, 3 days ago

ead

LOW Academic United States

Expectation Maximization (EM) Converges for General Agnostic Mixtures

arXiv:2604.05842v1 Announce Type: new Abstract: Mixture of linear regression is well studied in statistics and machine learning, where the data points are generated probabilistically using $k$ linear models. Algorithms like Expectation Maximization (EM) may be used to recover the ground...

1 min 1 week, 3 days ago

ead

LOW Academic International

Instruction-Tuned LLMs for Parsing and Mining Unstructured Logs on Leadership HPC Systems

arXiv:2604.05168v1 Announce Type: new Abstract: Leadership-class HPC systems generate massive volumes of heterogeneous, largely unstructured system logs. Because these logs originate from diverse software, hardware, and runtime layers, they exhibit inconsistent formats, making structure extraction and pattern discovery extremely challenging....

1 min 1 week, 3 days ago

ead

LOW Academic International

Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion

arXiv:2604.05688v1 Announce Type: new Abstract: Key-Value (KV) cache memory and bandwidth increasingly dominate large language model inference cost in long-context and long-generation regimes. Architectures such as multi-head latent attention (MLA) and hybrid sliding-window attention (SWA) can alleviate this bound, but...

1 min 1 week, 3 days ago

ead

LOW Academic International

EpiBench: Benchmarking Multi-turn Research Workflows for Multimodal Agents

arXiv:2604.05557v1 Announce Type: new Abstract: Scientific research follows multi-turn, multi-step workflows that require proactively searching the literature, consulting figures and tables, and integrating evidence across papers to align experimental settings and support reproducible conclusions. This joint capability is not systematically...

1 min 1 week, 3 days ago

ead

LOW Academic International

MMORF: A Multi-agent Framework for Designing Multi-objective Retrosynthesis Planning Systems

arXiv:2604.05075v1 Announce Type: new Abstract: Multi-objective retrosynthesis planning is a critical chemistry task requiring dynamic balancing of quality, safety, and cost objectives. Language model-based multi-agent systems (MAS) offer a promising approach for this task: leveraging interactions of specialized agents to...

1 min 1 week, 3 days ago

tps

LOW Academic European Union

Learning to Focus: CSI-Free Hierarchical MARL for Reconfigurable Reflectors

arXiv:2604.05165v1 Announce Type: new Abstract: Reconfigurable Intelligent Surfaces (RIS) has a potential to engineer smart radio environments for next-generation millimeter-wave (mmWave) networks. However, the prohibitive computational overhead of Channel State Information (CSI) estimation and the dimensionality explosion inherent in centralized...

1 min 1 week, 3 days ago

ead

A Parameter-Efficient Transfer Learning Approach through Multitask Prompt Distillation and Decomposition for Clinical NLP

State election dispute on political speech comes to Supreme Court on interim docket

A Comparative Study of Demonstration Selection for Practical Large Language Models-based Next POI Prediction

Distributed Interpretability and Control for Large Language Models

In-Context Learning in Speech Language Models: Analyzing the Role of Acoustic Features, Linguistic Structure, and Induction Heads

MO-RiskVAE: A Multi-Omics Variational Autoencoder for Survival Risk Modeling in Multiple MyelomaMO-RiskVAE

Optimal Rates for Pure {\varepsilon}-Differentially Private Stochastic Convex Optimization with Heavy Tails

A Benchmark of Classical and Deep Learning Models for Agricultural Commodity Price Forecasting on A Novel Bangladeshi Market Price Dataset

Beyond Facts: Benchmarking Distributional Reading Comprehension in Large Language Models

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

Towards Scaling Law Analysis For Spatiotemporal Weather Data

Towards Effective In-context Cross-domain Knowledge Transfer via Domain-invariant-neurons-based Retrieval

Do Domain-specific Experts exist in MoE-based LLMs?

Jeffreys Flow: Robust Boltzmann Generators for Rare Event Sampling via Parallel Tempering Distillation

Prune-Quantize-Distill: An Ordered Pipeline for Efficient Neural Network Compression

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

The UNDO Flip-Flop: A Controlled Probe for Reversible Semantic State Management in State Space Model

LatentAudit: Real-Time White-Box Faithfulness Monitoring for Retrieval-Augmented Generation with Verifiable Deployment

Attribution Bias in Large Language Models

Thinking Diffusion: Penalize and Guide Visual-Grounded Reasoning in Diffusion Multimodal Language Models

DIA-HARM: Dialectal Disparities in Harmful Content Detection Across 50 English Dialects

Learning to Edit Knowledge via Instruction-based Chain-of-Thought Prompting

CODESTRUCT: Code Agents over Structured Action Spaces

Expectation Maximization (EM) Converges for General Agnostic Mixtures

Instruction-Tuned LLMs for Parsing and Mining Unstructured Logs on Leadership HPC Systems

Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion

EpiBench: Benchmarking Multi-turn Research Workflows for Multimodal Agents

MMORF: A Multi-agent Framework for Designing Multi-objective Retrosynthesis Planning Systems

Learning to Focus: CSI-Free Hierarchical MARL for Reconfigurable Reflectors

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.