Labor & Employment

LOW Academic International

Learning to Interrupt in Language-based Multi-agent Communication

arXiv:2604.06452v1 Announce Type: new Abstract: Multi-agent systems using large language models (LLMs) have demonstrated impressive capabilities across various domains. However, current agent communication suffers from verbose output that overload context and increase computational costs. Although existing approaches focus on compressing...

1 min 1 week, 1 day ago

ada

LOW Academic International

The Detection--Extraction Gap: Models Know the Answer Before They Can Say It

arXiv:2604.06613v1 Announce Type: new Abstract: Modern reasoning models continue generating long after the answer is already determined. Across five model configurations, two families, and three benchmarks, we find that \textbf{52--88\% of chain-of-thought tokens are produced after the answer is recoverable}...

1 min 1 week, 1 day ago

ada

LOW Academic International

Feedback Adaptation for Retrieval-Augmented Generation

arXiv:2604.06647v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) systems are typically evaluated under static assumptions, despite being frequently corrected through user or expert feedback in deployment. Existing evaluation protocols focus on overall accuracy and fail to capture how systems adapt...

1 min 1 week, 1 day ago

ada

LOW Academic International

Cross-Lingual Transfer and Parameter-Efficient Adaptation in the Turkic Language Family: A Theoretical Framework for Low-Resource Language Models

arXiv:2604.06202v1 Announce Type: new Abstract: Large language models (LLMs) have transformed natural language processing, yet their capabilities remain uneven across languages. Most multilingual models are trained primarily on high-resource languages, leaving many languages with large speaker populations underrepresented in both...

1 min 1 week, 1 day ago

ada

LOW Academic International

GraphWalker: Graph-Guided In-Context Learning for Clinical Reasoning on Electronic Health Records

arXiv:2604.06684v1 Announce Type: new Abstract: Clinical Reasoning on Electronic Health Records (EHRs) is a fundamental yet challenging task in modern healthcare. While in-context learning (ICL) offers a promising inference-time adaptation paradigm for large language models (LLMs) in EHR reasoning, existing...

1 min 1 week, 1 day ago

ada

LOW Academic International

State-of-the-Art Arabic Language Modeling with Sparse MoE Fine-Tuning and Chain-of-Thought Distillation

arXiv:2604.06421v1 Announce Type: new Abstract: This paper introduces Arabic-DeepSeek-R1, an application-driven open-source Arabic LLM that leverages a sparse MoE backbone to address the digital equity gap for under-represented languages, and establishes a new SOTA across the entire Open Arabic LLM...

1 min 1 week, 1 day ago

ada

LOW Academic International

$S^3$: Stratified Scaling Search for Test-Time in Diffusion Language Models

arXiv:2604.06260v1 Announce Type: new Abstract: Test-time scaling investigates whether a fixed diffusion language model (DLM) can generate better outputs when given more inference compute, without additional training. However, naive best-of-$K$ sampling is fundamentally limited because it repeatedly draws from the...

1 min 1 week, 1 day ago

ada

LOW Academic International

A Parameter-Efficient Transfer Learning Approach through Multitask Prompt Distillation and Decomposition for Clinical NLP

arXiv:2604.06650v1 Announce Type: new Abstract: Existing prompt-based fine-tuning methods typically learn task-specific prompts independently, imposing significant computing and storage overhead at scale when deploying multiple clinical natural language processing (NLP) systems. We present a multitask prompt distillation and decomposition framework...

1 min 1 week, 1 day ago

ada

LOW Academic International

The Illusion of Superposition? A Principled Analysis of Latent Thinking in Language Models

arXiv:2604.06374v1 Announce Type: new Abstract: Latent reasoning via continuous chain-of-thoughts (Latent CoT) has emerged as a promising alternative to discrete CoT reasoning. Operating in continuous space increases expressivity and has been hypothesized to enable superposition: the ability to maintain multiple...

1 min 1 week, 1 day ago

ada

LOW Academic International

SensorPersona: An LLM-Empowered System for Continual Persona Extraction from Longitudinal Mobile Sensor Streams

arXiv:2604.06204v1 Announce Type: new Abstract: Personalization is essential for Large Language Model (LLM)-based agents to adapt to users' preferences and improve response quality and task performance. However, most existing approaches infer personas from chat histories, which capture only self-disclosed information...

1 min 1 week, 1 day ago

ada

LOW Academic International

TwinLoop: Simulation-in-the-Loop Digital Twins for Online Multi-Agent Reinforcement Learning

arXiv:2604.06610v1 Announce Type: new Abstract: Decentralised online learning enables runtime adaptation in cyber-physical multi-agent systems, but when operating conditions change, learned policies often require substantial trial-and-error interaction before recovering performance. To address this, we propose TwinLoop, a simulation-in-the-loop digital twin...

1 min 1 week, 1 day ago

ada

LOW Academic International

AE-ViT: Stable Long-Horizon Parametric Partial Differential Equations Modeling

arXiv:2604.06475v1 Announce Type: new Abstract: Deep Learning Reduced Order Models (ROMs) are becoming increasingly popular as surrogate models for parametric partial differential equations (PDEs) due to their ability to handle high-dimensional data, approximate highly nonlinear mappings, and utilize GPUs. Existing...

1 min 1 week, 1 day ago

ada

LOW Academic International

Extracting Breast Cancer Phenotypes from Clinical Notes: Comparing LLMs with Classical Ontology Methods

arXiv:2604.06208v1 Announce Type: new Abstract: A significant amount of data held in Oncology Electronic Medical Records (EMRs) is contained in unstructured provider notes -- including but not limited to the chemotherapy (or cancer treatment) outcome, different biomarkers, the tumor's location,...

1 min 1 week, 1 day ago

ada

LOW Academic International

TalkLoRA: Communication-Aware Mixture of Low-Rank Adaptation for Large Language Models

arXiv:2604.06291v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) enables parameter-efficient fine-tuning of Large Language Models (LLMs), and recent Mixture-of-Experts (MoE) extensions further enhance flexibility by dynamically combining multiple LoRA experts. However, existing MoE-augmented LoRA methods assume that experts operate independently,...

1 min 1 week, 1 day ago

ada

LOW Academic International

MICA: Multivariate Infini Compressive Attention for Time Series Forecasting

arXiv:2604.06473v1 Announce Type: new Abstract: Multivariate forecasting with Transformers faces a core scalability challenge: modeling cross-channel dependencies via attention compounds attention's quadratic sequence complexity with quadratic channel scaling, making full cross-channel attention impractical for high-dimensional time series. We propose Multivariate...

1 min 1 week, 1 day ago

ada

LOW Academic International

Quality-preserving Model for Electronics Production Quality Tests Reduction

arXiv:2604.06451v1 Announce Type: new Abstract: Manufacturing test flows in high-volume electronics production are typically fixed during product development and executed unchanged on every unit, even as failure patterns and process conditions evolve. This protects quality, but it also imposes unnecessary...

1 min 1 week, 1 day ago

ada

LOW Academic International

Spectral Edge Dynamics Reveal Functional Modes of Learning

arXiv:2604.06256v1 Announce Type: new Abstract: Training dynamics during grokking concentrate along a small number of dominant update directions -- the spectral edge -- which reliably distinguishes grokking from non-grokking regimes. We show that standard mechanistic interpretability tools (head attribution, activation...

1 min 1 week, 1 day ago

ada

LOW Academic International

FLeX: Fourier-based Low-rank EXpansion for multilingual transfer

arXiv:2604.06253v1 Announce Type: new Abstract: Cross-lingual code generation is critical in enterprise environments where multiple programming languages coexist. However, fine-tuning large language models (LLMs) individually for each language is computationally prohibitive. This paper investigates whether parameter-efficient fine-tuning methods and optimizer...

1 min 1 week, 1 day ago

ada

LOW Academic International

Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling

arXiv:2604.05445v1 Announce Type: new Abstract: Vision-language reward modeling faces a dilemma: generative approaches are interpretable but slow, while discriminative ones are efficient but act as opaque "black boxes." To bridge this gap, we propose VL-MDR (Vision-Language Multi-Dimensional Reward), a framework...

1 min 1 week, 2 days ago

ada

LOW Academic International

ActivityEditor: Learning to Synthesize Physically Valid Human Mobility

arXiv:2604.05529v1 Announce Type: new Abstract: Human mobility modeling is indispensable for diverse urban applications. However, existing data-driven methods often suffer from data scarcity, limiting their applicability in regions where historical trajectories are unavailable or restricted. To bridge this gap, we...

1 min 1 week, 2 days ago

labor

LOW Academic International

PRIME: Prototype-Driven Multimodal Pretraining for Cancer Prognosis with Missing Modalities

arXiv:2604.04999v1 Announce Type: new Abstract: Multimodal self-supervised pretraining offers a promising route to cancer prognosis by integrating histopathology whole-slide images, gene expression, and pathology reports, yet most existing approaches require fully paired and complete inputs. In practice, clinical cohorts are...

1 min 1 week, 2 days ago

ada

LOW Academic International

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

arXiv:2604.05426v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) is now the dominant method for parameter-efficient fine-tuning of large language models, but achieving a high-quality adapter often requires systematic hyperparameter tuning because LoRA performance is highly sensitive to configuration choices. In...

1 min 1 week, 2 days ago

ada

LOW Academic International

Instruction-Tuned LLMs for Parsing and Mining Unstructured Logs on Leadership HPC Systems

arXiv:2604.05168v1 Announce Type: new Abstract: Leadership-class HPC systems generate massive volumes of heterogeneous, largely unstructured system logs. Because these logs originate from diverse software, hardware, and runtime layers, they exhibit inconsistent formats, making structure extraction and pattern discovery extremely challenging....

1 min 1 week, 2 days ago

ada

LOW Academic International

Improving Sparse Memory Finetuning

arXiv:2604.05248v1 Announce Type: new Abstract: Large Language Models (LLMs) are typically static after training, yet real-world applications require continual adaptation to new knowledge without degrading existing capabilities. Standard approaches to updating models, like full finetuning or parameter-efficient methods (e.g., LoRA),...

1 min 1 week, 2 days ago

ada

LOW Academic International

Extending Tabular Denoising Diffusion Probabilistic Models for Time-Series Data Generation

arXiv:2604.05257v1 Announce Type: new Abstract: Diffusion models are increasingly being utilised to create synthetic tabular and time series data for privacy-preserving augmentation. Tabular Denoising Diffusion Probabilistic Models (TabDDPM) generate high-quality synthetic data from heterogeneous tabular datasets but assume independence between...

1 min 1 week, 2 days ago

ada

LOW Academic International

PCA-Driven Adaptive Sensor Triage for Edge AI Inference

arXiv:2604.05045v1 Announce Type: new Abstract: Multi-channel sensor networks in industrial IoT often exceed available bandwidth. We propose PCA-Triage, a streaming algorithm that converts incremental PCA loadings into proportional per-channel sampling rates under a bandwidth budget. PCA-Triage runs in O(wdk) time...

1 min 1 week, 2 days ago

ada

LOW Academic International

Territory Paint Wars: Diagnosing and Mitigating Failure Modes in Competitive Multi-Agent PPO

arXiv:2604.04983v1 Announce Type: new Abstract: We present Territory Paint Wars, a minimal competitive multi-agent reinforcement learning environment implemented in Unity, and use it to systematically investigate failure modes of Proximal Policy Optimisation (PPO) under self-play. A first agent trained for...

1 min 1 week, 2 days ago

ada

LOW Academic International

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

arXiv:2604.05091v1 Announce Type: new Abstract: We present MegaTrain, a memory-centric system that efficiently trains 100B+ parameter large language models at full precision on a single GPU. Unlike traditional GPU-centric systems, MegaTrain stores parameters and optimizer states in host memory (CPU...

1 min 1 week, 2 days ago

ada

LOW Academic International

Not All Turns Are Equally Hard: Adaptive Thinking Budgets For Efficient Multi-Turn Reasoning

arXiv:2604.05164v1 Announce Type: new Abstract: As LLM reasoning performance plateau, improving inference-time compute efficiency is crucial to mitigate overthinking and long thinking traces even for simple queries. Prior approaches including length regularization, adaptive routing, and difficulty-based budget allocation primarily focus...

1 min 1 week, 2 days ago

ada

LOW Academic International

IntentScore: Intent-Conditioned Action Evaluation for Computer-Use Agents

arXiv:2604.05157v1 Announce Type: new Abstract: Computer-Use Agents (CUAs) leverage large language models to execute GUI operations on desktop environments, yet they generate actions without evaluating action quality, leading to irreversible errors that cascade through subsequent steps. We propose IntentScore, a...

1 min 1 week, 2 days ago

discrimination

Learning to Interrupt in Language-based Multi-agent Communication

The Detection--Extraction Gap: Models Know the Answer Before They Can Say It

Feedback Adaptation for Retrieval-Augmented Generation

Cross-Lingual Transfer and Parameter-Efficient Adaptation in the Turkic Language Family: A Theoretical Framework for Low-Resource Language Models

GraphWalker: Graph-Guided In-Context Learning for Clinical Reasoning on Electronic Health Records

State-of-the-Art Arabic Language Modeling with Sparse MoE Fine-Tuning and Chain-of-Thought Distillation

$S^3$: Stratified Scaling Search for Test-Time in Diffusion Language Models

A Parameter-Efficient Transfer Learning Approach through Multitask Prompt Distillation and Decomposition for Clinical NLP

The Illusion of Superposition? A Principled Analysis of Latent Thinking in Language Models

SensorPersona: An LLM-Empowered System for Continual Persona Extraction from Longitudinal Mobile Sensor Streams

TwinLoop: Simulation-in-the-Loop Digital Twins for Online Multi-Agent Reinforcement Learning

AE-ViT: Stable Long-Horizon Parametric Partial Differential Equations Modeling

Extracting Breast Cancer Phenotypes from Clinical Notes: Comparing LLMs with Classical Ontology Methods

TalkLoRA: Communication-Aware Mixture of Low-Rank Adaptation for Large Language Models

MICA: Multivariate Infini Compressive Attention for Time Series Forecasting

Quality-preserving Model for Electronics Production Quality Tests Reduction

Spectral Edge Dynamics Reveal Functional Modes of Learning

FLeX: Fourier-based Low-rank EXpansion for multilingual transfer

Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling

ActivityEditor: Learning to Synthesize Physically Valid Human Mobility

PRIME: Prototype-Driven Multimodal Pretraining for Cancer Prognosis with Missing Modalities

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

Instruction-Tuned LLMs for Parsing and Mining Unstructured Logs on Leadership HPC Systems

Improving Sparse Memory Finetuning

Extending Tabular Denoising Diffusion Probabilistic Models for Time-Series Data Generation

PCA-Driven Adaptive Sensor Triage for Edge AI Inference

Territory Paint Wars: Diagnosing and Mitigating Failure Modes in Competitive Multi-Agent PPO

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Not All Turns Are Equally Hard: Adaptive Thinking Budgets For Efficient Multi-Turn Reasoning

IntentScore: Intent-Conditioned Action Evaluation for Computer-Use Agents

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.