Labor & Employment

LOW Academic International

Coding Agents are Effective Long-Context Processors

arXiv:2603.20432v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable progress in scaling to access massive contexts. However, the access is via the latent and uninterpretable attention mechanisms, and LLMs fail to effective process long context, exhibiting significant...

1 min 3 weeks, 4 days ago

ada

LOW Academic International

Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable

arXiv:2603.20450v1 Announce Type: new Abstract: A number of scientific conferences and journals have recently enacted policies that prohibit LLM usage by peer reviewers, except for polishing, paraphrasing, and grammar correction of otherwise human-written reviews. But, are these policies enforceable? To...

1 min 3 weeks, 4 days ago

labor

LOW Academic International

Diffutron: A Masked Diffusion Language Model for Turkish Language

arXiv:2603.20466v1 Announce Type: new Abstract: Masked Diffusion Language Models (MDLMs) have emerged as a compelling non-autoregressive alternative to standard large language models; however, their application to morphologically rich languages remains limited. In this paper, we introduce $\textit{Diffutron}$, a masked diffusion...

1 min 3 weeks, 4 days ago

ada

LOW Academic International

JUBAKU: An Adversarial Benchmark for Exposing Culturally Grounded Stereotypes in Japanese LLMs

arXiv:2603.20581v1 Announce Type: new Abstract: Social biases reflected in language are inherently shaped by cultural norms, which vary significantly across regions and lead to diverse manifestations of stereotypes. Existing evaluations of social bias in large language models (LLMs) for non-English...

1 min 3 weeks, 4 days ago

ada

LOW Academic International

BenchBench: Benchmarking Automated Benchmark Generation

arXiv:2603.20807v1 Announce Type: new Abstract: Benchmarks are the de facto standard for tracking progress in large language models (LLMs), yet static test sets can rapidly saturate, become vulnerable to contamination, and are costly to refresh. Scalable evaluation of open-ended items...

1 min 3 weeks, 4 days ago

discrimination

LOW Academic International

HiCI: Hierarchical Construction-Integration for Long-Context Attention

arXiv:2603.20843v1 Announce Type: new Abstract: Long-context language modeling is commonly framed as a scalability challenge of token-level attention, yet local-to-global information structuring remains largely implicit in existing approaches. Drawing on cognitive theories of discourse comprehension, we propose HiCI (Hierarchical Construction--Integration),...

1 min 3 weeks, 4 days ago

ada

LOW Academic International

Can ChatGPT Really Understand Modern Chinese Poetry?

arXiv:2603.20851v1 Announce Type: new Abstract: ChatGPT has demonstrated remarkable capabilities on both poetry generation and translation, yet its ability to truly understand poetry remains unexplored. Previous poetry-related work merely analyzed experimental outcomes without addressing fundamental issues of comprehension. This paper...

1 min 3 weeks, 4 days ago

labor

LOW Academic International

Transformer-Based Predictive Maintenance for Risk-Aware Instrument Calibration

arXiv:2603.20297v1 Announce Type: new Abstract: Accurate calibration is essential for instruments whose measurements must remain traceable, reliable, and compliant over long operating periods. Fixed-interval programs are easy to administer, but they ignore that instruments drift at different rates under different...

1 min 3 weeks, 4 days ago

ada

LOW Academic International

Bounded Coupled AI Learning Dynamics in Tri-Hierarchical Drone Swarms

arXiv:2603.20333v1 Announce Type: new Abstract: Modern autonomous multi-agent systems combine heterogeneous learning mechanisms operating at different timescales. An open question remains: can one formally guarantee that coupled dynamics of such mechanisms stay within the admissible operational regime? This paper studies...

1 min 3 weeks, 4 days ago

ada

LOW Academic International

KV Cache Optimization Strategies for Scalable and Efficient LLM Inference

arXiv:2603.20397v1 Announce Type: new Abstract: The key-value (KV) cache is a foundational optimization in Transformer-based large language models (LLMs), eliminating redundant recomputation of past token representations during autoregressive generation. However, its memory footprint scales linearly with context length, imposing critical...

1 min 3 weeks, 4 days ago

ada

LOW Academic International

AE-LLM: Adaptive Efficiency Optimization for Large Language Models

arXiv:2603.20492v1 Announce Type: new Abstract: Large Language Models (LLMs) have achieved remarkable success across diverse applications, yet their deployment remains challenging due to substantial computational costs, memory requirements, and energy consumption. Recent empirical studies have demonstrated that no single efficiency...

1 min 3 weeks, 4 days ago

ada

LOW Academic International

Pitfalls in Evaluating Interpretability Agents

arXiv:2603.20101v1 Announce Type: new Abstract: Automated interpretability systems aim to reduce the need for human labor and scale analysis to increasingly large models and diverse tasks. Recent efforts toward this goal leverage large language models (LLMs) at increasing levels of...

1 min 3 weeks, 5 days ago

labor

LOW Academic International

When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models

arXiv:2603.19247v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly integrated into high-stakes applications, making robust safety guarantees a central practical and commercial concern. Existing safety evaluations predominantly rely on fixed collections of harmful prompts, implicitly assuming non-adaptive adversaries...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

DuCCAE: A Hybrid Engine for Immersive Conversation via Collaboration, Augmentation, and Evolution

arXiv:2603.19248v1 Announce Type: cross Abstract: Immersive conversational systems in production face a persistent trade-off between responsiveness and long-horizon task capability. Real-time interaction is achievable for lightweight turns, but requests involving planning and tool invocation (e.g., search and media generation) produce...

1 min 3 weeks, 5 days ago

labor

LOW Academic International

PA2D-MORL: Pareto Ascent Directional Decomposition based Multi-Objective Reinforcement Learning

arXiv:2603.19579v1 Announce Type: new Abstract: Multi-objective reinforcement learning (MORL) provides an effective solution for decision-making problems involving conflicting objectives. However, achieving high-quality approximations to the Pareto policy set remains challenging, especially in complex tasks with continuous or high-dimensional state-action space....

1 min 3 weeks, 5 days ago

ada

LOW Academic International

Full-Stack Domain Enhancement for Combustion LLMs: Construction and Optimization

arXiv:2603.19268v1 Announce Type: cross Abstract: Large language models (LLMs) in the direction of task adaptation and capability enhancement for professional fields demonstrate significant application potential. Nevertheless, for complex physical systems such as combustion science, general-purpose LLMs often generate severe hallucinations...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

HypeLoRA: Hyper-Network-Generated LoRA Adapters for Calibrated Language Model Fine-Tuning

arXiv:2603.19278v1 Announce Type: cross Abstract: Modern Transformer-based models frequently suffer from miscalibration, producing overconfident predictions that do not reflect true empirical frequencies. This work investigates the calibration dynamics of LoRA: Low-Rank Adaptation and a novel hyper-network-based adaptation framework as parameter-efficient...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

Speculating Experts Accelerates Inference for Mixture-of-Experts

arXiv:2603.19289v1 Announce Type: cross Abstract: Mixture-of-Experts (MoE) models have gained popularity as a means of scaling the capacity of large language models (LLMs) while maintaining sparse activations and reduced per-token compute. However, in memory-constrained inference settings, expert weights must be...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

Enhancing Legal LLMs through Metadata-Enriched RAG Pipelines and Direct Preference Optimization

arXiv:2603.19251v1 Announce Type: new Abstract: Large Language Models (LLMs) perform well in short contexts but degrade on long legal documents, often producing hallucinations such as incorrect clauses or precedents. In the legal domain, where precision is critical, such errors undermine...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

ShobdoSetu: A Data-Centric Framework for Bengali Long-Form Speech Recognition and Speaker Diarization

arXiv:2603.19256v1 Announce Type: new Abstract: Bengali is spoken by over 230 million people yet remains severely under-served in automatic speech recognition (ASR) and speaker diarization research. In this paper, we present our system for the DL Sprint 4.0 Bengali Long-Form...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

Automatic Analysis of Collaboration Through Human Conversational Data Resources: A Review

arXiv:2603.19292v1 Announce Type: new Abstract: Collaboration is a task-oriented, high-level human behavior. In most cases, conversation serves as the primary medium for information exchange and coordination, making conversational data a valuable resource for the automatic analysis of collaborative processes. In...

1 min 3 weeks, 5 days ago

labor

LOW Academic International

BrainSCL: Subtype-Guided Contrastive Learning for Brain Disorder Diagnosis

arXiv:2603.19295v1 Announce Type: new Abstract: Mental disorder populations exhibit pronounced heterogeneity -- that is, the significant differences between samples -- poses a significant challenge to the definition of positive pairs in contrastive learning. To address this, we propose a subtype-guided...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

TTQ: Activation-Aware Test-Time Quantization to Accelerate LLM Inference On The Fly

arXiv:2603.19296v1 Announce Type: new Abstract: To tackle the huge computational demand of large foundation models, activation-aware compression techniques without retraining have been introduced. However, since these methods highly rely on calibration data, domain shift issues may arise for unseen downstream...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

DPxFin: Adaptive Differential Privacy for Anti-Money Laundering Detection via Reputation-Weighted Federated Learning

arXiv:2603.19314v1 Announce Type: new Abstract: In the modern financial system, combating money laundering is a critical challenge complicated by data privacy concerns and increasingly complex fraud transaction patterns. Although federated learning (FL) is a promising problem-solving approach as it allows...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

MSNet and LS-Net: Scalable Multi-Scale Multi-Representation Networks for Time Series Classification

arXiv:2603.19315v1 Announce Type: new Abstract: Time series classification (TSC) performance depends not only on architectural design but also on the diversity of input representations. In this work, we propose a scalable multi-scale convolutional framework that systematically integrates structured multi-representation inputs...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

Target Concept Tuning Improves Extreme Weather Forecasting

arXiv:2603.19325v1 Announce Type: new Abstract: Deep learning models for meteorological forecasting often fail in rare but high-impact events such as typhoons, where relevant data is scarce. Existing fine-tuning methods typically face a trade-off between overlooking these extreme events and overfitting...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

Anatomical Heterogeneity in Transformer Language Models

arXiv:2603.19348v1 Announce Type: new Abstract: Current transformer language models are trained with uniform computational budgets across all layers, implicitly assuming layer homogeneity. We challenge this assumption through empirical analysis of SmolLM2-135M, a 30-layer, 135M-parameter causal language model, using five diagnostic...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL

arXiv:2603.19470v1 Announce Type: new Abstract: Off-policy problems such as policy staleness and training-inference mismatch, has become a major bottleneck for training stability and further exploration for LLM RL. To enhance inference efficiency, the distribution gap between the inference and updated...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

ICLAD: In-Context Learning for Unified Tabular Anomaly Detection Across Supervision Regimes

arXiv:2603.19497v1 Announce Type: new Abstract: Anomaly detection on tabular data is commonly studied under three supervision regimes, including one-class settings that assume access to anomaly-free training samples, fully unsupervised settings with unlabeled and potentially contaminated training data, and semi-supervised settings...

1 min 3 weeks, 5 days ago

ada

LOW Academic International

ARMOR: Adaptive Resilience Against Model Poisoning Attacks in Continual Federated Learning for Mobile Indoor Localization

arXiv:2603.19594v1 Announce Type: new Abstract: Indoor localization has become increasingly essential for applications ranging from asset tracking to delivering personalized services. Federated learning (FL) offers a privacy-preserving approach by training a centralized global model (GM) using distributed data from mobile...

1 min 3 weeks, 5 days ago

ada

Coding Agents are Effective Long-Context Processors

Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable

Diffutron: A Masked Diffusion Language Model for Turkish Language

JUBAKU: An Adversarial Benchmark for Exposing Culturally Grounded Stereotypes in Japanese LLMs

BenchBench: Benchmarking Automated Benchmark Generation

HiCI: Hierarchical Construction-Integration for Long-Context Attention

Can ChatGPT Really Understand Modern Chinese Poetry?

Transformer-Based Predictive Maintenance for Risk-Aware Instrument Calibration

Bounded Coupled AI Learning Dynamics in Tri-Hierarchical Drone Swarms

KV Cache Optimization Strategies for Scalable and Efficient LLM Inference

AE-LLM: Adaptive Efficiency Optimization for Large Language Models

Pitfalls in Evaluating Interpretability Agents

When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models

DuCCAE: A Hybrid Engine for Immersive Conversation via Collaboration, Augmentation, and Evolution

PA2D-MORL: Pareto Ascent Directional Decomposition based Multi-Objective Reinforcement Learning

Full-Stack Domain Enhancement for Combustion LLMs: Construction and Optimization

HypeLoRA: Hyper-Network-Generated LoRA Adapters for Calibrated Language Model Fine-Tuning

Speculating Experts Accelerates Inference for Mixture-of-Experts

Enhancing Legal LLMs through Metadata-Enriched RAG Pipelines and Direct Preference Optimization

ShobdoSetu: A Data-Centric Framework for Bengali Long-Form Speech Recognition and Speaker Diarization

Automatic Analysis of Collaboration Through Human Conversational Data Resources: A Review

BrainSCL: Subtype-Guided Contrastive Learning for Brain Disorder Diagnosis

TTQ: Activation-Aware Test-Time Quantization to Accelerate LLM Inference On The Fly

DPxFin: Adaptive Differential Privacy for Anti-Money Laundering Detection via Reputation-Weighted Federated Learning

MSNet and LS-Net: Scalable Multi-Scale Multi-Representation Networks for Time Series Classification

Target Concept Tuning Improves Extreme Weather Forecasting

Anatomical Heterogeneity in Transformer Language Models

Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL

ICLAD: In-Context Learning for Unified Tabular Anomaly Detection Across Supervision Regimes

ARMOR: Adaptive Resilience Against Model Poisoning Attacks in Continual Federated Learning for Mobile Indoor Localization

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.