Labor & Employment

LOW Academic European Union

ADAB: Arabic Dataset for Automated Politeness Benchmarking -- A Large-Scale Resource for Computational Sociopragmatics

arXiv:2602.13870v1 Announce Type: new Abstract: The growing importance of culturally-aware natural language processing systems has led to an increasing demand for resources that capture sociopragmatic phenomena across diverse languages. Nevertheless, Arabic-language resources for politeness detection remain under-explored, despite the rich...

1 min 1 month, 1 week ago

ada

LOW Academic International

Pre-Editorial Normalization for Automatically Transcribed Medieval Manuscripts in Old French and Latin

arXiv:2602.13905v1 Announce Type: new Abstract: Recent advances in Automatic Text Recognition (ATR) have improved access to historical archives, yet a methodological divide persists between palaeographic transcriptions and normalized digital editions. While ATR models trained on more palaeographically-oriented datasets such as...

1 min 1 month, 1 week ago

ada

LOW Academic International

The Sufficiency-Conciseness Trade-off in LLM Self-Explanation from an Information Bottleneck Perspective

arXiv:2602.14002v1 Announce Type: new Abstract: Large Language Models increasingly rely on self-explanations, such as chain of thought reasoning, to improve performance on multi step question answering. While these explanations enhance accuracy, they are often verbose and costly to generate, raising...

1 min 1 month, 1 week ago

ada

LOW Academic International

GRRM: Group Relative Reward Modeling for Machine Translation

arXiv:2602.14028v1 Announce Type: new Abstract: While Group Relative Policy Optimization (GRPO) offers a powerful framework for LLM post-training, its effectiveness in open-ended domains like Machine Translation hinges on accurate intra-group ranking. We identify that standard Scalar Quality Metrics (SQM) fall...

1 min 1 month, 1 week ago

ada

LOW Academic International

Context Shapes LLMs Retrieval-Augmented Fact-Checking Effectiveness

arXiv:2602.14044v1 Announce Type: new Abstract: Large language models (LLMs) show strong reasoning abilities across diverse tasks, yet their performance on extended contexts remains inconsistent. While prior research has emphasized mid-context degradation in question answering, this study examines the impact of...

1 min 1 month, 1 week ago

ada

LOW Academic International

Epistemic Traps: Rational Misalignment Driven by Model Misspecification

arXiv:2602.17676v1 Announce Type: new Abstract: The rapid deployment of Large Language Models and AI agents across critical societal and technical domains is hindered by persistent behavioral pathologies including sycophancy, hallucination, and strategic deception that resist mitigation via reinforcement learning. Current...

1 min 1 month, 1 week ago

ada

LOW Academic International

WorkflowPerturb: Calibrated Stress Tests for Evaluating Multi-Agent Workflow Metrics

arXiv:2602.17990v1 Announce Type: new Abstract: LLM-based systems increasingly generate structured workflows for complex tasks. In practice, automatic evaluation of these workflows is difficult, because metric scores are often not calibrated, and score changes do not directly communicate the severity of...

1 min 1 month, 1 week ago

ada

LOW Academic International

IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering

arXiv:2602.17687v1 Announce Type: cross Abstract: AI systems have achieved remarkable success in processing text and relational data, yet visual document processing remains relatively underexplored. Whereas traditional systems require OCR transcriptions to convert these visual documents into text and metadata, recent...

1 min 1 month, 1 week ago

ada

LOW Academic International

Robust Pre-Training of Medical Vision-and-Language Models with Domain-Invariant Multi-Modal Masked Reconstruction

arXiv:2602.17689v1 Announce Type: cross Abstract: Medical vision-language models show strong potential for joint reasoning over medical images and clinical text, but their performance often degrades under domain shift caused by variations in imaging devices, acquisition protocols, and reporting styles. Existing...

1 min 1 month, 1 week ago

ada

LOW Academic International

Agentic Unlearning: When LLM Agent Meets Machine Unlearning

arXiv:2602.17692v1 Announce Type: cross Abstract: In this paper, we introduce \textbf{agentic unlearning} which removes specified information from both model parameters and persistent memory in agents with closed-loop interaction. Existing unlearning methods target parameters alone, leaving two critical gaps: (i) parameter-memory...

1 min 1 month, 1 week ago

ada

LOW Academic International

EXACT: Explicit Attribute-Guided Decoding-Time Personalization

arXiv:2602.17695v1 Announce Type: cross Abstract: Achieving personalized alignment requires adapting large language models to each user's evolving context. While decoding-time personalization offers a scalable alternative to training-time methods, existing methods largely rely on implicit, less interpretable preference representations and impose...

1 min 1 month, 1 week ago

ada

LOW Academic United States

MIDAS: Mosaic Input-Specific Differentiable Architecture Search

arXiv:2602.17700v1 Announce Type: cross Abstract: Differentiable Neural Architecture Search (NAS) provides efficient, gradient-based methods for automatically designing neural networks, yet its adoption remains limited in practice. We present MIDAS, a novel approach that modernizes DARTS by replacing static architecture parameters...

1 min 1 month, 1 week ago

discrimination

LOW Academic International

"Everyone's using it, but no one is allowed to talk about it": College Students' Experiences Navigating the Higher Education Environment in a Generative AI World

arXiv:2602.17720v1 Announce Type: cross Abstract: Higher education students are increasingly using generative AI in their academic work. However, existing institutional practices have not yet adapted to this shift. Through semi-structured interviews with 23 college students, our study examines the environmental...

1 min 1 month, 1 week ago

ada

LOW Academic International

GeneZip: Region-Aware Compression for Long Context DNA Modeling

arXiv:2602.17739v1 Announce Type: cross Abstract: Genomic sequences span billions of base pairs (bp), posing a fundamental challenge for genome-scale foundation models. Existing approaches largely sidestep this barrier by either scaling relatively small models to long contexts or relying on heavy...

1 min 1 month, 1 week ago

ada

LOW Academic International

Impact of Artificial Intelligence on Dental Education: A Review and Guide for Curriculum Update

In this intellectual work, the clinical and educational aspects of dentistry were confronted with practical applications of artificial intelligence (AI). The aim was to provide an up-to-date overview of the upcoming changes and a brief analysis of the influential advancements...

1 min 1 month, 1 week ago

ada

LOW Academic International

Feedback-based Automated Verification in Vibe Coding of CAS Adaptation Built on Constraint Logic

arXiv:2602.18607v1 Announce Type: new Abstract: In CAS adaptation, a challenge is to define the dynamic architecture of the system and changes in its behavior. Implementation-wise, this is projected into an adaptation mechanism, typically realized as an Adaptation Manager (AM). With...

1 min 1 month, 1 week ago

ada

LOW Academic International

LAMMI-Pathology: A Tool-Centric Bottom-Up LVLM-Agent Framework for Molecularly Informed Medical Intelligence in Pathology

arXiv:2602.18773v1 Announce Type: new Abstract: The emergence of tool-calling-based agent systems introduces a more evidence-driven paradigm for pathology image analysis in contrast to the coarse-grained text-image diagnostic approaches. With the recent large-scale experimental adoption of spatial transcriptomics technologies, molecularly validated...

1 min 1 month, 1 week ago

ada

LOW Academic International

DREAM: Deep Research Evaluation with Agentic Metrics

arXiv:2602.18940v1 Announce Type: new Abstract: Deep Research Agents generate analyst-grade reports, yet evaluating them remains challenging due to the absence of a single ground truth and the multidimensional nature of research quality. Recent benchmarks propose distinct methodologies, yet they suffer...

1 min 1 month, 1 week ago

ada

LOW Academic International

(Perlin) Noise as AI coordinator

arXiv:2602.18947v1 Announce Type: new Abstract: Large scale control of nonplayer agents is central to modern games, while production systems still struggle to balance several competing goals: locally smooth, natural behavior, and globally coordinated variety across space and time. Prior approaches...

1 min 1 month, 1 week ago

ada

LOW Academic European Union

InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

arXiv:2602.18985v1 Announce Type: new Abstract: Infrared radiation computing underpins advances in climate science, remote sensing and spectroscopy but remains constrained by manual workflows. We introduce InfEngine, an autonomous intelligent computational engine designed to drive a paradigm shift from human-led orchestration...

1 min 1 month, 1 week ago

labor

LOW Academic International

Benchmark Test-Time Scaling of General LLM Agents

arXiv:2602.18998v1 Announce Type: new Abstract: LLM agents are increasingly expected to function as general-purpose systems capable of resolving open-ended user requests. While existing benchmarks focus on domain-aware environments for developing specialized agents, evaluating general-purpose agents requires more realistic settings that...

1 min 1 month, 1 week ago

ada

LOW Academic International

Evaluating Large Language Models on Quantum Mechanics: A Comparative Study Across Diverse Models and Tasks

arXiv:2602.19006v1 Announce Type: new Abstract: We present a systematic evaluation of large language models on quantum mechanics problem-solving. Our study evaluates 15 models from five providers (OpenAI, Anthropic, Google, Alibaba, DeepSeek) spanning three capability tiers on 20 tasks covering derivations,...

1 min 1 month, 1 week ago

ada

LOW Academic International

Reasoning Capabilities of Large Language Models. Lessons Learned from General Game Playing

arXiv:2602.19160v1 Announce Type: new Abstract: This paper examines the reasoning capabilities of Large Language Models (LLMs) from a novel perspective, focusing on their ability to operate within formally specified, rule-governed environments. We evaluate four LLMs (Gemini 2.5 Pro and Flash...

1 min 1 month, 1 week ago

ada

LOW Academic International

Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training

arXiv:2602.19225v1 Announce Type: new Abstract: Multi-turn LLM agents are becoming pivotal to production systems, spanning customer service automation, e-commerce assistance, and interactive task management, where accurately distinguishing high-value informative signals from stochastic noise is critical for sample-efficient training. In real-world...

1 min 1 month, 1 week ago

ada

LOW Academic United States

Automated Generation of Microfluidic Netlists using Large Language Models

arXiv:2602.19297v1 Announce Type: new Abstract: Microfluidic devices have emerged as powerful tools in various laboratory applications, but the complexity of their design limits accessibility for many practitioners. While progress has been made in microfluidic design automation (MFDA), a practical and...

1 min 1 month, 1 week ago

labor

LOW Academic European Union

ALPACA: A Reinforcement Learning Environment for Medication Repurposing and Treatment Optimization in Alzheimer's Disease

arXiv:2602.19298v1 Announce Type: new Abstract: Evaluating personalized, sequential treatment strategies for Alzheimer's disease (AD) using clinical trials is often impractical due to long disease horizons and substantial inter-patient heterogeneity. To address these constraints, we present the Alzheimer's Learning Platform for...

1 min 1 month, 1 week ago

ada

LOW Academic International

Think$^{2}$: Grounded Metacognitive Reasoning in Large Language Models

arXiv:2602.18806v1 Announce Type: new Abstract: Large Language Models (LLMs) demonstrate strong reasoning performance, yet their ability to reliably monitor, diagnose, and correct their own errors remains limited. We introduce a psychologically grounded metacognitive framework that operationalizes Ann Brown's regulatory cycle...

1 min 1 month, 1 week ago

ada

LOW Academic International

Whisper: Courtside Edition Enhancing ASR Performance Through LLM-Driven Context Generation

arXiv:2602.18966v1 Announce Type: new Abstract: Domain-specific speech remains a persistent challenge for automatic speech recognition (ASR), even for state-of-the-art systems like OpenAI's Whisper. We introduce Whisper: Courtside Edition, a novel multi-agent large language model (LLM) pipeline that enhances Whisper transcriptions...

1 min 1 month, 1 week ago

ada

LOW Academic International

Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

arXiv:2602.23579v1 Announce Type: new Abstract: The Multiple Traveling Salesman Problem (mTSP) extends the Traveling Salesman Problem to m tours that start and end at a common depot and jointly visit all customers exactly once. In the min-max variant, the objective...

1 min 1 month, 1 week ago

ada

LOW Academic International

AI Must Embrace Specialization via Superhuman Adaptable Intelligence

arXiv:2602.23643v1 Announce Type: new Abstract: Everyone from AI executives and researchers to doomsayers, politicians, and activists is talking about Artificial General Intelligence (AGI). Yet, they often don't seem to agree on its exact definition. One common definition of AGI is...

1 min 1 month, 1 week ago

ada

ADAB: Arabic Dataset for Automated Politeness Benchmarking -- A Large-Scale Resource for Computational Sociopragmatics

Pre-Editorial Normalization for Automatically Transcribed Medieval Manuscripts in Old French and Latin

The Sufficiency-Conciseness Trade-off in LLM Self-Explanation from an Information Bottleneck Perspective

GRRM: Group Relative Reward Modeling for Machine Translation

Context Shapes LLMs Retrieval-Augmented Fact-Checking Effectiveness

Epistemic Traps: Rational Misalignment Driven by Model Misspecification

WorkflowPerturb: Calibrated Stress Tests for Evaluating Multi-Agent Workflow Metrics

IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering

Robust Pre-Training of Medical Vision-and-Language Models with Domain-Invariant Multi-Modal Masked Reconstruction

Agentic Unlearning: When LLM Agent Meets Machine Unlearning

EXACT: Explicit Attribute-Guided Decoding-Time Personalization

MIDAS: Mosaic Input-Specific Differentiable Architecture Search

"Everyone's using it, but no one is allowed to talk about it": College Students' Experiences Navigating the Higher Education Environment in a Generative AI World

GeneZip: Region-Aware Compression for Long Context DNA Modeling

Impact of Artificial Intelligence on Dental Education: A Review and Guide for Curriculum Update

Feedback-based Automated Verification in Vibe Coding of CAS Adaptation Built on Constraint Logic

LAMMI-Pathology: A Tool-Centric Bottom-Up LVLM-Agent Framework for Molecularly Informed Medical Intelligence in Pathology

DREAM: Deep Research Evaluation with Agentic Metrics

(Perlin) Noise as AI coordinator

InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

Benchmark Test-Time Scaling of General LLM Agents

Evaluating Large Language Models on Quantum Mechanics: A Comparative Study Across Diverse Models and Tasks

Reasoning Capabilities of Large Language Models. Lessons Learned from General Game Playing

Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training

Automated Generation of Microfluidic Netlists using Large Language Models

ALPACA: A Reinforcement Learning Environment for Medication Repurposing and Treatment Optimization in Alzheimer's Disease

Think$^{2}$: Grounded Metacognitive Reasoning in Large Language Models

Whisper: Courtside Edition Enhancing ASR Performance Through LLM-Driven Context Generation

Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

AI Must Embrace Specialization via Superhuman Adaptable Intelligence

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.