Labor & Employment

LOW Academic International

Epistemic Traps: Rational Misalignment Driven by Model Misspecification

arXiv:2602.17676v1 Announce Type: new Abstract: The rapid deployment of Large Language Models and AI agents across critical societal and technical domains is hindered by persistent behavioral pathologies including sycophancy, hallucination, and strategic deception that resist mitigation via reinforcement learning. Current...

1 min 1 month, 1 week ago

ada

LOW Academic International

WorkflowPerturb: Calibrated Stress Tests for Evaluating Multi-Agent Workflow Metrics

arXiv:2602.17990v1 Announce Type: new Abstract: LLM-based systems increasingly generate structured workflows for complex tasks. In practice, automatic evaluation of these workflows is difficult, because metric scores are often not calibrated, and score changes do not directly communicate the severity of...

1 min 1 month, 1 week ago

ada

LOW Academic International

IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering

arXiv:2602.17687v1 Announce Type: cross Abstract: AI systems have achieved remarkable success in processing text and relational data, yet visual document processing remains relatively underexplored. Whereas traditional systems require OCR transcriptions to convert these visual documents into text and metadata, recent...

1 min 1 month, 1 week ago

ada

LOW Academic International

Robust Pre-Training of Medical Vision-and-Language Models with Domain-Invariant Multi-Modal Masked Reconstruction

arXiv:2602.17689v1 Announce Type: cross Abstract: Medical vision-language models show strong potential for joint reasoning over medical images and clinical text, but their performance often degrades under domain shift caused by variations in imaging devices, acquisition protocols, and reporting styles. Existing...

1 min 1 month, 1 week ago

ada

LOW Academic International

Agentic Unlearning: When LLM Agent Meets Machine Unlearning

arXiv:2602.17692v1 Announce Type: cross Abstract: In this paper, we introduce \textbf{agentic unlearning} which removes specified information from both model parameters and persistent memory in agents with closed-loop interaction. Existing unlearning methods target parameters alone, leaving two critical gaps: (i) parameter-memory...

1 min 1 month, 1 week ago

ada

LOW Academic International

EXACT: Explicit Attribute-Guided Decoding-Time Personalization

arXiv:2602.17695v1 Announce Type: cross Abstract: Achieving personalized alignment requires adapting large language models to each user's evolving context. While decoding-time personalization offers a scalable alternative to training-time methods, existing methods largely rely on implicit, less interpretable preference representations and impose...

1 min 1 month, 1 week ago

ada

LOW Academic United States

MIDAS: Mosaic Input-Specific Differentiable Architecture Search

arXiv:2602.17700v1 Announce Type: cross Abstract: Differentiable Neural Architecture Search (NAS) provides efficient, gradient-based methods for automatically designing neural networks, yet its adoption remains limited in practice. We present MIDAS, a novel approach that modernizes DARTS by replacing static architecture parameters...

1 min 1 month, 1 week ago

discrimination

LOW Academic International

"Everyone's using it, but no one is allowed to talk about it": College Students' Experiences Navigating the Higher Education Environment in a Generative AI World

arXiv:2602.17720v1 Announce Type: cross Abstract: Higher education students are increasingly using generative AI in their academic work. However, existing institutional practices have not yet adapted to this shift. Through semi-structured interviews with 23 college students, our study examines the environmental...

1 min 1 month, 1 week ago

ada

LOW Academic International

GeneZip: Region-Aware Compression for Long Context DNA Modeling

arXiv:2602.17739v1 Announce Type: cross Abstract: Genomic sequences span billions of base pairs (bp), posing a fundamental challenge for genome-scale foundation models. Existing approaches largely sidestep this barrier by either scaling relatively small models to long contexts or relying on heavy...

1 min 1 month, 1 week ago

ada

LOW Academic International

Impact of Artificial Intelligence on Dental Education: A Review and Guide for Curriculum Update

In this intellectual work, the clinical and educational aspects of dentistry were confronted with practical applications of artificial intelligence (AI). The aim was to provide an up-to-date overview of the upcoming changes and a brief analysis of the influential advancements...

1 min 1 month, 1 week ago

ada

LOW Academic International

Feedback-based Automated Verification in Vibe Coding of CAS Adaptation Built on Constraint Logic

arXiv:2602.18607v1 Announce Type: new Abstract: In CAS adaptation, a challenge is to define the dynamic architecture of the system and changes in its behavior. Implementation-wise, this is projected into an adaptation mechanism, typically realized as an Adaptation Manager (AM). With...

1 min 1 month, 1 week ago

ada

LOW Academic International

LAMMI-Pathology: A Tool-Centric Bottom-Up LVLM-Agent Framework for Molecularly Informed Medical Intelligence in Pathology

arXiv:2602.18773v1 Announce Type: new Abstract: The emergence of tool-calling-based agent systems introduces a more evidence-driven paradigm for pathology image analysis in contrast to the coarse-grained text-image diagnostic approaches. With the recent large-scale experimental adoption of spatial transcriptomics technologies, molecularly validated...

1 min 1 month, 1 week ago

ada

LOW Academic International

DREAM: Deep Research Evaluation with Agentic Metrics

arXiv:2602.18940v1 Announce Type: new Abstract: Deep Research Agents generate analyst-grade reports, yet evaluating them remains challenging due to the absence of a single ground truth and the multidimensional nature of research quality. Recent benchmarks propose distinct methodologies, yet they suffer...

1 min 1 month, 1 week ago

ada

LOW Academic International

(Perlin) Noise as AI coordinator

arXiv:2602.18947v1 Announce Type: new Abstract: Large scale control of nonplayer agents is central to modern games, while production systems still struggle to balance several competing goals: locally smooth, natural behavior, and globally coordinated variety across space and time. Prior approaches...

1 min 1 month, 1 week ago

ada

LOW Academic European Union

InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

arXiv:2602.18985v1 Announce Type: new Abstract: Infrared radiation computing underpins advances in climate science, remote sensing and spectroscopy but remains constrained by manual workflows. We introduce InfEngine, an autonomous intelligent computational engine designed to drive a paradigm shift from human-led orchestration...

1 min 1 month, 1 week ago

labor

LOW Academic International

Benchmark Test-Time Scaling of General LLM Agents

arXiv:2602.18998v1 Announce Type: new Abstract: LLM agents are increasingly expected to function as general-purpose systems capable of resolving open-ended user requests. While existing benchmarks focus on domain-aware environments for developing specialized agents, evaluating general-purpose agents requires more realistic settings that...

1 min 1 month, 1 week ago

ada

LOW Academic International

Evaluating Large Language Models on Quantum Mechanics: A Comparative Study Across Diverse Models and Tasks

arXiv:2602.19006v1 Announce Type: new Abstract: We present a systematic evaluation of large language models on quantum mechanics problem-solving. Our study evaluates 15 models from five providers (OpenAI, Anthropic, Google, Alibaba, DeepSeek) spanning three capability tiers on 20 tasks covering derivations,...

1 min 1 month, 1 week ago

ada

LOW Academic International

Reasoning Capabilities of Large Language Models. Lessons Learned from General Game Playing

arXiv:2602.19160v1 Announce Type: new Abstract: This paper examines the reasoning capabilities of Large Language Models (LLMs) from a novel perspective, focusing on their ability to operate within formally specified, rule-governed environments. We evaluate four LLMs (Gemini 2.5 Pro and Flash...

1 min 1 month, 1 week ago

ada

LOW Academic International

Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training

arXiv:2602.19225v1 Announce Type: new Abstract: Multi-turn LLM agents are becoming pivotal to production systems, spanning customer service automation, e-commerce assistance, and interactive task management, where accurately distinguishing high-value informative signals from stochastic noise is critical for sample-efficient training. In real-world...

1 min 1 month, 1 week ago

ada

LOW Academic United States

Automated Generation of Microfluidic Netlists using Large Language Models

arXiv:2602.19297v1 Announce Type: new Abstract: Microfluidic devices have emerged as powerful tools in various laboratory applications, but the complexity of their design limits accessibility for many practitioners. While progress has been made in microfluidic design automation (MFDA), a practical and...

1 min 1 month, 1 week ago

labor

LOW Academic European Union

ALPACA: A Reinforcement Learning Environment for Medication Repurposing and Treatment Optimization in Alzheimer's Disease

arXiv:2602.19298v1 Announce Type: new Abstract: Evaluating personalized, sequential treatment strategies for Alzheimer's disease (AD) using clinical trials is often impractical due to long disease horizons and substantial inter-patient heterogeneity. To address these constraints, we present the Alzheimer's Learning Platform for...

1 min 1 month, 1 week ago

ada

LOW Academic International

Think$^{2}$: Grounded Metacognitive Reasoning in Large Language Models

arXiv:2602.18806v1 Announce Type: new Abstract: Large Language Models (LLMs) demonstrate strong reasoning performance, yet their ability to reliably monitor, diagnose, and correct their own errors remains limited. We introduce a psychologically grounded metacognitive framework that operationalizes Ann Brown's regulatory cycle...

1 min 1 month, 1 week ago

ada

LOW Academic International

Whisper: Courtside Edition Enhancing ASR Performance Through LLM-Driven Context Generation

arXiv:2602.18966v1 Announce Type: new Abstract: Domain-specific speech remains a persistent challenge for automatic speech recognition (ASR), even for state-of-the-art systems like OpenAI's Whisper. We introduce Whisper: Courtside Edition, a novel multi-agent large language model (LLM) pipeline that enhances Whisper transcriptions...

1 min 1 month, 1 week ago

ada

LOW Academic International

Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

arXiv:2602.23579v1 Announce Type: new Abstract: The Multiple Traveling Salesman Problem (mTSP) extends the Traveling Salesman Problem to m tours that start and end at a common depot and jointly visit all customers exactly once. In the min-max variant, the objective...

1 min 1 month, 1 week ago

ada

LOW Academic International

AI Must Embrace Specialization via Superhuman Adaptable Intelligence

arXiv:2602.23643v1 Announce Type: new Abstract: Everyone from AI executives and researchers to doomsayers, politicians, and activists is talking about Artificial General Intelligence (AGI). Yet, they often don't seem to agree on its exact definition. One common definition of AGI is...

1 min 1 month, 1 week ago

ada

LOW Academic European Union

ODAR: Principled Adaptive Routing for LLM Reasoning via Active Inference

arXiv:2602.23681v1 Announce Type: new Abstract: The paradigm of large language model (LLM) reasoning is shifting from parameter scaling to test-time compute scaling, yet many existing approaches still rely on uniform brute-force sampling (for example, fixed best-of-N or self-consistency) that is...

1 min 1 month, 1 week ago

ada

LOW Academic International

ProductResearch: Training E-Commerce Deep Research Agents via Multi-Agent Synthetic Trajectory Distillation

arXiv:2602.23716v1 Announce Type: new Abstract: Large Language Model (LLM)-based agents show promise for e-commerce conversational shopping, yet existing implementations lack the interaction depth and contextual breadth required for complex product research. Meanwhile, the Deep Research paradigm, despite advancing information synthesis...

1 min 1 month, 1 week ago

labor

LOW Academic United States

The Auton Agentic AI Framework

arXiv:2602.23720v1 Announce Type: new Abstract: The field of Artificial Intelligence is undergoing a transition from Generative AI -- probabilistic generation of text and images -- to Agentic AI, in which autonomous systems execute actions within external environments on behalf of...

1 min 1 month, 1 week ago

ada

LOW Academic European Union

RUMAD: Reinforcement-Unifying Multi-Agent Debate

arXiv:2602.23864v1 Announce Type: new Abstract: Multi-agent debate (MAD) systems leverage collective intelligence to enhance reasoning capabilities, yet existing approaches struggle to simultaneously optimize accuracy, consensus formation, and computational efficiency. Static topology methods lack adaptability to task complexity variations, while external...

1 min 1 month, 1 week ago

ada

LOW Academic International

Human or Machine? A Preliminary Turing Test for Speech-to-Speech Interaction

arXiv:2602.24080v1 Announce Type: new Abstract: The pursuit of human-like conversational agents has long been guided by the Turing test. For modern speech-to-speech (S2S) systems, a critical yet unanswered question is whether they can converse like humans. To tackle this, we...

1 min 1 month, 1 week ago

discrimination

Epistemic Traps: Rational Misalignment Driven by Model Misspecification

WorkflowPerturb: Calibrated Stress Tests for Evaluating Multi-Agent Workflow Metrics

IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering

Robust Pre-Training of Medical Vision-and-Language Models with Domain-Invariant Multi-Modal Masked Reconstruction

Agentic Unlearning: When LLM Agent Meets Machine Unlearning

EXACT: Explicit Attribute-Guided Decoding-Time Personalization

MIDAS: Mosaic Input-Specific Differentiable Architecture Search

"Everyone's using it, but no one is allowed to talk about it": College Students' Experiences Navigating the Higher Education Environment in a Generative AI World

GeneZip: Region-Aware Compression for Long Context DNA Modeling

Impact of Artificial Intelligence on Dental Education: A Review and Guide for Curriculum Update

Feedback-based Automated Verification in Vibe Coding of CAS Adaptation Built on Constraint Logic

LAMMI-Pathology: A Tool-Centric Bottom-Up LVLM-Agent Framework for Molecularly Informed Medical Intelligence in Pathology

DREAM: Deep Research Evaluation with Agentic Metrics

(Perlin) Noise as AI coordinator

InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

Benchmark Test-Time Scaling of General LLM Agents

Evaluating Large Language Models on Quantum Mechanics: A Comparative Study Across Diverse Models and Tasks

Reasoning Capabilities of Large Language Models. Lessons Learned from General Game Playing

Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training

Automated Generation of Microfluidic Netlists using Large Language Models

ALPACA: A Reinforcement Learning Environment for Medication Repurposing and Treatment Optimization in Alzheimer's Disease

Think$^{2}$: Grounded Metacognitive Reasoning in Large Language Models

Whisper: Courtside Edition Enhancing ASR Performance Through LLM-Driven Context Generation

Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

AI Must Embrace Specialization via Superhuman Adaptable Intelligence

ODAR: Principled Adaptive Routing for LLM Reasoning via Active Inference

ProductResearch: Training E-Commerce Deep Research Agents via Multi-Agent Synthetic Trajectory Distillation

The Auton Agentic AI Framework

RUMAD: Reinforcement-Unifying Multi-Agent Debate

Human or Machine? A Preliminary Turing Test for Speech-to-Speech Interaction

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.