Litigation

LOW Academic International

Single-Agent LLMs Outperform Multi-Agent Systems on Multi-Hop Reasoning Under Equal Thinking Token Budgets

arXiv:2604.02460v1 Announce Type: new Abstract: Recent work reports strong performance from multi-agent LLM systems (MAS), but these gains are often confounded by increased test-time computation. When computation is normalized, single-agent systems (SAS) can match or outperform MAS, yet the theoretical...

1 min 1 week, 4 days ago

standing

LOW Academic International

CharTool: Tool-Integrated Visual Reasoning for Chart Understanding

arXiv:2604.02794v1 Announce Type: new Abstract: Charts are ubiquitous in scientific and financial literature for presenting structured data. However, chart reasoning remains challenging for multimodal large language models (MLLMs) due to the lack of high-quality training data, as well as the...

1 min 1 week, 4 days ago

standing

LOW Academic International

VoxelCodeBench: Benchmarking 3D World Modeling Through Code Generation

arXiv:2604.02580v1 Announce Type: new Abstract: Evaluating code generation models for 3D spatial reasoning requires executing generated code in realistic environments and assessing outputs beyond surface-level correctness. We introduce a platform VoxelCode, for analyzing code generation capabilities for 3D understanding and...

1 min 1 week, 4 days ago

standing

LOW Academic International

AutoVerifier: An Agentic Automated Verification Framework Using Large Language Models

arXiv:2604.02617v1 Announce Type: new Abstract: Scientific and Technical Intelligence (S&TI) analysis requires verifying complex technical claims across rapidly growing literature, where existing approaches fail to bridge the verification gap between surface-level accuracy and deeper methodological validity. We present AutoVerifier, an...

1 min 1 week, 4 days ago

evidence

LOW Academic European Union

Competency Questions as Executable Plans: a Controlled RAG Architecture for Cultural Heritage Storytelling

arXiv:2604.02545v1 Announce Type: new Abstract: The preservation of intangible cultural heritage is a critical challenge as collective memory fades over time. While Large Language Models (LLMs) offer a promising avenue for generating engaging narratives, their propensity for factual inaccuracies or...

1 min 1 week, 4 days ago

evidence

LOW Academic International

ESL-Bench: An Event-Driven Synthetic Longitudinal Benchmark for Health Agents

arXiv:2604.02834v1 Announce Type: new Abstract: Longitudinal health agents must reason across multi-source trajectories that combine continuous device streams, sparse clinical exams, and episodic life events - yet evaluating them is hard: real-world data cannot be released at scale, and temporally...

1 min 1 week, 4 days ago

evidence

LOW Academic International

Let's Have a Conversation: Designing and Evaluating LLM Agents for Interactive Optimization

arXiv:2604.02666v1 Announce Type: new Abstract: Optimization is as much about modeling the right problem as solving it. Identifying the right objectives, constraints, and trade-offs demands extensive interaction between researchers and stakeholders. Large language models can empower decision-makers with optimization capabilities...

1 min 1 week, 4 days ago

evidence

LOW Academic International

I must delete the evidence: AI Agents Explicitly Cover up Fraud and Violent Crime

arXiv:2604.02500v1 Announce Type: new Abstract: As ongoing research explores the ability of AI agents to be insider threats and act against company interests, we showcase the abilities of such agents to act against human well being in service of corporate...

1 min 1 week, 4 days ago

evidence

LOW Academic United States

Understanding the Nature of Generative AI as Threshold Logic in High-Dimensional Space

arXiv:2604.02476v1 Announce Type: new Abstract: This paper examines the role of threshold logic in understanding generative artificial intelligence. Threshold functions, originally studied in the 1960s in digital circuit synthesis, provide a structurally transparent model of neural computation: a weighted sum...

1 min 1 week, 4 days ago

standing

LOW Academic United States

CIPHER: Conformer-based Inference of Phonemes from High-density EEG

arXiv:2604.02362v1 Announce Type: cross Abstract: Decoding speech information from scalp EEG remains difficult due to low SNR and spatial blurring. We present CIPHER (Conformer-based Inference of Phonemes from High-density EEG Representations), a dual-pathway model using (i) ERP features and (ii)...

1 min 1 week, 4 days ago

evidence

LOW Academic United States

A Spectral Framework for Multi-Scale Nonlinear Dimensionality Reduction

arXiv:2604.02535v1 Announce Type: new Abstract: Dimensionality reduction (DR) is characterized by two longstanding trade-offs. First, there is a global-local preservation tension: methods such as t-SNE and UMAP prioritize local neighborhood preservation, yet may distort global manifold structure, while methods such...

1 min 1 week, 4 days ago

standing

LOW Academic International

Too Polite to Disagree: Understanding Sycophancy Propagation in Multi-Agent Systems

arXiv:2604.02668v1 Announce Type: new Abstract: Large language models (LLMs) often exhibit sycophancy: agreement with user stance even when it conflicts with the model's opinion. While prior work has mostly studied this in single-agent settings, it remains underexplored in collaborative multi-agent...

1 min 1 week, 4 days ago

standing

LOW Academic International

SocioEval: A Template-Based Framework for Evaluating Socioeconomic Status Bias in Foundation Models

arXiv:2604.02660v1 Announce Type: new Abstract: As Large Language Models (LLMs) increasingly power decision-making systems across critical domains, understanding and mitigating their biases becomes essential for responsible AI deployment. Although bias assessment frameworks have proliferated for attributes such as race and...

1 min 1 week, 4 days ago

standing

LOW Academic European Union

Aligning Progress and Feasibility: A Neuro-Symbolic Dual Memory Framework for Long-Horizon LLM Agents

arXiv:2604.02734v1 Announce Type: new Abstract: Large language models (LLMs) have demonstrated strong potential in long-horizon decision-making tasks, such as embodied manipulation and web interaction. However, agents frequently struggle with endless trial-and-error loops or deviate from the main objective in complex...

1 min 1 week, 4 days ago

trial

LOW Academic European Union

Evaluating the Formal Reasoning Capabilities of Large Language Models through Chomsky Hierarchy

arXiv:2604.02709v1 Announce Type: new Abstract: The formal reasoning capabilities of LLMs are crucial for advancing automated software engineering. However, existing benchmarks for LLMs lack systematic evaluation based on computation and complexity, leaving a critical gap in understanding their formal reasoning...

1 min 1 week, 4 days ago

standing

LOW Academic European Union

Cognitive Energy Modeling for Neuroadaptive Human-Machine Systems using EEG and WGAN-GP

arXiv:2604.01653v1 Announce Type: new Abstract: Electroencephalography (EEG) provides a non-invasive insight into the brain's cognitive and emotional dynamics. However, modeling how these states evolve in real time and quantifying the energy required for such transitions remains a major challenge. The...

1 min 2 weeks, 1 day ago

motion

LOW Academic United States

More Human, More Efficient: Aligning Annotations with Quantized SLMs

arXiv:2604.00586v1 Announce Type: new Abstract: As Large Language Model (LLM) capabilities advance, the demand for high-quality annotation of exponentially increasing text corpora has outpaced human capacity, leading to the widespread adoption of LLMs in automatic evaluation and annotation. However, proprietary...

1 min 2 weeks, 1 day ago

motion

LOW Academic International

Can Large Language Models Self-Correct in Medical Question Answering? An Exploratory Study

arXiv:2604.00261v2 Announce Type: new Abstract: Large language models (LLMs) have achieved strong performance on medical question answering (medical QA), and chain-of-thought (CoT) prompting has further improved results by eliciting explicit intermediate reasoning; meanwhile, self-reflective (self-corrective) prompting has been widely claimed...

1 min 2 weeks, 1 day ago

standing

LOW Academic European Union

Care-Conditioned Neuromodulation for Autonomy-Preserving Supportive Dialogue Agents

arXiv:2604.01576v1 Announce Type: new Abstract: Large language models deployed in supportive or advisory roles must balance helpfulness with preservation of user autonomy, yet standard alignment methods primarily optimize for helpfulness and harmlessness without explicitly modeling relational risks such as dependency...

1 min 2 weeks, 1 day ago

motion

LOW Academic International

Therefore I am. I Think

arXiv:2604.01202v2 Announce Type: new Abstract: We consider the question: when a large language reasoning model makes a choice, did it think first and then decide to, or decide first and then think? In this paper, we present evidence that detectable,...

1 min 2 weeks, 1 day ago

evidence

LOW Academic International

DISCO-TAB: A Hierarchical Reinforcement Learning Framework for Privacy-Preserving Synthesis of Complex Clinical Data

arXiv:2604.01481v1 Announce Type: new Abstract: The development of robust clinical decision support systems is frequently impeded by the scarcity of high-fidelity, privacy-preserving biomedical data. While Generative Large Language Models (LLMs) offer a promising avenue for synthetic data generation, they often...

1 min 2 weeks, 1 day ago

discovery

LOW News United States

Authors' lucky break in court may help class action over Meta torrenting

Judge gave authors an easier attack on Meta’s torrenting. Meta hopes SCOTUS ruling will block it.

1 min 2 weeks, 1 day ago

class action

LOW Academic United States

Do LLMs Know What Is Private Internally? Probing and Steering Contextual Privacy Norms in Large Language Model Representations

arXiv:2604.00209v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly deployed in high-stakes settings, yet they frequently violate contextual privacy by disclosing private information in situations where humans would exercise discretion. This raises a fundamental question: do LLMs internally...

1 min 2 weeks, 1 day ago

standing

LOW Academic United Kingdom

Phonological Fossils: Machine Learning Detection of Non-Mainstream Vocabulary in Sulawesi Basic Lexicon

arXiv:2604.00023v1 Announce Type: new Abstract: Basic vocabulary in many Sulawesi Austronesian languages includes forms resisting reconstruction to any proto-form with phonological patterns inconsistent with inherited roots, but whether this non-conforming vocabulary represents pre-Austronesian substrate or independent innovation has not been...

1 min 2 weeks, 1 day ago

evidence

LOW Academic International

Towards Intrinsically Calibrated Uncertainty Quantification in Industrial Data-Driven Models via Diffusion Sampler

arXiv:2604.01870v1 Announce Type: new Abstract: In modern process industries, data-driven models are important tools for real-time monitoring when key performance indicators are difficult to measure directly. While accurate predictions are essential, reliable uncertainty quantification (UQ) is equally critical for safety,...

1 min 2 weeks, 1 day ago

trial

LOW Academic International

An Online Machine Learning Multi-resolution Optimization Framework for Energy System Design Limit of Performance Analysis

arXiv:2604.01308v1 Announce Type: new Abstract: Designing reliable integrated energy systems for industrial processes requires optimization and verification models across multiple fidelities, from architecture-level sizing to high-fidelity dynamic operation. However, model mismatch across fidelities obscures the sources of performance loss and...

1 min 2 weeks, 1 day ago

trial

LOW Academic International

Improvisational Games as a Benchmark for Social Intelligence of AI Agents: The Case of Connections

arXiv:2604.00284v1 Announce Type: new Abstract: We formally introduce a improvisational wordplay game called Connections to explore reasoning capabilities of AI agents. Playing Connections combines skills in knowledge retrieval, summarization and awareness of cognitive states of other agents. We show how...

1 min 2 weeks, 1 day ago

standing

LOW News United States

Costco sued for seeking refunds on tariffs customers paid

Proposed class action accuses Costco of unjust enrichment.

1 min 2 weeks, 1 day ago

class action

LOW Academic European Union

One Panel Does Not Fit All: Case-Adaptive Multi-Agent Deliberation for Clinical Prediction

arXiv:2604.00085v1 Announce Type: new Abstract: Large language models applied to clinical prediction exhibit case-level heterogeneity: simple cases yield consistent outputs, while complex cases produce divergent predictions under minor prompt changes. Existing single-agent strategies sample from one role-conditioned distribution, and multi-agent...

1 min 2 weeks, 1 day ago

evidence

LOW Academic European Union

Semantic Shifts of Psychological Concepts in Scientific and Popular Media Discourse: A Distributional Semantics Analysis of Russian-Language Corpora

arXiv:2604.00017v1 Announce Type: new Abstract: This article examines semantic shifts in psychological concepts across scientific and popular media discourse using methods of distributional semantics applied to Russian-language corpora. Two corpora were compiled: a scientific corpus of approximately 300 research articles...

1 min 2 weeks, 1 day ago

motion

Single-Agent LLMs Outperform Multi-Agent Systems on Multi-Hop Reasoning Under Equal Thinking Token Budgets

CharTool: Tool-Integrated Visual Reasoning for Chart Understanding

VoxelCodeBench: Benchmarking 3D World Modeling Through Code Generation

AutoVerifier: An Agentic Automated Verification Framework Using Large Language Models

Competency Questions as Executable Plans: a Controlled RAG Architecture for Cultural Heritage Storytelling

ESL-Bench: An Event-Driven Synthetic Longitudinal Benchmark for Health Agents

Let's Have a Conversation: Designing and Evaluating LLM Agents for Interactive Optimization

I must delete the evidence: AI Agents Explicitly Cover up Fraud and Violent Crime

Understanding the Nature of Generative AI as Threshold Logic in High-Dimensional Space

CIPHER: Conformer-based Inference of Phonemes from High-density EEG

A Spectral Framework for Multi-Scale Nonlinear Dimensionality Reduction

Too Polite to Disagree: Understanding Sycophancy Propagation in Multi-Agent Systems

SocioEval: A Template-Based Framework for Evaluating Socioeconomic Status Bias in Foundation Models

Aligning Progress and Feasibility: A Neuro-Symbolic Dual Memory Framework for Long-Horizon LLM Agents

Evaluating the Formal Reasoning Capabilities of Large Language Models through Chomsky Hierarchy

Cognitive Energy Modeling for Neuroadaptive Human-Machine Systems using EEG and WGAN-GP

More Human, More Efficient: Aligning Annotations with Quantized SLMs

Can Large Language Models Self-Correct in Medical Question Answering? An Exploratory Study

Care-Conditioned Neuromodulation for Autonomy-Preserving Supportive Dialogue Agents

Therefore I am. I Think

DISCO-TAB: A Hierarchical Reinforcement Learning Framework for Privacy-Preserving Synthesis of Complex Clinical Data

Authors' lucky break in court may help class action over Meta torrenting

Do LLMs Know What Is Private Internally? Probing and Steering Contextual Privacy Norms in Large Language Model Representations

Phonological Fossils: Machine Learning Detection of Non-Mainstream Vocabulary in Sulawesi Basic Lexicon

Towards Intrinsically Calibrated Uncertainty Quantification in Industrial Data-Driven Models via Diffusion Sampler

An Online Machine Learning Multi-resolution Optimization Framework for Energy System Design Limit of Performance Analysis

Improvisational Games as a Benchmark for Social Intelligence of AI Agents: The Case of Connections

Costco sued for seeking refunds on tariffs customers paid

One Panel Does Not Fit All: Case-Adaptive Multi-Agent Deliberation for Clinical Prediction

Semantic Shifts of Psychological Concepts in Scientific and Popular Media Discourse: A Distributional Semantics Analysis of Russian-Language Corpora

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.