Litigation

LOW Academic United States

Efficient and Interpretable Multi-Agent LLM Routing via Ant Colony Optimization

arXiv:2603.12933v1 Announce Type: new Abstract: Large Language Model (LLM)-driven Multi-Agent Systems (MAS) have demonstrated strong capability in complex reasoning and tool use, and heterogeneous agent pools further broaden the quality--cost trade-off space. Despite these advances, real-world deployment is often constrained...

1 min 1 month ago

evidence

LOW Academic European Union

Detecting Miscitation on the Scholarly Web through LLM-Augmented Text-Rich Graph Learning

arXiv:2603.12290v1 Announce Type: cross Abstract: Scholarly web is a vast network of knowledge connected by citations. However, this system is increasingly compromised by miscitation, where references do not support or even contradict the claims they are cited for. Current miscitation...

1 min 1 month ago

evidence

LOW Academic International

An ethical framework for conversational AI in higher education: toward an evidence-based ethical governance

1 min 1 month ago

evidence

LOW Academic United States

Generating Expressive and Customizable Evals for Timeseries Data Analysis Agents with AgentFuel

arXiv:2603.12483v1 Announce Type: new Abstract: Across many domains (e.g., IoT, observability, telecommunications, cybersecurity), there is an emerging adoption of conversational data analysis agents that enable users to "talk to your data" to extract insights. Such data analysis agents operate on...

1 min 1 month ago

evidence

LOW Academic International

Developing and evaluating a chatbot to support maternal health care

arXiv:2603.13168v1 Announce Type: new Abstract: The ability to provide trustworthy maternal health information using phone-based chatbots can have a significant impact, particularly in low-resource settings where users have low health literacy and limited access to care. However, deploying such systems...

1 min 1 month ago

evidence

LOW Academic European Union

The DIME Architecture: A Unified Operational Algorithm for Neural Representation, Dynamics, Control and Integration

arXiv:2603.12286v1 Announce Type: cross Abstract: Modern neuroscience has accumulated extensive evidence on perception, memory, prediction, valuation, and consciousness, yet still lacks an explicit operational architecture capable of integrating these phenomena within a unified computational framework. Existing theories address specific aspects...

1 min 1 month ago

evidence

LOW Academic United States

Budget-Sensitive Discovery Scoring: A Formally Verified Framework for Evaluating AI-Guided Scientific Selection

arXiv:2603.12349v1 Announce Type: cross Abstract: Scientific discovery increasingly relies on AI systems to select candidates for expensive experimental validation, yet no principled, budget-aware evaluation framework exists for comparing selection strategies -- a gap intensified by large language models (LLMs), which...

1 min 1 month ago

discovery

LOW Academic International

SPARROW: Learning Spatial Precision and Temporal Referential Consistency in Pixel-Grounded Video MLLMs

arXiv:2603.12382v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) have advanced from image-level reasoning to pixel-level grounding, but extending these capabilities to videos remains challenging as models must achieve spatial precision and temporally consistent reference tracking. Existing video MLLMs...

1 min 1 month ago

standing

LOW Academic United States

Operationalising Cyber Risk Management Using AI: Connecting Cyber Incidents to MITRE ATT&CK Techniques, Security Controls, and Metrics

arXiv:2603.12455v1 Announce Type: cross Abstract: The escalating frequency of cyber-attacks poses significant challenges for organisations, particularly small enterprises constrained by limited in-house expertise, insufficient knowledge, and financial resources. This research presents a novel framework that leverages Natural Language Processing to...

1 min 1 month ago

evidence

LOW Academic International

Shattering the Shortcut: A Topology-Regularized Benchmark for Multi-hop Medical Reasoning in LLMs

arXiv:2603.12458v1 Announce Type: cross Abstract: While Large Language Models (LLMs) achieve expert-level performance on standard medical benchmarks through single-hop factual recall, they severely struggle with the complex, multi-hop diagnostic reasoning required in real-world clinical settings. A primary obstacle is "shortcut...

1 min 1 month ago

evidence

LOW Academic International

The Perfection Paradox: From Architect to Curator in AI-Assisted API Design

arXiv:2603.12475v1 Announce Type: cross Abstract: Enterprise API design is often bottlenecked by the tension between rapid feature delivery and the rigorous maintenance of usability standards. We present an industrial case study evaluating an AI-assisted design workflow trained on API Improvement...

1 min 1 month ago

trial

LOW Academic International

TRACE: Temporal Rule-Anchored Chain-of-Evidence on Knowledge Graphs for Interpretable Stock Movement Prediction

arXiv:2603.12500v1 Announce Type: cross Abstract: We present a Temporal Rule-Anchored Chain-of-Evidence (TRACE) on knowledge graphs for interpretable stock movement prediction that unifies symbolic relational priors, dynamic graph exploration, and LLM-guided decision making in a single end-to-end pipeline. The approach performs...

1 min 1 month ago

evidence

LOW Academic International

LLM BiasScope: A Real-Time Bias Analysis Platform for Comparative LLM Evaluation

arXiv:2603.12522v1 Announce Type: cross Abstract: As large language models (LLMs) are deployed widely, detecting and understanding bias in their outputs is critical. We present LLM BiasScope, a web application for side-by-side comparison of LLM outputs with real-time bias analysis. The...

1 min 1 month ago

standing

LOW Academic International

Continual Learning in Large Language Models: Methods, Challenges, and Opportunities

arXiv:2603.12658v1 Announce Type: new Abstract: Continual learning (CL) has emerged as a pivotal paradigm to enable large language models (LLMs) to dynamically adapt to evolving knowledge and sequential tasks while mitigating catastrophic forgetting-a critical limitation of the static pre-training paradigm...

1 min 1 month ago

standing

LOW Academic International

Experimental evidence of progressive ChatGPT models self-convergence

arXiv:2603.12683v1 Announce Type: new Abstract: Large Language Models (LLMs) that undergo recursive training on synthetically generated data are susceptible to model collapse, a phenomenon marked by the generation of meaningless output. Existing research has examined this issue from either theoretical...

1 min 1 month ago

evidence

LOW Academic International

ESG-Bench: Benchmarking Long-Context ESG Reports for Hallucination Mitigation

arXiv:2603.13154v1 Announce Type: new Abstract: As corporate responsibility increasingly incorporates environmental, social, and governance (ESG) criteria, ESG reporting is becoming a legal requirement in many regions and a key channel for documenting sustainability practices and assessing firms' long-term and ethical...

1 min 1 month ago

standing

LOW Academic International

Generalist Large Language Models for Molecular Property Prediction: Distilling Knowledge from Specialist Models

arXiv:2603.12344v1 Announce Type: new Abstract: Molecular Property Prediction (MPP) is a central task in drug discovery. While Large Language Models (LLMs) show promise as generalist models for MPP, their current performance remains below the threshold for practical adoption. We propose...

1 min 1 month ago

discovery

LOW Academic International

Overcoming the Modality Gap in Context-Aided Forecasting

arXiv:2603.12451v1 Announce Type: new Abstract: Context-aided forecasting (CAF) holds promise for integrating domain knowledge and forward-looking information, enabling AI systems to surpass traditional statistical methods. However, recent empirical studies reveal a puzzling gap: multimodal models often fail to outperform their...

1 min 1 month ago

evidence

LOW Academic United States

Learning Pore-scale Multiphase Flow from 4D Velocimetry

arXiv:2603.12516v1 Announce Type: new Abstract: Multiphase flow in porous media underpins subsurface energy and environmental technologies, including geological CO$_2$ storage and underground hydrogen storage, yet pore-scale dynamics in realistic three-dimensional materials remain difficult to characterize and predict. Here we introduce...

1 min 1 month ago

motion

LOW Academic European Union

Deep Distance Measurement Method for Unsupervised Multivariate Time Series Similarity Retrieval

arXiv:2603.12544v1 Announce Type: new Abstract: We propose the Deep Distance Measurement Method (DDMM) to improve retrieval accuracy in unsupervised multivariate time series similarity retrieval. DDMM enables learning of minute differences within states in the entire time series and thereby recognition...

1 min 1 month ago

trial

LOW Academic International

Asymptotic and Finite-Time Guarantees for Langevin-Based Temperature Annealing in InfoNCE

arXiv:2603.12552v1 Announce Type: new Abstract: The InfoNCE loss in contrastive learning depends critically on a temperature parameter, yet its dynamics under fixed versus annealed schedules remain poorly understood. We provide a theoretical analysis by modeling embedding evolution under Langevin dynamics...

1 min 1 month ago

standing

LOW Academic United States

Scaling Laws and Pathologies of Single-Layer PINNs: Network Width and PDE Nonlinearity

arXiv:2603.12556v1 Announce Type: new Abstract: We establish empirical scaling laws for Single-Layer Physics-Informed Neural Networks on canonical nonlinear PDEs. We identify a dual optimization failure: (i) a baseline pathology, where the solution error fails to decrease with network width, even...

1 min 1 month ago

evidence

LOW Journal International

Nailing Down a Conversion Therapy Offence

1 min 1 month ago

standing

LOW Academic United States

AI Knows What's Wrong But Cannot Fix It: Helicoid Dynamics in Frontier LLMs Under High-Stakes Decisions

arXiv:2603.11559v1 Announce Type: new Abstract: Large language models perform reliably when their outputs can be checked: solving equations, writing code, retrieving facts. They perform differently when checking is impossible, as when a clinician chooses an irreversible treatment on incomplete data,...

1 min 1 month ago

standing

LOW Academic International

MaterialFigBENCH: benchmark dataset with figures for evaluating college-level materials science problem-solving abilities of multimodal large language models

arXiv:2603.11414v1 Announce Type: new Abstract: We present MaterialFigBench, a benchmark dataset designed to evaluate the ability of multimodal large language models (LLMs) to solve university-level materials science problems that require accurate interpretation of figures. Unlike existing benchmarks that primarily rely...

1 min 1 month ago

standing

LOW Academic International

Deactivating Refusal Triggers: Understanding and Mitigating Overrefusal in Safety Alignment

arXiv:2603.11388v1 Announce Type: new Abstract: Safety alignment aims to ensure that large language models (LLMs) refuse harmful requests by post-training on harmful queries paired with refusal answers. Although safety alignment is widely adopted in industry, the overrefusal problem where aligned...

1 min 1 month ago

standing

LOW Academic United States

VisDoT : Enhancing Visual Reasoning through Human-Like Interpretation Grounding and Decomposition of Thought

arXiv:2603.11631v1 Announce Type: new Abstract: Large vision-language models (LVLMs) struggle to reliably detect visual primitives in charts and align them with semantic representations, which severely limits their performance on complex visual reasoning. This lack of perceptual grounding constitutes a major...

1 min 1 month ago

standing

LOW Academic United States

Measuring AI Agents' Progress on Multi-Step Cyber Attack Scenarios

arXiv:2603.11214v1 Announce Type: new Abstract: We evaluate the autonomous cyber-attack capabilities of frontier AI models on two purpose-built cyber ranges-a 32-step corporate network attack and a 7-step industrial control system attack-that require chaining heterogeneous capabilities across extended action sequences. By...

1 min 1 month ago

trial

LOW Academic International

Gender Bias in Generative AI-assisted Recruitment Processes

arXiv:2603.11736v1 Announce Type: new Abstract: In recent years, generative artificial intelligence (GenAI) systems have assumed increasingly crucial roles in selection processes, personnel recruitment and analysis of candidates' profiles. However, the employment of large language models (LLMs) risks reproducing, and in...

1 min 1 month ago

motion

LOW Academic International

LLMs can construct powerful representations and streamline sample-efficient supervised learning

arXiv:2603.11679v1 Announce Type: new Abstract: As real-world datasets become increasingly complex and heterogeneous, supervised learning is often bottlenecked by input representation design. Modeling multimodal data for downstream tasks, such as time-series, free text, and structured records, often requires non-trivial domain-specific...

1 min 1 month ago

evidence

Efficient and Interpretable Multi-Agent LLM Routing via Ant Colony Optimization

Detecting Miscitation on the Scholarly Web through LLM-Augmented Text-Rich Graph Learning

An ethical framework for conversational AI in higher education: toward an evidence-based ethical governance

Generating Expressive and Customizable Evals for Timeseries Data Analysis Agents with AgentFuel

Developing and evaluating a chatbot to support maternal health care

The DIME Architecture: A Unified Operational Algorithm for Neural Representation, Dynamics, Control and Integration

Budget-Sensitive Discovery Scoring: A Formally Verified Framework for Evaluating AI-Guided Scientific Selection

SPARROW: Learning Spatial Precision and Temporal Referential Consistency in Pixel-Grounded Video MLLMs

Operationalising Cyber Risk Management Using AI: Connecting Cyber Incidents to MITRE ATT&CK Techniques, Security Controls, and Metrics

Shattering the Shortcut: A Topology-Regularized Benchmark for Multi-hop Medical Reasoning in LLMs

The Perfection Paradox: From Architect to Curator in AI-Assisted API Design

TRACE: Temporal Rule-Anchored Chain-of-Evidence on Knowledge Graphs for Interpretable Stock Movement Prediction

LLM BiasScope: A Real-Time Bias Analysis Platform for Comparative LLM Evaluation

Continual Learning in Large Language Models: Methods, Challenges, and Opportunities

Experimental evidence of progressive ChatGPT models self-convergence

ESG-Bench: Benchmarking Long-Context ESG Reports for Hallucination Mitigation

Generalist Large Language Models for Molecular Property Prediction: Distilling Knowledge from Specialist Models

Overcoming the Modality Gap in Context-Aided Forecasting

Learning Pore-scale Multiphase Flow from 4D Velocimetry

Deep Distance Measurement Method for Unsupervised Multivariate Time Series Similarity Retrieval

Asymptotic and Finite-Time Guarantees for Langevin-Based Temperature Annealing in InfoNCE

Scaling Laws and Pathologies of Single-Layer PINNs: Network Width and PDE Nonlinearity

Nailing Down a Conversion Therapy Offence

AI Knows What's Wrong But Cannot Fix It: Helicoid Dynamics in Frontier LLMs Under High-Stakes Decisions

MaterialFigBENCH: benchmark dataset with figures for evaluating college-level materials science problem-solving abilities of multimodal large language models

Deactivating Refusal Triggers: Understanding and Mitigating Overrefusal in Safety Alignment

VisDoT : Enhancing Visual Reasoning through Human-Like Interpretation Grounding and Decomposition of Thought

Measuring AI Agents' Progress on Multi-Step Cyber Attack Scenarios

Gender Bias in Generative AI-assisted Recruitment Processes

LLMs can construct powerful representations and streamline sample-efficient supervised learning

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.