Litigation

LOW Academic International

RAVEL: Reasoning Agents for Validating and Evaluating LLM Text Synthesis

arXiv:2603.00686v1 Announce Type: new Abstract: Large Language Models have evolved from single-round generators into long-horizon agents, capable of complex text synthesis scenarios. However, current evaluation frameworks lack the ability to assess the actual synthesis operations, such as outlining, drafting, and...

1 min 1 month, 2 weeks ago

standing

LOW Academic International

Thoth: Mid-Training Bridges LLMs to Time Series Understanding

arXiv:2603.01042v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable success in general-purpose reasoning. However, they still struggle to understand and reason about time series data, which limits their effectiveness in decision-making scenarios that depend on temporal dynamics....

1 min 1 month, 2 weeks ago

standing

LOW Academic International

M3-AD: Reflection-aware Multi-modal, Multi-category, and Multi-dimensional Benchmark and Framework for Industrial Anomaly Detection

arXiv:2603.00055v1 Announce Type: new Abstract: Although multimodal large language models (MLLMs) have advanced industrial anomaly detection toward a zero-shot paradigm, they still tend to produce high-confidence yet unreliable decisions in fine-grained and structurally complex industrial scenarios, and lack effective self-corrective...

1 min 1 month, 2 weeks ago

trial

LOW Academic International

Certainty-Validity: A Diagnostic Framework for Discrete Commitment Systems

arXiv:2603.00070v1 Announce Type: new Abstract: Standard evaluation metrics for machine learning -- accuracy, precision, recall, and AUROC -- assume that all errors are equivalent: a confident incorrect prediction is penalized identically to an uncertain one. For discrete commitment systems (architectures...

1 min 1 month, 2 weeks ago

evidence

LOW Academic European Union

SEval-NAS: A Search-Agnostic Evaluation for Neural Architecture Search

arXiv:2603.00099v1 Announce Type: new Abstract: Neural architecture search (NAS) automates the discovery of neural networks that meet specified criteria, yet its evaluation procedures are often hardcoded, limiting the ability to introduce new metrics. This issue is especially pronounced in hardware-aware...

1 min 1 month, 2 weeks ago

discovery

LOW Academic International

LIDS: LLM Summary Inference Under the Layered Lens

arXiv:2603.00105v1 Announce Type: new Abstract: Large language models (LLMs) have gained significant attention by many researchers and practitioners in natural language processing (NLP) since the introduction of ChatGPT in 2022. One notable feature of ChatGPT is its ability to generate...

1 min 1 month, 2 weeks ago

discovery

LOW Academic International

OSF: On Pre-training and Scaling of Sleep Foundation Models

arXiv:2603.00190v1 Announce Type: new Abstract: Polysomnography (PSG) provides the gold standard for sleep assessment but suffers from substantial heterogeneity across recording devices and cohorts. There have been growing efforts to build general-purpose foundation models (FMs) for sleep physiology, but lack...

1 min 1 month, 2 weeks ago

standing

LOW Academic United States

A medical coding language model trained on clinical narratives from a population-wide cohort of 1.8 million patients

arXiv:2603.00221v1 Announce Type: new Abstract: Medical coding translates clinical documentation into standardized codes for billing, research, and public health, but manual coding is time-consuming and error-prone. Existing automation efforts rely on small datasets that poorly represent real-world patient heterogeneity. We...

1 min 1 month, 2 weeks ago

standing

LOW Academic European Union

Hereditary Geometric Meta-RL: Nonlocal Generalization via Task Symmetries

arXiv:2603.00396v1 Announce Type: new Abstract: Meta-Reinforcement Learning (Meta-RL) commonly generalizes via smoothness in the task encoding. While this enables local generalization around each training task, it requires dense coverage of the task space and leaves richer task space structure untapped....

1 min 1 month, 2 weeks ago

discovery

LOW News United States

FCC chair calls Paramount/WBD merger "a lot cleaner" than defunct Netflix deal

FCC to review foreign debt, but Carr indicates it will be a formality.

1 min 1 month, 2 weeks ago

discovery

LOW Academic International

Humans and LLMs Diverge on Probabilistic Inferences

arXiv:2602.23546v1 Announce Type: new Abstract: Human reasoning often involves working over limited information to arrive at probabilistic conclusions. In its simplest form, this involves making an inference that is not strictly entailed by a premise, but rather only likely given...

1 min 1 month, 2 weeks ago

evidence

LOW Academic European Union

France or Spain or Germany or France: A Neural Account of Non-Redundant Redundant Disjunctions

arXiv:2602.23547v1 Announce Type: new Abstract: Sentences like "She will go to France or Spain, or perhaps to Germany or France." appear formally redundant, yet become acceptable in contexts such as "Mary will go to a philosophy program in France or...

1 min 1 month, 2 weeks ago

evidence

LOW Academic International

Multi-Agent Causal Reasoning for Suicide Ideation Detection Through Online Conversations

arXiv:2602.23577v1 Announce Type: new Abstract: Suicide remains a pressing global public health concern. While social media platforms offer opportunities for early risk detection through online conversation trees, existing approaches face two major limitations: (1) They rely on predefined rules (e.g.,...

1 min 1 month, 2 weeks ago

motion

LOW Academic United States

BRIDGE the Gap: Mitigating Bias Amplification in Automated Scoring of English Language Learners via Inter-group Data Augmentation

arXiv:2602.23580v1 Announce Type: new Abstract: In the field of educational assessment, automated scoring systems increasingly rely on deep learning and large language models (LLMs). However, these systems face significant risks of bias amplification, where model prediction gaps between student groups...

1 min 1 month, 2 weeks ago

evidence

LOW Academic International

TRIZ-RAGNER: A Retrieval-Augmented Large Language Model for TRIZ-Aware Named Entity Recognition in Patent-Based Contradiction Mining

arXiv:2602.23656v1 Announce Type: new Abstract: TRIZ-based contradiction mining is a fundamental task in patent analysis and systematic innovation, as it enables the identification of improving and worsening technical parameters that drive inventive problem solving. However, existing approaches largely rely on...

1 min 1 month, 2 weeks ago

standing

LOW Academic International

Structured Prompt Optimization for Few-Shot Text Classification via Semantic Alignment in Latent Space

arXiv:2602.23753v1 Announce Type: new Abstract: This study addresses the issues of semantic entanglement, unclear label structure, and insufficient feature representation in few-shot text classification, and proposes an optimization framework based on structured prompts to enhance semantic understanding and task adaptation...

1 min 1 month, 2 weeks ago

standing

LOW Academic European Union

GLUScope: A Tool for Analyzing GLU Neurons in Transformer Language Models

arXiv:2602.23826v1 Announce Type: new Abstract: We present GLUScope, an open-source tool for analyzing neurons in Transformer-based language models, intended for interpretability researchers. We focus on more recent models than previous tools do; specifically we consider gated activation functions such as...

1 min 1 month, 2 weeks ago

standing

LOW Academic International

The Astonishing Ability of Large Language Models to Parse Jabberwockified Language

arXiv:2602.23928v1 Announce Type: new Abstract: We show that large language models (LLMs) have an astonishing ability to recover meaning from severely degraded English texts. Texts in which content words have been randomly substituted by nonsense strings, e.g., "At the ghybe...

1 min 1 month, 2 weeks ago

standing

LOW Academic International

MemEmo: Evaluating Emotion in Memory Systems of Agents

arXiv:2602.23944v1 Announce Type: new Abstract: Memory systems address the challenge of context loss in Large Language Model during prolonged interactions. However, compared to human cognition, the efficacy of these systems in processing emotion-related information remains inconclusive. To address this gap,...

1 min 1 month, 2 weeks ago

motion

LOW Academic International

Dialect and Gender Bias in YouTube's Spanish Captioning System

arXiv:2602.24002v1 Announce Type: new Abstract: Spanish is the official language of twenty-one countries and is spoken by over 441 million people. Naturally, there are many variations in how Spanish is spoken across these countries. Media platforms such as YouTube rely...

1 min 1 month, 2 weeks ago

evidence

LOW Academic International

Task Complexity Matters: An Empirical Study of Reasoning in LLMs for Sentiment Analysis

arXiv:2602.24060v1 Announce Type: new Abstract: Large language models (LLMs) with reasoning capabilities have fueled a compelling narrative that reasoning universally improves performance across language tasks. We test this claim through a comprehensive evaluation of 504 configurations across seven model families--including...

1 min 1 month, 2 weeks ago

motion

LOW Academic European Union

Terminology Rarity Predicts Catastrophic Failure in LLM Translation of Low-Resource Ancient Languages: Evidence from Ancient Greek

arXiv:2602.24119v1 Announce Type: new Abstract: This study presents the first systematic, reference-free human evaluation of large language model (LLM) machine translation (MT) for Ancient Greek (AG) technical prose. We evaluate translations by three commercial LLMs (Claude, Gemini, ChatGPT) of twenty...

1 min 1 month, 2 weeks ago

evidence

LOW Academic European Union

Serendipity with Generative AI: Repurposing knowledge components during polycrisis with a Viable Systems Model approach

arXiv:2602.23365v1 Announce Type: cross Abstract: Organisations face polycrisis uncertainty yet overlook embedded knowledge. We show how generative AI can operate as a serendipity engine and knowledge transducer to discover, classify and mobilise reusable components (models, frameworks, patterns) from existing documents....

1 min 1 month, 2 weeks ago

discovery

LOW Academic International

Ref-Adv: Exploring MLLM Visual Reasoning in Referring Expression Tasks

arXiv:2602.23898v1 Announce Type: cross Abstract: Referring Expression Comprehension (REC) links language to region level visual perception. Standard benchmarks (RefCOCO, RefCOCO+, RefCOCOg) have progressed rapidly with multimodal LLMs but remain weak tests of visual reasoning and grounding: (i) many expressions are...

1 min 1 month, 2 weeks ago

standing

LOW Academic European Union

Neural Operators Can Discover Functional Clusters

arXiv:2602.23528v1 Announce Type: new Abstract: Operator learning is reshaping scientific computing by amortizing inference across infinite families of problems. While neural operators (NOs) are increasingly well understood for regression, far less is known for classification and its unsupervised analogue: clustering....

1 min 1 month, 2 weeks ago

evidence

LOW Academic European Union

BTTackler: A Diagnosis-based Framework for Efficient Deep Learning Hyperparameter Optimization

arXiv:2602.23630v1 Announce Type: new Abstract: Hyperparameter optimization (HPO) is known to be costly in deep learning, especially when leveraging automated approaches. Most of the existing automated HPO methods are accuracy-based, i.e., accuracy metrics are used to guide the trials of...

1 min 1 month, 2 weeks ago

trial

LOW Academic European Union

On the Convergence of Single-Loop Stochastic Bilevel Optimization with Approximate Implicit Differentiation

arXiv:2602.23633v1 Announce Type: new Abstract: Stochastic Bilevel Optimization has emerged as a fundamental framework for meta-learning and hyperparameter optimization. Despite the practical prevalence of single-loop algorithms--which update lower and upper variables concurrently--their theoretical understanding, particularly in the stochastic regime, remains...

1 min 1 month, 2 weeks ago

standing

LOW Academic United States

FedRot-LoRA: Mitigating Rotational Misalignment in Federated LoRA

arXiv:2602.23638v1 Announce Type: new Abstract: Federated LoRA provides a communication-efficient mechanism for fine-tuning large language models on decentralized data. In practice, however, a discrepancy between the factor-wise averaging used to preserve low rank and the mathematically correct aggregation of local...

1 min 1 month, 2 weeks ago

standing

LOW Academic International

Actor-Critic Pretraining for Proximal Policy Optimization

arXiv:2602.23804v1 Announce Type: new Abstract: Reinforcement learning (RL) actor-critic algorithms enable autonomous learning but often require a large number of environment interactions, which limits their applicability in robotics. Leveraging expert data can reduce the number of required environment interactions. A...

1 min 1 month, 2 weeks ago

motion

LOW Academic European Union

Hierarchical Concept-based Interpretable Models

arXiv:2602.23947v1 Announce Type: new Abstract: Modern deep neural networks remain challenging to interpret due to the opacity of their latent representations, impeding model understanding, debugging, and debiasing. Concept Embedding Models (CEMs) address this by mapping inputs to human-interpretable concept representations...

1 min 1 month, 2 weeks ago

standing

RAVEL: Reasoning Agents for Validating and Evaluating LLM Text Synthesis

Thoth: Mid-Training Bridges LLMs to Time Series Understanding

M3-AD: Reflection-aware Multi-modal, Multi-category, and Multi-dimensional Benchmark and Framework for Industrial Anomaly Detection

Certainty-Validity: A Diagnostic Framework for Discrete Commitment Systems

SEval-NAS: A Search-Agnostic Evaluation for Neural Architecture Search

LIDS: LLM Summary Inference Under the Layered Lens

OSF: On Pre-training and Scaling of Sleep Foundation Models

A medical coding language model trained on clinical narratives from a population-wide cohort of 1.8 million patients

Hereditary Geometric Meta-RL: Nonlocal Generalization via Task Symmetries

FCC chair calls Paramount/WBD merger "a lot cleaner" than defunct Netflix deal

Humans and LLMs Diverge on Probabilistic Inferences

France or Spain or Germany or France: A Neural Account of Non-Redundant Redundant Disjunctions

Multi-Agent Causal Reasoning for Suicide Ideation Detection Through Online Conversations

BRIDGE the Gap: Mitigating Bias Amplification in Automated Scoring of English Language Learners via Inter-group Data Augmentation

TRIZ-RAGNER: A Retrieval-Augmented Large Language Model for TRIZ-Aware Named Entity Recognition in Patent-Based Contradiction Mining

Structured Prompt Optimization for Few-Shot Text Classification via Semantic Alignment in Latent Space

GLUScope: A Tool for Analyzing GLU Neurons in Transformer Language Models

The Astonishing Ability of Large Language Models to Parse Jabberwockified Language

MemEmo: Evaluating Emotion in Memory Systems of Agents

Dialect and Gender Bias in YouTube's Spanish Captioning System

Task Complexity Matters: An Empirical Study of Reasoning in LLMs for Sentiment Analysis

Terminology Rarity Predicts Catastrophic Failure in LLM Translation of Low-Resource Ancient Languages: Evidence from Ancient Greek

Serendipity with Generative AI: Repurposing knowledge components during polycrisis with a Viable Systems Model approach

Ref-Adv: Exploring MLLM Visual Reasoning in Referring Expression Tasks

Neural Operators Can Discover Functional Clusters

BTTackler: A Diagnosis-based Framework for Efficient Deep Learning Hyperparameter Optimization

On the Convergence of Single-Loop Stochastic Bilevel Optimization with Approximate Implicit Differentiation

FedRot-LoRA: Mitigating Rotational Misalignment in Federated LoRA

Actor-Critic Pretraining for Proximal Policy Optimization

Hierarchical Concept-based Interpretable Models

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.