Litigation

LOW Academic International

Deactivating Refusal Triggers: Understanding and Mitigating Overrefusal in Safety Alignment

arXiv:2603.11388v1 Announce Type: new Abstract: Safety alignment aims to ensure that large language models (LLMs) refuse harmful requests by post-training on harmful queries paired with refusal answers. Although safety alignment is widely adopted in industry, the overrefusal problem where aligned...

1 min 1 month ago

standing

LOW Academic International

BLooP: Zero-Shot Abstractive Summarization using Large Language Models with Bigram Lookahead Promotion

arXiv:2603.11415v1 Announce Type: new Abstract: Abstractive summarization requires models to generate summaries that convey information in the source document. While large language models can generate summaries without fine-tuning, they often miss key details and include extraneous information. We propose BLooP...

1 min 1 month ago

motion

LOW Academic International

MaterialFigBENCH: benchmark dataset with figures for evaluating college-level materials science problem-solving abilities of multimodal large language models

arXiv:2603.11414v1 Announce Type: new Abstract: We present MaterialFigBench, a benchmark dataset designed to evaluate the ability of multimodal large language models (LLMs) to solve university-level materials science problems that require accurate interpretation of figures. Unlike existing benchmarks that primarily rely...

1 min 1 month ago

standing

LOW Academic United States

Measuring AI Agents' Progress on Multi-Step Cyber Attack Scenarios

arXiv:2603.11214v1 Announce Type: new Abstract: We evaluate the autonomous cyber-attack capabilities of frontier AI models on two purpose-built cyber ranges-a 32-step corporate network attack and a 7-step industrial control system attack-that require chaining heterogeneous capabilities across extended action sequences. By...

1 min 1 month ago

trial

LOW Academic International

Artificial Intelligence for Sentiment Analysis of Persian Poetry

arXiv:2603.11254v1 Announce Type: new Abstract: Recent advancements of the Artificial Intelligence (AI) have led to the development of large language models (LLMs) that are capable of understanding, analysing, and creating textual data. These language models open a significant opportunity in...

1 min 1 month ago

standing

LOW Academic European Union

Automated Detection of Malignant Lesions in the Ovary Using Deep Learning Models and XAI

arXiv:2603.11818v1 Announce Type: new Abstract: The unrestrained proliferation of cells that are malignant in nature is cancer. In recent times, medical professionals are constantly acquiring enhanced diagnostic and treatment abilities by implementing deep learning models to analyze medical data for...

1 min 1 month ago

discovery

LOW Academic International

DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

arXiv:2603.11076v1 Announce Type: new Abstract: Recent work synthesizes agentic tasks for post-training tool-using LLMs, yet robust generalization under shifts in tasks and toolsets remains an open challenge. We trace this brittleness to insufficient diversity in synthesized tasks. Scaling diversity is...

1 min 1 month ago

evidence

LOW Academic International

Understanding Wikidata Qualifiers: An Analysis and Taxonomy

arXiv:2603.11767v1 Announce Type: new Abstract: This paper presents an in-depth analysis of Wikidata qualifiers, focusing on their semantics and actual usage, with the aim of developing a taxonomy that addresses the challenges of selecting appropriate qualifiers, querying the graph, and...

1 min 1 month ago

standing

LOW Academic International

QChunker: Learning Question-Aware Text Chunking for Domain RAG via Multi-Agent Debate

arXiv:2603.11650v1 Announce Type: new Abstract: The effectiveness upper bound of retrieval-augmented generation (RAG) is fundamentally constrained by the semantic integrity and information granularity of text chunks in its knowledge base. To address these challenges, this paper proposes QChunker, which restructures...

1 min 1 month ago

standing

LOW Academic International

SemBench: A Universal Semantic Framework for LLM Evaluation

arXiv:2603.11687v1 Announce Type: new Abstract: Recent progress in Natural Language Processing (NLP) has been driven by the emergence of Large Language Models (LLMs), which exhibit remarkable generative and reasoning capabilities. However, despite their success, evaluating the true semantic understanding of...

1 min 1 month ago

standing

LOW Academic International

CoMMET: To What Extent Can LLMs Perform Theory of Mind Tasks?

arXiv:2603.11915v1 Announce Type: new Abstract: Theory of Mind (ToM)-the ability to reason about the mental states of oneself and others-is a cornerstone of human social intelligence. As Large Language Models (LLMs) become ubiquitous in real-world applications, validating their capacity for...

1 min 1 month ago

standing

LOW Academic International

To Words and Beyond: Probing Large Language Models for Sentence-Level Psycholinguistic Norms of Memorability and Reading Times

arXiv:2603.12105v1 Announce Type: new Abstract: Large Language Models (LLMs) have recently been shown to produce estimates of psycholinguistic norms, such as valence, arousal, or concreteness, for words and multiword expressions, that correlate with human judgments. These estimates are obtained by...

1 min 1 month ago

evidence

LOW Academic International

Long-Context Encoder Models for Polish Language Understanding

arXiv:2603.12191v1 Announce Type: new Abstract: While decoder-only Large Language Models (LLMs) have recently dominated the NLP landscape, encoder-only architectures remain a cost-effective and parameter-efficient standard for discriminative tasks. However, classic encoders like BERT are limited by a short context window,...

1 min 1 month ago

standing

LOW Academic International

Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration

arXiv:2603.12226v1 Announce Type: new Abstract: Despite interdisciplinary research leading to larger and longer-term impact, most work remains confined to single-domain academic silos. Recent AI-based approaches to scientific discovery show promise for interdisciplinary research, but many prioritize rapidly designing experiments and...

1 min 1 month ago

discovery

LOW Academic International

Group Resonance Network: Learnable Prototypes and Multi-Subject Resonance for EEG Emotion Recognition

arXiv:2603.11119v1 Announce Type: new Abstract: Electroencephalography(EEG)-basedemotionrecognitionre- mains challenging in cross-subject settings due to severe inter-subject variability. Existing methods mainly learn subject-invariant features, but often under-exploit stimulus-locked group regularities shared across sub- jects. To address this issue, we propose the Group...

1 min 1 month ago

motion

LOW Academic European Union

Beyond Barren Plateaus: A Scalable Quantum Convolutional Architecture for High-Fidelity Image Classification

arXiv:2603.11131v1 Announce Type: new Abstract: While Quantum Convolutional Neural Networks (QCNNs) offer a theoretical paradigm for quantum machine learning, their practical implementation is severely bottlenecked by barren plateaus -- the exponential vanishing of gradients -- and poor empirical accuracy compared...

1 min 1 month ago

evidence

LOW Academic International

Attention Gathers, MLPs Compose: A Causal Analysis of an Action-Outcome Circuit in VideoViT

arXiv:2603.11142v1 Announce Type: new Abstract: The paper explores how video models trained for classification tasks represent nuanced, hidden semantic information that may not affect the final outcome, a key challenge for Trustworthy AI models. Through Explainable and Interpretable AI methods,...

1 min 1 month ago

evidence

LOW Academic International

Systematic Scaling Analysis of Jailbreak Attacks in Large Language Models

arXiv:2603.11149v1 Announce Type: new Abstract: Large language models remain vulnerable to jailbreak attacks, yet we still lack a systematic understanding of how jailbreak success scales with attacker effort across methods, model families, and harm types. We initiate a scaling-law framework...

1 min 1 month ago

standing

LOW Academic United Kingdom

Bayesian Optimization of Partially Known Systems using Hybrid Models

arXiv:2603.11199v1 Announce Type: new Abstract: Bayesian optimization (BO) has gained attention as an efficient algorithm for black-box optimization of expensive-to-evaluate systems, where the BO algorithm iteratively queries the system and suggests new trials based on a probabilistic model fitted to...

1 min 1 month ago

trial

LOW Academic United Kingdom

Client-Conditional Federated Learning via Local Training Data Statistics

arXiv:2603.11307v1 Announce Type: new Abstract: Federated learning (FL) under data heterogeneity remains challenging: existing methods either ignore client differences (FedAvg), require costly cluster discovery (IFCA), or maintain per-client models (Ditto). All degrade when data is sparse or heterogeneity is multi-dimensional....

1 min 1 month ago

discovery

LOW Academic European Union

ZTab: Domain-based Zero-shot Annotation for Table Columns

arXiv:2603.11436v1 Announce Type: new Abstract: This study addresses the challenge of automatically detecting semantic column types in relational tables, a key task in many real-world applications. Zero-shot modeling eliminates the need for user-provided labeled training data, making it ideal for...

1 min 1 month ago

standing

LOW Academic European Union

UniHetCO: A Unified Heterogeneous Representation for Multi-Problem Learning in Unsupervised Neural Combinatorial Optimization

arXiv:2603.11456v1 Announce Type: new Abstract: Unsupervised neural combinatorial optimization (NCO) offers an appealing alternative to supervised approaches by training learning-based solvers without ground-truth solutions, directly minimizing instance objectives and constraint violations. Yet for graph node subset-selection problems (e.g., Maximum Clique...

1 min 1 month ago

appeal

LOW Academic European Union

Slack More, Predict Better: Proximal Relaxation for Probabilistic Latent Variable Model-based Soft Sensors

arXiv:2603.11473v1 Announce Type: new Abstract: Nonlinear Probabilistic Latent Variable Models (NPLVMs) are a cornerstone of soft sensor modeling due to their capacity for uncertainty delineation. However, conventional NPLVMs are trained using amortized variational inference, where neural networks parameterize the variational...

1 min 1 month ago

trial

LOW Academic International

Attention Sinks Are Provably Necessary in Softmax Transformers: Evidence from Trigger-Conditional Tasks

arXiv:2603.11487v1 Announce Type: new Abstract: Transformers often display an attention sink: probability mass concentrates on a fixed, content-agnostic position. We prove that computing a simple trigger-conditional behavior necessarily induces a sink in softmax self-attention models. Our results formalize a familiar...

1 min 1 month ago

evidence

LOW News United States

An interview with Jerry Goldman, founder of the Oyez Project

Welcome to our SCOTUS Innovators series, a new recurring column on people who have shaped our understanding of the Supreme Court. A few weeks ago, I had the opportunity to […]The postAn interview with Jerry Goldman, founder of the Oyez...

1 min 1 month ago

standing

LOW News International

Live Nation director boasted of gouging ticket buyers, "robbing them blind"

Unsealed messages add wrinkle to trial after US agreed to settle with Live Nation.

1 min 1 month ago

trial

LOW News United States

Trump's DOJ is not falling for Sam Bankman-Fried's MAGA makeover on X

SBF is still twisting facts to hide FTX crypto losses, DOJ says to block new trial.

1 min 1 month ago

trial

LOW Academic International

Evaluating Adjective-Noun Compositionality in LLMs: Functional vs Representational Perspectives

arXiv:2603.09994v1 Announce Type: cross Abstract: Compositionality is considered central to language abilities. As performant language systems, how do large language models (LLMs) do on compositional tasks? We evaluate adjective-noun compositionality in LLMs using two complementary setups: prompt-based functional assessment and...

1 min 1 month ago

standing

LOW Academic United States

Verbalizing LLM's Higher-order Uncertainty via Imprecise Probabilities

arXiv:2603.10396v1 Announce Type: new Abstract: Despite the growing demand for eliciting uncertainty from large language models (LLMs), empirical evidence suggests that LLM behavior is not always adequately captured by the elicitation techniques developed under the classical probabilistic uncertainty framework. This...

1 min 1 month ago

evidence

LOW Academic United States

DeliberationBench: A Normative Benchmark for the Influence of Large Language Models on Users' Views

arXiv:2603.10018v1 Announce Type: cross Abstract: As large language models (LLMs) become pervasive as assistants and thought partners, it is important to characterize their persuasive influence on users' beliefs. However, a central challenge is to distinguish "beneficial" from "harmful" forms of...

1 min 1 month ago

evidence

Deactivating Refusal Triggers: Understanding and Mitigating Overrefusal in Safety Alignment

BLooP: Zero-Shot Abstractive Summarization using Large Language Models with Bigram Lookahead Promotion

MaterialFigBENCH: benchmark dataset with figures for evaluating college-level materials science problem-solving abilities of multimodal large language models

Measuring AI Agents' Progress on Multi-Step Cyber Attack Scenarios

Artificial Intelligence for Sentiment Analysis of Persian Poetry

Automated Detection of Malignant Lesions in the Ovary Using Deep Learning Models and XAI

DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

Understanding Wikidata Qualifiers: An Analysis and Taxonomy

QChunker: Learning Question-Aware Text Chunking for Domain RAG via Multi-Agent Debate

SemBench: A Universal Semantic Framework for LLM Evaluation

CoMMET: To What Extent Can LLMs Perform Theory of Mind Tasks?

To Words and Beyond: Probing Large Language Models for Sentence-Level Psycholinguistic Norms of Memorability and Reading Times

Long-Context Encoder Models for Polish Language Understanding

Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration

Group Resonance Network: Learnable Prototypes and Multi-Subject Resonance for EEG Emotion Recognition

Beyond Barren Plateaus: A Scalable Quantum Convolutional Architecture for High-Fidelity Image Classification

Attention Gathers, MLPs Compose: A Causal Analysis of an Action-Outcome Circuit in VideoViT

Systematic Scaling Analysis of Jailbreak Attacks in Large Language Models

Bayesian Optimization of Partially Known Systems using Hybrid Models

Client-Conditional Federated Learning via Local Training Data Statistics

ZTab: Domain-based Zero-shot Annotation for Table Columns

UniHetCO: A Unified Heterogeneous Representation for Multi-Problem Learning in Unsupervised Neural Combinatorial Optimization

Slack More, Predict Better: Proximal Relaxation for Probabilistic Latent Variable Model-based Soft Sensors

Attention Sinks Are Provably Necessary in Softmax Transformers: Evidence from Trigger-Conditional Tasks

An interview with Jerry Goldman, founder of the Oyez Project

Live Nation director boasted of gouging ticket buyers, "robbing them blind"

Trump's DOJ is not falling for Sam Bankman-Fried's MAGA makeover on X

Evaluating Adjective-Noun Compositionality in LLMs: Functional vs Representational Perspectives

Verbalizing LLM's Higher-order Uncertainty via Imprecise Probabilities

DeliberationBench: A Normative Benchmark for the Influence of Large Language Models on Users' Views

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.