Deep Learning-Based Meat Freshness Detection with Segmentation and OOD-Aware Classification
arXiv:2603.00368v1 Announce Type: new Abstract: In this study, we present a meat freshness classification framework from Red-Green-Blue (RGB) images that supports both packaged and unpackaged meat datasets. The system classifies four in-distribution (ID) meat classes and uses an out-of-distribution (OOD)-aware...
Improving Full Waveform Inversion in Large Model Era
arXiv:2603.00377v1 Announce Type: new Abstract: Full Waveform Inversion (FWI) is a highly nonlinear and ill-posed problem that aims to recover subsurface velocity maps from surface-recorded seismic waveforms data. Existing data-driven FWI typically uses small models, as available datasets have limited...
Hereditary Geometric Meta-RL: Nonlocal Generalization via Task Symmetries
arXiv:2603.00396v1 Announce Type: new Abstract: Meta-Reinforcement Learning (Meta-RL) commonly generalizes via smoothness in the task encoding. While this enables local generalization around each training task, it requires dense coverage of the task space and leaves richer task space structure untapped....
Physics-Aware Learnability: From Set-Theoretic Independence to Operational Constraints
arXiv:2603.00417v1 Announce Type: new Abstract: Beyond binary classification, learnability can become a logically fragile notion: in EMX, even the class of all finite subsets of $[0,1]$ is learnable in some models of ZFC and not in others. We argue the...
Weight Updates as Activation Shifts: A Principled Framework for Steering
arXiv:2603.00425v1 Announce Type: new Abstract: Activation steering promises to be an extremely parameter-efficient form of adaptation, but its effectiveness depends on critical design choices -- such as intervention location and parameterization -- that currently rely on empirical heuristics rather than...
Efficient Decoder Scaling Strategy for Neural Routing Solvers
arXiv:2603.00430v1 Announce Type: new Abstract: Construction-based neural routing solvers, typically composed of an encoder and a decoder, have emerged as a promising approach for solving vehicle routing problems. While recent studies suggest that shifting parameters from the encoder to the...
Déjà vu all over again
The Relist Watch column examines cert petitions that the Supreme Court has “relisted” for its upcoming conference. A short explanation of relists is available here. The Supreme Court is continuing to […]The postDéjà vu all over againappeared first onSCOTUSblog.
Episode 41: Thinking through Rupture in International Economic Law: Views from Latin America - EJIL: The Podcast!
FCC chair calls Paramount/WBD merger "a lot cleaner" than defunct Netflix deal
FCC to review foreign debt, but Carr indicates it will be a formality.
Why AI startups are selling the same equity at two different prices
Some AI founders are using a novel valuation mechanism to manufacture unicorn status.
Alibaba’s Qwen tech lead steps down after major AI push
Reactions rippled through Alibaba's Qwen team after tech lead Junyang Lin stepped down following a major model launch.
AI companies are spending millions to thwart this former tech exec’s congressional bid
A tech billionaire-backed super PAC is spending $125 million to undercut candidates pushing for AI regulation. New York's Alex Bores, a former tech executive himself, is one of them.
Claude Code rolls out a voice mode capability
Anthropic is stepping up its game in the AI coding space with the rollout of Voice Mode in Claude Code.
X says it will suspend creators from revenue-sharing program for unlabeled AI posts of ‘armed conflict’
Creators who break the rules will get a three-month suspension, and if they continue to violate the policy, they'll be permanently banned.
France or Spain or Germany or France: A Neural Account of Non-Redundant Redundant Disjunctions
arXiv:2602.23547v1 Announce Type: new Abstract: Sentences like "She will go to France or Spain, or perhaps to Germany or France." appear formally redundant, yet become acceptable in contexts such as "Mary will go to a philosophy program in France or...
Structured Prompt Optimization for Few-Shot Text Classification via Semantic Alignment in Latent Space
arXiv:2602.23753v1 Announce Type: new Abstract: This study addresses the issues of semantic entanglement, unclear label structure, and insufficient feature representation in few-shot text classification, and proposes an optimization framework based on structured prompts to enhance semantic understanding and task adaptation...
GLUScope: A Tool for Analyzing GLU Neurons in Transformer Language Models
arXiv:2602.23826v1 Announce Type: new Abstract: We present GLUScope, an open-source tool for analyzing neurons in Transformer-based language models, intended for interpretability researchers. We focus on more recent models than previous tools do; specifically we consider gated activation functions such as...
Benchmarking BERT-based Models for Sentence-level Topic Classification in Nepali Language
arXiv:2602.23940v1 Announce Type: new Abstract: Transformer-based models such as BERT have significantly advanced Natural Language Processing (NLP) across many languages. However, Nepali, a low-resource language written in Devanagari script, remains relatively underexplored. This study benchmarks multilingual, Indic, Hindi, and Nepali...
EDDA-Coordinata: An Annotated Dataset of Historical Geographic Coordinates
arXiv:2602.23941v1 Announce Type: new Abstract: This paper introduces a dataset of enriched geographic coordinates retrieved from Diderot and d'Alembert's eighteenth-century Encyclopedie. Automatically recovering geographic coordinates from historical texts is a complex task, as they are expressed in a variety of...
MemEmo: Evaluating Emotion in Memory Systems of Agents
arXiv:2602.23944v1 Announce Type: new Abstract: Memory systems address the challenge of context loss in Large Language Model during prolonged interactions. However, compared to human cognition, the efficacy of these systems in processing emotion-related information remains inconclusive. To address this gap,...
The GRADIEND Python Package: An End-to-End System for Gradient-Based Feature Learning
arXiv:2602.23993v1 Announce Type: new Abstract: We present gradiend, an open-source Python package that operationalizes the GRADIEND method for learning feature directions from factual-counterfactual MLM and CLM gradients in language models. The package provides a unified workflow for feature-related data creation,...
Task-Centric Acceleration of Small-Language Models
arXiv:2602.24174v1 Announce Type: new Abstract: Small language models (SLMs) have emerged as efficient alternatives to large language models for task-specific applications. However, they are often employed in high-volume, low-latency settings, where efficiency is crucial. We propose TASC, Task-Adaptive Sequence Compression,...
Controllable Reasoning Models Are Private Thinkers
arXiv:2602.24210v1 Announce Type: new Abstract: AI agents powered by reasoning models require access to sensitive user data. However, their reasoning traces are difficult to control, which can result in the unintended leakage of private information to external parties. We propose...
NAU-QMUL: Utilizing BERT and CLIP for Multi-modal AI-Generated Image Detection
arXiv:2602.23863v1 Announce Type: cross Abstract: With the aim of detecting AI-generated images and identifying the specific models responsible for their generation, we propose a multi-modal multi-task model. The model leverages pre-trained BERT and CLIP Vision encoders for text and image...
Global Interpretability via Automated Preprocessing: A Framework Inspired by Psychiatric Questionnaires
arXiv:2602.23459v1 Announce Type: new Abstract: Psychiatric questionnaires are highly context sensitive and often only weakly predict subsequent symptom severity, which makes the prognostic relationship difficult to learn. Although flexible nonlinear models can improve predictive accuracy, their limited interpretability can erode...
FedDAG: Clustered Federated Learning via Global Data and Gradient Integration for Heterogeneous Environments
arXiv:2602.23504v1 Announce Type: new Abstract: Federated Learning (FL) enables a group of clients to collaboratively train a model without sharing individual data, but its performance drops when client data are heterogeneous. Clustered FL tackles this by grouping similar clients. However,...
Neural Operators Can Discover Functional Clusters
arXiv:2602.23528v1 Announce Type: new Abstract: Operator learning is reshaping scientific computing by amortizing inference across infinite families of problems. While neural operators (NOs) are increasingly well understood for regression, far less is known for classification and its unsupervised analogue: clustering....
SDMixer: Sparse Dual-Mixer for Time Series Forecasting
arXiv:2602.23581v1 Announce Type: new Abstract: Multivariate time series forecasting is widely applied in fields such as transportation, energy, and finance. However, the data commonly suffers from issues of multi-scale characteristics, weak correlations, and noise interference, which limit the predictive performance...
FedRot-LoRA: Mitigating Rotational Misalignment in Federated LoRA
arXiv:2602.23638v1 Announce Type: new Abstract: Federated LoRA provides a communication-efficient mechanism for fine-tuning large language models on decentralized data. In practice, however, a discrepancy between the factor-wise averaging used to preserve low rank and the mathematically correct aggregation of local...
Selective Denoising Diffusion Model for Time Series Anomaly Detection
arXiv:2602.23662v1 Announce Type: new Abstract: Time series anomaly detection (TSAD) has been an important area of research for decades, with reconstruction-based methods, mostly based on generative models, gaining popularity and demonstrating success. Diffusion models have recently attracted attention due to...