International Law

LOW Academic United States

Click it or Leave it: Detecting and Spoiling Clickbait with Informativeness Measures and Large Language Models

arXiv:2602.18171v1 Announce Type: new Abstract: Clickbait headlines degrade the quality of online information and undermine user trust. We present a hybrid approach to clickbait detection that combines transformer-based text embeddings with linguistically motivated informativeness features. Using natural language processing techniques,...

1 min 2 months ago

ear

LOW Academic International

Simplifying Outcomes of Language Model Component Analyses with ELIA

arXiv:2602.18262v1 Announce Type: new Abstract: While mechanistic interpretability has developed powerful tools to analyze the internal workings of Large Language Models (LLMs), their complexity has created an accessibility gap, limiting their use to specialists. We address this challenge by designing,...

1 min 2 months ago

ear

LOW Academic International

PsihoRo: Depression and Anxiety Romanian Text Corpus

arXiv:2602.18324v1 Announce Type: new Abstract: Psychological corpora in NLP are collections of texts used to analyze human psychology, emotions, and mental health. These texts allow researchers to study psychological constructs, detect mental health issues and analyze emotional language. However, mental...

1 min 2 months ago

ear

LOW Academic European Union

Predicting Contextual Informativeness for Vocabulary Learning using Deep Learning

arXiv:2602.18326v1 Announce Type: new Abstract: We describe a modern deep learning system that automatically identifies informative contextual examples (\qu{contexts}) for first language vocabulary instruction for high school student. Our paper compares three modeling approaches: (i) an unsupervised similarity-based strategy using...

1 min 2 months ago

ear

LOW Academic United Kingdom

Validating Political Position Predictions of Arguments

arXiv:2602.18351v1 Announce Type: new Abstract: Real-world knowledge representation often requires capturing subjective, continuous attributes -- such as political positions -- that conflict with pairwise validation, the widely accepted gold standard for human evaluation. We address this challenge through a dual-scale...

1 min 2 months ago

ear

LOW Academic European Union

SPQ: An Ensemble Technique for Large Language Model Compression

arXiv:2602.18420v1 Announce Type: new Abstract: This study presents an ensemble technique, SPQ (SVD-Pruning-Quantization), for large language model (LLM) compression that combines variance-retained singular value decomposition (SVD), activation-based pruning, and post-training linear quantization. Each component targets a different source of inefficiency:...

1 min 2 months ago

ear

LOW Academic International

RVR: Retrieve-Verify-Retrieve for Comprehensive Question Answering

arXiv:2602.18425v1 Announce Type: new Abstract: Comprehensively retrieving diverse documents is crucial to address queries that admit a wide range of valid answers. We introduce retrieve-verify-retrieve (RVR), a multi-round retrieval framework designed to maximize answer coverage. Initially, a retriever takes the...

1 min 2 months ago

ear

LOW Academic United States

Lost Before Translation: Social Information Transmission and Survival in AI-AI Communication

arXiv:2602.17674v1 Announce Type: cross Abstract: When AI systems summarize and relay information, they inevitably transform it. But how? We introduce an experimental paradigm based on the telephone game to study what happens when AI talks to AI. Across five studies...

1 min 2 months ago

ear

LOW Academic International

Reducing Text Bias in Synthetically Generated MCQAs for VLMs in Autonomous Driving

arXiv:2602.17677v1 Announce Type: cross Abstract: Multiple Choice Question Answering (MCQA) benchmarks are an established standard for measuring Vision Language Model (VLM) performance in driving tasks. However, we observe the known phenomenon that synthetically generated MCQAs are highly susceptible to hidden...

1 min 2 months ago

ear

LOW Academic International

LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs

arXiv:2602.17681v1 Announce Type: cross Abstract: Post-training quantization (PTQ) is a widely used approach for reducing the memory and compute costs of large language models (LLMs). Recent studies have shown that applying invertible transformations to activations can significantly improve quantization robustness...

1 min 2 months ago

ear

LOW Academic United States

Bayesian Optimality of In-Context Learning with Selective State Spaces

arXiv:2602.17744v1 Announce Type: cross Abstract: We propose Bayesian optimal sequential prediction as a new principle for understanding in-context learning (ICL). Unlike interpretations framing Transformers as performing implicit gradient descent, we formalize ICL as meta-learning over latent sequence tasks. For tasks...

1 min 2 months ago

ear

LOW Academic International

ADAPT: Hybrid Prompt Optimization for LLM Feature Visualization

arXiv:2602.17867v1 Announce Type: cross Abstract: Understanding what features are encoded by learned directions in LLM activation space requires identifying inputs that strongly activate them. Feature visualization, which optimizes inputs to maximally activate a target direction, offers an alternative to costly...

1 min 2 months ago

ear

LOW Academic European Union

NIMMGen: Learning Neural-Integrated Mechanistic Digital Twins with LLMs

arXiv:2602.18008v1 Announce Type: cross Abstract: Mechanistic models encode scientific knowledge about dynamical systems and are widely used in downstream scientific and policy applications. Recent work has explored LLM-based agentic frameworks to automatically construct mechanistic models from data; however, existing problem...

1 min 2 months ago

ear

LOW Academic International

Gradient Regularization Prevents Reward Hacking in Reinforcement Learning from Human Feedback and Verifiable Rewards

arXiv:2602.18037v1 Announce Type: cross Abstract: Reinforcement Learning from Human Feedback (RLHF) or Verifiable Rewards (RLVR) are two key steps in the post-training of modern Language Models (LMs). A common problem is reward hacking, where the policy may exploit inaccuracies of...

1 min 2 months ago

ear

LOW Academic International

On the Semantic and Syntactic Information Encoded in Proto-Tokens for One-Step Text Reconstruction

arXiv:2602.18301v1 Announce Type: cross Abstract: Autoregressive large language models (LLMs) generate text token-by-token, requiring n forward passes to produce a sequence of length n. Recent work, Exploring the Latent Capacity of LLMs for One-Step Text Reconstruction (Mezentsev and Oseledets), shows...

1 min 2 months ago

ear

LOW Academic European Union

On the "Induction Bias" in Sequence Models

arXiv:2602.18333v1 Announce Type: cross Abstract: Despite the remarkable practical success of transformer-based language models, recent work has raised concerns about their ability to perform state tracking. In particular, a growing body of literature has shown this limitation primarily through failures...

1 min 2 months ago

ear

LOW Academic International

Subgroups of $U(d)$ Induce Natural RNN and Transformer Architectures

arXiv:2602.18417v1 Announce Type: cross Abstract: This paper presents a direct framework for sequence models with hidden states on closed subgroups of U(d). We use a minimal axiomatic setup and derive recurrent and transformer templates from a shared skeleton in which...

1 min 2 months ago

ear

LOW Academic United States

Probabilistic NDVI Forecasting from Sparse Satellite Time Series and Weather Covariates

arXiv:2602.17683v1 Announce Type: new Abstract: Accurate short-term forecasting of vegetation dynamics is a key enabler for data-driven decision support in precision agriculture. Normalized Difference Vegetation Index (NDVI) forecasting from satellite observations, however, remains challenging due to sparse and irregular sampling...

1 min 2 months ago

ear

LOW Academic European Union

Optimal Multi-Debris Mission Planning in LEO: A Deep Reinforcement Learning Approach with Co-Elliptic Transfers and Refueling

arXiv:2602.17685v1 Announce Type: new Abstract: This paper addresses the challenge of multi target active debris removal (ADR) in Low Earth Orbit (LEO) by introducing a unified coelliptic maneuver framework that combines Hohmann transfers, safety ellipse proximity operations, and explicit refueling...

1 min 2 months ago

ear

LOW Academic International

Pimp My LLM: Leveraging Variability Modeling to Tune Inference Hyperparameters

arXiv:2602.17697v1 Announce Type: new Abstract: Large Language Models (LLMs) are being increasingly used across a wide range of tasks. However, their substantial computational demands raise concerns about the energy efficiency and sustainability of both training and inference. Inference, in particular,...

1 min 2 months ago

ear

LOW Academic International

Provable Adversarial Robustness in In-Context Learning

arXiv:2602.17743v1 Announce Type: new Abstract: Large language models adapt to new tasks through in-context learning (ICL) without parameter updates. Current theoretical explanations for this capability assume test tasks are drawn from a distribution similar to that seen during pretraining. This...

1 min 2 months ago

ear

LOW Academic European Union

Multi-material Multi-physics Topology Optimization with Physics-informed Gaussian Process Priors

arXiv:2602.17783v1 Announce Type: new Abstract: Machine learning (ML) has been increasingly used for topology optimization (TO). However, most existing ML-based approaches focus on simplified benchmark problems due to their high computational cost, spectral bias, and difficulty in handling complex physics....

1 min 2 months ago

ear

LOW Academic International

Grassmannian Mixture-of-Experts: Concentration-Controlled Routing on Subspace Manifolds

arXiv:2602.17798v1 Announce Type: new Abstract: Mixture-of-Experts models rely on learned routers to assign tokens to experts, yet standard softmax gating provides no principled mechanism to control the tradeoff between sparsity and utilization. We propose Grassmannian MoE (GrMoE), a routing framework...

1 min 2 months ago

ear

LOW Academic International

Avoid What You Know: Divergent Trajectory Balance for GFlowNets

arXiv:2602.17827v1 Announce Type: new Abstract: Generative Flow Networks (GFlowNets) are a flexible family of amortized samplers trained to generate discrete and compositional objects with probability proportional to a reward function. However, learning efficiency is constrained by the model's ability to...

1 min 2 months ago

ear

LOW Academic International

Causality by Abstraction: Symbolic Rule Learning in Multivariate Timeseries with Large Language Models

arXiv:2602.17829v1 Announce Type: new Abstract: Inferring causal relations in timeseries data with delayed effects is a fundamental challenge, especially when the underlying system exhibits complex dynamics that cannot be captured by simple functional mappings. Traditional approaches often fail to produce...

1 min 2 months ago

ear

LOW Academic International

MePoly: Max Entropy Polynomial Policy Optimization

arXiv:2602.17832v1 Announce Type: new Abstract: Stochastic Optimal Control provides a unified mathematical framework for solving complex decision-making problems, encompassing paradigms such as maximum entropy reinforcement learning(RL) and imitation learning(IL). However, conventional parametric policies often struggle to represent the multi-modality of...

1 min 2 months ago

ear

LOW Academic International

Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning

arXiv:2602.17835v1 Announce Type: new Abstract: Supervised fine-tuning (SFT) relies critically on selecting training data that most benefits a model's downstream performance. Gradient-based data selection methods such as TracIn and Influence Functions leverage influence to identify useful samples, but their computational...

1 min 2 months ago

ear

LOW Academic International

Two Calm Ends and the Wild Middle: A Geometric Picture of Memorization in Diffusion Models

arXiv:2602.17846v1 Announce Type: new Abstract: Diffusion models generate high-quality samples but can also memorize training data, raising serious privacy concerns. Understanding the mechanisms governing when memorization versus generalization occurs remains an active area of research. In particular, it is unclear...

1 min 2 months ago

ear

LOW Academic European Union

Neural Prior Estimation: Learning Class Priors from Latent Representations

arXiv:2602.17853v1 Announce Type: new Abstract: Class imbalance induces systematic bias in deep neural networks by imposing a skewed effective class prior. This work introduces the Neural Prior Estimator (NPE), a framework that learns feature-conditioned log-prior estimates from latent representations. NPE...

1 min 2 months ago

ear

LOW Academic International

JAX-Privacy: A library for differentially private machine learning

arXiv:2602.17861v1 Announce Type: new Abstract: JAX-Privacy is a library designed to simplify the deployment of robust and performant mechanisms for differentially private machine learning. Guided by design principles of usability, flexibility, and efficiency, JAX-Privacy serves both researchers requiring deep customization...

1 min 2 months ago

ear

Click it or Leave it: Detecting and Spoiling Clickbait with Informativeness Measures and Large Language Models

Simplifying Outcomes of Language Model Component Analyses with ELIA

PsihoRo: Depression and Anxiety Romanian Text Corpus

Predicting Contextual Informativeness for Vocabulary Learning using Deep Learning

Validating Political Position Predictions of Arguments

SPQ: An Ensemble Technique for Large Language Model Compression

RVR: Retrieve-Verify-Retrieve for Comprehensive Question Answering

Lost Before Translation: Social Information Transmission and Survival in AI-AI Communication

Reducing Text Bias in Synthetically Generated MCQAs for VLMs in Autonomous Driving

LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs

Bayesian Optimality of In-Context Learning with Selective State Spaces

ADAPT: Hybrid Prompt Optimization for LLM Feature Visualization

NIMMGen: Learning Neural-Integrated Mechanistic Digital Twins with LLMs

Gradient Regularization Prevents Reward Hacking in Reinforcement Learning from Human Feedback and Verifiable Rewards

On the Semantic and Syntactic Information Encoded in Proto-Tokens for One-Step Text Reconstruction

On the "Induction Bias" in Sequence Models

Subgroups of $U(d)$ Induce Natural RNN and Transformer Architectures

Probabilistic NDVI Forecasting from Sparse Satellite Time Series and Weather Covariates

Optimal Multi-Debris Mission Planning in LEO: A Deep Reinforcement Learning Approach with Co-Elliptic Transfers and Refueling

Pimp My LLM: Leveraging Variability Modeling to Tune Inference Hyperparameters

Provable Adversarial Robustness in In-Context Learning

Multi-material Multi-physics Topology Optimization with Physics-informed Gaussian Process Priors

Grassmannian Mixture-of-Experts: Concentration-Controlled Routing on Subspace Manifolds

Avoid What You Know: Divergent Trajectory Balance for GFlowNets

Causality by Abstraction: Symbolic Rule Learning in Multivariate Timeseries with Large Language Models

MePoly: Max Entropy Polynomial Policy Optimization

Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning

Two Calm Ends and the Wild Middle: A Geometric Picture of Memorization in Diffusion Models

Neural Prior Estimation: Learning Class Priors from Latent Representations

JAX-Privacy: A library for differentially private machine learning

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.