All Practice Areas

International Law

국제법

Jurisdiction: All US KR EU Intl
LOW Academic International

UTPTrack: Towards Simple and Unified Token Pruning for Visual Tracking

arXiv:2602.23734v1 Announce Type: cross Abstract: One-stream Transformer-based trackers achieve advanced performance in visual object tracking but suffer from significant computational overhead that hinders real-time deployment. While token pruning offers a path to efficiency, existing methods are fragmented. They typically prune...

1 min 1 month, 3 weeks ago
ear
LOW Academic International

SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale

arXiv:2602.23866v1 Announce Type: cross Abstract: Software engineering agents (SWE) are improving rapidly, with recent gains largely driven by reinforcement learning (RL). However, RL training is constrained by the scarcity of large-scale task collections with reproducible execution environments and reliable test...

1 min 1 month, 3 weeks ago
ear
LOW Academic United States

Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking

arXiv:2602.24009v1 Announce Type: cross Abstract: Jailbreak techniques for large language models (LLMs) evolve faster than benchmarks, making robustness estimates stale and difficult to compare across papers due to drift in datasets, harnesses, and judging protocols. We introduce JAILBREAK FOUNDRY (JBF),...

1 min 1 month, 3 weeks ago
ear
LOW Academic International

RewardUQ: A Unified Framework for Uncertainty-Aware Reward Models

arXiv:2602.24040v1 Announce Type: cross Abstract: Reward models are central to aligning large language models (LLMs) with human preferences. Yet most approaches rely on pointwise reward estimates that overlook the epistemic uncertainty in reward models arising from limited human feedback. Recent...

1 min 1 month, 3 weeks ago
ear
LOW Academic European Union

Detoxifying LLMs via Representation Erasure-Based Preference Optimization

arXiv:2602.23391v1 Announce Type: new Abstract: Large language models (LLMs) trained on webscale data can produce toxic outputs, raising concerns for safe deployment. Prior defenses, based on applications of DPO, NPO, and similar algorithms, reduce the likelihood of harmful continuations, but...

1 min 1 month, 3 weeks ago
ear
LOW Academic European Union

U-CAN: Utility-Aware Contrastive Attenuation for Efficient Unlearning in Generative Recommendation

arXiv:2602.23400v1 Announce Type: new Abstract: Generative Recommendation (GenRec) typically leverages Large Language Models (LLMs) to redefine personalization as an instruction-driven sequence generation task. However, fine-tuning on user logs inadvertently encodes sensitive attributes into model parameters, raising critical privacy concerns. Existing...

1 min 1 month, 3 weeks ago
ear
LOW Academic International

Global Interpretability via Automated Preprocessing: A Framework Inspired by Psychiatric Questionnaires

arXiv:2602.23459v1 Announce Type: new Abstract: Psychiatric questionnaires are highly context sensitive and often only weakly predict subsequent symptom severity, which makes the prognostic relationship difficult to learn. Although flexible nonlinear models can improve predictive accuracy, their limited interpretability can erode...

1 min 1 month, 3 weeks ago
ear
LOW Academic International

Uncertainty-aware Language Guidance for Concept Bottleneck Models

arXiv:2602.23495v1 Announce Type: new Abstract: Concept Bottleneck Models (CBMs) provide inherent interpretability by first mapping input samples to high-level semantic concepts, followed by a combination of these concepts for the final classification. However, the annotation of human-understandable concepts requires extensive...

1 min 1 month, 3 weeks ago
ear
LOW Academic United States

FedDAG: Clustered Federated Learning via Global Data and Gradient Integration for Heterogeneous Environments

arXiv:2602.23504v1 Announce Type: new Abstract: Federated Learning (FL) enables a group of clients to collaboratively train a model without sharing individual data, but its performance drops when client data are heterogeneous. Clustered FL tackles this by grouping similar clients. However,...

1 min 1 month, 3 weeks ago
ear
LOW Academic European Union

Sample Size Calculations for Developing Clinical Prediction Models: Overview and pmsims R package

arXiv:2602.23507v1 Announce Type: new Abstract: Background: Clinical prediction models are increasingly used to inform healthcare decisions, but determining the minimum sample size for their development remains a critical and unresolved challenge. Inadequate sample sizes can lead to overfitting, poor generalisability,...

1 min 1 month, 3 weeks ago
ear
LOW Academic European Union

Neural Operators Can Discover Functional Clusters

arXiv:2602.23528v1 Announce Type: new Abstract: Operator learning is reshaping scientific computing by amortizing inference across infinite families of problems. While neural operators (NOs) are increasingly well understood for regression, far less is known for classification and its unsupervised analogue: clustering....

1 min 1 month, 3 weeks ago
ear
LOW Academic International

Active Value Querying to Minimize Additive Error in Subadditive Set Function Learning

arXiv:2602.23529v1 Announce Type: new Abstract: Subadditive set functions play a pivotal role in computational economics (especially in combinatorial auctions), combinatorial optimization or artificial intelligence applications such as interpretable machine learning. However, specifying a set function requires assigning values to an...

1 min 1 month, 3 weeks ago
ear
LOW Academic European Union

Rudder: Steering Prefetching in Distributed GNN Training using LLM Agents

arXiv:2602.23556v1 Announce Type: new Abstract: Large-scale Graph Neural Networks (GNNs) are typically trained by sampling a vertex's neighbors to a fixed distance. Because large input graphs are distributed, training requires frequent irregular communication that stalls forward progress. Moreover, fetched data...

1 min 1 month, 3 weeks ago
ear
LOW Academic International

Dynamics of Learning under User Choice: Overspecialization and Peer-Model Probing

arXiv:2602.23565v1 Announce Type: new Abstract: In many economically relevant contexts where machine learning is deployed, multiple platforms obtain data from the same pool of users, each of whom selects the platform that best serves them. Prior work in this setting...

1 min 1 month, 3 weeks ago
ear
LOW Academic European Union

Flowette: Flow Matching with Graphette Priors for Graph Generation

arXiv:2602.23566v1 Announce Type: new Abstract: We study generative modeling of graphs with recurring subgraph motifs. We propose Flowette, a continuous flow matching framework, that employs a graph neural network based transformer to learn a velocity field defined over graph representations...

1 min 1 month, 3 weeks ago
ear
LOW Academic European Union

Hybrid Quantum Temporal Convolutional Networks

arXiv:2602.23578v1 Announce Type: new Abstract: Quantum machine learning models for sequential data face scalability challenges with complex multivariate signals. We introduce the Hybrid Quantum Temporal Convolutional Network (HQTCN), which combines classical temporal windowing with a quantum convolutional neural network core....

1 min 1 month, 3 weeks ago
ear
LOW Academic European Union

Normalisation and Initialisation Strategies for Graph Neural Networks in Blockchain Anomaly Detection

arXiv:2602.23599v1 Announce Type: new Abstract: Graph neural networks (GNNs) offer a principled approach to financial fraud detection by jointly learning from node features and transaction graph topology. However, their effectiveness on real-world anti-money laundering (AML) benchmarks depends critically on training...

1 min 1 month, 3 weeks ago
ear
LOW Academic International

When Does Multimodal Learning Help in Healthcare? A Benchmark on EHR and Chest X-Ray Fusion

arXiv:2602.23614v1 Announce Type: new Abstract: Machine learning holds promise for advancing clinical decision support, yet it remains unclear when multimodal learning truly helps in practice, particularly under modality missingness and fairness constraints. In this work, we conduct a systematic benchmark...

1 min 1 month, 3 weeks ago
ear
LOW Academic European Union

BTTackler: A Diagnosis-based Framework for Efficient Deep Learning Hyperparameter Optimization

arXiv:2602.23630v1 Announce Type: new Abstract: Hyperparameter optimization (HPO) is known to be costly in deep learning, especially when leveraging automated approaches. Most of the existing automated HPO methods are accuracy-based, i.e., accuracy metrics are used to guide the trials of...

1 min 1 month, 3 weeks ago
ear
LOW Academic European Union

On the Convergence of Single-Loop Stochastic Bilevel Optimization with Approximate Implicit Differentiation

arXiv:2602.23633v1 Announce Type: new Abstract: Stochastic Bilevel Optimization has emerged as a fundamental framework for meta-learning and hyperparameter optimization. Despite the practical prevalence of single-loop algorithms--which update lower and upper variables concurrently--their theoretical understanding, particularly in the stochastic regime, remains...

1 min 1 month, 3 weeks ago
ear
LOW Academic International

Selective Denoising Diffusion Model for Time Series Anomaly Detection

arXiv:2602.23662v1 Announce Type: new Abstract: Time series anomaly detection (TSAD) has been an important area of research for decades, with reconstruction-based methods, mostly based on generative models, gaining popularity and demonstrating success. Diffusion models have recently attracted attention due to...

1 min 1 month, 3 weeks ago
ear
LOW Academic International

Disentangled Mode-Specific Representations for Tensor Time Series via Contrastive Learning

arXiv:2602.23663v1 Announce Type: new Abstract: Multi-mode tensor time series (TTS) can be found in many domains, such as search engines and environmental monitoring systems. Learning representations of a TTS benefits various applications, but it is also challenging since the complexities...

1 min 1 month, 3 weeks ago
ear
LOW Academic International

Optimizer-Induced Low-Dimensional Drift and Transverse Dynamics in Transformer Training

arXiv:2602.23696v1 Announce Type: new Abstract: We study the geometry of training trajectories in small transformer models and find that parameter updates organize into a dominant drift direction with transverse residual dynamics. Using uncentered, row-normalized trajectory PCA, we show that a...

1 min 1 month, 3 weeks ago
ear
LOW Academic International

Bridging Dynamics Gaps via Diffusion Schr\"odinger Bridge for Cross-Domain Reinforcement Learning

arXiv:2602.23737v1 Announce Type: new Abstract: Cross-domain reinforcement learning (RL) aims to learn transferable policies under dynamics shifts between source and target domains. A key challenge lies in the lack of target-domain environment interaction and reward supervision, which prevents direct policy...

1 min 1 month, 3 weeks ago
ear
LOW Academic International

MAGE: Multi-scale Autoregressive Generation for Offline Reinforcement Learning

arXiv:2602.23770v1 Announce Type: new Abstract: Generative models have gained significant traction in offline reinforcement learning (RL) due to their ability to model complex trajectory distributions. However, existing generation-based approaches still struggle with long-horizon tasks characterized by sparse rewards. Some hierarchical...

1 min 1 month, 3 weeks ago
ear
LOW Academic International

TradeFM: A Generative Foundation Model for Trade-flow and Market Microstructure

arXiv:2602.23784v1 Announce Type: new Abstract: Foundation models have transformed domains from language to genomics by learning general-purpose representations from large-scale, heterogeneous data. We introduce TradeFM, a 524M-parameter generative Transformer that brings this paradigm to market microstructure, learning directly from billions...

1 min 1 month, 3 weeks ago
ear
LOW Academic United States

Provable Subspace Identification of Nonlinear Multi-view CCA

arXiv:2602.23785v1 Announce Type: new Abstract: We investigate the identifiability of nonlinear Canonical Correlation Analysis (CCA) in a multi-view setup, where each view is generated by an unknown nonlinear map applied to a linear mixture of shared latents and view-private noise....

1 min 1 month, 3 weeks ago
ear
LOW Academic European Union

UPath: Universal Planner Across Topological Heterogeneity For Grid-Based Pathfinding

arXiv:2602.23789v1 Announce Type: new Abstract: The performance of search algorithms for grid-based pathfinding, e.g. A*, critically depends on the heuristic function that is used to focus the search. Recent studies have shown that informed heuristics that take the positions/shapes of...

1 min 1 month, 3 weeks ago
ear
LOW Academic International

GRAIL: Post-hoc Compensation by Linear Reconstruction for Compressed Networks

arXiv:2602.23795v1 Announce Type: new Abstract: Structured deep model compression methods are hardware-friendly and substantially reduce memory and inference costs. However, under aggressive compression, the resulting accuracy degradation often necessitates post-compression finetuning, which can be impractical due to missing labeled data...

1 min 1 month, 3 weeks ago
ear
LOW Academic United States

MPU: Towards Secure and Privacy-Preserving Knowledge Unlearning for Large Language Models

arXiv:2602.23798v1 Announce Type: new Abstract: Machine unlearning for large language models often faces a privacy dilemma in which strict constraints prohibit sharing either the server's parameters or the client's forget set. To address this dual non-disclosure constraint, we propose MPU,...

1 min 1 month, 3 weeks ago
ear
Previous Page 107 of 135 Next