Immigration Law

LOW Academic International

FTimeXer: Frequency-aware Time-series Transformer with Exogenous variables for Robust Carbon Footprint Forecasting

arXiv:2604.02347v1 Announce Type: new Abstract: Accurate and up-to-date forecasting of the power grid's carbon footprint is crucial for effective product carbon footprint (PCF) accounting and informed decarbonization decisions. However, the carbon intensity of the grid exhibits high non-stationarity, and existing...

1 min 1 week, 5 days ago

ead

LOW Academic International

Causal-Audit: A Framework for Risk Assessment of Assumption Violations in Time-Series Causal Discovery

arXiv:2604.02488v1 Announce Type: new Abstract: Time-series causal discovery methods rely on assumptions such as stationarity, regular sampling, and bounded temporal dependence. When these assumptions are violated, structure learning can produce confident but misleading causal graphs without warning. We introduce Causal-Audit,...

1 min 1 week, 5 days ago

ead

LOW Academic International

From Broad Exploration to Stable Synthesis: Entropy-Guided Optimization for Autoregressive Image Generation

arXiv:2604.02355v1 Announce Type: new Abstract: Combining Chain-of-Thought (CoT) with Reinforcement Learning (RL) improves text-to-image (T2I) generation, yet the underlying interaction between CoT's exploration and RL's optimization remains unclear. We present a systematic entropy-based analysis that yields three key insights: (1)...

1 min 1 week, 5 days ago

ead

LOW Academic European Union

Learning the Signature of Memorization in Autoregressive Language Models

arXiv:2604.03199v1 Announce Type: new Abstract: All prior membership inference attacks for fine-tuned language models use hand-crafted heuristics (e.g., loss thresholding, Min-K\%, reference calibration), each bounded by the designer's intuition. We introduce the first transferable learned attack, enabled by the observation...

1 min 1 week, 5 days ago

tps

LOW Academic International

EMS: Multi-Agent Voting via Efficient Majority-then-Stopping

arXiv:2604.02863v1 Announce Type: new Abstract: Majority voting is the standard for aggregating multi-agent responses into a final decision. However, traditional methods typically require all agents to complete their reasoning before aggregation begins, leading to significant computational overhead, as many responses...

1 min 1 week, 5 days ago

ead

LOW Academic United States

Redirected, Not Removed: Task-Dependent Stereotyping Reveals the Limits of LLM Alignments

arXiv:2604.02669v1 Announce Type: new Abstract: How biased is a language model? The answer depends on how you ask. A model that refuses to choose between castes for a leadership role will, in a fill-in-the-blank task, reliably associate upper castes with...

1 min 1 week, 5 days ago

ead

LOW Academic United Kingdom

Dynamical structure of vanishing gradient and overfitting in multi-layer perceptrons

arXiv:2604.02393v1 Announce Type: new Abstract: Vanishing gradient and overfitting are two of the most extensively studied problems in the literature about machine learning. However, they are frequently considered in some asymptotic setting, which obscure the underlying dynamical mechanisms responsible for...

1 min 1 week, 5 days ago

ead

LOW Academic International

Failing to Falsify: Evaluating and Mitigating Confirmation Bias in Language Models

arXiv:2604.02485v1 Announce Type: new Abstract: Confirmation bias, the tendency to seek evidence that supports rather than challenges one's belief, hinders one's reasoning ability. We examine whether large language models (LLMs) exhibit confirmation bias by adapting the rule-discovery study from human...

1 min 1 week, 5 days ago

ead

LOW Academic United States

InfoSeeker: A Scalable Hierarchical Parallel Agent Framework for Web Information Seeking

arXiv:2604.02971v1 Announce Type: new Abstract: Recent agentic search systems have made substantial progress by emphasising deep, multi-step reasoning. However, this focus often overlooks the challenges of wide-scale information synthesis, where agents must aggregate large volumes of heterogeneous evidence across many...

1 min 1 week, 5 days ago

tps

LOW Academic International

Interpretable Deep Reinforcement Learning for Element-level Bridge Life-cycle Optimization

arXiv:2604.02528v1 Announce Type: new Abstract: The new Specifications for the National Bridge Inventory (SNBI), in effect from 2022, emphasize the use of element-level condition states (CS) for risk-based bridge management. Instead of a general component rating, element-level condition data use...

1 min 1 week, 5 days ago

ead

LOW Academic United States

StoryScope: Investigating idiosyncrasies in AI fiction

arXiv:2604.03136v1 Announce Type: new Abstract: As AI-generated fiction becomes increasingly prevalent, questions of authorship and originality are becoming central to how written work is evaluated. While most existing work in this space focuses on identifying surface-level signatures of AI writing,...

1 min 1 week, 5 days ago

ead

LOW Academic European Union

Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training

arXiv:2604.02651v1 Announce Type: new Abstract: Graph neural networks (GNNs) are widely used for learning on graph datasets derived from various real-world scenarios. Learning from extremely large graphs requires distributed training, and mini-batching with sampling is a popular approach for parallelizing...

1 min 1 week, 5 days ago

ead

LOW Academic International

Improving Role Consistency in Multi-Agent Collaboration via Quantitative Role Clarity

arXiv:2604.02770v1 Announce Type: new Abstract: In large language model (LLM)-driven multi-agent systems, disobey role specification (failure to adhere to the defined responsibilities and constraints of an assigned role, potentially leading to an agent behaving like another) is a major failure...

1 min 1 week, 5 days ago

ead

LOW Academic International

Jump Start or False Start? A Theoretical and Empirical Evaluation of LLM-initialized Bandits

arXiv:2604.02527v1 Announce Type: new Abstract: The recent advancement of Large Language Models (LLMs) offers new opportunities to generate user preference data to warm-start bandits. Recent studies on contextual bandits with LLM initialization (CBLI) have shown that these synthetic priors can...

1 min 1 week, 5 days ago

ead

LOW Academic United States

LiME: Lightweight Mixture of Experts for Efficient Multimodal Multi-task Learning

arXiv:2604.02338v1 Announce Type: new Abstract: MoE-PEFT methods combine Mixture of Experts with parameter-efficient fine-tuning for multi-task adaptation, but require separate adapters per expert causing trainable parameters to scale linearly with expert count and limiting applicability to adapter-based architectures. We propose...

1 min 1 week, 5 days ago

ead

LOW Academic International

Let's Have a Conversation: Designing and Evaluating LLM Agents for Interactive Optimization

arXiv:2604.02666v1 Announce Type: new Abstract: Optimization is as much about modeling the right problem as solving it. Identifying the right objectives, constraints, and trade-offs demands extensive interaction between researchers and stakeholders. Large language models can empower decision-makers with optimization capabilities...

1 min 1 week, 5 days ago

ead

LOW Academic United States

ROMAN: A Multiscale Routing Operator for Convolutional Time Series Models

arXiv:2604.02577v1 Announce Type: new Abstract: We introduce ROMAN (ROuting Multiscale representAtioN), a deterministic operator for time series that maps temporal scale and coarse temporal position into an explicit channel structure while reducing sequence length. ROMAN builds an anti-aliased multiscale pyramid,...

1 min 1 week, 5 days ago

tps

LOW Academic United States

Compositional Neuro-Symbolic Reasoning

arXiv:2604.02434v1 Announce Type: new Abstract: We study structured abstraction-based reasoning for the Abstraction and Reasoning Corpus (ARC) and compare its generalization to test-time approaches. Purely neural architectures lack reliable combinatorial generalization, while strictly symbolic systems struggle with perceptual grounding. We...

1 min 1 week, 5 days ago

tps

LOW Academic European Union

Student-in-the-Loop Chain-of-Thought Distillation via Generation-Time Selection

arXiv:2604.02819v1 Announce Type: new Abstract: Large reasoning models achieve strong performance on complex tasks through long chain-of-thought (CoT) trajectories, but directly transferring such reasoning processes to smaller models remains challenging. A key difficulty is that not all teacher-generated reasoning trajectories...

1 min 1 week, 5 days ago

ead

LOW Academic International

Revealing the Learning Dynamics of Long-Context Continual Pre-training

arXiv:2604.02650v1 Announce Type: new Abstract: Existing studies on Long-Context Continual Pre-training (LCCP) mainly focus on small-scale models and limited data regimes (tens of billions of tokens). We argue that directly migrating these small-scale settings to industrial-grade models risks insufficient adaptation...

1 min 1 week, 5 days ago

ead

LOW Academic International

FoE: Forest of Errors Makes the First Solution the Best in Large Reasoning Models

arXiv:2604.02967v1 Announce Type: new Abstract: Recent Large Reasoning Models (LRMs) like DeepSeek-R1 have demonstrated remarkable success in complex reasoning tasks, exhibiting human-like patterns in exploring multiple alternative solutions. Upon closer inspection, however, we uncover a surprising phenomenon: The First is...

1 min 1 week, 5 days ago

ead

LOW Academic International

AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents

arXiv:2604.02947v1 Announce Type: new Abstract: Computer-use agents extend language models from text generation to persistent action over tools, files, and execution environments. Unlike chat systems, they maintain state across interactions and translate intermediate outputs into concrete actions. This creates a...

1 min 1 week, 5 days ago

ead

LOW Academic International

SocioEval: A Template-Based Framework for Evaluating Socioeconomic Status Bias in Foundation Models

arXiv:2604.02660v1 Announce Type: new Abstract: As Large Language Models (LLMs) increasingly power decision-making systems across critical domains, understanding and mitigating their biases becomes essential for responsible AI deployment. Although bias assessment frameworks have proliferated for attributes such as race and...

1 min 1 week, 5 days ago

ead

LOW Academic International

A Multi-head-based architecture for effective morphological tagging in Russian with open dictionary

arXiv:2604.02926v1 Announce Type: new Abstract: The article proposes a new architecture based on Multi-head attention to solve the problem of morphological tagging for the Russian language. The preprocessing of the word vectors includes splitting the words into subtokens, followed by...

1 min 1 week, 5 days ago

ead

LOW Academic International

VoxelCodeBench: Benchmarking 3D World Modeling Through Code Generation

arXiv:2604.02580v1 Announce Type: new Abstract: Evaluating code generation models for 3D spatial reasoning requires executing generated code in realistic environments and assessing outputs beyond surface-level correctness. We introduce a platform VoxelCode, for analyzing code generation capabilities for 3D understanding and...

1 min 1 week, 5 days ago

ead

LOW Academic International

An Initial Exploration of Contrastive Prompt Tuning to Generate Energy-Efficient Code

arXiv:2604.02352v1 Announce Type: cross Abstract: Although LLMs are capable of generating functionally correct code, they also tend to produce less energy-efficient code in comparison to human-written solutions. As these inefficiencies lead to higher computational overhead, they are in direct conflict...

1 min 1 week, 5 days ago

ead

LOW Academic European Union

A Comprehensive Framework for Long-Term Resiliency Investment Planning under Extreme Weather Uncertainty for Electric Utilities

arXiv:2604.02504v1 Announce Type: new Abstract: Electric utilities must make massive capital investments in the coming years to respond to explosive growth in demand, aging assets and rising threats from extreme weather. Utilities today already have rigorous frameworks for capital planning,...

1 min 1 week, 5 days ago

ead

LOW Academic United States

Characterizing WebGPU Dispatch Overhead for LLM Inference Across Four GPU Vendors, Three Backends, and Three Browsers

arXiv:2604.02344v1 Announce Type: new Abstract: WebGPU's security-focused design imposes per-operation validation that compounds across the many small dispatches in neural network inference, yet the true cost of this overhead is poorly characterized. We present a systematic characterization of WebGPU dispatch...

1 min 1 week, 5 days ago

ead

LOW Academic International

Fast NF4 Dequantization Kernels for Large Language Model Inference

arXiv:2604.02556v1 Announce Type: new Abstract: Large language models (LLMs) have grown beyond the memory capacity of single GPU devices, necessitating quantization techniques for practical deployment. While NF4 (4-bit NormalFloat) quantization enables 4$\times$ memory reduction, inference on current NVIDIA GPUs (e.g.,...

1 min 1 week, 5 days ago

ead

LOW Academic International

Conditional Sampling via Wasserstein Autoencoders and Triangular Transport

arXiv:2604.02644v1 Announce Type: new Abstract: We present Conditional Wasserstein Autoencoders (CWAEs), a framework for conditional simulation that exploits low-dimensional structure in both the conditioned and the conditioning variables. The key idea is to modify a Wasserstein autoencoder to use a...

1 min 1 week, 5 days ago

ead

FTimeXer: Frequency-aware Time-series Transformer with Exogenous variables for Robust Carbon Footprint Forecasting

Causal-Audit: A Framework for Risk Assessment of Assumption Violations in Time-Series Causal Discovery

From Broad Exploration to Stable Synthesis: Entropy-Guided Optimization for Autoregressive Image Generation

Learning the Signature of Memorization in Autoregressive Language Models

EMS: Multi-Agent Voting via Efficient Majority-then-Stopping

Redirected, Not Removed: Task-Dependent Stereotyping Reveals the Limits of LLM Alignments

Dynamical structure of vanishing gradient and overfitting in multi-layer perceptrons

Failing to Falsify: Evaluating and Mitigating Confirmation Bias in Language Models

InfoSeeker: A Scalable Hierarchical Parallel Agent Framework for Web Information Seeking

Interpretable Deep Reinforcement Learning for Element-level Bridge Life-cycle Optimization

StoryScope: Investigating idiosyncrasies in AI fiction

Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training

Improving Role Consistency in Multi-Agent Collaboration via Quantitative Role Clarity

Jump Start or False Start? A Theoretical and Empirical Evaluation of LLM-initialized Bandits

LiME: Lightweight Mixture of Experts for Efficient Multimodal Multi-task Learning

Let's Have a Conversation: Designing and Evaluating LLM Agents for Interactive Optimization

ROMAN: A Multiscale Routing Operator for Convolutional Time Series Models

Compositional Neuro-Symbolic Reasoning

Student-in-the-Loop Chain-of-Thought Distillation via Generation-Time Selection

Revealing the Learning Dynamics of Long-Context Continual Pre-training

FoE: Forest of Errors Makes the First Solution the Best in Large Reasoning Models

AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents

SocioEval: A Template-Based Framework for Evaluating Socioeconomic Status Bias in Foundation Models

A Multi-head-based architecture for effective morphological tagging in Russian with open dictionary

VoxelCodeBench: Benchmarking 3D World Modeling Through Code Generation

An Initial Exploration of Contrastive Prompt Tuning to Generate Energy-Efficient Code

A Comprehensive Framework for Long-Term Resiliency Investment Planning under Extreme Weather Uncertainty for Electric Utilities

Characterizing WebGPU Dispatch Overhead for LLM Inference Across Four GPU Vendors, Three Backends, and Three Browsers

Fast NF4 Dequantization Kernels for Large Language Model Inference

Conditional Sampling via Wasserstein Autoencoders and Triangular Transport

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.