Immigration Law

LOW Academic International

SozKZ: Training Efficient Small Language Models for Kazakh from Scratch

arXiv:2603.20854v1 Announce Type: new Abstract: Kazakh, a Turkic language spoken by over 22 million people, remains underserved by existing multilingual language models, which allocate minimal capacity to low-resource languages and employ tokenizers ill-suited to agglutinative morphology. We present SozKZ, a...

1 min 4 weeks ago

ead

LOW Academic International

NoveltyAgent: Autonomous Novelty Reporting Agent with Point-wise Novelty Analysis and Self-Validation

arXiv:2603.20884v1 Announce Type: new Abstract: The exponential growth of academic publications has led to a surge in papers of varying quality, increasing the cost of paper screening. Current approaches either use novelty assessment within general AI Reviewers or repurpose DeepResearch,...

1 min 4 weeks ago

tps

LOW Academic International

Mitigating Shortcut Reasoning in Language Models: A Gradient-Aware Training Approach

arXiv:2603.20899v1 Announce Type: new Abstract: Large language models exhibit strong reasoning capabilities, yet often rely on shortcuts such as surface pattern matching and answer memorization rather than genuine logical inference. We propose Shortcut-Aware Reasoning Training (SART), a gradient-aware framework that...

1 min 4 weeks ago

tps

LOW Academic International

User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction

arXiv:2603.20939v1 Announce Type: new Abstract: Large language models are increasingly used as personal assistants, yet most lack a persistent user model, forcing users to repeatedly restate preferences across sessions. We propose Vector-Adapted Retrieval Scoring (VARS), a pipeline-agnostic, frozen-backbone framework that...

1 min 4 weeks ago

tps

LOW Academic International

Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO

arXiv:2603.21016v1 Announce Type: new Abstract: Large language models (LLMs) used for multiple-choice and pairwise evaluation tasks often exhibit selection bias due to non-semantic factors like option positions and label symbols. Existing inference-time debiasing is costly and may harm reasoning, while...

1 min 4 weeks ago

tps

LOW Academic International

JointFM-0.1: A Foundation Model for Multi-Target Joint Distributional Prediction

arXiv:2603.20266v1 Announce Type: new Abstract: Despite the rapid advancements in Artificial Intelligence (AI), Stochastic Differential Equations (SDEs) remain the gold-standard formalism for modeling systems under uncertainty. However, applying SDEs in practice is fraught with challenges: modeling risk is high, calibration...

1 min 4 weeks ago

ead

LOW Academic European Union

Rolling-Origin Validation Reverses Model Rankings in Multi-Step PM10 Forecasting: XGBoost, SARIMA, and Persistence

arXiv:2603.20315v1 Announce Type: new Abstract: (a) Many air quality forecasting studies report gains from machine learning, but evaluations often use static chronological splits and omit persistence baselines, so the operational added value under routine updating is unclear. (b) Using 2,350...

1 min 4 weeks ago

ead

LOW Academic International

Hybrid Autoencoder-Isolation Forest approach for time series anomaly detection in C70XP cyclotron operation data at ARRONAX

arXiv:2603.20335v1 Announce Type: new Abstract: The Interest Public Group ARRONAX's C70XP cyclotron, used for radioisotope production for medical and research applications, relies on complex and costly systems that are prone to failures, leading to operational disruptions. In this context, this...

1 min 4 weeks ago

ead

LOW Academic International

KV Cache Optimization Strategies for Scalable and Efficient LLM Inference

arXiv:2603.20397v1 Announce Type: new Abstract: The key-value (KV) cache is a foundational optimization in Transformer-based large language models (LLMs), eliminating redundant recomputation of past token representations during autoregressive generation. However, its memory footprint scales linearly with context length, imposing critical...

1 min 4 weeks ago

ead

LOW Academic European Union

SDE-Driven Spatio-Temporal Hypergraph Neural Networks for Irregular Longitudinal fMRI Connectome Modeling in Alzheimer's Disease

arXiv:2603.20452v1 Announce Type: new Abstract: Longitudinal neuroimaging is essential for modeling disease progression in Alzheimer's disease (AD), yet irregular sampling and missing visits pose substantial challenges for learning reliable temporal representations. To address this challenge, we propose SDE-HGNN, a stochastic...

1 min 4 weeks ago

tps

LOW Academic International

AE-LLM: Adaptive Efficiency Optimization for Large Language Models

arXiv:2603.20492v1 Announce Type: new Abstract: Large Language Models (LLMs) have achieved remarkable success across diverse applications, yet their deployment remains challenging due to substantial computational costs, memory requirements, and energy consumption. Recent empirical studies have demonstrated that no single efficiency...

1 min 4 weeks ago

ead

LOW Academic European Union

RMNP: Row-Momentum Normalized Preconditioning for Scalable Matrix-Based Optimization

arXiv:2603.20527v1 Announce Type: new Abstract: Preconditioned adaptive methods have gained significant attention for training deep neural networks, as they capture rich curvature information of the loss landscape . The central challenge in this field lies in balancing preconditioning effectiveness with...

1 min 4 weeks ago

tps

LOW Academic International

MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning

arXiv:2603.20586v1 Announce Type: new Abstract: As long-context language modeling becomes increasingly important, the cost of maintaining and attending to large Key/Value (KV) caches grows rapidly, becoming a major bottleneck in both training and inference. While prior works such as Multi-Query...

1 min 4 weeks ago

ead

LOW Academic European Union

Neural collapse in the orthoplex regime

arXiv:2603.20587v1 Announce Type: new Abstract: When training a neural network for classification, the feature vectors of the training set are known to collapse to the vertices of a regular simplex, provided the dimension $d$ of the feature space and the...

1 min 4 weeks ago

ead

LOW Academic International

Beyond Token Eviction: Mixed-Dimension Budget Allocation for Efficient KV Cache Compression

arXiv:2603.20616v1 Announce Type: new Abstract: Key-value (KV) caching is widely used to accelerate transformer inference, but its memory cost grows linearly with input length, limiting long-context deployment. Existing token eviction methods reduce memory by discarding less important tokens, which can...

1 min 4 weeks ago

ead

LOW Academic European Union

Diffusion Model for Manifold Data: Score Decomposition, Curvature, and Statistical Complexity

arXiv:2603.20645v1 Announce Type: new Abstract: Diffusion models have become a leading framework in generative modeling, yet their theoretical understanding -- especially for high-dimensional data concentrated on low-dimensional structures -- remains incomplete. This paper investigates how diffusion models learn such structured...

1 min 4 weeks ago

ead

LOW Academic International

Breaking the $O(\sqrt{T})$ Cumulative Constraint Violation Barrier while Achieving $O(\sqrt{T})$ Static Regret in Constrained Online Convex Optimization

arXiv:2603.20671v1 Announce Type: new Abstract: The problem of constrained online convex optimization is considered, where at each round, once a learner commits to an action $x_t \in \mathcal{X} \subset \mathbb{R}^d$, a convex loss function $f_t$ and a convex constraint function...

1 min 4 weeks ago

ead

LOW Academic International

Centrality-Based Pruning for Efficient Echo State Networks

arXiv:2603.20684v1 Announce Type: new Abstract: Echo State Networks (ESNs) are a reservoir computing framework widely used for nonlinear time-series prediction. However, despite their effectiveness, the randomly initialized reservoir often contains redundant nodes, leading to unnecessary computational overhead and reduced efficiency....

1 min 4 weeks ago

ead

LOW Academic European Union

Neuronal Self-Adaptation Enhances Capacity and Robustness of Representation in Spiking Neural Networks

arXiv:2603.20687v1 Announce Type: new Abstract: Spiking Neural Networks (SNNs) are promising for energy-efficient, real-time edge computing, yet their performance is often constrained by the limited adaptability of conventional leaky integrate-and-fire (LIF) neurons. Existing LIF models struggle with restricted information capacity...

1 min 4 weeks ago

ead

LOW News United States

Court appears ready to overturn state law allowing for late-arriving mail-in ballots

The Supreme Court on Monday appeared ready to overturn a Mississippi law that allows mail-in ballots to be counted as long as they are postmarked by, and then received within […]The postCourt appears ready to overturn state law allowing for...

1 min 4 weeks ago

ead

LOW News United States

SCOTUStoday for Monday, March 23

Good morning, and welcome to the March argument session, which includes the argument on birthright citizenship on Wednesday, April 1. This Thursday, March 26, SCOTUSblog is teaming up with Briefly […]The postSCOTUStoday for Monday, March 23appeared first onSCOTUSblog.

1 min 4 weeks ago

citizenship

LOW News International

Littlebird raises $11M for its AI-assisted ‘recall’ tool that reads your computer screen

Littlebird is building an AI that reads your screen in real time to capture context, answer questions, and automate tasks, without relying on screenshots.

1 min 4 weeks ago

ead

LOW Academic International

ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

arXiv:2603.19515v1 Announce Type: new Abstract: Large language models (LLMs) with advanced cognitive capabilities are emerging as agents for various reasoning and planning tasks. Traditional evaluations often focus on specific reasoning or planning questions within controlled environments. Recent studies have explored...

1 min 4 weeks, 1 day ago

tps

LOW Academic European Union

HATL: Hierarchical Adaptive-Transfer Learning Framework for Sign Language Machine Translation

arXiv:2603.19260v1 Announce Type: cross Abstract: Sign Language Machine Translation (SLMT) aims to bridge communication between Deaf and hearing individuals. However, its progress is constrained by scarce datasets, limited signer diversity, and large domain gaps between sign motion patterns and pretrained...

1 min 4 weeks, 1 day ago

ead

LOW Academic United States

A Subgoal-driven Framework for Improving Long-Horizon LLM Agents

arXiv:2603.19685v1 Announce Type: new Abstract: Large language model (LLM)-based agents have emerged as powerful autonomous controllers for digital environments, including mobile interfaces, operating systems, and web browsers. Web navigation, for example, requires handling dynamic content and long sequences of actions,...

1 min 4 weeks, 1 day ago

ead

LOW Academic International

When both Grounding and not Grounding are Bad -- A Partially Grounded Encoding of Planning into SAT (Extended Version)

arXiv:2603.19429v1 Announce Type: new Abstract: Classical planning problems are typically defined using lifted first-order representations, which offer compactness and generality. While most planners ground these representations to simplify reasoning, this can cause an exponential blowup in size. Recent approaches instead...

1 min 4 weeks, 1 day ago

ead

LOW Academic International

PA2D-MORL: Pareto Ascent Directional Decomposition based Multi-Objective Reinforcement Learning

arXiv:2603.19579v1 Announce Type: new Abstract: Multi-objective reinforcement learning (MORL) provides an effective solution for decision-making problems involving conflicting objectives. However, achieving high-quality approximations to the Pareto policy set remains challenging, especially in complex tasks with continuous or high-dimensional state-action space....

1 min 4 weeks, 1 day ago

ead

LOW Academic International

Learning Dynamic Belief Graphs for Theory-of-mind Reasoning

arXiv:2603.20170v1 Announce Type: new Abstract: Theory of Mind (ToM) reasoning with Large Language Models (LLMs) requires inferring how people's implicit, evolving beliefs shape what they seek and how they act under uncertainty -- especially in high-stakes settings such as disaster...

1 min 4 weeks, 1 day ago

tps

LOW Academic International

Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs

arXiv:2603.20046v1 Announce Type: new Abstract: Reinforcement Learning (RL) with rubric-based rewards has recently shown remarkable progress in enhancing general reasoning capabilities of Large Language Models (LLMs), yet still suffers from ineffective exploration confined to curent policy distribution. In fact, RL...

1 min 4 weeks, 1 day ago

tps

LOW Academic European Union

Generative Active Testing: Efficient LLM Evaluation via Proxy Task Adaptation

arXiv:2603.19264v1 Announce Type: cross Abstract: With the widespread adoption of pre-trained Large Language Models (LLM), there exists a high demand for task-specific test sets to benchmark their performance in domains such as healthcare and biomedicine. However, the cost of labeling...

1 min 4 weeks, 1 day ago

ead

SozKZ: Training Efficient Small Language Models for Kazakh from Scratch

NoveltyAgent: Autonomous Novelty Reporting Agent with Point-wise Novelty Analysis and Self-Validation

Mitigating Shortcut Reasoning in Language Models: A Gradient-Aware Training Approach

User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction

Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO

JointFM-0.1: A Foundation Model for Multi-Target Joint Distributional Prediction

Rolling-Origin Validation Reverses Model Rankings in Multi-Step PM10 Forecasting: XGBoost, SARIMA, and Persistence

Hybrid Autoencoder-Isolation Forest approach for time series anomaly detection in C70XP cyclotron operation data at ARRONAX

KV Cache Optimization Strategies for Scalable and Efficient LLM Inference

SDE-Driven Spatio-Temporal Hypergraph Neural Networks for Irregular Longitudinal fMRI Connectome Modeling in Alzheimer's Disease

AE-LLM: Adaptive Efficiency Optimization for Large Language Models

RMNP: Row-Momentum Normalized Preconditioning for Scalable Matrix-Based Optimization

MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning

Neural collapse in the orthoplex regime

Beyond Token Eviction: Mixed-Dimension Budget Allocation for Efficient KV Cache Compression

Diffusion Model for Manifold Data: Score Decomposition, Curvature, and Statistical Complexity

Breaking the $O(\sqrt{T})$ Cumulative Constraint Violation Barrier while Achieving $O(\sqrt{T})$ Static Regret in Constrained Online Convex Optimization

Centrality-Based Pruning for Efficient Echo State Networks

Neuronal Self-Adaptation Enhances Capacity and Robustness of Representation in Spiking Neural Networks

Court appears ready to overturn state law allowing for late-arriving mail-in ballots

SCOTUStoday for Monday, March 23

Littlebird raises $11M for its AI-assisted ‘recall’ tool that reads your computer screen

ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models

HATL: Hierarchical Adaptive-Transfer Learning Framework for Sign Language Machine Translation

A Subgoal-driven Framework for Improving Long-Horizon LLM Agents

When both Grounding and not Grounding are Bad -- A Partially Grounded Encoding of Planning into SAT (Extended Version)

PA2D-MORL: Pareto Ascent Directional Decomposition based Multi-Objective Reinforcement Learning

Learning Dynamic Belief Graphs for Theory-of-mind Reasoning

Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs

Generative Active Testing: Efficient LLM Evaluation via Proxy Task Adaptation

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.