Arbitration

LOW Academic European Union

NeuronSpark: A Spiking Neural Network Language Model with Selective State Space Dynamics

arXiv:2603.16148v1 Announce Type: new Abstract: We ask whether a pure spiking backbone can learn large-scale language modeling from random initialization, without Transformer distillation. We introduce NeuronSpark, a 0.9B-parameter SNN language model trained with next-token prediction and surrogate gradients. The model...

1 min 1 month ago

bit

LOW Academic European Union

CraniMem: Cranial Inspired Gated and Bounded Memory for Agentic Systems

arXiv:2603.15642v1 Announce Type: new Abstract: Large language model (LLM) agents are increasingly deployed in long running workflows, where they must preserve user and task state across many turns. Many existing agent memory systems behave like external databases with ad hoc...

1 min 1 month ago

bit

LOW Academic International

VIGIL: Towards Edge-Extended Agentic AI for Enterprise IT Support

arXiv:2603.16110v1 Announce Type: new Abstract: Enterprise IT support is constrained by heterogeneous devices, evolving policies, and long-tail failure modes that are difficult to resolve centrally. We present VIGIL, an edge-extended agentic AI system that deploys desktop-resident agents to perform situated...

1 min 1 month ago

mediation

LOW Academic United States

Persona-Conditioned Risk Behavior in Large Language Models: A Simulated Gambling Study with GPT-4.1

arXiv:2603.15831v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly deployed as autonomous agents in uncertain, sequential decision-making contexts. Yet it remains poorly understood whether the behaviors they exhibit in such environments reflect principled cognitive patterns or simply surface-level...

1 min 1 month ago

bit

LOW Academic International

Form Follows Function: Recursive Stem Model

arXiv:2603.15641v1 Announce Type: new Abstract: Recursive reasoning models such as Hierarchical Reasoning Model (HRM) and Tiny Recursive Model (TRM) show that small, weight-shared networks can solve compute-heavy and NP puzzles by iteratively refining latent states, but their training typically relies...

1 min 1 month ago

bit

LOW Academic International

Context-Length Robustness in Question Answering Models: A Comparative Empirical Study

arXiv:2603.15723v1 Announce Type: new Abstract: Large language models are increasingly deployed in settings where relevant information is embedded within long and noisy contexts. Despite this, robustness to growing context length remains poorly understood across different question answering tasks. In this...

1 min 1 month ago

bit

LOW Academic International

Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization

arXiv:2603.16105v1 Announce Type: new Abstract: Post-training model compression is essential for enhancing the portability of Large Language Models (LLMs) while preserving their performance. While several compression approaches have been proposed, less emphasis has been placed on selecting the most suitable...

1 min 1 month ago

bit

LOW Academic International

Pre-training LLM without Learning Rate Decay Enhances Supervised Fine-Tuning

arXiv:2603.16127v1 Announce Type: new Abstract: We investigate the role of learning rate scheduling in the large-scale pre-training of large language models, focusing on its influence on downstream performance after supervised fine-tuning (SFT). Decay-based learning rate schedulers are widely used to...

1 min 1 month ago

bit

LOW Academic International

Social Simulacra in the Wild: AI Agent Communities on Moltbook

arXiv:2603.16128v1 Announce Type: new Abstract: As autonomous LLM-based agents increasingly populate social platforms, understanding the dynamics of AI-agent communities becomes essential for both communication research and platform governance. We present the first large-scale empirical comparison of AI-agent and human online...

1 min 1 month ago

bit

LOW Academic International

PlotTwist: A Creative Plot Generation Framework with Small Language Models

arXiv:2603.16410v1 Announce Type: new Abstract: Creative plot generation presents a fundamental challenge for language models: transforming a concise premise into a coherent narrative that sustains global structure, character development, and emotional resonance. Although recent Large Language Models (LLMs) demonstrate strong...

1 min 1 month ago

bit

LOW Academic International

Discovering the Hidden Role of Gini Index In Prompt-based Classification

arXiv:2603.15654v1 Announce Type: new Abstract: In classification tasks, the long-tailed minority classes usually offer the predictions that are most important. Yet these classes consistently exhibit low accuracies, whereas a few high-performing classes dominate the game. We pursue a foundational understanding...

1 min 1 month ago

bit

LOW Academic United States

Spectral Edge Dynamics of Training Trajectories: Signal--Noise Geometry Across Scales

arXiv:2603.15678v1 Announce Type: new Abstract: Despite hundreds of millions of parameters, transformer training trajectories evolve within only a few coherent directions. We introduce \emph{Spectral Edge Dynamics} (SED) to measure this structure: rolling-window SVD of parameter updates reveals a sharp boundary...

1 min 1 month ago

bit

LOW Academic International

Transition Flow Matching

arXiv:2603.15689v1 Announce Type: new Abstract: Mainstream flow matching methods typically focus on learning the local velocity field, which inherently requires multiple integration steps during generation. In contrast, Mean Velocity Flow models establish a relationship between the local velocity field and...

1 min 1 month ago

bit

LOW Academic European Union

Hypothesis Class Determines Explanation: Why Accurate Models Disagree on Feature Attribution

arXiv:2603.15821v1 Announce Type: new Abstract: The assumption that prediction-equivalent models produce equivalent explanations underlies many practices in explainable AI, including model selection, auditing, and regulatory evaluation. In this work, we show that this assumption does not hold. Through a large-scale...

1 min 1 month ago

bit

LOW Academic International

When Stability Fails: Hidden Failure Modes Of LLMS in Data-Constrained Scientific Decision-Making

arXiv:2603.15840v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used as decision-support tools in data-constrained scientific workflows, where correctness and validity are critical. However, evaluation practices often emphasize stability or reproducibility across repeated runs. While these properties are...

1 min 1 month ago

bit

LOW Academic United States

Informationally Compressive Anonymization: Non-Degrading Sensitive Input Protection for Privacy-Preserving Supervised Machine Learning

arXiv:2603.15842v1 Announce Type: new Abstract: Modern machine learning systems increasingly rely on sensitive data, creating significant privacy, security, and regulatory risks that existing privacy-preserving machine learning (ppML) techniques, such as Differential Privacy (DP) and Homomorphic Encryption (HE), address only at...

1 min 1 month ago

bit

LOW Academic European Union

Generative Inverse Design with Abstention via Diagonal Flow Matching

arXiv:2603.15925v1 Announce Type: new Abstract: Inverse design aims to find design parameters $x$ achieving target performance $y^*$. Generative approaches learn bidirectional mappings between designs and labels, enabling diverse solution sampling. However, standard conditional flow matching (CFM), when adapted to inverse...

1 min 1 month ago

bit

LOW Academic International

Deriving Hyperparameter Scaling Laws via Modern Optimization Theory

arXiv:2603.15958v1 Announce Type: new Abstract: Hyperparameter transfer has become an important component of modern large-scale training recipes. Existing methods, such as muP, primarily focus on transfer between model sizes, with transfer across batch sizes and training horizons often relying on...

1 min 1 month ago

adr

LOW Academic United States

Collaborative Temporal Feature Generation via Critic-Free Reinforcement Learning for Cross-User Sensor-Based Activity Recognition

arXiv:2603.16043v1 Announce Type: new Abstract: Human Activity Recognition using wearable inertial sensors is foundational to healthcare monitoring, fitness analytics, and context-aware computing, yet its deployment is hindered by cross-user variability arising from heterogeneous physiological traits, motor habits, and sensor placements....

1 min 1 month ago

bit

LOW Academic United States

MDM-Prime-v2: Binary Encoding and Index Shuffling Enable Compute-optimal Scaling of Diffusion Language Models

arXiv:2603.16077v1 Announce Type: new Abstract: Masked diffusion models (MDM) exhibit superior generalization when learned using a Partial masking scheme (Prime). This approach converts tokens into sub-tokens and models the diffusion process at the sub-token level. We identify two limitations of...

1 min 1 month ago

bit

LOW Academic European Union

A Depth-Aware Comparative Study of Euclidean and Hyperbolic Graph Neural Networks on Bitcoin Transaction Systems

arXiv:2603.16080v1 Announce Type: new Abstract: Bitcoin transaction networks are large scale socio- technical systems in which activities are represented through multi-hop interaction patterns. Graph Neural Networks(GNNs) have become a widely adopted tool for analyzing such systems, supporting tasks such as...

1 min 1 month ago

bit

LOW Academic European Union

DyACE: Dynamic Algorithm Co-evolution for Online Automated Heuristic Design with Large Language Model

arXiv:2603.13344v1 Announce Type: new Abstract: The prevailing paradigm in Automated Heuristic Design (AHD) typically relies on the assumption that a single, fixed algorithm can effectively navigate the shifting dynamics of a combinatorial search. This static approach often proves inadequate for...

1 min 1 month ago

bit

LOW Academic South Korea

Distilling Deep Reinforcement Learning into Interpretable Fuzzy Rules: An Explainable AI Framework

arXiv:2603.13257v1 Announce Type: new Abstract: Deep Reinforcement Learning (DRL) agents achieve remarkable performance in continuous control but remain opaque, hindering deployment in safety-critical domains. Existing explainability methods either provide only local insights (SHAP, LIME) or employ over-simplified surrogates failing to...

1 min 1 month ago

bit

LOW Academic United States

Multi-Axis Trust Modeling for Interpretable Account Hijacking Detection

arXiv:2603.13246v1 Announce Type: new Abstract: This paper proposes a Hadith-inspired multi-axis trust modeling framework, motivated by a structurally analogous problem in classical Hadith scholarship: assessing the trustworthiness of information sources using interpretable, multidimensional criteria rather than a single anomaly score....

1 min 1 month ago

bit

LOW Academic International

Knowledge Distillation for Large Language Models

arXiv:2603.13765v1 Announce Type: new Abstract: We propose a resource-efficient framework for compressing large language models through knowledge distillation, combined with guided chain-of-thought reinforcement learning. Using Qwen 3B as the teacher and Qwen 0.5B as the student, we apply knowledge distillation...

1 min 1 month ago

bit

LOW Academic International

PA-Net: Precipitation-Adaptive Mixture-of-Experts for Long-Tail Rainfall Nowcasting

arXiv:2603.13818v1 Announce Type: new Abstract: Precipitation nowcasting is vital for flood warning, agricultural management, and emergency response, yet two bottlenecks persist: the prohibitive cost of modeling million-scale spatiotemporal tokens from multi-variate atmospheric fields, and the extreme long-tailed rainfall distribution where...

1 min 1 month ago

bit

LOW Academic International

DOVA: Deliberation-First Multi-Agent Orchestration for Autonomous Research Automation

arXiv:2603.13327v1 Announce Type: new Abstract: Large language model (LLM) agents have demonstrated remarkable capabilities in tool use, reasoning, and code generation, yet single-agent systems exhibit fundamental limitations when confronted with complex research tasks demanding multi-source synthesis, adversarial verification, and personalized...

1 min 1 month ago

bit

LOW Academic International

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

arXiv:2603.13985v1 Announce Type: new Abstract: Pre-trained Large Language Model (LLM) exhibits broad capabilities, yet, for specific tasks or domains their attainment of higher accuracy and more reliable reasoning generally depends on post-training through Supervised Fine-Tuning (SFT) or Reinforcement Learning (RL)....

1 min 1 month ago

bit

LOW Academic International

LLM-MINE: Large Language Model based Alzheimer's Disease and Related Dementias Phenotypes Mining from Clinical Notes

arXiv:2603.13673v1 Announce Type: new Abstract: Accurate extraction of Alzheimer's Disease and Related Dementias (ADRD) phenotypes from electronic health records (EHR) is critical for early-stage detection and disease staging. However, this information is usually embedded in unstructured textual data rather than...

1 min 1 month ago

adr

LOW Academic European Union

The ARC of Progress towards AGI: A Living Survey of Abstraction and Reasoning

arXiv:2603.13372v1 Announce Type: new Abstract: The Abstraction and Reasoning Corpus (ARC-AGI) has become a key benchmark for fluid intelligence in AI. This survey presents the first cross-generation analysis of 82 approaches across three benchmark versions and the ARC Prize 2024-2025...

1 min 1 month ago

bit

NeuronSpark: A Spiking Neural Network Language Model with Selective State Space Dynamics

CraniMem: Cranial Inspired Gated and Bounded Memory for Agentic Systems

VIGIL: Towards Edge-Extended Agentic AI for Enterprise IT Support

Persona-Conditioned Risk Behavior in Large Language Models: A Simulated Gambling Study with GPT-4.1

Form Follows Function: Recursive Stem Model

Context-Length Robustness in Question Answering Models: A Comparative Empirical Study

Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization

Pre-training LLM without Learning Rate Decay Enhances Supervised Fine-Tuning

Social Simulacra in the Wild: AI Agent Communities on Moltbook

PlotTwist: A Creative Plot Generation Framework with Small Language Models

Discovering the Hidden Role of Gini Index In Prompt-based Classification

Spectral Edge Dynamics of Training Trajectories: Signal--Noise Geometry Across Scales

Transition Flow Matching

Hypothesis Class Determines Explanation: Why Accurate Models Disagree on Feature Attribution

When Stability Fails: Hidden Failure Modes Of LLMS in Data-Constrained Scientific Decision-Making

Informationally Compressive Anonymization: Non-Degrading Sensitive Input Protection for Privacy-Preserving Supervised Machine Learning

Generative Inverse Design with Abstention via Diagonal Flow Matching

Deriving Hyperparameter Scaling Laws via Modern Optimization Theory

Collaborative Temporal Feature Generation via Critic-Free Reinforcement Learning for Cross-User Sensor-Based Activity Recognition

MDM-Prime-v2: Binary Encoding and Index Shuffling Enable Compute-optimal Scaling of Diffusion Language Models

A Depth-Aware Comparative Study of Euclidean and Hyperbolic Graph Neural Networks on Bitcoin Transaction Systems

DyACE: Dynamic Algorithm Co-evolution for Online Automated Heuristic Design with Large Language Model

Distilling Deep Reinforcement Learning into Interpretable Fuzzy Rules: An Explainable AI Framework

Multi-Axis Trust Modeling for Interpretable Account Hijacking Detection

Knowledge Distillation for Large Language Models

PA-Net: Precipitation-Adaptive Mixture-of-Experts for Long-Tail Rainfall Nowcasting

DOVA: Deliberation-First Multi-Agent Orchestration for Autonomous Research Automation

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

LLM-MINE: Large Language Model based Alzheimer's Disease and Related Dementias Phenotypes Mining from Clinical Notes

The ARC of Progress towards AGI: A Living Survey of Abstraction and Reasoning

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.