International Law

LOW Academic International

Balancing Multiple Objectives in Urban Traffic Control with Reinforcement Learning from AI Feedback

arXiv:2602.20728v1 Announce Type: new Abstract: Reward design has been one of the central challenges for real world reinforcement learning (RL) deployment, especially in settings with multiple objectives. Preference-based RL offers an appealing alternative by learning from human preferences over pairs...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

PyVision-RL: Forging Open Agentic Vision Models via RL

arXiv:2602.20739v1 Announce Type: new Abstract: Reinforcement learning for agentic multimodal models often suffers from interaction collapse, where models learn to reduce tool usage and multi-turn reasoning, limiting the benefits of agentic behavior. We introduce PyVision-RL, a reinforcement learning framework for...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

POMDPPlanners: Open-Source Package for POMDP Planning

arXiv:2602.20810v1 Announce Type: new Abstract: We present POMDPPlanners, an open-source Python package for empirical evaluation of Partially Observable Markov Decision Process (POMDP) planning algorithms. The package integrates state-of-the-art planning algorithms, a suite of benchmark environments with safety-critical variants, automated hyperparameter...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Pressure Reveals Character: Behavioural Alignment Evaluation at Depth

arXiv:2602.20813v1 Announce Type: new Abstract: Evaluating alignment in language models requires testing how they behave under realistic pressure, not just what they claim they would do. While alignment failures increasingly cause real-world harm, comprehensive evaluation frameworks with realistic multi-turn scenarios...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Diagnosing Causal Reasoning in Vision-Language Models via Structured Relevance Graphs

arXiv:2602.20878v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) achieve strong performance on visual question answering benchmarks, yet often rely on spurious correlations rather than genuine causal reasoning. Existing evaluations primarily assess the correctness of the answers, making it unclear...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence

arXiv:2602.20934v1 Announce Type: new Abstract: The paradigm of Large Language Models is undergoing a fundamental transition from static inference engines to dynamic autonomous cognitive systems.While current research primarily focuses on scaling context windows or optimizing prompt engineering the theoretical bridge...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Tool Building as a Path to "Superintelligence"

arXiv:2602.21061v1 Announce Type: new Abstract: The Diligent Learner framework suggests LLMs can achieve superintelligence via test-time search, provided a sufficient step-success probability $\gamma$. In this work, we design a benchmark to measure $\gamma$ on logical out-of-distribution inference. We construct a...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

The Initial Exploration Problem in Knowledge Graph Exploration

arXiv:2602.21066v1 Announce Type: new Abstract: Knowledge Graphs (KGs) enable the integration and representation of complex information across domains, but their semantic richness and structural complexity create substantial barriers for lay users without expertise in semantic web technologies. When encountering an...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Interpretable Medical Image Classification using Prototype Learning and Privileged Information

arXiv:2310.15741v1 Announce Type: cross Abstract: Interpretability is often an essential requirement in medical imaging. Advanced deep learning methods are required to address this need for explainability and high performance. In this work, we investigate whether additional information available during the...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

ConceptRM: The Quest to Mitigate Alert Fatigue through Consensus-Based Purity-Driven Data Cleaning for Reflection Modelling

arXiv:2602.20166v1 Announce Type: cross Abstract: In many applications involving intelligent agents, the overwhelming volume of alerts (mostly false) generated by the agents may desensitize users and cause them to overlook critical issues, leading to the so-called ''alert fatigue''. A common...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Benchmarking Early Deterioration Prediction Across Hospital-Rich and MCI-Like Emergency Triage Under Constrained Sensing

arXiv:2602.20168v1 Announce Type: cross Abstract: Emergency triage decisions are made under severe information constraints, yet most data-driven deterioration models are evaluated using signals unavailable during initial assessment. We present a leakage-aware benchmarking framework for early deterioration prediction that evaluates model...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

InterviewSim: A Scalable Framework for Interview-Grounded Personality Simulation

arXiv:2602.20294v1 Announce Type: new Abstract: Simulating real personalities with large language models requires grounding generation in authentic personal data. Existing evaluation approaches rely on demographic surveys, personality questionnaires, or short AI-led interviews as proxies, but lack direct assessment against what...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance

arXiv:2602.20300v1 Announce Type: new Abstract: Large Language Model (LLM) hallucinations are usually treated as defects of the model or its decoding strategy. Drawing on classical linguistics, we argue that a query's form can also shape a listener's (and model's) response....

1 min 1 month, 3 weeks ago

ear

LOW Academic International

No One Size Fits All: QueryBandits for Hallucination Mitigation

arXiv:2602.20332v1 Announce Type: new Abstract: Advanced reasoning capabilities in Large Language Models (LLMs) have led to more frequent hallucinations; yet most mitigation work focuses on open-source models for post-hoc detection and parameter editing. The dearth of studies focusing on hallucinations...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Disentangling Geometry, Performance, and Training in Language Models

arXiv:2602.20433v1 Announce Type: new Abstract: Geometric properties of Transformer weights, particularly the unembedding matrix, have been widely useful in language model interpretability research. Yet, their utility for estimating downstream performance remains unclear. In this work, we systematically investigate the relationship...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

arXiv:2602.21534v1 Announce Type: new Abstract: Agentic reinforcement learning (ARL) has rapidly gained attention as a promising paradigm for training agents to solve complex, multi-step interactive tasks. Despite encouraging early results, ARL remains highly unstable, often leading to training collapse. This...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Distill and Align Decomposition for Enhanced Claim Verification

arXiv:2602.21857v1 Announce Type: new Abstract: Complex claim verification requires decomposing sentences into verifiable subclaims, yet existing methods struggle to align decomposition quality with verification performance. We propose a reinforcement learning (RL) approach that jointly optimizes decomposition quality and verifier alignment...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

ProactiveMobile: A Comprehensive Benchmark for Boosting Proactive Intelligence on Mobile Devices

arXiv:2602.21858v1 Announce Type: new Abstract: Multimodal large language models (MLLMs) have made significant progress in mobile agent development, yet their capabilities are predominantly confined to a reactive paradigm, where they merely execute explicit user commands. The emerging paradigm of proactive...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Inference-time Alignment via Sparse Junction Steering

arXiv:2602.21215v1 Announce Type: cross Abstract: Token-level steering has emerged as a pivotal approach for inference-time alignment, enabling fine grained control over large language models by modulating their output distributions without parameter updates. While effective, existing methods rely on dense intervention...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

EPSVec: Efficient and Private Synthetic Data Generation via Dataset Vectors

arXiv:2602.21218v1 Announce Type: cross Abstract: High-quality data is essential for modern machine learning, yet many valuable corpora are sensitive and cannot be freely shared. Synthetic data offers a practical substitute for downstream development, and large language models (LLMs) have emerged...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Field-Theoretic Memory for AI Agents: Continuous Dynamics for Context Preservation

arXiv:2602.21220v1 Announce Type: cross Abstract: We present a memory system for AI agents that treats stored information as continuous fields governed by partial differential equations rather than discrete entries in a database. The approach draws from classical field theory: memories...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Architecture-Agnostic Curriculum Learning for Document Understanding: Empirical Evidence from Text-Only and Multimodal

arXiv:2602.21225v1 Announce Type: cross Abstract: We investigate whether progressive data scheduling -- a curriculum learning strategy that incrementally increases training data exposure (33\%$\rightarrow$67\%$\rightarrow$100\%) -- yields consistent efficiency gains across architecturally distinct document understanding models. By evaluating BERT (text-only, 110M parameters)...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

IslamicLegalBench: Evaluating LLMs Knowledge and Reasoning of Islamic Law Across 1,200 Years of Islamic Pluralist Legal Traditions

arXiv:2602.21226v1 Announce Type: cross Abstract: As millions of Muslims turn to LLMs like GPT, Claude, and DeepSeek for religious guidance, a critical question arises: Can these AI systems reliably reason about Islamic law? We introduce IslamicLegalBench, the first benchmark evaluating...

1 min 1 month, 3 weeks ago

ear

LOW Law Review International

The Fundamental Right to Education

ARTICLE The Fundamental Right to Education Derek W. Black* New litigation has revived one of the most important questions of constitutional law: Is education a fundamental right? The Court’s previous answers have been disappointing. While the Court has hinted that...

1 min 1 month, 3 weeks ago

ear

LOW Law Review International

Gains, Losses, and Judges: Framing and the Judiciary

ARTICLE Gains, Losses, and Judges: Framing and the Judiciary Jeffrey J. Rachlinski* & Andrew J. Wistrich** Losses hurt more than foregone gains—an asymmetry that psychologists call “loss aversion.” Losses cause more regret than foregone gains, and people struggle harder to...

1 min 1 month, 3 weeks ago

ear

LOW Business & Strategy International

Corporate Governance in the Age of AI: Board Responsibilities and Best Practices

As AI transforms business operations, corporate boards face new governance challenges requiring updated oversight frameworks and expertise.

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Budget-Aware Agentic Routing via Boundary-Guided Training

arXiv:2602.21227v1 Announce Type: cross Abstract: As large language models (LLMs) evolve into autonomous agents that execute long-horizon workflows, invoking a high-capability model at every step becomes economically unsustainable. While model routing is effective for single-turn queries, agentic routing is a...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

ImpRIF: Stronger Implicit Reasoning Leads to Better Complex Instruction Following

arXiv:2602.21228v1 Announce Type: cross Abstract: As applications of large language models (LLMs) become increasingly complex, the demand for robust complex instruction following capabilities is growing accordingly. We argue that a thorough understanding of the instruction itself, especially the latent reasoning...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

AngelSlim: A more accessible, comprehensive, and efficient toolkit for large model compression

arXiv:2602.21233v1 Announce Type: cross Abstract: This technical report introduces AngelSlim, a comprehensive and versatile toolkit for large model compression developed by the Tencent Hunyuan team. By consolidating cutting-edge algorithms, including quantization, speculative decoding, token pruning, and distillation. AngelSlim provides a...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

AgenticTyper: Automated Typing of Legacy Software Projects Using Agentic AI

arXiv:2602.21251v1 Announce Type: cross Abstract: Legacy JavaScript systems lack type safety, making maintenance risky. While TypeScript can help, manually adding types is expensive. Previous automated typing research focuses on type inference but rarely addresses type checking setup, definition generation, bug...

1 min 1 month, 3 weeks ago

ear

Balancing Multiple Objectives in Urban Traffic Control with Reinforcement Learning from AI Feedback

PyVision-RL: Forging Open Agentic Vision Models via RL

POMDPPlanners: Open-Source Package for POMDP Planning

Pressure Reveals Character: Behavioural Alignment Evaluation at Depth

Diagnosing Causal Reasoning in Vision-Language Models via Structured Relevance Graphs

Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence

Tool Building as a Path to "Superintelligence"

The Initial Exploration Problem in Knowledge Graph Exploration

Interpretable Medical Image Classification using Prototype Learning and Privileged Information

ConceptRM: The Quest to Mitigate Alert Fatigue through Consensus-Based Purity-Driven Data Cleaning for Reflection Modelling

Benchmarking Early Deterioration Prediction Across Hospital-Rich and MCI-Like Emergency Triage Under Constrained Sensing

InterviewSim: A Scalable Framework for Interview-Grounded Personality Simulation

What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance

No One Size Fits All: QueryBandits for Hallucination Mitigation

Disentangling Geometry, Performance, and Training in Language Models

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

Distill and Align Decomposition for Enhanced Claim Verification

ProactiveMobile: A Comprehensive Benchmark for Boosting Proactive Intelligence on Mobile Devices

Inference-time Alignment via Sparse Junction Steering

EPSVec: Efficient and Private Synthetic Data Generation via Dataset Vectors

Field-Theoretic Memory for AI Agents: Continuous Dynamics for Context Preservation

Architecture-Agnostic Curriculum Learning for Document Understanding: Empirical Evidence from Text-Only and Multimodal

IslamicLegalBench: Evaluating LLMs Knowledge and Reasoning of Islamic Law Across 1,200 Years of Islamic Pluralist Legal Traditions

The Fundamental Right to Education

Gains, Losses, and Judges: Framing and the Judiciary

Corporate Governance in the Age of AI: Board Responsibilities and Best Practices

Budget-Aware Agentic Routing via Boundary-Guided Training

ImpRIF: Stronger Implicit Reasoning Leads to Better Complex Instruction Following

AngelSlim: A more accessible, comprehensive, and efficient toolkit for large model compression

AgenticTyper: Automated Typing of Legacy Software Projects Using Agentic AI

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.