International Law

LOW Academic International

The Trilingual Triad Framework: Integrating Design, AI, and Domain Knowledge in No-code AI Smart City Course

arXiv:2603.05036v1 Announce Type: new Abstract: This paper introduces the "Trilingual Triad" framework, a model that explains how students learn to design with generative artificial intelligence (AI) through the integration of Design, AI, and Domain Knowledge. As generative AI rapidly enters...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents

arXiv:2603.05044v1 Announce Type: new Abstract: Current paradigms for training GUI agents are fundamentally limited by a reliance on either unsafe, non-reproducible live web interactions or costly, scarce human-crafted data and environments. We argue this focus on data volume overlooks a...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning

arXiv:2603.05120v1 Announce Type: new Abstract: Enhancing mathematical reasoning in Large Language Models typically demands massive datasets, yet data efficiency remains a critical bottleneck. While Curriculum Learning attempts to structure this process, standard unidirectional approaches (simple-to-complex) suffer from inefficient sample utilization:...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

CTRL-RAG: Contrastive Likelihood Reward Based Reinforcement Learning for Context-Faithful RAG Models

arXiv:2603.04406v1 Announce Type: new Abstract: With the growing use of Retrieval-Augmented Generation (RAG), training large language models (LLMs) for context-sensitive reasoning and faithfulness is increasingly important. Existing RAG-oriented reinforcement learning (RL) methods rely on external rewards that often fail to...

1 min 1 month, 2 weeks ago

ear

LOW Academic United Kingdom

Unpacking Human Preference for LLMs: Demographically Aware Evaluation with the HUMAINE Framework

arXiv:2603.04409v1 Announce Type: new Abstract: The evaluation of large language models faces significant challenges. Technical benchmarks often lack real-world relevance, while existing human preference evaluations suffer from unrepresentative sampling, superficial assessment depth, and single-metric reductionism. To address these issues, we...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Simulating Meaning, Nevermore! Introducing ICR: A Semiotic-Hermeneutic Metric for Evaluating Meaning in LLM Text Summaries

arXiv:2603.04413v1 Announce Type: new Abstract: Meaning in human language is relational, context dependent, and emergent, arising from dynamic systems of signs rather than fixed word-concept mappings. In computational settings, this semiotic and interpretive complexity complicates the generation and evaluation of...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Multiclass Hate Speech Detection with RoBERTa-OTA: Integrating Transformer Attention and Graph Convolutional Networks

arXiv:2603.04414v1 Announce Type: new Abstract: Multiclass hate speech detection across demographic categories remains computationally challenging due to implicit targeting strategies and linguistic variability in social media content. Existing approaches rely solely on learned representations from training data, without explicitly incorporating...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Same Input, Different Scores: A Multi Model Study on the Inconsistency of LLM Judge

arXiv:2603.04417v1 Announce Type: new Abstract: Large language models are increasingly used as automated evaluators in research and enterprise settings, a practice known as LLM-as-a-judge. While prior work has examined accuracy, bias, and alignment with human preferences, far less attention has...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Context-Dependent Affordance Computation in Vision-Language Models

arXiv:2603.04419v1 Announce Type: new Abstract: We characterize the phenomenon of context-dependent affordance computation in vision-language models (VLMs). Through a large-scale computational study (n=3,213 scene-context pairs from COCO-2017) using Qwen-VL 30B and LLaVA-1.5-13B subject to systematic context priming across 7 agentic...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

What Is Missing: Interpretable Ratings for Large Language Model Outputs

arXiv:2603.04429v1 Announce Type: new Abstract: Current Large Language Model (LLM) preference learning methods such as Proximal Policy Optimization and Direct Preference Optimization learn from direct rankings or numerical ratings of model outputs, these rankings are subjective, and a single numerical...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

From Static Inference to Dynamic Interaction: Navigating the Landscape of Streaming Large Language Models

arXiv:2603.04592v1 Announce Type: new Abstract: Standard Large Language Models (LLMs) are predominantly designed for static inference with pre-defined inputs, which limits their applicability in dynamic, real-time scenarios. To address this gap, the streaming LLM paradigm has emerged. However, existing definitions...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

arXiv:2603.04597v1 Announce Type: new Abstract: Large language models (LLMs) typically receive diverse natural language (NL) feedback through interaction with the environment. However, current reinforcement learning (RL) algorithms rely solely on scalar rewards, leaving the rich information in NL feedback underutilized...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

iAgentBench: Benchmarking Sensemaking Capabilities of Information-Seeking Agents on High-Traffic Topics

arXiv:2603.04656v1 Announce Type: new Abstract: With the emergence of search-enabled generative QA systems, users are increasingly turning to tools that browse, aggregate, and reconcile evidence across multiple sources on their behalf. Yet many widely used QA benchmarks remain answerable by...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Stan: An LLM-based thermodynamics course assistant

arXiv:2603.04657v1 Announce Type: new Abstract: Discussions of AI in education focus predominantly on student-facing tools -- chatbots, tutors, and problem generators -- while the potential for the same infrastructure to support instructors remains largely unexplored. We describe Stan, a suite...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Optimizing Language Models for Crosslingual Knowledge Consistency

arXiv:2603.04678v1 Announce Type: new Abstract: Large language models are known to often exhibit inconsistent knowledge. This is particularly problematic in multilingual scenarios, where models are likely to be asked similar questions in different languages, and inconsistent responses can undermine their...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement

arXiv:2603.04698v1 Announce Type: new Abstract: This paper evaluates data augmentation and feature enhancement techniques for hate speech detection, comparing traditional classifiers, e.g., Delta Term Frequency-Inverse Document Frequency (Delta TF-IDF), with transformer-based models (DistilBERT, RoBERTa, DeBERTa, Gemma-7B, gpt-oss-20b) across diverse datasets....

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Detection of Illicit Content on Online Marketplaces using Large Language Models

arXiv:2603.04707v1 Announce Type: new Abstract: Online marketplaces, while revolutionizing global commerce, have inadvertently facilitated the proliferation of illicit activities, including drug trafficking, counterfeit sales, and cybercrimes. Traditional content moderation methods such as manual reviews and rule-based automated systems struggle with...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

AI-Assisted Moot Courts: Simulating Justice-Specific Questioning in Oral Arguments

arXiv:2603.04718v1 Announce Type: new Abstract: In oral arguments, judges probe attorneys with questions about the factual record, legal claims, and the strength of their arguments. To prepare for this questioning, both law schools and practicing attorneys rely on moot courts:...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings

arXiv:2603.04772v1 Announce Type: new Abstract: Despite the exceptional reasoning capabilities of Multimodal Large Language Models (MLLMs), their adaptation into universal embedding models is significantly impeded by task conflict. To address this, we propose TSEmbed, a universal multimodal embedding framework that...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Autoscoring Anticlimax: A Meta-analytic Understanding of AI's Short-answer Shortcomings and Wording Weaknesses

arXiv:2603.04820v1 Announce Type: new Abstract: Automated short-answer scoring lags other LLM applications. We meta-analyze 890 culminating results across a systematic review of LLM short-answer scoring studies, modeling the traditional effect size of Quadratic Weighted Kappa (QWK) with mixed effects metaregression....

1 min 1 month, 2 weeks ago

ear

LOW Academic International

SinhaLegal: A Benchmark Corpus for Information Extraction and Analysis in Sinhala Legislative Texts

arXiv:2603.04854v1 Announce Type: new Abstract: SinhaLegal introduces a Sinhala legislative text corpus containing approximately 2 million words across 1,206 legal documents. The dataset includes two types of legal documents: 1,065 Acts dated from 1981 to 2014 and 141 Bills from...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

HACHIMI: Scalable and Controllable Student Persona Generation via Orchestrated Agents

arXiv:2603.04855v1 Announce Type: new Abstract: Student Personas (SPs) are emerging as infrastructure for educational LLMs, yet prior work often relies on ad-hoc prompting or hand-crafted profiles with limited control over educational theory and population distributions. We formalize this as Theory-Aligned...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

arXiv:2603.04893v1 Announce Type: new Abstract: Diverse outputs in text generation are necessary for effective exploration in complex reasoning tasks, such as code generation and mathematical problem solving. Such Pass@$k$ problems benefit from distinct candidates covering the solution space. However, traditional...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Can LLMs Capture Expert Uncertainty? A Comparative Analysis of Value Alignment in Ethnographic Qualitative Research

arXiv:2603.04897v1 Announce Type: new Abstract: Qualitative analysis of open-ended interviews plays a central role in ethnographic and economic research by uncovering individuals' values, motivations, and culturally embedded financial behaviors. While large language models (LLMs) offer promising support for automating and...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Federated Heterogeneous Language Model Optimization for Hybrid Automatic Speech Recognition

arXiv:2603.04945v1 Announce Type: new Abstract: Training automatic speech recognition (ASR) models increasingly relies on decentralized federated learning to ensure data privacy and accessibility, producing multiple local models that require effective merging. In hybrid ASR systems, while acoustic models can be...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Decorrelating the Future: Joint Frequency Domain Learning for Spatio-temporal Forecasting

arXiv:2603.04418v1 Announce Type: new Abstract: Standard direct forecasting models typically rely on point-wise objectives such as Mean Squared Error, which fail to capture the complex spatio-temporal dependencies inherent in graph-structured signals. While recent frequency-domain approaches such as FreDF mitigate temporal...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Machine Learning for Complex Systems Dynamics: Detecting Bifurcations in Dynamical Systems with Deep Neural Networks

arXiv:2603.04420v1 Announce Type: new Abstract: Critical transitions are the abrupt shifts between qualitatively different states of a system, and they are crucial to understanding tipping points in complex dynamical systems across ecology, climate science, and biology. Detecting these shifts typically...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

FedEMA-Distill: Exponential Moving Average Guided Knowledge Distillation for Robust Federated Learning

arXiv:2603.04422v1 Announce Type: new Abstract: Federated learning (FL) often degrades when clients hold heterogeneous non-Independent and Identically Distributed (non-IID) data and when some clients behave adversarially, leading to client drift, slow convergence, and high communication overhead. This paper proposes FedEMA-Distill,...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Delta-Crosscoder: Robust Crosscoder Model Diffing in Narrow Fine-Tuning Regimes

arXiv:2603.04426v1 Announce Type: new Abstract: Model diffing methods aim to identify how fine-tuning changes a model's internal representations. Crosscoders approach this by learning shared dictionaries of interpretable latent directions between base and fine-tuned models. However, existing formulations struggle with narrow...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Thin Keys, Full Values: Reducing KV Cache via Low-Dimensional Attention Selection

arXiv:2603.04427v1 Announce Type: new Abstract: Standard transformer attention uses identical dimensionality for queries, keys, and values ($d_q = d_k = d_v = \dmodel$). Our insight is that these components serve fundamentally different roles, and this symmetry is unnecessary. Queries and...

1 min 1 month, 2 weeks ago

ear

The Trilingual Triad Framework: Integrating Design, AI, and Domain Knowledge in No-code AI Smart City Course

WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents

Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning

CTRL-RAG: Contrastive Likelihood Reward Based Reinforcement Learning for Context-Faithful RAG Models

Unpacking Human Preference for LLMs: Demographically Aware Evaluation with the HUMAINE Framework

Simulating Meaning, Nevermore! Introducing ICR: A Semiotic-Hermeneutic Metric for Evaluating Meaning in LLM Text Summaries

Multiclass Hate Speech Detection with RoBERTa-OTA: Integrating Transformer Attention and Graph Convolutional Networks

Same Input, Different Scores: A Multi Model Study on the Inconsistency of LLM Judge

Context-Dependent Affordance Computation in Vision-Language Models

What Is Missing: Interpretable Ratings for Large Language Model Outputs

From Static Inference to Dynamic Interaction: Navigating the Landscape of Streaming Large Language Models

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

iAgentBench: Benchmarking Sensemaking Capabilities of Information-Seeking Agents on High-Traffic Topics

Stan: An LLM-based thermodynamics course assistant

Optimizing Language Models for Crosslingual Knowledge Consistency

Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement

Detection of Illicit Content on Online Marketplaces using Large Language Models

AI-Assisted Moot Courts: Simulating Justice-Specific Questioning in Oral Arguments

TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings

Autoscoring Anticlimax: A Meta-analytic Understanding of AI's Short-answer Shortcomings and Wording Weaknesses

SinhaLegal: A Benchmark Corpus for Information Extraction and Analysis in Sinhala Legislative Texts

HACHIMI: Scalable and Controllable Student Persona Generation via Orchestrated Agents

Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

Can LLMs Capture Expert Uncertainty? A Comparative Analysis of Value Alignment in Ethnographic Qualitative Research

Federated Heterogeneous Language Model Optimization for Hybrid Automatic Speech Recognition

Decorrelating the Future: Joint Frequency Domain Learning for Spatio-temporal Forecasting

Machine Learning for Complex Systems Dynamics: Detecting Bifurcations in Dynamical Systems with Deep Neural Networks

FedEMA-Distill: Exponential Moving Average Guided Knowledge Distillation for Robust Federated Learning

Delta-Crosscoder: Robust Crosscoder Model Diffing in Narrow Fine-Tuning Regimes

Thin Keys, Full Values: Reducing KV Cache via Low-Dimensional Attention Selection

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.