International Law

LOW Academic International

LARFT: Closing the Cognition-Action Gap for Length Instruction Following in Large Language Models

arXiv:2603.19255v1 Announce Type: cross Abstract: Despite the strong performance of Large Language Models (LLMs) on complex instruction-following tasks, precise control of output length remains a persistent challenge. Existing methods primarily attempt to enforce length constraints by externally imposing length signals...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

Breeze Taigi: Benchmarks and Models for Taiwanese Hokkien Speech Recognition and Synthesis

arXiv:2603.19259v1 Announce Type: cross Abstract: Taiwanese Hokkien (Taigi) presents unique opportunities for advancing speech technology methodologies that can generalize to diverse linguistic contexts. We introduce Breeze Taigi, a comprehensive framework centered on standardized benchmarks for evaluating Taigi speech recognition and...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

Teaching an Agent to Sketch One Part at a Time

arXiv:2603.19500v1 Announce Type: new Abstract: We develop a method for producing vector sketches one part at a time. To do this, we train a multi-modal language model-based agent using a novel multi-turn process-reward reinforcement learning following supervised fine-tuning. Our approach...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

On the Ability of Transformers to Verify Plans

arXiv:2603.19954v1 Announce Type: new Abstract: Transformers have shown inconsistent success in AI planning tasks, and theoretical understanding of when generalization should be expected has been limited. We take important steps towards addressing this gap by analyzing the ability of decoder-only...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

Pitfalls in Evaluating Interpretability Agents

arXiv:2603.20101v1 Announce Type: new Abstract: Automated interpretability systems aim to reduce the need for human labor and scale analysis to increasingly large models and diverse tasks. Recent efforts toward this goal leverage large language models (LLMs) at increasing levels of...

1 min 4 weeks, 1 day ago

ear

LOW Academic United States

Probing to Refine: Reinforcement Distillation of LLMs via Explanatory Inversion

arXiv:2603.19266v1 Announce Type: cross Abstract: Distilling robust reasoning capabilities from large language models (LLMs) into smaller, computationally efficient student models remains an unresolved challenge. Despite recent advances, distilled models frequently suffer from superficial pattern memorization and subpar generalization. To overcome...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

Full-Stack Domain Enhancement for Combustion LLMs: Construction and Optimization

arXiv:2603.19268v1 Announce Type: cross Abstract: Large language models (LLMs) in the direction of task adaptation and capability enhancement for professional fields demonstrate significant application potential. Nevertheless, for complex physical systems such as combustion science, general-purpose LLMs often generate severe hallucinations...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

A Human-Centered Workflow for Using Large Language Models in Content Analysis

arXiv:2603.19271v1 Announce Type: cross Abstract: While many researchers use Large Language Models (LLMs) through chat-based access, their real potential lies in leveraging LLMs via application programming interfaces (APIs). This paper conceptualizes LLMs as universal text processing machines and presents a...

1 min 4 weeks, 1 day ago

ear

LOW Academic United States

Improving Automatic Summarization of Radiology Reports through Mid-Training of Large Language Models

arXiv:2603.19275v1 Announce Type: cross Abstract: Automatic summarization of radiology reports is an essential application to reduce the burden on physicians. Previous studies have widely used the "pre-training, fine-tuning" strategy to adapt large language models (LLMs) for summarization. This study proposed...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

HypeLoRA: Hyper-Network-Generated LoRA Adapters for Calibrated Language Model Fine-Tuning

arXiv:2603.19278v1 Announce Type: cross Abstract: Modern Transformer-based models frequently suffer from miscalibration, producing overconfident predictions that do not reflect true empirical frequencies. This work investigates the calibration dynamics of LoRA: Low-Rank Adaptation and a novel hyper-network-based adaptation framework as parameter-efficient...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

Framing Effects in Independent-Agent Large Language Models: A Cross-Family Behavioral Analysis

arXiv:2603.19282v1 Announce Type: cross Abstract: In many real-world applications, large language models (LLMs) operate as independent agents without interaction, thereby limiting coordination. In this setting, we examine how prompt framing influences decisions in a threshold voting task involving individual-group interest...

1 min 4 weeks, 1 day ago

ear

LOW Academic European Union

CDEoH: Category-Driven Automatic Algorithm Design With Large Language Models

arXiv:2603.19284v1 Announce Type: cross Abstract: With the rapid advancement of large language models (LLMs), LLM-based heuristic search methods have demonstrated strong capabilities in automated algorithm generation. However, their evolutionary processes often suffer from instability and premature convergence. Existing approaches mainly...

1 min 4 weeks, 1 day ago

ear

LOW Academic United States

Joint Return and Risk Modeling with Deep Neural Networks for Portfolio Construction

arXiv:2603.19288v1 Announce Type: cross Abstract: Portfolio construction traditionally relies on separately estimating expected returns and covariance matrices using historical statistics, often leading to suboptimal allocation under time-varying market conditions. This paper proposes a joint return and risk modeling framework based...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

Spelling Correction in Healthcare Query-Answer Systems: Methods, Retrieval Impact, and Empirical Evaluation

arXiv:2603.19249v1 Announce Type: new Abstract: Healthcare question-answering (QA) systems face a persistent challenge: users submit queries with spelling errors at rates substantially higher than those found in the professional documents they search. This paper presents the first controlled study of...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

From Comprehension to Reasoning: A Hierarchical Benchmark for Automated Financial Research Reporting

arXiv:2603.19254v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used to generate financial research reports, shifting from auxiliary analytic tools to primary content producers. Yet recent real-world deployments reveal persistent failures--factual errors, numerical inconsistencies, fabricated references, and shallow...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

ShobdoSetu: A Data-Centric Framework for Bengali Long-Form Speech Recognition and Speaker Diarization

arXiv:2603.19256v1 Announce Type: new Abstract: Bengali is spoken by over 230 million people yet remains severely under-served in automatic speech recognition (ASR) and speaker diarization research. In this paper, we present our system for the DL Sprint 4.0 Bengali Long-Form...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

Constraint-aware Path Planning from Natural Language Instructions Using Large Language Models

arXiv:2603.19257v1 Announce Type: new Abstract: Real-world path planning tasks typically involve multiple constraints beyond simple route optimization, such as the number of routes, maximum route length, depot locations, and task-specific requirements. Traditional approaches rely on dedicated formulations and algorithms for...

1 min 4 weeks, 1 day ago

ear

LOW Academic United States

Reviewing the Reviewer: Graph-Enhanced LLMs for E-commerce Appeal Adjudication

arXiv:2603.19267v1 Announce Type: new Abstract: Hierarchical review workflows, where a second-tier reviewer (Checker) corrects first-tier (Maker) decisions, generate valuable correction signals that encode why initial judgments failed. However, learning from these signals is hindered by information asymmetry: corrections often depend...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

From Tokens To Agents: A Researcher's Guide To Understanding Large Language Models

arXiv:2603.19269v1 Announce Type: new Abstract: Researchers face a critical choice: how to use -- or not use -- large language models in their work. Using them well requires understanding the mechanisms that shape what LLMs can and cannot do. This...

1 min 4 weeks, 1 day ago

ear

LOW Academic United States

Autonoma: A Hierarchical Multi-Agent Framework for End-to-End Workflow Automation

arXiv:2603.19270v1 Announce Type: new Abstract: The increasing complexity of user demands necessitates automation frameworks that can reliably translate open-ended instructions into robust, multi-step workflows. Current monolithic agent architectures often struggle with the challenges of scalability, error propagation, and maintaining focus...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

MOSAIC: Modular Opinion Summarization using Aspect Identification and Clustering

arXiv:2603.19277v1 Announce Type: new Abstract: Reviews are central to how travelers evaluate products on online marketplaces, yet existing summarization research often emphasizes end-to-end quality while overlooking benchmark reliability and the practical utility of granular insights. To address this, we propose...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

Multilingual Hate Speech Detection and Counterspeech Generation: A Comprehensive Survey and Practical Guide

arXiv:2603.19279v1 Announce Type: new Abstract: Combating online hate speech in multilingual settings requires approaches that go beyond English-centric models and capture the cultural and linguistic diversity of global online discourse. This paper presents a comprehensive survey and practical guide to...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

Automated Motif Indexing on the Arabian Nights

arXiv:2603.19283v1 Announce Type: new Abstract: Motifs are non-commonplace, recurring narrative elements, often found originally in folk stories. In addition to being of interest to folklorists, motifs appear as metaphoric devices in modern news, literature, propaganda, and other cultural texts. Finding...

1 min 4 weeks, 1 day ago

ear

LOW Academic United States

PrefPO: Pairwise Preference Prompt Optimization

arXiv:2603.19311v1 Announce Type: new Abstract: Prompt engineering is effective but labor-intensive, motivating automated optimization methods. Existing methods typically require labeled datasets, which are often unavailable, and produce verbose, repetitive prompts. We introduce PrefPO, a minimal prompt optimization approach inspired by...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

Prompt-tuning with Attribute Guidance for Low-resource Entity Matching

arXiv:2603.19321v1 Announce Type: new Abstract: Entity Matching (EM) is an important task that determines the logical relationship between two entities, such as Same, Different, or Undecidable. Traditional EM approaches rely heavily on supervised learning, which requires large amounts of high-quality...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

Is Evaluation Awareness Just Format Sensitivity? Limitations of Probe-Based Evidence under Controlled Prompt Structure

arXiv:2603.19426v1 Announce Type: new Abstract: Prior work uses linear probes on benchmark prompts as evidence of evaluation awareness in large language models. Because evaluation context is typically entangled with benchmark format and genre, it is unclear whether probe-based signals reflect...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

Vocabulary shapes cross-lingual variation of word-order learnability in language models

arXiv:2603.19427v1 Announce Type: new Abstract: Why do some languages like Czech permit free word order, while others like English do not? We address this question by pretraining transformer language models on a spectrum of synthetic word-order variants of natural languages....

1 min 4 weeks, 1 day ago

ear

LOW Academic European Union

Cooperation and Exploitation in LLM Policy Synthesis for Sequential Social Dilemmas

arXiv:2603.19453v1 Announce Type: new Abstract: We study LLM policy synthesis: using a large language model to iteratively generate programmatic agent policies for multi-agent environments. Rather than training neural policies via reinforcement learning, our framework prompts an LLM to produce Python...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

Inducing Sustained Creativity and Diversity in Large Language Models

arXiv:2603.19519v1 Announce Type: new Abstract: We address a not-widely-recognized subset of exploratory search, where a user sets out on a typically long "search quest" for the perfect wedding dress, overlooked research topic, killer company idea, etc. The first few outputs...

1 min 4 weeks, 1 day ago

ear

LOW Academic International

EvidenceRL: Reinforcing Evidence Consistency for Trustworthy Language Models

arXiv:2603.19532v1 Announce Type: new Abstract: Large Language Models (LLMs) are fluent but prone to hallucinations, producing answers that appear plausible yet are unsupported by available evidence. This failure is especially problematic in high-stakes domains where decisions must be justified by...

1 min 4 weeks, 1 day ago

ear

LARFT: Closing the Cognition-Action Gap for Length Instruction Following in Large Language Models

Breeze Taigi: Benchmarks and Models for Taiwanese Hokkien Speech Recognition and Synthesis

Teaching an Agent to Sketch One Part at a Time

On the Ability of Transformers to Verify Plans

Pitfalls in Evaluating Interpretability Agents

Probing to Refine: Reinforcement Distillation of LLMs via Explanatory Inversion

Full-Stack Domain Enhancement for Combustion LLMs: Construction and Optimization

A Human-Centered Workflow for Using Large Language Models in Content Analysis

Improving Automatic Summarization of Radiology Reports through Mid-Training of Large Language Models

HypeLoRA: Hyper-Network-Generated LoRA Adapters for Calibrated Language Model Fine-Tuning

Framing Effects in Independent-Agent Large Language Models: A Cross-Family Behavioral Analysis

CDEoH: Category-Driven Automatic Algorithm Design With Large Language Models

Joint Return and Risk Modeling with Deep Neural Networks for Portfolio Construction

Spelling Correction in Healthcare Query-Answer Systems: Methods, Retrieval Impact, and Empirical Evaluation

From Comprehension to Reasoning: A Hierarchical Benchmark for Automated Financial Research Reporting

ShobdoSetu: A Data-Centric Framework for Bengali Long-Form Speech Recognition and Speaker Diarization

Constraint-aware Path Planning from Natural Language Instructions Using Large Language Models

Reviewing the Reviewer: Graph-Enhanced LLMs for E-commerce Appeal Adjudication

From Tokens To Agents: A Researcher's Guide To Understanding Large Language Models

Autonoma: A Hierarchical Multi-Agent Framework for End-to-End Workflow Automation

MOSAIC: Modular Opinion Summarization using Aspect Identification and Clustering

Multilingual Hate Speech Detection and Counterspeech Generation: A Comprehensive Survey and Practical Guide

Automated Motif Indexing on the Arabian Nights

PrefPO: Pairwise Preference Prompt Optimization

Prompt-tuning with Attribute Guidance for Low-resource Entity Matching

Is Evaluation Awareness Just Format Sensitivity? Limitations of Probe-Based Evidence under Controlled Prompt Structure

Vocabulary shapes cross-lingual variation of word-order learnability in language models

Cooperation and Exploitation in LLM Policy Synthesis for Sequential Social Dilemmas

Inducing Sustained Creativity and Diversity in Large Language Models

EvidenceRL: Reinforcing Evidence Consistency for Trustworthy Language Models

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.