International Law

LOW Academic International

HumanLM: Simulating Users with State Alignment Beats Response Imitation

arXiv:2603.03303v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly used to simulate how specific users respond to a given context, enabling more user-centric applications that rely on user feedback. However, existing user simulators mostly imitate surface-level patterns and...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Token-Oriented Object Notation vs JSON: A Benchmark of Plain and Constrained Decoding Generation

arXiv:2603.03306v1 Announce Type: cross Abstract: Recently presented Token-Oriented Object Notation (TOON) aims to replace JSON as a serialization format for passing structured data to LLMs with significantly reduced token usage. While showing solid accuracy in LLM comprehension, there is a...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

How does fine-tuning improve sensorimotor representations in large language models?

arXiv:2603.03313v1 Announce Type: cross Abstract: Large Language Models (LLMs) exhibit a significant "embodiment gap", where their text-based representations fail to align with human sensorimotor experiences. This study systematically investigates whether and how task-specific fine-tuning can bridge this gap. Utilizing Representational...

1 min 1 month, 3 weeks ago

ear

LOW Academic United States

Towards Self-Robust LLMs: Intrinsic Prompt Noise Resistance via CoIPO

arXiv:2603.03314v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated remarkable and steadily improving performance across a wide range of tasks. However, LLM performance may be highly sensitive to prompt variations especially in scenarios with limited openness or strict...

1 min 1 month, 3 weeks ago

ear

LOW Academic United States

M-QUEST -- Meme Question-Understanding Evaluation on Semantics and Toxicity

arXiv:2603.03315v1 Announce Type: cross Abstract: Internet memes are a powerful form of online communication, yet their nature and reliance on commonsense knowledge make toxicity detection challenging. Identifying key features for meme interpretation and understanding, is a crucial task. Previous work...

1 min 1 month, 3 weeks ago

ear

LOW Academic United Kingdom

The Influence of Iconicity in Transfer Learning for Sign Language Recognition

arXiv:2603.03316v1 Announce Type: cross Abstract: Most sign language recognition research relies on Transfer Learning (TL) from vision-based datasets such as ImageNet. Some extend this to alternatively available language datasets, often focusing on signs with cross-linguistic similarities. This body of work...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Can Large Language Models Derive New Knowledge? A Dynamic Benchmark for Biological Knowledge Discovery

arXiv:2603.03322v1 Announce Type: cross Abstract: Recent advancements in Large Language Model (LLM) agents have demonstrated remarkable potential in automatic knowledge discovery. However, rigorously evaluating an AI's capacity for knowledge discovery remains a critical challenge. Existing benchmarks predominantly rely on static...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Discern Truth from Falsehood: Reducing Over-Refusal via Contrastive Refinement

arXiv:2603.03323v1 Announce Type: cross Abstract: Large language models (LLMs) aligned for safety often suffer from over-refusal, the tendency to reject seemingly toxic or benign prompts by misclassifying them as toxic. This behavior undermines models' helpfulness and restricts usability in sensitive...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Controlling Chat Style in Language Models via Single-Direction Editing

arXiv:2603.03324v1 Announce Type: cross Abstract: Controlling stylistic attributes in large language models (LLMs) remains challenging, with existing approaches relying on either prompt engineering or post-training alignment. This paper investigates this challenge through the lens of representation engineering, testing the hypothesis...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

IntPro: A Proxy Agent for Context-Aware Intent Understanding via Retrieval-conditioned Inference

arXiv:2603.03325v1 Announce Type: cross Abstract: Large language models (LLMs) have become integral to modern Human-AI collaboration workflows, where accurately understanding user intent serves as a crucial step for generating satisfactory responses. Context-aware intent understanding, which involves inferring user intentions from...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

SE-Search: Self-Evolving Search Agent via Memory and Dense Reward

arXiv:2603.03293v1 Announce Type: new Abstract: Retrieval augmented generation (RAG) reduces hallucinations and factual errors in large language models (LLMs) by conditioning generation on retrieved external knowledge. Recent search agents further cast RAG as an autonomous, multi-turn information-seeking process. However, existing...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

How LLMs Cite and Why It Matters: A Cross-Model Audit of Reference Fabrication in AI-Assisted Academic Writing and Methods to Detect Phantom Citations

arXiv:2603.03299v1 Announce Type: new Abstract: Large language models (LLMs) have been noted to fabricate scholarly citations, yet the scope of this behavior across providers, domains, and prompting conditions remains poorly quantified. We present one of the largest citation hallucination audits...

1 min 1 month, 3 weeks ago

ear

LOW Academic United States

Benchmarking Legal RAG: The Promise and Limits of AI Statutory Surveys

arXiv:2603.03300v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) offers significant potential for legal AI, yet systematic benchmarks are sparse. Prior work introduced LaborBench to benchmark RAG models based on ostensible ground truth from an exhaustive, multi-month, manual enumeration of all...

1 min 1 month, 3 weeks ago

ear

LOW Academic European Union

Combating data scarcity in recommendation services: Integrating cognitive types of VARK and neural network technologies (LLM)

arXiv:2603.03309v1 Announce Type: new Abstract: Cold start scenarios present fundamental obstacles to effective recommendation generation, particularly when dealing with users lacking interaction history or items with sparse metadata. This research proposes an innovative hybrid framework that leverages Large Language Models...

1 min 1 month, 3 weeks ago

ear

LOW Academic European Union

Entropic-Time Inference: Self-Organizing Large Language Model Decoding Beyond Attention

arXiv:2603.03310v1 Announce Type: new Abstract: Modern large language model (LLM) inference engines optimize throughput and latency under fixed decoding rules, treating generation as a linear progression in token time. We propose a fundamentally different paradigm: entropic\-time inference, where decoding is...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

The Logovista English-Japanese Machine Translation System

arXiv:2603.03311v1 Announce Type: new Abstract: This paper documents the architecture, development practices, and preserved artifacts of the Logovista English--Japanese machine translation system, a large, explicitly rule-based MT system that was developed and sold commercially from the early 1990s through at...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

StructLens: A Structural Lens for Language Models via Maximum Spanning Trees

arXiv:2603.03328v1 Announce Type: new Abstract: Language exhibits inherent structures, a property that explains both language acquisition and language change. Given this characteristic, we expect language models to manifest internal structures as well. While interpretability research has investigated the components of...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

AutoHarness: improving LLM agents by automatically synthesizing a code harness

arXiv:2603.03329v1 Announce Type: new Abstract: Despite significant strides in language models in the last few years, when used as agents, such models often try to perform actions that are not just suboptimal for a given state, but are strictly prohibited...

1 min 1 month, 3 weeks ago

ear

LOW Academic United States

PulseLM: A Foundation Dataset and Benchmark for PPG-Text Learning

arXiv:2603.03331v1 Announce Type: new Abstract: Photoplethysmography (PPG) is a widely used non-invasive sensing modality for continuous cardiovascular and physiological monitoring across clinical, laboratory, and wearable settings. While existing PPG datasets support a broad range of downstream tasks, they typically provide...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

The CompMath-MCQ Dataset: Are LLMs Ready for Higher-Level Math?

arXiv:2603.03334v1 Announce Type: new Abstract: The evaluation of Large Language Models (LLMs) on mathematical reasoning has largely focused on elementary problems, competition-style questions, or formal theorem proving, leaving graduate-level and computational mathematics relatively underexplored. We introduce CompMath-MCQ, a new benchmark...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Prompt-Dependent Ranking of Large Language Models with Uncertainty Quantification

arXiv:2603.03336v1 Announce Type: new Abstract: Rankings derived from pairwise comparisons are central to many economic and computational systems. In the context of large language models (LLMs), rankings are typically constructed from human preference data and presented as leaderboards that guide...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Tracing Pharmacological Knowledge In Large Language Models

arXiv:2603.03407v1 Announce Type: new Abstract: Large language models (LLMs) have shown strong empirical performance across pharmacology and drug discovery tasks, yet the internal mechanisms by which they encode pharmacological knowledge remain poorly understood. In this work, we investigate how drug-group...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Farther the Shift, Sparser the Representation: Analyzing OOD Mechanisms in LLMs

arXiv:2603.03415v1 Announce Type: new Abstract: In this work, we investigate how Large Language Models (LLMs) adapt their internal representations when encountering inputs of increasing difficulty, quantified as the degree of out-of-distribution (OOD) shift. We reveal a consistent and quantifiable phenomenon:...

1 min 1 month, 3 weeks ago

ear

LOW Academic European Union

Raising Bars, Not Parameters: LilMoo Compact Language Model for Hindi

arXiv:2603.03508v1 Announce Type: new Abstract: The dominance of large multilingual foundation models has widened linguistic inequalities in Natural Language Processing (NLP), often leaving low-resource languages underrepresented. This paper introduces LilMoo, a 0.6-billion-parameter Hindi language model trained entirely from scratch to...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

A theoretical model of dynamical grammatical gender shifting based on set-valued set function

arXiv:2603.03510v1 Announce Type: new Abstract: This study investigates the diverse characteristics of nouns, focusing on both semantic (e.g., countable/uncountable) and morphosyntactic (e.g., masculine/feminine) distinctions. We explore inter-word variations for gender markers in noun morphology. Grammatical gender shift is a widespread...

1 min 1 month, 3 weeks ago

ear

LOW Academic United States

AOI: Turning Failed Trajectories into Training Signals for Autonomous Cloud Diagnosis

arXiv:2603.03378v1 Announce Type: new Abstract: Large language model (LLM) agents offer a promising data-driven approach to automating Site Reliability Engineering (SRE), yet their enterprise deployment is constrained by three challenges: restricted access to proprietary data, unsafe action execution under permission-governed...

1 min 1 month, 3 weeks ago

ear

LOW Academic European Union

RADAR: Learning to Route with Asymmetry-aware DistAnce Representations

arXiv:2603.03388v1 Announce Type: new Abstract: Recent neural solvers have achieved strong performance on vehicle routing problems (VRPs), yet they mainly assume symmetric Euclidean distances, restricting applicability to real-world scenarios. A core challenge is encoding the relational features in asymmetric distance...

1 min 1 month, 3 weeks ago

ear

LOW Academic European Union

Towards Improved Sentence Representations using Token Graphs

arXiv:2603.03389v1 Announce Type: new Abstract: Obtaining a single-vector representation from a Large Language Model's (LLM) token-level outputs is a critical step for nearly all sentence-level tasks. However, standard pooling methods like mean or max aggregation treat tokens as an independent...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

[Re] FairDICE: A Gap Between Theory And Practice

arXiv:2603.03454v1 Announce Type: new Abstract: Offline Reinforcement Learning (RL) is an emerging field of RL in which policies are learned solely from demonstrations. Within offline RL, some environments involve balancing multiple objectives, but existing multi-objective offline RL algorithms do not...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Half the Nonlinearity Is Wasted: Measuring and Reallocating the Transformer's MLP Budget

arXiv:2603.03459v1 Announce Type: new Abstract: We investigate when transformer MLP nonlinearity is actually necessary. A gate with $d+1$ parameters decides when to replace the full MLP with a linear surrogate. Through systematic investigation across six models (162M-2.8B parameters), two architectures,...

1 min 1 month, 3 weeks ago

ear

HumanLM: Simulating Users with State Alignment Beats Response Imitation

Token-Oriented Object Notation vs JSON: A Benchmark of Plain and Constrained Decoding Generation

How does fine-tuning improve sensorimotor representations in large language models?

Towards Self-Robust LLMs: Intrinsic Prompt Noise Resistance via CoIPO

M-QUEST -- Meme Question-Understanding Evaluation on Semantics and Toxicity

The Influence of Iconicity in Transfer Learning for Sign Language Recognition

Can Large Language Models Derive New Knowledge? A Dynamic Benchmark for Biological Knowledge Discovery

Discern Truth from Falsehood: Reducing Over-Refusal via Contrastive Refinement

Controlling Chat Style in Language Models via Single-Direction Editing

IntPro: A Proxy Agent for Context-Aware Intent Understanding via Retrieval-conditioned Inference

SE-Search: Self-Evolving Search Agent via Memory and Dense Reward

How LLMs Cite and Why It Matters: A Cross-Model Audit of Reference Fabrication in AI-Assisted Academic Writing and Methods to Detect Phantom Citations

Benchmarking Legal RAG: The Promise and Limits of AI Statutory Surveys

Combating data scarcity in recommendation services: Integrating cognitive types of VARK and neural network technologies (LLM)

Entropic-Time Inference: Self-Organizing Large Language Model Decoding Beyond Attention

The Logovista English-Japanese Machine Translation System

StructLens: A Structural Lens for Language Models via Maximum Spanning Trees

AutoHarness: improving LLM agents by automatically synthesizing a code harness

PulseLM: A Foundation Dataset and Benchmark for PPG-Text Learning

The CompMath-MCQ Dataset: Are LLMs Ready for Higher-Level Math?

Prompt-Dependent Ranking of Large Language Models with Uncertainty Quantification

Tracing Pharmacological Knowledge In Large Language Models

Farther the Shift, Sparser the Representation: Analyzing OOD Mechanisms in LLMs

Raising Bars, Not Parameters: LilMoo Compact Language Model for Hindi

A theoretical model of dynamical grammatical gender shifting based on set-valued set function

AOI: Turning Failed Trajectories into Training Signals for Autonomous Cloud Diagnosis

RADAR: Learning to Route with Asymmetry-aware DistAnce Representations

Towards Improved Sentence Representations using Token Graphs

[Re] FairDICE: A Gap Between Theory And Practice

Half the Nonlinearity Is Wasted: Measuring and Reallocating the Transformer's MLP Budget

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.