Litigation

LOW Academic International

Language Model Goal Selection Differs from Humans' in an Open-Ended Task

arXiv:2603.03295v1 Announce Type: cross Abstract: As large language models (LLMs) get integrated into human decision-making, they are increasingly choosing goals autonomously rather than only completing human-defined ones, assuming they will reflect human preferences. However, human-LLM similarity in goal selection remains...

1 min 1 month, 1 week ago

discovery

LOW Academic International

HumanLM: Simulating Users with State Alignment Beats Response Imitation

arXiv:2603.03303v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly used to simulate how specific users respond to a given context, enabling more user-centric applications that rely on user feedback. However, existing user simulators mostly imitate surface-level patterns and...

1 min 1 month, 1 week ago

motion

LOW Academic International

Quantum-Inspired Self-Attention in a Large Language Model

arXiv:2603.03318v1 Announce Type: cross Abstract: Recent advances in Natural Language Processing have been predominantly driven by transformer-based architectures, which rely heavily on self-attention mechanisms to model relationships between tokens in a sequence. Similarly, the field of Quantum Natural Language Processing,...

1 min 1 month, 1 week ago

standing

LOW Academic International

Can Large Language Models Derive New Knowledge? A Dynamic Benchmark for Biological Knowledge Discovery

arXiv:2603.03322v1 Announce Type: cross Abstract: Recent advancements in Large Language Model (LLM) agents have demonstrated remarkable potential in automatic knowledge discovery. However, rigorously evaluating an AI's capacity for knowledge discovery remains a critical challenge. Existing benchmarks predominantly rely on static...

1 min 1 month, 1 week ago

discovery

LOW Academic International

IntPro: A Proxy Agent for Context-Aware Intent Understanding via Retrieval-conditioned Inference

arXiv:2603.03325v1 Announce Type: cross Abstract: Large language models (LLMs) have become integral to modern Human-AI collaboration workflows, where accurately understanding user intent serves as a crucial step for generating satisfactory responses. Context-aware intent understanding, which involves inferring user intentions from...

1 min 1 month, 1 week ago

standing

LOW Academic International

SE-Search: Self-Evolving Search Agent via Memory and Dense Reward

arXiv:2603.03293v1 Announce Type: new Abstract: Retrieval augmented generation (RAG) reduces hallucinations and factual errors in large language models (LLMs) by conditioning generation on retrieved external knowledge. Recent search agents further cast RAG as an autonomous, multi-turn information-seeking process. However, existing...

1 min 1 month, 1 week ago

evidence

LOW Academic International

StructLens: A Structural Lens for Language Models via Maximum Spanning Trees

arXiv:2603.03328v1 Announce Type: new Abstract: Language exhibits inherent structures, a property that explains both language acquisition and language change. Given this characteristic, we expect language models to manifest internal structures as well. While interpretability research has investigated the components of...

1 min 1 month, 1 week ago

standing

LOW Academic International

Tracing Pharmacological Knowledge In Large Language Models

arXiv:2603.03407v1 Announce Type: new Abstract: Large language models (LLMs) have shown strong empirical performance across pharmacology and drug discovery tasks, yet the internal mechanisms by which they encode pharmacological knowledge remain poorly understood. In this work, we investigate how drug-group...

1 min 1 month, 1 week ago

discovery

LOW Academic International

A theoretical model of dynamical grammatical gender shifting based on set-valued set function

arXiv:2603.03510v1 Announce Type: new Abstract: This study investigates the diverse characteristics of nouns, focusing on both semantic (e.g., countable/uncountable) and morphosyntactic (e.g., masculine/feminine) distinctions. We explore inter-word variations for gender markers in noun morphology. Grammatical gender shift is a widespread...

1 min 1 month, 1 week ago

standing

LOW Academic International

Trade-offs in Ensembling, Merging and Routing Among Parameter-Efficient Experts

arXiv:2603.03535v1 Announce Type: new Abstract: While large language models (LLMs) fine-tuned with lightweight adapters achieve strong performance across diverse tasks, their performance on individual tasks depends on the fine-tuning strategy. Fusing independently trained models with different strengths has shown promise...

1 min 1 month, 1 week ago

standing

LOW Academic International

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

arXiv:2603.03756v1 Announce Type: new Abstract: While large language models (LLMs) show promise in scientific discovery, existing research focuses on inference or feedback-driven training, leaving the direct modeling of the generative reasoning process, $P(\text{hypothesis}|\text{background})$ ($P(h|b)$), unexplored. We demonstrate that directly training...

1 min 1 month, 1 week ago

discovery

LOW Academic International

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

arXiv:2603.03818v1 Announce Type: new Abstract: Continual learning is a long-standing challenge in robot policy learning, where a policy must acquire new skills over time without catastrophically forgetting previously learned ones. While prior work has extensively studied continual learning in relatively...

1 min 1 month, 1 week ago

standing

LOW Academic International

LaTeX Compilation: Challenges in the Era of LLMs

arXiv:2603.02873v1 Announce Type: new Abstract: As large language models (LLMs) increasingly assist scientific writing, limitations and the significant token cost of TeX become more and more visible. This paper analyzes TeX's fundamental defects in compilation and user experience design to...

1 min 1 month, 2 weeks ago

appeal

LOW Academic International

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

arXiv:2603.03202v1 Announce Type: new Abstract: As large language models (LLMs) advance their mathematical capabilities toward the IMO level, the scarcity of challenging, high-quality problems for training and evaluation has become a significant bottleneck. Simultaneously, recent code agents have demonstrated sophisticated...

1 min 1 month, 2 weeks ago

evidence

LOW Academic International

Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use

arXiv:2603.03205v1 Announce Type: new Abstract: Agentic language models operate in a fundamentally different safety regime than chat models: they must plan, call tools, and execute long-horizon actions where a single misstep, such as accessing files or entering credentials, can cause...

1 min 1 month, 2 weeks ago

class action

LOW Academic International

Using Learning Progressions to Guide AI Feedback for Science Learning

arXiv:2603.03249v1 Announce Type: new Abstract: Generative artificial intelligence (AI) offers scalable support for formative feedback, yet most AI-generated feedback relies on task-specific rubrics authored by domain experts. While effective, rubric authoring is time-consuming and limits scalability across instructional contexts. Learning...

1 min 1 month, 2 weeks ago

standing

LOW Academic International

Routing Absorption in Sparse Attention: Why Random Gates Are Hard to Beat

arXiv:2603.02227v1 Announce Type: cross Abstract: Can a transformer learn which attention entries matter during training? In principle, yes: attention distributions are highly concentrated, and a small gate network can identify the important entries post-hoc with near-perfect accuracy. In practice, barely....

1 min 1 month, 2 weeks ago

evidence

LOW Academic International

Safety Training Persists Through Helpfulness Optimization in LLM Agents

arXiv:2603.02229v1 Announce Type: cross Abstract: Safety post-training has been studied extensively in single-step "chat" settings where safety typically refers to refusing harmful requests. We study an "agentic" (i.e., multi-step, tool-use) setting where safety refers to harmful actions directly taken by...

1 min 1 month, 2 weeks ago

standing

LOW Academic International

Rigidity-Aware Geometric Pretraining for Protein Design and Conformational Ensembles

arXiv:2603.02406v1 Announce Type: new Abstract: Generative models have recently advanced $\textit{de novo}$ protein design by learning the statistical regularities of natural structures. However, current approaches face three key limitations: (1) Existing methods cannot jointly learn protein geometry and design tasks,...

1 min 1 month, 2 weeks ago

standing

LOW Academic International

A Unified Revisit of Temperature in Classification-Based Knowledge Distillation

arXiv:2603.02430v1 Announce Type: new Abstract: A central idea of knowledge distillation is to expose relational structure embedded in the teacher's weights for the student to learn, which is often facilitated using a temperature parameter. Despite its widespread use, there remains...

1 min 1 month, 2 weeks ago

standing

LOW News International

Lawsuit: Google Gemini sent man on violent missions, set suicide "countdown"

Gemini allegedly called man its "husband," said they could be together in death.

1 min 1 month, 2 weeks ago

lawsuit

LOW Academic International

Distribution-Aware Companding Quantization of Large Language Models

arXiv:2603.00364v1 Announce Type: new Abstract: Large language models such as GPT and Llama are trained with a next-token prediction loss. In this work, we suggest that training language models to predict multiple future tokens at once results in higher sample...

1 min 1 month, 2 weeks ago

appeal

LOW Academic International

CIRCUS: Circuit Consensus under Uncertainty via Stability Ensembles

arXiv:2603.00523v1 Announce Type: new Abstract: Mechanistic circuit discovery is notoriously sensitive to arbitrary analyst choices, especially pruning thresholds and feature dictionaries, often yielding brittle "one-shot" explanations with no principled notion of uncertainty. We reframe circuit discovery as an uncertainty-quantification problem...

1 min 1 month, 2 weeks ago

discovery

LOW Academic International

Super Research: Answering Highly Complex Questions with Large Language Models through Super Deep and Super Wide Research

arXiv:2603.00582v1 Announce Type: new Abstract: While Large Language Models (LLMs) have demonstrated proficiency in Deep Research or Wide Search, their capacity to solve highly complex questions-those requiring long-horizon planning, massive evidence gathering, and synthesis across heterogeneous sources-remains largely unexplored. We...

1 min 1 month, 2 weeks ago

evidence

LOW Academic International

From Literature to Hypotheses: An AI Co-Scientist System for Biomarker-Guided Drug Combination Hypothesis Generation

arXiv:2603.00612v1 Announce Type: new Abstract: The rapid growth of biomedical literature and curated databases has made it increasingly difficult for researchers to systematically connect biomarker mechanisms to actionable drug combination hypotheses. We present AI Co-Scientist (CoDHy), an interactive, human-in-the-loop system...

1 min 1 month, 2 weeks ago

evidence

LOW Academic International

SSKG Hub: An Expert-Guided Platform for LLM-Empowered Sustainability Standards Knowledge Graphs

arXiv:2603.00669v1 Announce Type: new Abstract: Sustainability disclosure standards (e.g., GRI, SASB, TCFD, IFRS S2) are comprehensive yet lengthy, terminology-dense, and highly cross-referential, hindering structured analysis and downstream use. We present SSKG Hub (Sustainability Standards Knowledge Graph Hub), a research prototype...

1 min 1 month, 2 weeks ago

evidence

LOW Academic International

RAVEL: Reasoning Agents for Validating and Evaluating LLM Text Synthesis

arXiv:2603.00686v1 Announce Type: new Abstract: Large Language Models have evolved from single-round generators into long-horizon agents, capable of complex text synthesis scenarios. However, current evaluation frameworks lack the ability to assess the actual synthesis operations, such as outlining, drafting, and...

1 min 1 month, 2 weeks ago

standing

LOW Academic International

Thoth: Mid-Training Bridges LLMs to Time Series Understanding

arXiv:2603.01042v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable success in general-purpose reasoning. However, they still struggle to understand and reason about time series data, which limits their effectiveness in decision-making scenarios that depend on temporal dynamics....

1 min 1 month, 2 weeks ago

standing

LOW Academic International

M3-AD: Reflection-aware Multi-modal, Multi-category, and Multi-dimensional Benchmark and Framework for Industrial Anomaly Detection

arXiv:2603.00055v1 Announce Type: new Abstract: Although multimodal large language models (MLLMs) have advanced industrial anomaly detection toward a zero-shot paradigm, they still tend to produce high-confidence yet unreliable decisions in fine-grained and structurally complex industrial scenarios, and lack effective self-corrective...

1 min 1 month, 2 weeks ago

trial

LOW Academic International

Certainty-Validity: A Diagnostic Framework for Discrete Commitment Systems

arXiv:2603.00070v1 Announce Type: new Abstract: Standard evaluation metrics for machine learning -- accuracy, precision, recall, and AUROC -- assume that all errors are equivalent: a confident incorrect prediction is penalized identically to an uncertain one. For discrete commitment systems (architectures...

1 min 1 month, 2 weeks ago

evidence

Language Model Goal Selection Differs from Humans' in an Open-Ended Task

HumanLM: Simulating Users with State Alignment Beats Response Imitation

Quantum-Inspired Self-Attention in a Large Language Model

Can Large Language Models Derive New Knowledge? A Dynamic Benchmark for Biological Knowledge Discovery

IntPro: A Proxy Agent for Context-Aware Intent Understanding via Retrieval-conditioned Inference

SE-Search: Self-Evolving Search Agent via Memory and Dense Reward

StructLens: A Structural Lens for Language Models via Maximum Spanning Trees

Tracing Pharmacological Knowledge In Large Language Models

A theoretical model of dynamical grammatical gender shifting based on set-valued set function

Trade-offs in Ensembling, Merging and Routing Among Parameter-Efficient Experts

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

LaTeX Compilation: Challenges in the Era of LLMs

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use

Using Learning Progressions to Guide AI Feedback for Science Learning

Routing Absorption in Sparse Attention: Why Random Gates Are Hard to Beat

Safety Training Persists Through Helpfulness Optimization in LLM Agents

Rigidity-Aware Geometric Pretraining for Protein Design and Conformational Ensembles

A Unified Revisit of Temperature in Classification-Based Knowledge Distillation

Lawsuit: Google Gemini sent man on violent missions, set suicide "countdown"

Distribution-Aware Companding Quantization of Large Language Models

CIRCUS: Circuit Consensus under Uncertainty via Stability Ensembles

Super Research: Answering Highly Complex Questions with Large Language Models through Super Deep and Super Wide Research

From Literature to Hypotheses: An AI Co-Scientist System for Biomarker-Guided Drug Combination Hypothesis Generation

SSKG Hub: An Expert-Guided Platform for LLM-Empowered Sustainability Standards Knowledge Graphs

RAVEL: Reasoning Agents for Validating and Evaluating LLM Text Synthesis

Thoth: Mid-Training Bridges LLMs to Time Series Understanding

M3-AD: Reflection-aware Multi-modal, Multi-category, and Multi-dimensional Benchmark and Framework for Industrial Anomaly Detection

Certainty-Validity: A Diagnostic Framework for Discrete Commitment Systems

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.