Intellectual Property

LOW Academic International

Elaborating a Human Rights-Friendly Copyright Framework for Generative AI

1 min 1 month, 1 week ago

copyright

LOW Academic United States

Language Shapes Mental Health Evaluations in Large Language Models

arXiv:2603.06910v1 Announce Type: new Abstract: This study investigates whether large language models (LLMs) exhibit cross-linguistic differences in mental health evaluations. Focusing on Chinese and English, we examine two widely used models, GPT-4o and Qwen3, to assess whether prompt language systematically...

1 min 1 month, 2 weeks ago

ip

LOW Conference International

BROADENING PARTICIPATION (BP)

1 min 1 month, 2 weeks ago

ip

LOW Academic International

"Dark Triad" Model Organisms of Misalignment: Narrow Fine-Tuning Mirrors Human Antisocial Behavior

arXiv:2603.06816v1 Announce Type: new Abstract: The alignment problem refers to concerns regarding powerful intelligences, ensuring compatibility with human preferences and values as capabilities increase. Current large language models (LLMs) show misaligned behaviors, such as strategic deception, manipulation, and reward-seeking, that...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

Rethinking Personalization in Large Language Models at the Token Level

arXiv:2603.06595v1 Announce Type: new Abstract: With large language models (LLMs) now performing strongly across diverse tasks, there is growing demand for them to personalize outputs for individual users. Personalization is typically framed as an additional layer on top of a...

1 min 1 month, 2 weeks ago

ip

LOW Academic United States

Hierarchical Embedding Fusion for Retrieval-Augmented Code Generation

arXiv:2603.06593v1 Announce Type: new Abstract: Retrieval-augmented code generation often conditions the decoder on large retrieved code snippets. This ties online inference cost to repository size and introduces noise from long contexts. We present Hierarchical Embedding Fusion (HEF), a two-stage approach...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

Deep Research, Shallow Evaluation: A Case Study in Meta-Evaluation for Long-Form QA Benchmarks

arXiv:2603.06942v1 Announce Type: new Abstract: Recent advances have made long-form report-generating systems widely available. This has prompted evaluation frameworks that use LLM-as-judge protocols and claim verification, along with meta-evaluation frameworks that seek to validate these methods. Many of the meta-evaluations...

1 min 1 month, 2 weeks ago

nda

LOW Academic International

Elenchus: Generating Knowledge Bases from Prover-Skeptic Dialogues

arXiv:2603.06974v1 Announce Type: new Abstract: We present Elenchus, a dialogue system for knowledge base construction grounded in inferentialist semantics, where knowledge engineering is re-conceived as explicitation rather than extraction from expert testimony or textual content. A human expert develops a...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

Can Safety Emerge from Weak Supervision? A Systematic Analysis of Small Language Models

arXiv:2603.07017v1 Announce Type: new Abstract: Safety alignment is critical for deploying large language models (LLMs) in real-world applications, yet most existing approaches rely on large human-annotated datasets and static red-teaming benchmarks that are costly, difficult to scale, and slow to...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

AutoChecklist: Composable Pipelines for Checklist Generation and Scoring with LLM-as-a-Judge

arXiv:2603.07019v1 Announce Type: new Abstract: Checklists have emerged as a popular approach for interpretable and fine-grained evaluation, particularly with LLM-as-a-Judge. Beyond evaluation, these structured criteria can serve as signals for model alignment, reinforcement learning, and self-correction. To support these use...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

Hit-RAG: Learning to Reason with Long Contexts via Preference Alignment

arXiv:2603.07023v1 Announce Type: new Abstract: Despite the promise of Retrieval-Augmented Generation in grounding Multimodal Large Language Models with external knowledge, the transition to extensive contexts often leads to significant attention dilution and reasoning hallucinations. The surge in information density causes...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

Emotion Transcription in Conversation: A Benchmark for Capturing Subtle and Complex Emotional States through Natural Language

arXiv:2603.07138v1 Announce Type: new Abstract: Emotion Recognition in Conversation (ERC) is critical for enabling natural human-machine interactions. However, existing methods predominantly employ categorical or dimensional emotion annotations, which often fail to adequately represent complex, subtle, or culturally specific emotional nuances....

1 min 1 month, 2 weeks ago

ip

LOW Academic European Union

Lying to Win: Assessing LLM Deception through Human-AI Games and Parallel-World Probing

arXiv:2603.07202v1 Announce Type: new Abstract: As Large Language Models (LLMs) transition into autonomous agentic roles, the risk of deception-defined behaviorally as the systematic provision of false information to satisfy external incentives-poses a significant challenge to AI safety. Existing benchmarks often...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

Scaling Self-Supervised Speech Models Uncovers Deep Linguistic Relationships: Evidence from the Pacific Cluster

arXiv:2603.07238v1 Announce Type: new Abstract: Similarities between language representations derived from Self-Supervised Speech Models (S3Ms) have been observed to primarily reflect geographic proximity or surface typological similarities driven by recent expansion or contact, potentially missing deeper genealogical signals. We investigate...

1 min 1 month, 2 weeks ago

ip

LOW Academic European Union

RILEC: Detection and Generation of L1 Russian Interference Errors in English Learner Texts

arXiv:2603.07366v1 Announce Type: new Abstract: Many errors in student essays can be explained by influence from the native language (L1). L1 interference refers to errors influenced by a speaker's first language, such as using stadion instead of stadium, reflecting lexical...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

Domain-Specific Quality Estimation for Machine Translation in Low-Resource Scenarios

arXiv:2603.07372v1 Announce Type: new Abstract: Quality Estimation (QE) is essential for assessing machine translation quality in reference-less settings, particularly for domain-specific and low-resource language scenarios. In this paper, we investigate sentence-level QE for English to Indic machine translation across four...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams

arXiv:2603.07392v1 Announce Type: new Abstract: LLMs operating in dynamic real-world contexts often encounter knowledge that evolves continuously or emerges incrementally. To remain accurate and effective, models must adapt to newly arriving information on the fly. We introduce Online Adaptation to...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

The Dual-Stream Transformer: Channelized Architecture for Interpretable Language Modeling

arXiv:2603.07461v1 Announce Type: new Abstract: Standard transformers entangle all computation in a single residual stream, obscuring which components perform which functions. We introduce the Dual-Stream Transformer, which decomposes the residual stream into two functionally distinct components: a token stream updated...

1 min 1 month, 2 weeks ago

nda

LOW Academic European Union

A Joint Neural Baseline for Concept, Assertion, and Relation Extraction from Clinical Text

arXiv:2603.07487v1 Announce Type: new Abstract: Clinical information extraction (e.g., 2010 i2b2/VA challenge) usually presents tasks of concept recognition, assertion classification, and relation extraction. Jointly modeling the multi-stage tasks in the clinical domain is an underexplored topic. The existing independent task...

1 min 1 month, 2 weeks ago

ip

LOW Academic European Union

Bolbosh: Script-Aware Flow Matching for Kashmiri Text-to-Speech

arXiv:2603.07513v1 Announce Type: new Abstract: Kashmiri is spoken by around 7 million people but remains critically underserved in speech technology, despite its official status and rich linguistic heritage. The lack of robust Text-to-Speech (TTS) systems limits digital accessibility and inclusive...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data

arXiv:2603.07534v1 Announce Type: new Abstract: Accent is an integral part of society, reflecting multiculturalism and shaping how individuals express identity. The majority of English speakers are non-native (L2) speakers, yet current Text-To-Speech (TTS) systems primarily model American-accented English due limited...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

MAWARITH: A Dataset and Benchmark for Legal Inheritance Reasoning with LLMs

arXiv:2603.07539v1 Announce Type: new Abstract: Islamic inheritance law ('ilm al-mawarith) is challenging for large language models because solving inheritance cases requires complex, structured multi-step reasoning and the correct application of juristic rules to compute heirs' shares. We introduce MAWARITH, a...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

Learning-free L2-Accented Speech Generation using Phonological Rules

arXiv:2603.07550v1 Announce Type: new Abstract: Accent plays a crucial role in speaker identity and inclusivity in speech technologies. Existing accented text-to-speech (TTS) systems either require large-scale accented datasets or lack fine-grained phoneme-level controllability. We propose a accented TTS framework that...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems

arXiv:2603.07779v1 Announce Type: new Abstract: Training next-generation code generation models requires high-quality datasets, yet existing datasets face difficulty imbalance, format inconsistency, and data quality problems. We address these challenges through systematic data processing and difficulty scaling. We introduce a four-stage...

1 min 1 month, 2 weeks ago

ip

LOW Academic United States

Dual-Metric Evaluation of Social Bias in Large Language Models: Evidence from an Underrepresented Nepali Cultural Context

arXiv:2603.07792v1 Announce Type: new Abstract: Large language models (LLMs) increasingly influence global digital ecosystems, yet their potential to perpetuate social and cultural biases remains poorly understood in underrepresented contexts. This study presents a systematic analysis of representational biases in seven...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

FuzzingRL: Reinforcement Fuzz-Testing for Revealing VLM Failures

arXiv:2603.06600v1 Announce Type: new Abstract: Vision Language Models (VLMs) are prone to errors, and identifying where these errors occur is critical for ensuring the reliability and safety of AI systems. In this paper, we propose an approach that automatically generates...

1 min 1 month, 2 weeks ago

ip

LOW Academic International

Khatri-Rao Clustering for Data Summarization

arXiv:2603.06602v1 Announce Type: new Abstract: As datasets continue to grow in size and complexity, finding succinct yet accurate data summaries poses a key challenge. Centroid-based clustering, a widely adopted approach to address this challenge, finds informative summaries of datasets in...

1 min 1 month, 2 weeks ago

nda

LOW Academic United States

Know When You're Wrong: Aligning Confidence with Correctness for LLM Error Detection

arXiv:2603.06604v1 Announce Type: new Abstract: As large language models (LLMs) are increasingly deployed in critical decision-making systems, the lack of reliable methods to measure their uncertainty presents a fundamental trustworthiness risk. We introduce a normalized confidence score based on output...

1 min 1 month, 2 weeks ago

nda

LOW Academic International

Structure-Aware Set Transformers: Temporal and Variable-Type Attention Biases for Asynchronous Clinical Time Series

arXiv:2603.06605v1 Announce Type: new Abstract: Electronic health records (EHR) are irregular, asynchronous multivariate time series. As time-series foundation models increasingly tokenize events rather than discretizing time, the input layout becomes a key design choice. Grids expose time$\times$variable structure but require...

1 min 1 month, 2 weeks ago

nda

LOW Academic International

Valid Feature-Level Inference for Tabular Foundation Models via the Conditional Randomization Test

arXiv:2603.06609v1 Announce Type: new Abstract: Modern machine learning models are highly expressive but notoriously difficult to analyze statistically. In particular, while black-box predictors can achieve strong empirical performance, they rarely provide valid hypothesis tests or p-values for assessing whether individual...

1 min 1 month, 2 weeks ago

nda

Elaborating a Human Rights-Friendly Copyright Framework for Generative AI

Language Shapes Mental Health Evaluations in Large Language Models

BROADENING PARTICIPATION (BP)

"Dark Triad" Model Organisms of Misalignment: Narrow Fine-Tuning Mirrors Human Antisocial Behavior

Rethinking Personalization in Large Language Models at the Token Level

Hierarchical Embedding Fusion for Retrieval-Augmented Code Generation

Deep Research, Shallow Evaluation: A Case Study in Meta-Evaluation for Long-Form QA Benchmarks

Elenchus: Generating Knowledge Bases from Prover-Skeptic Dialogues

Can Safety Emerge from Weak Supervision? A Systematic Analysis of Small Language Models

AutoChecklist: Composable Pipelines for Checklist Generation and Scoring with LLM-as-a-Judge

Hit-RAG: Learning to Reason with Long Contexts via Preference Alignment

Emotion Transcription in Conversation: A Benchmark for Capturing Subtle and Complex Emotional States through Natural Language

Lying to Win: Assessing LLM Deception through Human-AI Games and Parallel-World Probing

Scaling Self-Supervised Speech Models Uncovers Deep Linguistic Relationships: Evidence from the Pacific Cluster

RILEC: Detection and Generation of L1 Russian Interference Errors in English Learner Texts

Domain-Specific Quality Estimation for Machine Translation in Low-Resource Scenarios

Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams

The Dual-Stream Transformer: Channelized Architecture for Interpretable Language Modeling

A Joint Neural Baseline for Concept, Assertion, and Relation Extraction from Clinical Text

Bolbosh: Script-Aware Flow Matching for Kashmiri Text-to-Speech

Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data

MAWARITH: A Dataset and Benchmark for Legal Inheritance Reasoning with LLMs

Learning-free L2-Accented Speech Generation using Phonological Rules

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems

Dual-Metric Evaluation of Social Bias in Large Language Models: Evidence from an Underrepresented Nepali Cultural Context

FuzzingRL: Reinforcement Fuzz-Testing for Revealing VLM Failures

Khatri-Rao Clustering for Data Summarization

Know When You're Wrong: Aligning Confidence with Correctness for LLM Error Detection

Structure-Aware Set Transformers: Temporal and Variable-Type Attention Biases for Asynchronous Clinical Time Series

Valid Feature-Level Inference for Tabular Foundation Models via the Conditional Randomization Test

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.