International Law

LOW Academic International

Reinforcement Learning-based Knowledge Distillation with LLM-as-a-Judge

arXiv:2604.02621v1 Announce Type: new Abstract: Reinforcement Learning (RL) has been shown to substantially improve the reasoning capability of small and large language models (LLMs), but existing approaches typically rely on verifiable rewards, hence ground truth labels. We propose an RL...

1 min 1 week, 5 days ago

ear

LOW Academic International

An Initial Exploration of Contrastive Prompt Tuning to Generate Energy-Efficient Code

arXiv:2604.02352v1 Announce Type: cross Abstract: Although LLMs are capable of generating functionally correct code, they also tend to produce less energy-efficient code in comparison to human-written solutions. As these inefficiencies lead to higher computational overhead, they are in direct conflict...

1 min 1 week, 5 days ago

ear

LOW Academic United States

InfoSeeker: A Scalable Hierarchical Parallel Agent Framework for Web Information Seeking

arXiv:2604.02971v1 Announce Type: new Abstract: Recent agentic search systems have made substantial progress by emphasising deep, multi-step reasoning. However, this focus often overlooks the challenges of wide-scale information synthesis, where agents must aggregate large volumes of heterogeneous evidence across many...

1 min 1 week, 5 days ago

ear

LOW Academic United States

A Spectral Framework for Multi-Scale Nonlinear Dimensionality Reduction

arXiv:2604.02535v1 Announce Type: new Abstract: Dimensionality reduction (DR) is characterized by two longstanding trade-offs. First, there is a global-local preservation tension: methods such as t-SNE and UMAP prioritize local neighborhood preservation, yet may distort global manifold structure, while methods such...

1 min 1 week, 5 days ago

ear

LOW Academic United States

Chart-RL: Policy Optimization Reinforcement Learning for Enhanced Visual Reasoning in Chart Question Answering with Vision Language Models

arXiv:2604.03157v1 Announce Type: new Abstract: The recent advancements in Vision Language Models (VLMs) have demonstrated progress toward true intelligence requiring robust reasoning capabilities. Beyond pattern recognition, linguistic reasoning must integrate with visual comprehension, particularly for Chart Question Answering (CQA) tasks...

1 min 1 week, 5 days ago

ear

LOW Academic International

OPRIDE: Offline Preference-based Reinforcement Learning via In-Dataset Exploration

arXiv:2604.02349v1 Announce Type: cross Abstract: Preference-based reinforcement learning (PbRL) can help avoid sophisticated reward designs and align better with human intentions, showing great promise in various real-world applications. However, obtaining human feedback for preferences can be expensive and time-consuming, which...

1 min 1 week, 5 days ago

ear

LOW Academic European Union

Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training

arXiv:2604.02651v1 Announce Type: new Abstract: Graph neural networks (GNNs) are widely used for learning on graph datasets derived from various real-world scenarios. Learning from extremely large graphs requires distributed training, and mini-batching with sampling is a popular approach for parallelizing...

1 min 1 week, 5 days ago

ear

LOW Academic International

VALOR: Value-Aware Revenue Uplift Modeling with Treatment-Gated Representation for B2B Sales

arXiv:2604.02472v1 Announce Type: new Abstract: B2B sales organizations must identify "persuadable" accounts within zero-inflated revenue distributions to optimize expensive human resource allocation. Standard uplift frameworks struggle with treatment signal collapse in high-dimensional spaces and a misalignment between regression calibration and...

1 min 1 week, 5 days ago

ear

LOW Academic International

TRACE: Traceroute-based Internet Route change Analysis with Ensemble Learning

arXiv:2604.02361v1 Announce Type: cross Abstract: Detecting Internet routing instability is a critical yet challenging task, particularly when relying solely on endpoint active measurements. This study introduces TRACE, a MachineLearning (ML)pipeline designed to identify route changes using only traceroute latency data,...

1 min 1 week, 5 days ago

ear

LOW Academic International

Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models

arXiv:2604.02340v1 Announce Type: new Abstract: Recent advances in masked diffusion language models (MDLMs) narrow the quality gap to autoregressive LMs, but their sampling remains expensive because generation requires many full-sequence denoising passes with a large Transformer and, unlike autoregressive decoding,...

1 min 1 week, 5 days ago

ear

LOW Academic International

An Empirical Study of Many-Shot In-Context Learning for Machine Translation of Low-Resource Languages

arXiv:2604.02596v1 Announce Type: new Abstract: In-context learning (ICL) allows large language models (LLMs) to adapt to new tasks from a few examples, making it promising for languages underrepresented in pre-training. Recent work on many-shot ICL suggests that modern LLMs can...

1 min 1 week, 5 days ago

ear

LOW Academic United States

LiME: Lightweight Mixture of Experts for Efficient Multimodal Multi-task Learning

arXiv:2604.02338v1 Announce Type: new Abstract: MoE-PEFT methods combine Mixture of Experts with parameter-efficient fine-tuning for multi-task adaptation, but require separate adapters per expert causing trainable parameters to scale linearly with expert count and limiting applicability to adapter-based architectures. We propose...

1 min 1 week, 5 days ago

ear

LOW Academic International

SWAY: A Counterfactual Computational Linguistic Approach to Measuring and Mitigating Sycophancy

arXiv:2604.02423v1 Announce Type: new Abstract: Large language models exhibit sycophancy: the tendency to shift outputs toward user-expressed stances, regardless of correctness or consistency. While prior work has studied this issue and its impacts, rigorous computational linguistic metrics are needed to...

1 min 1 week, 5 days ago

ear

LOW Academic European Union

NeuReasoner: Towards Explainable, Controllable, and Unified Reasoning via Mixture-of-Neurons

arXiv:2604.02972v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) have recently achieved remarkable success in complex reasoning tasks. However, closer scrutiny reveals persistent failure modes compromising performance and cost: I) Intra-step level, marked by calculation or derivation errors; II) Inter-step...

1 min 1 week, 5 days ago

ear

LOW Academic European Union

Evaluating the Formal Reasoning Capabilities of Large Language Models through Chomsky Hierarchy

arXiv:2604.02709v1 Announce Type: new Abstract: The formal reasoning capabilities of LLMs are crucial for advancing automated software engineering. However, existing benchmarks for LLMs lack systematic evaluation based on computation and complexity, leaving a critical gap in understanding their formal reasoning...

1 min 1 week, 5 days ago

ear

LOW Academic International

LLM Reasoning with Process Rewards for Outcome-Guided Steps

arXiv:2604.02341v1 Announce Type: cross Abstract: Mathematical reasoning in large language models has improved substantially with reinforcement learning using verifiable rewards, where final answers can be checked automatically and converted into reliable training signals. Most such pipelines optimize outcome correctness only,...

1 min 1 week, 5 days ago

ear

LOW Academic United States

Overcoming the "Impracticality" of RAG: Proposing a Real-World Benchmark and Multi-Dimensional Diagnostic Framework

arXiv:2604.02640v1 Announce Type: new Abstract: Performance evaluation of Retrieval-Augmented Generation (RAG) systems within enterprise environments is governed by multi-dimensional and composite factors extending far beyond simple final accuracy checks. These factors include reasoning complexity, retrieval difficulty, the diverse structure of...

1 min 1 week, 5 days ago

ear

LOW Academic International

ESL-Bench: An Event-Driven Synthetic Longitudinal Benchmark for Health Agents

arXiv:2604.02834v1 Announce Type: new Abstract: Longitudinal health agents must reason across multi-source trajectories that combine continuous device streams, sparse clinical exams, and episodic life events - yet evaluating them is hard: real-world data cannot be released at scale, and temporally...

1 min 1 week, 5 days ago

ear

LOW Academic International

YC Bench: a Live Benchmark for Forecasting Startup Outperformance in Y Combinator Batches

arXiv:2604.02378v1 Announce Type: new Abstract: Forecasting startup success is notoriously difficult, partly because meaningful outcomes, such as exits, large funding rounds, and sustained revenue growth, are rare and can take years to materialize. As a result, signals are sparse and...

1 min 1 week, 5 days ago

ear

LOW Academic United States

Compositional Neuro-Symbolic Reasoning

arXiv:2604.02434v1 Announce Type: new Abstract: We study structured abstraction-based reasoning for the Abstraction and Reasoning Corpus (ARC) and compare its generalization to test-time approaches. Purely neural architectures lack reliable combinatorial generalization, while strictly symbolic systems struggle with perceptual grounding. We...

1 min 1 week, 5 days ago

ear

LOW Academic International

Do We Need Frontier Models to Verify Mathematical Proofs?

arXiv:2604.02450v1 Announce Type: new Abstract: Advances in training, post-training, and inference-time methods have enabled frontier reasoning models to win gold medals in math competitions and settle challenging open problems. Gaining trust in the responses of these models requires that natural...

1 min 1 week, 5 days ago

ear

LOW Academic International

Verbalizing LLMs' assumptions to explain and control sycophancy

arXiv:2604.03058v1 Announce Type: new Abstract: LLMs can be socially sycophantic, affirming users when they ask questions like "am I in the wrong?" rather than providing genuine assessment. We hypothesize that this behavior arises from incorrect assumptions about the user, like...

1 min 1 week, 5 days ago

ear

LOW Academic International

Rubrics to Tokens: Bridging Response-level Rubrics and Token-level Rewards in Instruction Following Tasks

arXiv:2604.02795v1 Announce Type: new Abstract: Rubric-based Reinforcement Learning (RL) has emerged as a promising approach for aligning Large Language Models (LLMs) with complex, open-domain instruction following tasks. However, existing methods predominantly rely on response-level rewards, introducing severe reward sparsity and...

1 min 1 week, 5 days ago

ear

LOW Academic United States

Homophily-aware Supervised Contrastive Counterfactual Augmented Fair Graph Neural Network

arXiv:2604.02342v1 Announce Type: new Abstract: In recent years, Graph Neural Networks (GNNs) have achieved remarkable success in tasks such as node classification, link prediction, and graph representation learning. However, they remain susceptible to biases that can arise not only from...

1 min 1 week, 5 days ago

ear

LOW Academic International

R2-Write: Reflection and Revision for Open-Ended Writing with Deep Reasoning

arXiv:2604.03004v1 Announce Type: new Abstract: While deep reasoning with long chain-of-thought has dramatically improved large language models in verifiable domains like mathematics, its effectiveness for open-ended tasks such as writing remains unexplored. In this paper, we conduct a systematic investigation...

1 min 1 week, 5 days ago

ear

LOW Academic South Korea

Time-Warping Recurrent Neural Networks for Transfer Learning

arXiv:2604.02474v1 Announce Type: new Abstract: Dynamical systems describe how a physical system evolves over time. Physical processes can evolve faster or slower in different environmental conditions. We use time-warping as rescaling the time in a model of a physical system....

1 min 1 week, 5 days ago

ear

LOW Academic International

When Modalities Remember: Continual Learning for Multimodal Knowledge Graphs

arXiv:2604.02778v1 Announce Type: new Abstract: Real-world multimodal knowledge graphs (MMKGs) are dynamic, with new entities, relations, and multimodal knowledge emerging over time. Existing continual knowledge graph reasoning (CKGR) methods focus on structural triples and cannot fully exploit multimodal signals from...

1 min 1 week, 5 days ago

ear

LOW Academic International

BioUNER: A Benchmark Dataset for Clinical Urdu Named Entity Recognition

arXiv:2604.02904v1 Announce Type: new Abstract: In this article, we present a gold-standard benchmark dataset for Biomedical Urdu Named Entity Recognition (BioUNER), developed by crawling health-related articles from online Urdu news portals, medical prescriptions, and hospital health blogs and websites. After...

1 min 1 week, 5 days ago

ear

LOW Academic United States

Communication-Efficient Distributed Learning with Differential Privacy

arXiv:2604.02558v1 Announce Type: new Abstract: We address nonconvex learning problems over undirected networks. In particular, we focus on the challenge of designing an algorithm that is both communication-efficient and that guarantees the privacy of the agents' data. The first goal...

1 min 1 week, 5 days ago

ear

LOW Academic International

Beyond the Parameters: A Technical Survey of Contextual Enrichment in Large Language Models: From In-Context Prompting to Causal Retrieval-Augmented Generation

arXiv:2604.03174v1 Announce Type: new Abstract: Large language models (LLMs) encode vast world knowledge in their parameters, yet they remain fundamentally limited by static knowledge, finite context windows, and weakly structured causal reasoning. This survey provides a unified account of augmentation...

1 min 1 week, 5 days ago

ear

Reinforcement Learning-based Knowledge Distillation with LLM-as-a-Judge

An Initial Exploration of Contrastive Prompt Tuning to Generate Energy-Efficient Code

InfoSeeker: A Scalable Hierarchical Parallel Agent Framework for Web Information Seeking

A Spectral Framework for Multi-Scale Nonlinear Dimensionality Reduction

Chart-RL: Policy Optimization Reinforcement Learning for Enhanced Visual Reasoning in Chart Question Answering with Vision Language Models

OPRIDE: Offline Preference-based Reinforcement Learning via In-Dataset Exploration

Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training

VALOR: Value-Aware Revenue Uplift Modeling with Treatment-Gated Representation for B2B Sales

TRACE: Traceroute-based Internet Route change Analysis with Ensemble Learning

Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models

An Empirical Study of Many-Shot In-Context Learning for Machine Translation of Low-Resource Languages

LiME: Lightweight Mixture of Experts for Efficient Multimodal Multi-task Learning

SWAY: A Counterfactual Computational Linguistic Approach to Measuring and Mitigating Sycophancy

NeuReasoner: Towards Explainable, Controllable, and Unified Reasoning via Mixture-of-Neurons

Evaluating the Formal Reasoning Capabilities of Large Language Models through Chomsky Hierarchy

LLM Reasoning with Process Rewards for Outcome-Guided Steps

Overcoming the "Impracticality" of RAG: Proposing a Real-World Benchmark and Multi-Dimensional Diagnostic Framework

ESL-Bench: An Event-Driven Synthetic Longitudinal Benchmark for Health Agents

YC Bench: a Live Benchmark for Forecasting Startup Outperformance in Y Combinator Batches

Compositional Neuro-Symbolic Reasoning

Do We Need Frontier Models to Verify Mathematical Proofs?

Verbalizing LLMs' assumptions to explain and control sycophancy

Rubrics to Tokens: Bridging Response-level Rubrics and Token-level Rewards in Instruction Following Tasks

Homophily-aware Supervised Contrastive Counterfactual Augmented Fair Graph Neural Network

R2-Write: Reflection and Revision for Open-Ended Writing with Deep Reasoning

Time-Warping Recurrent Neural Networks for Transfer Learning

When Modalities Remember: Continual Learning for Multimodal Knowledge Graphs

BioUNER: A Benchmark Dataset for Clinical Urdu Named Entity Recognition

Communication-Efficient Distributed Learning with Differential Privacy

Beyond the Parameters: A Technical Survey of Contextual Enrichment in Large Language Models: From In-Context Prompting to Causal Retrieval-Augmented Generation

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.