All Practice Areas

International Law

국제법

Jurisdiction: All US KR EU Intl
LOW Academic International

AllMem: A Memory-centric Recipe for Efficient Long-context Modeling

arXiv:2602.13680v1 Announce Type: new Abstract: Large Language Models (LLMs) encounter significant performance bottlenecks in long-sequence tasks due to the computational complexity and memory overhead inherent in the self-attention mechanism. To address these challenges, we introduce \textsc{AllMem}, a novel and efficient...

1 min 1 month, 2 weeks ago
ear
LOW Academic European Union

PhGPO: Pheromone-Guided Policy Optimization for Long-Horizon Tool Planning

arXiv:2602.13691v1 Announce Type: new Abstract: Recent advancements in Large Language Model (LLM) agents have demonstrated strong capabilities in executing complex tasks through tool use. However, long-horizon multi-step tool planning is challenging, because the exploration space suffers from a combinatorial explosion....

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Using Machine Learning to Enhance the Detection of Obfuscated Abusive Words in Swahili: A Focus on Child Safety

arXiv:2602.13455v1 Announce Type: new Abstract: The rise of digital technology has dramatically increased the potential for cyberbullying and online abuse, necessitating enhanced measures for detection and prevention, especially among children. This study focuses on detecting abusive obfuscated language in Swahili,...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Language Model Memory and Memory Models for Language

arXiv:2602.13466v1 Announce Type: new Abstract: The ability of machine learning models to store input information in hidden layer vector embeddings, analogous to the concept of `memory', is widely employed but not well characterized. We find that language model embeddings typically...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

From Perceptions To Evidence: Detecting AI-Generated Content In Turkish News Media With A Fine-Tuned Bert Classifier

arXiv:2602.13504v1 Announce Type: new Abstract: The rapid integration of large language models into newsroom workflows has raised urgent questions about the prevalence of AI-generated content in online media. While computational studies have begun to quantify this phenomenon in English-language outlets,...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Think Deep, Not Just Long: Measuring LLM Reasoning Effort via Deep-Thinking Tokens

arXiv:2602.13517v1 Announce Type: new Abstract: Large language models (LLMs) have demonstrated impressive reasoning capabilities by scaling test-time compute via long Chain-of-Thought (CoT). However, recent findings suggest that raw token counts are unreliable proxies for reasoning quality: increased generation length does...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Elo-Evolve: A Co-evolutionary Framework for Language Model Alignment

arXiv:2602.13575v1 Announce Type: new Abstract: Current alignment methods for Large Language Models (LLMs) rely on compressing vast amounts of human preference data into static, absolute reward functions, leading to data scarcity, noise sensitivity, and training instability. We introduce Elo-Evolve, a...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Metaphors' journeys across time and genre: tracking the evolution of literary metaphors with temporal embeddings

arXiv:2602.13701v1 Announce Type: new Abstract: Metaphors are a distinctive feature of literary language, yet they remain less studied experimentally than everyday metaphors. Moreover, previous psycholinguistic and computational approaches overlooked the temporal dimension, although many literary metaphors were coined centuries apart...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

On Theoretically-Driven LLM Agents for Multi-Dimensional Discourse Analysis

arXiv:2602.13713v1 Announce Type: new Abstract: Identifying the strategic uses of reformulation in discourse remains a key challenge for computational argumentation. While LLMs can detect surface-level similarity, they often fail to capture the pragmatic functions of rephrasing, such as its role...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

RMPL: Relation-aware Multi-task Progressive Learning with Stage-wise Training for Multimedia Event Extraction

arXiv:2602.13748v1 Announce Type: new Abstract: Multimedia Event Extraction (MEE) aims to identify events and their arguments from documents that contain both text and images. It requires grounding event semantics across different modalities. Progress in MEE is limited by the lack...

1 min 1 month, 2 weeks ago
ear
LOW Academic United States

The acquisition of English irregular inflections by Yemeni L1 Arabic learners: A Universal Grammar approach

arXiv:2602.13816v1 Announce Type: new Abstract: This study examines the acquisition of English irregular inflections by Yemeni learners of English as a second language (L2), utilizing a Universal Grammar (UG) approach. Within the UG approach, the study considers Feature Reassembly Hypothesis...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Beyond Words: Evaluating and Bridging Epistemic Divergence in User-Agent Interaction via Theory of Mind

arXiv:2602.13832v1 Announce Type: new Abstract: Large Language Models (LLMs) have developed rapidly and are widely applied to both general-purpose and professional tasks to assist human users. However, they still struggle to comprehend and respond to the true user needs when...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

PrivAct: Internalizing Contextual Privacy Preservation via Multi-Agent Preference Training

arXiv:2602.13840v1 Announce Type: new Abstract: Large language model (LLM) agents are increasingly deployed in personalized tasks involving sensitive, context-dependent information, where privacy violations may arise in agents' action due to the implicitness of contextual privacy. Existing approaches rely on external,...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Tutoring Large Language Models to be Domain-adaptive, Precise, and Safe

arXiv:2602.13860v1 Announce Type: new Abstract: The overarching research direction of this work is the development of a ''Responsible Intelligence'' framework designed to reconcile the immense generative power of Large Language Models (LLMs) with the stringent requirements of real-world deployment. As...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Bridging the Multilingual Safety Divide: Efficient, Culturally-Aware Alignment for Global South Languages

arXiv:2602.13867v1 Announce Type: new Abstract: Large language models (LLMs) are being deployed across the Global South, where everyday use involves low-resource languages, code-mixing, and culturally specific norms. Yet safety pipelines, benchmarks, and alignment still largely target English and a handful...

1 min 1 month, 2 weeks ago
ear
LOW Academic European Union

ADAB: Arabic Dataset for Automated Politeness Benchmarking -- A Large-Scale Resource for Computational Sociopragmatics

arXiv:2602.13870v1 Announce Type: new Abstract: The growing importance of culturally-aware natural language processing systems has led to an increasing demand for resources that capture sociopragmatic phenomena across diverse languages. Nevertheless, Arabic-language resources for politeness detection remain under-explored, despite the rich...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Evaluating Prompt Engineering Techniques for RAG in Small Language Models: A Multi-Hop QA Approach

arXiv:2602.13890v1 Announce Type: new Abstract: Retrieval Augmented Generation (RAG) is a powerful approach for enhancing the factual grounding of language models by integrating external knowledge. While widely studied for large language models, the optimization of RAG for Small Language Models...

1 min 1 month, 2 weeks ago
ear
LOW Academic European Union

Chain-of-Thought Reasoning with Large Language Models for Clinical Alzheimer's Disease Assessment and Diagnosis

arXiv:2602.13979v1 Announce Type: new Abstract: Alzheimer's disease (AD) has become a prevalent neurodegenerative disease worldwide. Traditional diagnosis still relies heavily on medical imaging and clinical assessment by physicians, which is often time-consuming and resource-intensive in terms of both human expertise...

1 min 1 month, 2 weeks ago
ear
LOW Academic United States

Geometry-Preserving Aggregation for Mixture-of-Experts Embedding Models

arXiv:2602.14039v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) embedding models combine expert outputs using weighted linear summation, implicitly assuming a linear subspace structure in the embedding space. This assumption is shown to be inconsistent with the geometry of expert representations. Geometric...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Context Shapes LLMs Retrieval-Augmented Fact-Checking Effectiveness

arXiv:2602.14044v1 Announce Type: new Abstract: Large language models (LLMs) show strong reasoning abilities across diverse tasks, yet their performance on extended contexts remains inconsistent. While prior research has emphasized mid-context degradation in question answering, this study examines the impact of...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

LogitsCoder: Towards Efficient Chain-of-Thought Path Search via Logits Preference Decoding for Code Generation

arXiv:2602.14054v1 Announce Type: new Abstract: Code generation remains a challenging task that requires precise and structured reasoning. Existing Test Time Scaling (TTS) methods, including structured tree search, have made progress in exploring reasoning paths but still face two major challenges:...

1 min 1 month, 2 weeks ago
ear
LOW Academic European Union

LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts

arXiv:2602.14060v1 Announce Type: new Abstract: We introduce LM-Lexicon, an innovative definition modeling approach that incorporates data clustering, semantic expert learning, and model merging using a sparse mixture-of-experts architecture. By decomposing the definition modeling task into specialized semantic domains, where small...

1 min 1 month, 2 weeks ago
ear
LOW Academic European Union

Attention-gated U-Net model for semantic segmentation of brain tumors and feature extraction for survival prognosis

arXiv:2602.15067v1 Announce Type: new Abstract: Gliomas, among the most common primary brain tumors, vary widely in aggressiveness, prognosis, and histology, making treatment challenging due to complex and time-intensive surgical interventions. This study presents an Attention-Gated Recurrent Residual U-Net (R2U-Net) based...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

ResearchGym: Evaluating Language Model Agents on Real-World AI Research

arXiv:2602.15112v1 Announce Type: new Abstract: We introduce ResearchGym, a benchmark and execution environment for evaluating AI agents on end-to-end research. To instantiate this, we repurpose five oral and spotlight papers from ICML, ICLR, and ACL. From each paper's repository, we...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Panini: Continual Learning in Token Space via Structured Memory

arXiv:2602.15156v1 Announce Type: new Abstract: Language models are increasingly used to reason over content they were not trained on, such as new documents, evolving knowledge, and user-specific data. A common approach is retrieval-augmented generation (RAG), which stores verbatim documents externally...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

da Costa and Tarski meet Goguen and Carnap: a novel approach for ontological heterogeneity based on consequence systems

arXiv:2602.15158v1 Announce Type: new Abstract: This paper presents a novel approach for ontological heterogeneity that draws heavily from Carnapian-Goguenism, as presented by Kutz, Mossakowski and L\"ucke (2010). The approach is provisionally designated da Costian-Tarskianism, named after da Costa's Principle of...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Predicting Invoice Dilution in Supply Chain Finance with Leakage Free Two Stage XGBoost, KAN (Kolmogorov Arnold Networks), and Ensemble Models

arXiv:2602.15248v1 Announce Type: new Abstract: Invoice or payment dilution is the gap between the approved invoice amount and the actual collection is a significant source of non credit risk and margin loss in supply chain finance. Traditionally, this risk is...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Epistemic Traps: Rational Misalignment Driven by Model Misspecification

arXiv:2602.17676v1 Announce Type: new Abstract: The rapid deployment of Large Language Models and AI agents across critical societal and technical domains is hindered by persistent behavioral pathologies including sycophancy, hallucination, and strategic deception that resist mitigation via reinforcement learning. Current...

1 min 1 month, 2 weeks ago
ear
LOW Academic International

Cross-Embodiment Offline Reinforcement Learning for Heterogeneous Robot Datasets

arXiv:2602.18025v1 Announce Type: new Abstract: Scalable robot policy pre-training has been hindered by the high cost of collecting high-quality demonstrations for each platform. In this study, we address this issue by uniting offline reinforcement learning (offline RL) with cross-embodiment learning....

1 min 1 month, 2 weeks ago
ear
LOW Academic European Union

SOMtime the World Ain$'$t Fair: Violating Fairness Using Self-Organizing Maps

arXiv:2602.18201v1 Announce Type: new Abstract: Unsupervised representations are widely assumed to be neutral with respect to sensitive attributes when those attributes are withheld from training. We show that this assumption is false. Using SOMtime, a topology-preserving representation method based on...

1 min 1 month, 2 weeks ago
ear
Previous Page 89 of 135 Next