Litigation

LOW Academic International

HippoCamp: Benchmarking Contextual Agents on Personal Computers

arXiv:2604.01221v1 Announce Type: new Abstract: We present HippoCamp, a new benchmark designed to evaluate agents' capabilities on multimodal file management. Unlike existing agent benchmarks that focus on tasks like web interaction, tool use, or software automation in generic settings, HippoCamp...

1 min 2 weeks, 1 day ago

evidence

LOW Academic United Kingdom

Phonological Fossils: Machine Learning Detection of Non-Mainstream Vocabulary in Sulawesi Basic Lexicon

arXiv:2604.00023v1 Announce Type: new Abstract: Basic vocabulary in many Sulawesi Austronesian languages includes forms resisting reconstruction to any proto-form with phonological patterns inconsistent with inherited roots, but whether this non-conforming vocabulary represents pre-Austronesian substrate or independent innovation has not been...

1 min 2 weeks, 1 day ago

evidence

LOW Academic International

Improvisational Games as a Benchmark for Social Intelligence of AI Agents: The Case of Connections

arXiv:2604.00284v1 Announce Type: new Abstract: We formally introduce a improvisational wordplay game called Connections to explore reasoning capabilities of AI agents. Playing Connections combines skills in knowledge retrieval, summarization and awareness of cognitive states of other agents. We show how...

1 min 2 weeks, 1 day ago

standing

LOW Academic International

Agent psychometrics: Task-level performance prediction in agentic coding benchmarks

arXiv:2604.00594v1 Announce Type: new Abstract: As the focus in LLM-based coding shifts from static single-step code generation to multi-step agentic interaction with tools and environments, understanding which tasks will challenge agents and why becomes increasingly difficult. This is compounded by...

1 min 2 weeks, 1 day ago

standing

LOW Academic International

CRIT: Graph-Based Automatic Data Synthesis to Enhance Cross-Modal Multi-Hop Reasoning

arXiv:2604.01634v1 Announce Type: new Abstract: Real-world reasoning often requires combining information across modalities, connecting textual context with visual cues in a multi-hop process. Yet, most multimodal benchmarks fail to capture this ability: they typically rely on single images or set...

1 min 2 weeks, 1 day ago

evidence

LOW Academic European Union

One Panel Does Not Fit All: Case-Adaptive Multi-Agent Deliberation for Clinical Prediction

arXiv:2604.00085v1 Announce Type: new Abstract: Large language models applied to clinical prediction exhibit case-level heterogeneity: simple cases yield consistent outputs, while complex cases produce divergent predictions under minor prompt changes. Existing single-agent strategies sample from one role-conditioned distribution, and multi-agent...

1 min 2 weeks, 1 day ago

evidence

LOW Academic International

Detecting Multi-Agent Collusion Through Multi-Agent Interpretability

arXiv:2604.01151v1 Announce Type: new Abstract: As LLM agents are increasingly deployed in multi-agent systems, they introduce risks of covert coordination that may evade standard forms of human oversight. While linear probes on model activations have shown promise for detecting deception...

1 min 2 weeks, 1 day ago

evidence

LOW Academic International

MSA-Thinker: Discrimination-Calibration Reasoning with Hint-Guided Reinforcement Learning for Multimodal Sentiment Analysis

arXiv:2604.00013v1 Announce Type: cross Abstract: Multimodal sentiment analysis aims to understand human emotions by integrating textual, auditory, and visual modalities. Although Multimodal Large Language Models (MLLMs) have achieved state-of-the-art performance via supervised fine-tuning (SFT), their end-to-end "black-box" nature limits interpretability....

1 min 2 weeks, 1 day ago

motion

LOW Academic European Union

Cognitive Energy Modeling for Neuroadaptive Human-Machine Systems using EEG and WGAN-GP

arXiv:2604.01653v1 Announce Type: new Abstract: Electroencephalography (EEG) provides a non-invasive insight into the brain's cognitive and emotional dynamics. However, modeling how these states evolve in real time and quantifying the energy required for such transitions remains a major challenge. The...

1 min 2 weeks, 1 day ago

motion

LOW Academic European Union

Artificial Intelligence and International Law: Legal Implications of AI Development and Global Regulation

This paper examines the legal implications of artificial intelligence (AI) development within the framework of public international law. Employing a doctrinal and comparative legal methodology, it surveys the principal international and regional regulatory instruments currently governing AI — including the...

1 min 2 weeks, 2 days ago

jurisdiction

LOW Conference South Korea

About the Association for the Advancement of Artificial Intelligence (AAAI)

AAAI is an artificial intelligence organization dedicated to advancing the scientific understanding of AI.

2 min 2 weeks, 6 days ago

standing

LOW News International

AV1’s open, royalty-free promise in question as Dolby sues Snapchat over codec

Big Tech declaring AV1 royalty-free “doesn't mean that it is."

1 min 2 weeks, 6 days ago

lawsuit

LOW News International

Elon Musk loses big in court; X boycott perfectly legal

X admonished for "fishing expedition" as judge dismisses ad boycott lawsuit.

1 min 2 weeks, 6 days ago

lawsuit

LOW News International

OpenAI shuts down Sora while Meta gets shut out in court

When an 82-year-old Kentucky woman was offered $26 million from an AI company that wanted to build a data center on her land, she said no. Sure, that same company can try to rezone 2,000 acres nearby anyway, but as...

1 min 2 weeks, 6 days ago

lawsuit

LOW Academic International

The Compression Paradox in LLM Inference: Provider-Dependent Energy Effects of Prompt Compression

arXiv:2603.23528v1 Announce Type: new Abstract: The rapid proliferation of Large Language Models has created an environmental paradox: the very technology that could help solve climate challenges is itself becoming a significant contributor to global carbon emissions. We test whether prompt...

1 min 3 weeks, 2 days ago

trial

LOW Academic European Union

Prompt Compression in Production Task Orchestration: A Pre-Registered Randomized Trial

arXiv:2603.23525v1 Announce Type: new Abstract: The economics of prompt compression depend not only on reducing input tokens but on how compression changes output length, which is typically priced several times higher. We evaluate this in a pre-registered six-arm randomized controlled...

1 min 3 weeks, 2 days ago

trial

LOW Academic International

Visuospatial Perspective Taking in Multimodal Language Models

arXiv:2603.23510v1 Announce Type: new Abstract: As multimodal language models (MLMs) are increasingly used in social and collaborative settings, it is crucial to evaluate their perspective-taking abilities. Existing benchmarks largely rely on text-based vignettes or static scene understanding, leaving visuospatial perspective-taking...

1 min 3 weeks, 2 days ago

standing

LOW Academic United States

S-Path-RAG: Semantic-Aware Shortest-Path Retrieval Augmented Generation for Multi-Hop Knowledge Graph Question Answering

arXiv:2603.23512v1 Announce Type: new Abstract: We present S-Path-RAG, a semantic-aware shortest-path Retrieval-Augmented Generation framework designed to improve multi-hop question answering over large knowledge graphs. S-Path-RAG departs from one-shot, text-heavy retrieval by enumerating bounded-length, semantically weighted candidate paths using a hybrid...

1 min 3 weeks, 2 days ago

evidence

LOW Academic International

Fast and Faithful: Real-Time Verification for Long-Document Retrieval-Augmented Generation Systems

arXiv:2603.23508v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) is increasingly deployed in enterprise search and document-centric assistants, where responses must be grounded in long and complex source materials. In practice, verifying that generated answers faithfully reflect retrieved documents is difficult:...

1 min 3 weeks, 2 days ago

evidence

LOW Academic International

Beyond Masks: Efficient, Flexible Diffusion Language Models via Deletion-Insertion Processes

arXiv:2603.23507v1 Announce Type: new Abstract: While Masked Diffusion Language Models (MDLMs) relying on token masking and unmasking have shown promise in language modeling, their computational efficiency and generation flexibility remain constrained by the masking paradigm. In this paper, we propose...

1 min 3 weeks, 2 days ago

mdl

LOW Academic International

Do 3D Large Language Models Really Understand 3D Spatial Relationships?

arXiv:2603.23523v1 Announce Type: new Abstract: Recent 3D Large-Language Models (3D-LLMs) claim to understand 3D worlds, especially spatial relationships among objects. Yet, we find that simply fine-tuning a language model on text-only question-answer pairs can perform comparably or even surpass these...

1 min 3 weeks, 2 days ago

standing

LOW Academic International

Navigating the Concept Space of Language Models

arXiv:2603.23524v1 Announce Type: new Abstract: Sparse autoencoders (SAEs) trained on large language model activations output thousands of features that enable mapping to human-interpretable concepts. The current practice for analyzing these features primarily relies on inspecting top-activating examples, manually browsing individual...

1 min 3 weeks, 2 days ago

discovery

LOW Academic International

Chitrakshara: A Large Multilingual Multimodal Dataset for Indian languages

arXiv:2603.23521v1 Announce Type: new Abstract: Multimodal research has predominantly focused on single-image reasoning, with limited exploration of multi-image scenarios. Recent models have sought to enhance multi-image understanding through large-scale pretraining on interleaved image-text datasets. However, most Vision-Language Models (VLMs) are...

1 min 3 weeks, 2 days ago

standing

LOW Academic International

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

arXiv:2603.23516v1 Announce Type: new Abstract: Long-term memory is a cornerstone of human intelligence. Enabling AI to process lifetime-scale information remains a long-standing pursuit in the field. Due to the constraints of full-attention architectures, the effective context length of large language...

1 min 3 weeks, 2 days ago

standing

LOW Academic European Union

Revisiting Real-Time Digging-In Effects: No Evidence from NP/Z Garden-Paths

arXiv:2603.23624v1 Announce Type: new Abstract: Digging-in effects, where disambiguation difficulty increases with longer ambiguous regions, have been cited as evidence for self-organized sentence processing, in which structural commitments strengthen over time. In contrast, surprisal theory predicts no such effect unless...

1 min 3 weeks, 2 days ago

evidence

LOW Academic International

Language Model Planners do not Scale, but do Formalizers?

arXiv:2603.23844v1 Announce Type: new Abstract: Recent work shows overwhelming evidence that LLMs, even those trained to scale their reasoning trace, perform unsatisfactorily when solving planning problems too complex. Whether the same conclusion holds for LLM formalizers that generate solver-oriented programs...

1 min 3 weeks, 2 days ago

evidence

LOW Academic International

BeliefShift: Benchmarking Temporal Belief Consistency and Opinion Drift in LLM Agents

arXiv:2603.23848v1 Announce Type: new Abstract: LLMs are increasingly used as long-running conversational agents, yet every major benchmark evaluating their memory treats user information as static facts to be stored and retrieved. That's the wrong model. People change their minds, and...

1 min 3 weeks, 2 days ago

evidence

LOW Academic International

Dialogue to Question Generation for Evidence-based Medical Guideline Agent Development

arXiv:2603.23937v1 Announce Type: new Abstract: Evidence-based medicine (EBM) is central to high-quality care, but remains difficult to implement in fast-paced primary care settings. Physicians face short consultations, increasing patient loads, and lengthy guideline documents that are impractical to consult in...

1 min 3 weeks, 2 days ago

evidence

LOW Academic European Union

Thinking with Tables: Enhancing Multi-Modal Tabular Understanding via Neuro-Symbolic Reasoning

arXiv:2603.24004v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) have demonstrated remarkable reasoning capabilities across modalities such as images and text. However, tabular data, despite being a critical real-world modality, remains relatively underexplored in multimodal learning. In this paper,...

1 min 3 weeks, 2 days ago

standing

LOW Academic European Union

Dual-Criterion Curriculum Learning: Application to Temporal Data

arXiv:2603.23573v1 Announce Type: new Abstract: Curriculum Learning (CL) is a meta-learning paradigm that trains a model by feeding the data instances incrementally according to a schedule, which is based on difficulty progression. Defining meaningful difficulty assessment measures is crucial and...

1 min 3 weeks, 2 days ago

evidence

HippoCamp: Benchmarking Contextual Agents on Personal Computers

Phonological Fossils: Machine Learning Detection of Non-Mainstream Vocabulary in Sulawesi Basic Lexicon

Improvisational Games as a Benchmark for Social Intelligence of AI Agents: The Case of Connections

Agent psychometrics: Task-level performance prediction in agentic coding benchmarks

CRIT: Graph-Based Automatic Data Synthesis to Enhance Cross-Modal Multi-Hop Reasoning

One Panel Does Not Fit All: Case-Adaptive Multi-Agent Deliberation for Clinical Prediction

Detecting Multi-Agent Collusion Through Multi-Agent Interpretability

MSA-Thinker: Discrimination-Calibration Reasoning with Hint-Guided Reinforcement Learning for Multimodal Sentiment Analysis

Cognitive Energy Modeling for Neuroadaptive Human-Machine Systems using EEG and WGAN-GP

Artificial Intelligence and International Law: Legal Implications of AI Development and Global Regulation

About the Association for the Advancement of Artificial Intelligence (AAAI)

AV1’s open, royalty-free promise in question as Dolby sues Snapchat over codec

Elon Musk loses big in court; X boycott perfectly legal

OpenAI shuts down Sora while Meta gets shut out in court

The Compression Paradox in LLM Inference: Provider-Dependent Energy Effects of Prompt Compression

Prompt Compression in Production Task Orchestration: A Pre-Registered Randomized Trial

Visuospatial Perspective Taking in Multimodal Language Models

S-Path-RAG: Semantic-Aware Shortest-Path Retrieval Augmented Generation for Multi-Hop Knowledge Graph Question Answering

Fast and Faithful: Real-Time Verification for Long-Document Retrieval-Augmented Generation Systems

Beyond Masks: Efficient, Flexible Diffusion Language Models via Deletion-Insertion Processes

Do 3D Large Language Models Really Understand 3D Spatial Relationships?

Navigating the Concept Space of Language Models

Chitrakshara: A Large Multilingual Multimodal Dataset for Indian languages

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Revisiting Real-Time Digging-In Effects: No Evidence from NP/Z Garden-Paths

Language Model Planners do not Scale, but do Formalizers?

BeliefShift: Benchmarking Temporal Belief Consistency and Opinion Drift in LLM Agents

Dialogue to Question Generation for Evidence-based Medical Guideline Agent Development

Thinking with Tables: Enhancing Multi-Modal Tabular Understanding via Neuro-Symbolic Reasoning

Dual-Criterion Curriculum Learning: Application to Temporal Data

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.