Academic

A Dataset for Named Entity Recognition and Relation Extraction from Art-historical Image Descriptions

arXiv:2602.19133v1 Announce Type: new Abstract: This paper introduces FRAME (Fine-grained Recognition of Art-historical Metadata and Entities), a manually annotated dataset of art-historical image descriptions for Named Entity Recognition (NER) and Relation Extraction (RE). Descriptions were collected from museum catalogs, auction listings, open-access platforms, and scholarly databases, then filtered to ensure that each text focuses on a single artwork and contains explicit statements about its material, composition, or iconography. FRAME provides stand-off annotations in three layers: a metadata layer for object-level properties, a content layer for depicted subjects and motifs, and a co-reference layer linking repeated mentions. Across layers, entity spans are labeled with 37 types and connected by typed RE links between mentions. Entity types are aligned with Wikidata to support Named Entity Linking (NEL) and downstream knowledge-graph construction. The dataset is released as UIMA

Stefanie Schneider, Miriam G\"oldl, Julian Stalter, Ricarda Vollmer · February 25, 2026 · 1 min read · 5 views

#cs.CL

Executive Summary

The article introduces FRAME, a meticulously annotated dataset designed for Named Entity Recognition (NER) and Relation Extraction (RE) within the domain of art-historical image descriptions. Collected from diverse sources such as museum catalogs and scholarly databases, the dataset focuses on single-artwork descriptions, providing annotations across three layers: metadata, content, and co-reference. The dataset includes 37 entity types aligned with Wikidata, facilitating Named Entity Linking (NEL) and knowledge-graph construction. Released in UIMA XMI CAS format, FRAME is intended to benchmark and fine-tune NER and RE systems, including applications with Large Language Models (LLMs).

Key Points

▸ Introduction of FRAME dataset for NER and RE in art-historical image descriptions.
▸ Annotations across three layers: metadata, content, and co-reference.
▸ Alignment with Wikidata for NEL and knowledge-graph construction.
▸ Dataset released in UIMA XMI CAS format for benchmarking and fine-tuning.
▸ Potential applications with zero- and few-shot setups with LLMs.

Merits

Comprehensive Annotation

The dataset provides detailed annotations across multiple layers, ensuring a rich and nuanced understanding of art-historical descriptions.

Alignment with Wikidata

The alignment with Wikidata enhances the dataset's utility for Named Entity Linking and knowledge-graph construction, making it more versatile for various applications.

Diverse Data Sources

The data is collected from a variety of reputable sources, ensuring a broad and representative sample of art-historical descriptions.

Demerits

Limited Scope

The dataset focuses solely on single-artwork descriptions, which may limit its applicability to more complex or multi-artwork contexts.

Annotation Complexity

The three-layer annotation structure, while comprehensive, may introduce complexity that could be challenging for some users to navigate.

Potential Bias

The reliance on specific sources for data collection may introduce biases that could affect the generalizability of the dataset.

Expert Commentary

The introduction of the FRAME dataset represents a significant advancement in the field of art-historical NLP. By providing a meticulously annotated dataset that spans multiple layers of annotation, the authors have created a valuable resource for researchers and practitioners. The alignment with Wikidata is particularly noteworthy, as it facilitates the integration of the dataset with broader knowledge-graph initiatives. However, the focus on single-artwork descriptions may limit the dataset's applicability in more complex scenarios. Additionally, the three-layer annotation structure, while comprehensive, may present a steep learning curve for some users. Despite these limitations, the dataset's potential to enhance NER and RE systems in the art-historical domain is substantial. The practical implications of this work are evident in the improved automation of metadata extraction and knowledge organization, which can streamline the digitization and preservation of cultural heritage. Policy-wise, the dataset underscores the importance of AI in cultural heritage preservation, potentially influencing future policies in this area.

Recommendations

✓ Expand the dataset to include multi-artwork descriptions to broaden its applicability.
✓ Provide additional documentation and tutorials to help users navigate the complex annotation structure.

Sources

arXiv - cs.CL

Something extraordinary is coming.

A Dataset for Named Entity Recognition and Relation Extraction from Art-historical Image Descriptions

AI Commentary

Executive Summary

Key Points

Merits

Comprehensive Annotation

Alignment with Wikidata

Diverse Data Sources

Demerits

Limited Scope

Annotation Complexity

Potential Bias

Expert Commentary

Recommendations

Sources

Related Articles

Uncovering Context Reliance in Unstructured Knowledge Editing

Using AI in Dance Notation and Copyright Infringement Prevention: Enhancing …

Multilevel Determinants of Overweight and Obesity Among U.S. Children Aged …

An artificial intelligence framework for end-to-end rare disease phenotyping from …

JCG, PC

HSOLLC Co., Ltd.