From Few-Shot to Zero-Shot: Towards Generalist Graph Anomaly Detection
arXiv:2602.18793v1 Announce Type: new Abstract: Graph anomaly detection (GAD) is critical for identifying abnormal nodes in graph-structured data from diverse domains, including cybersecurity and social networks. The existing GAD methods often focus on the learning paradigms of "one-model-for-one-dataset", requiring dataset-specific training for each dataset to achieve optimal performance. However, this paradigm suffers from significant limitations, such as high computational and data costs, limited generalization and transferability to new datasets, and challenges in privacy-sensitive scenarios where access to full datasets or sufficient labels is restricted. To address these limitations, we propose a novel generalist GAD paradigm that aims to develop a unified model capable of detecting anomalies on multiple unseen datasets without extensive retraining/fine-tuning or dataset-specific customization. To this end, we propose ARC, a few-shot generalist GAD method that leverages in-conte
arXiv:2602.18793v1 Announce Type: new Abstract: Graph anomaly detection (GAD) is critical for identifying abnormal nodes in graph-structured data from diverse domains, including cybersecurity and social networks. The existing GAD methods often focus on the learning paradigms of "one-model-for-one-dataset", requiring dataset-specific training for each dataset to achieve optimal performance. However, this paradigm suffers from significant limitations, such as high computational and data costs, limited generalization and transferability to new datasets, and challenges in privacy-sensitive scenarios where access to full datasets or sufficient labels is restricted. To address these limitations, we propose a novel generalist GAD paradigm that aims to develop a unified model capable of detecting anomalies on multiple unseen datasets without extensive retraining/fine-tuning or dataset-specific customization. To this end, we propose ARC, a few-shot generalist GAD method that leverages in-context learning and requires only a few labeled normal samples at inference time. Specifically, ARC consists of three core modules: a feature Alignment module to unify and align features across datasets, a Residual GNN encoder to capture dataset-agnostic anomaly representations, and a cross-attentive in-Context learning module to score anomalies using few-shot normal context. Building on ARC, we further introduce ARC_zero for the zero-shot generalist GAD setting, which selects representative pseudo-normal nodes via a pseudo-context mechanism and thus enables fully label-free inference on unseen datasets. Extensive experiments on 17 real-world graph datasets demonstrate that both ARC and ARC_zero effectively detect anomalies, exhibit strong generalization ability, and perform efficiently under few-shot and zero-shot settings.
Executive Summary
This article proposes a novel generalist graph anomaly detection (GAD) paradigm that aims to develop a unified model capable of detecting anomalies on multiple unseen datasets without extensive retraining or dataset-specific customization. The proposed method, ARC, leverages in-context learning and requires only a few labeled normal samples at inference time. A further extension, ARC_zero, enables fully label-free inference on unseen datasets by selecting representative pseudo-normal nodes. Extensive experiments demonstrate the effectiveness of both methods in detecting anomalies, exhibiting strong generalization ability, and performing efficiently under few-shot and zero-shot settings. This paradigm has significant potential for real-world applications, particularly in scenarios where access to full datasets or sufficient labels is restricted.
Key Points
- ▸ Development of a generalist GAD paradigm to address limitations of existing methods
- ▸ Introduction of ARC, a few-shot generalist GAD method based on in-context learning
- ▸ Extension to zero-shot generalist GAD setting with ARC_zero
- ▸ Demonstration of effectiveness and efficiency in detecting anomalies on multiple unseen datasets
Merits
Strength in addressing limitations of existing GAD methods
The proposed paradigm effectively addresses the limitations of existing GAD methods, including high computational and data costs, limited generalization and transferability, and challenges in privacy-sensitive scenarios.
Efficiency and scalability
The proposed methods exhibit strong generalization ability and perform efficiently under few-shot and zero-shot settings, making them suitable for real-world applications.
Potential for real-world applications
The proposed paradigm has significant potential for real-world applications, particularly in scenarios where access to full datasets or sufficient labels is restricted.
Demerits
Limited evaluation on diverse datasets
The evaluation of the proposed methods is limited to 17 real-world graph datasets, and it would be beneficial to evaluate their performance on a more diverse set of datasets.
Lack of comprehensive comparison with existing methods
A more comprehensive comparison with existing GAD methods would be beneficial to demonstrate the superiority of the proposed paradigm.
Expert Commentary
The proposed generalist graph anomaly detection paradigm has the potential to revolutionize the field of graph-based machine learning. By developing models that can generalize across multiple datasets and scenarios, the proposed paradigm addresses the significant limitations of existing GAD methods. The introduction of ARC and ARC_zero, which leverage in-context learning and few-shot normal context, respectively, demonstrates the effectiveness and efficiency of the proposed paradigm. However, further evaluation and comparison with existing methods are necessary to fully demonstrate the superiority of the proposed paradigm. Additionally, the implications of the proposed paradigm for real-world applications and policy-making are significant and warrant further exploration.
Recommendations
- ✓ Further evaluation and comparison with existing GAD methods to demonstrate the superiority of the proposed paradigm
- ✓ Exploration of the implications of the proposed paradigm for real-world applications and policy-making