Skip to main content
Academic

Neural Proposals, Symbolic Guarantees: Neuro-Symbolic Graph Generation with Hard Constraints

arXiv:2602.16954v1 Announce Type: new Abstract: We challenge black-box purely deep neural approaches for molecules and graph generation, which are limited in controllability and lack formal guarantees. We introduce Neuro-Symbolic Graph Generative Modeling (NSGGM), a neurosymbolic framework that reapproaches molecule generation as a scaffold and interaction learning task with symbolic assembly. An autoregressive neural model proposes scaffolds and refines interaction signals, and a CPU-efficient SMT solver constructs full graphs while enforcing chemical validity, structural rules, and user-specific constraints, yielding molecules that are correct by construction and interpretable control that pure neural methods cannot provide. NSGGM delivers strong performance on both unconstrained generation and constrained generation tasks, demonstrating that neuro-symbolic modeling can match state-of-the-art generative performance while offering explicit controllability and guarantees. To evaluate

C
Chuqin Geng, Li Zhang, Mark Zhang, Haolin Ye, Ziyu Zhao, Xujie Si
· · 1 min read · 5 views

arXiv:2602.16954v1 Announce Type: new Abstract: We challenge black-box purely deep neural approaches for molecules and graph generation, which are limited in controllability and lack formal guarantees. We introduce Neuro-Symbolic Graph Generative Modeling (NSGGM), a neurosymbolic framework that reapproaches molecule generation as a scaffold and interaction learning task with symbolic assembly. An autoregressive neural model proposes scaffolds and refines interaction signals, and a CPU-efficient SMT solver constructs full graphs while enforcing chemical validity, structural rules, and user-specific constraints, yielding molecules that are correct by construction and interpretable control that pure neural methods cannot provide. NSGGM delivers strong performance on both unconstrained generation and constrained generation tasks, demonstrating that neuro-symbolic modeling can match state-of-the-art generative performance while offering explicit controllability and guarantees. To evaluate more nuanced controllability, we also introduce a Logical-Constraint Molecular Benchmark, designed to test strict hard-rule satisfaction in workflows that require explicit, interpretable specifications together with verifiable compliance.

Executive Summary

This study introduces Neuro-Symbolic Graph Generative Modeling (NSGGM), a novel framework that leverages the strengths of both symbolic and neural approaches to tackle the challenges of molecule and graph generation. NSGGM combines an autoregressive neural model with a CPU-efficient SMT solver to generate molecules that are correct by construction and interpretable. The framework demonstrates strong performance on both unconstrained and constrained generation tasks, while offering explicit controllability and guarantees. The study also introduces a Logical-Constraint Molecular Benchmark to evaluate nuanced controllability. The findings suggest that neuro-symbolic modeling can match state-of-the-art generative performance while providing explicit control and interpretability, which is crucial in various real-world applications.

Key Points

  • NSGGM combines symbolic and neural approaches to tackle molecule and graph generation challenges.
  • The framework leverages an autoregressive neural model and a CPU-efficient SMT solver to generate molecules.
  • NSGGM demonstrates strong performance on both unconstrained and constrained generation tasks.

Merits

Explicit Controllability and Guarantees

NSGGM offers explicit controllability and guarantees, which is crucial in various real-world applications, such as drug discovery and materials science.

Demerits

Complexity and Computational Cost

The framework may require significant computational resources and expertise in symbolic and neural modeling to implement and optimize.

Expert Commentary

The study's findings are significant, as they demonstrate the potential of neuro-symbolic modeling to address the limitations of purely deep neural approaches in molecule and graph generation. The introduction of NSGGM as a novel framework for controllable and interpretable generation is a major contribution. However, the complexity and computational cost of the framework may limit its adoption in certain contexts. Nevertheless, the study's implications for the development of explainable and transparent AI systems are substantial, and its findings have the potential to inform policy decisions on the responsible development and deployment of AI.

Recommendations

  • Future research should focus on optimizing the computational efficiency and scalability of NSGGM to enable its widespread adoption.
  • The development of similar neuro-symbolic frameworks for other AI applications, such as natural language processing and computer vision, is warranted to explore the broader potential of this approach.

Sources