Academic

RxnNano:Training Compact LLMs for Chemical Reaction and Retrosynthesis Prediction via Hierarchical Curriculum Learning

Ran Li, Shimin Di, Haowei LI, Luanshi Bu, Jiachuan Wang, Wangze Ni, Lei Chen · March 5, 2026 · 1 min read · 33 views

#cs.LG #cs.AI

arXiv:2603.02215v1 Announce Type: new Abstract: Chemical reaction prediction is pivotal for accelerating drug discovery and synthesis planning. Despite advances in data-driven models, current approaches are hindered by an overemphasis on parameter and dataset scaling. Some methods coupled with evaluation techniques that bypass fundamental challenges in reaction representation and fail to capture deep chemical intuition like reaction common sense and {topological atom mapping logic}. We argue that the core challenge lies in instilling these knowledge into the models. To this end, we propose a unified framework that prioritizes chemical understanding over scale through three key innovations: (1) a {Latent Chemical Consistency} objective that models reactions as movements on a continuous chemical manifold, ensuring reversible and physically plausible transformations; (2) a {Hierarchical Cognitive Curriculum} that trains the model through progressive stages, from syntax mastery to semantic reasoning, building robust chemical intuition; (3) {Atom-Map Permutation Invariance (AMPI)}, which force the model to learn invariant relational topology and balance multi-task learning. (4)and structured plan-based reasoning to improve the performance of the LLMs. Our compact {0.5B-parameter model}, \textbf{RxnNano} significantly outperforms fine-tuned LLMs ten times larger (>7B) and all the domain baselines, achieving a 23.5\% Top-1 accuracy improvement on rigorous benchmarks without test-time augmentation. https://github.com/rlisml/RxnNano.

Executive Summary

This article presents RxnNano, a novel approach to training compact Large Language Models (LLMs) for chemical reaction and retrosynthesis prediction. The authors propose a unified framework that prioritizes chemical understanding over scale, incorporating three key innovations: Latent Chemical Consistency, Hierarchical Cognitive Curriculum, Atom-Map Permutation Invariance, and structured plan-based reasoning. The compact 0.5B-parameter model, RxnNano, significantly outperforms larger fine-tuned LLMs and domain baselines, achieving a 23.5% Top-1 accuracy improvement on rigorous benchmarks. This breakthrough has significant implications for accelerating drug discovery and synthesis planning.

Key Points

▸ RxnNano is a compact LLM that prioritizes chemical understanding over scale
▸ The model incorporates three key innovations: Latent Chemical Consistency, Hierarchical Cognitive Curriculum, and Atom-Map Permutation Invariance
▸ RxnNano achieves a 23.5% Top-1 accuracy improvement on rigorous benchmarks

Merits

Strength in Chemical Understanding

RxnNano's focus on chemical understanding enables it to capture deep chemical intuition and reaction common sense, addressing a fundamental challenge in reaction representation.

Improved Performance

RxnNano's compact size and improved performance make it a more efficient and effective tool for chemical reaction and retrosynthesis prediction.

Demerits

Limited Generalizability

RxnNano's performance may be limited to specific chemical domains or tasks, requiring further adaptation and fine-tuning for broader applicability.

Computational Resource Requirements

Training and deploying RxnNano may require significant computational resources, potentially limiting its adoption in resource-constrained settings.

Expert Commentary

RxnNano represents a significant breakthrough in the development of compact LLMs for chemical reaction and retrosynthesis prediction. The authors' focus on chemical understanding and prioritization of compact size over scale enable RxnNano to outperform larger fine-tuned LLMs and domain baselines. While RxnNano's performance may be limited to specific chemical domains or tasks, its compact size and improved performance make it a more efficient and effective tool for chemical reaction and retrosynthesis prediction.

Recommendations

✓ Further research is needed to explore RxnNano's applicability to broader chemical domains and tasks.
✓ Investigating the potential of RxnNano in other areas of chemistry, such as materials science and chemical engineering, could lead to significant advancements in these fields.

Sources

arXiv - cs.LG

RxnNano:Training Compact LLMs for Chemical Reaction and Retrosynthesis Prediction via Hierarchical Curriculum Learning

AI Commentary

Executive Summary

Key Points

Merits

Strength in Chemical Understanding

Improved Performance

Demerits

Limited Generalizability

Computational Resource Requirements

Expert Commentary

Recommendations

Sources

Related Articles

ConstitutionGPT: An AI-Powered Multilingual Legal Assistance System for Indian Citizens

AI Copyright Infringement: Navigating the Legal Risks of AI-Generated Content

The Rhetoric of Machine Learning

Busemann energy-based attention for emotion analysis in Poincar\'e discs

JCG, PC

HSOLLC Co., Ltd.

Related Articles

ConstitutionGPT: An AI-Powered Multilingual Legal Assistance System for Indian Citizens

AI Copyright Infringement: Navigating the Legal Risks of AI-Generated Content

The Rhetoric of Machine Learning

Busemann energy-based attention for emotion analysis in Poincar\'e discs