Academic

Texo: Formula Recognition within 20M Parameters

arXiv:2602.17189v1 Announce Type: new Abstract: In this paper we present Texo, a minimalist yet highperformance formula recognition model that contains only 20 million parameters. By attentive design, distillation and transfer of the vocabulary and the tokenizer, Texo achieves comparable performance to state-of-the-art models such as UniMERNet-T and PPFormulaNet-S, while reducing the model size by 80% and 65%, respectively. This enables real-time inference on consumer-grade hardware and even in-browser deployment. We also developed a web application to demonstrate the model capabilities and facilitate its usage for end users.

Sicheng Mao · February 22, 2026 · 1 min read · 7 views

#cs.AI #cs.CV

Executive Summary

This article presents Texo, a novel formula recognition model that achieves state-of-the-art performance while significantly reducing the model size to 20 million parameters. By employing attentive design, distillation, and transfer learning, Texo outperforms existing models such as UniMERNet-T and PPFormulaNet-S, while decreasing their respective sizes by 80% and 65%. This breakthrough enables real-time inference on consumer-grade hardware and even in-browser deployment. The authors also develop a web application to demonstrate the model's capabilities and facilitate its usage for end-users. This advancement has significant implications for real-world applications, particularly in the fields of scientific research, education, and industry.

Key Points

▸ Texo achieves comparable performance to state-of-the-art models while reducing model size by 80% and 65%
▸ Attentive design, distillation, and transfer learning are employed to achieve significant reductions in model size
▸ Texo enables real-time inference on consumer-grade hardware and in-browser deployment

Merits

Strength in Performance

Texo achieves comparable performance to state-of-the-art models, indicating its effectiveness in formula recognition tasks.

Demerits

Limited Dataset

The article does not provide information on the dataset used to train and evaluate Texo, which may limit its generalizability to other scenarios.

Expert Commentary

The authors' approach to designing Texo, a minimalist yet high-performance formula recognition model, is a significant advancement in the field of deep learning. By employing attentive design, distillation, and transfer learning, they have managed to achieve state-of-the-art performance while significantly reducing the model size. This breakthrough has significant implications for real-world applications, particularly in the fields of scientific research, education, and industry. However, the article's limitations, such as the lack of information on the dataset used to train and evaluate Texo, may limit its generalizability to other scenarios.

Recommendations

✓ Future research should focus on exploring the potential applications of Texo in various industries and developing strategies for its deployment and maintenance.
✓ The authors should provide more information on the dataset used to train and evaluate Texo to facilitate a more comprehensive understanding of its generalizability and limitations.

Sources

arXiv - cs.AI

Something extraordinary is coming.

Texo: Formula Recognition within 20M Parameters

AI Commentary

Executive Summary

Key Points

Merits

Strength in Performance

Demerits

Limited Dataset

Expert Commentary

Recommendations

Sources

Related Articles

Humans and LLMs Diverge on Probabilistic Inferences

France or Spain or Germany or France: A Neural Account …

Multi-Agent Causal Reasoning for Suicide Ideation Detection Through Online Conversations

BRIDGE the Gap: Mitigating Bias Amplification in Automated Scoring of …

JCG, PC

HSOLLC Co., Ltd.