Academic

Rethinking the Harmonic Loss via Non-Euclidean Distance Layers

Maxwell Miller-Golub, Kamil Faber, Marcin Pietron, Panpan Zheng, Pasquale Minervini, Roberto Corizzo · March 12, 2026 · 1 min read · 38 views

#cs.LG #cs.AI

arXiv:2603.10225v1 Announce Type: new Abstract: Cross-entropy loss has long been the standard choice for training deep neural networks, yet it suffers from interpretability limitations, unbounded weight growth, and inefficiencies that can contribute to costly training dynamics. The harmonic loss is a distance-based alternative grounded in Euclidean geometry that improves interpretability and mitigates phenomena such as grokking, or delayed generalization on the test set. However, the study of harmonic loss remains narrow: only Euclidean distance is explored, and no systematic evaluation of computational efficiency or sustainability was conducted. We extend harmonic loss by systematically investigating a broad spectrum of distance metrics as replacements for the Euclidean distance. We comprehensively evaluate distance-tailored harmonic losses on both vision backbones and large language models. Our analysis is framed around a three-way evaluation of model performance, interpretability, and sustainability. On vision tasks, cosine distances provide the most favorable trade-off, consistently improving accuracy while lowering carbon emissions, whereas Bray-Curtis and Mahalanobis further enhance interpretability at varying efficiency costs. On language models, cosine-based harmonic losses improve gradient and learning stability, strengthen representation structure, and reduce emissions relative to cross-entropy and Euclidean heads. Our code is available at: https://anonymous.4open.science/r/rethinking-harmonic-loss-5BAB/.

Executive Summary

This article presents a comprehensive reevaluation of harmonic loss, a distance-based alternative to cross-entropy loss in deep neural networks. The authors extend the harmonic loss framework by investigating a range of non-Euclidean distance metrics, including cosine, Bray-Curtis, and Mahalanobis distances. The study evaluates the performance, interpretability, and sustainability of distance-tailored harmonic losses on both vision and language tasks. The findings suggest that cosine distances offer a favorable trade-off between accuracy and efficiency, while Bray-Curtis and Mahalanobis distances enhance interpretability at varying efficiency costs. The article highlights the potential of harmonic loss to improve model performance, stability, and sustainability, particularly in large language models.

Key Points

▸ Harmonic loss offers improved interpretability and stability compared to cross-entropy loss.
▸ Non-Euclidean distance metrics, such as cosine, Bray-Curtis, and Mahalanobis distances, can enhance harmonic loss performance.
▸ Distance-tailored harmonic losses show promising results on both vision and language tasks.

Merits

Systematic Evaluation

The article provides a comprehensive evaluation of harmonic loss with a range of non-Euclidean distance metrics, filling a significant gap in the existing literature.

Practical Implications

The study highlights the potential of harmonic loss to improve model performance, stability, and sustainability, particularly in large language models, with significant practical implications for industry and research.

Methodological Contributions

The article presents a novel method for evaluating the performance, interpretability, and sustainability of harmonic loss, which can be applied to other machine learning frameworks.

Demerits

Limited Generalizability

The study is limited to vision and language tasks, and it is unclear whether the findings generalize to other domains or applications.

Computational Complexity

The evaluation of distance-tailored harmonic losses may involve increased computational complexity, which could be a limitation in certain scenarios.

Scalability

The study does not investigate the scalability of harmonic loss to very large datasets or complex models.

Expert Commentary

This article presents a significant contribution to the field of machine learning, highlighting the potential of harmonic loss to improve model performance, stability, and sustainability. The study's comprehensive evaluation of distance-tailored harmonic losses provides valuable insights into the trade-offs between accuracy, interpretability, and efficiency. However, the limitations of the study, including the limited generalizability and scalability of the results, should be acknowledged. Nevertheless, the article's findings have significant implications for the development of more efficient and sustainable machine learning frameworks, and its methodological contributions can be applied to other machine learning frameworks.

Recommendations

✓ Future studies should investigate the scalability and generalizability of distance-tailored harmonic losses to very large datasets and complex models.
✓ The development of more efficient and sustainable machine learning frameworks should be prioritized, with a focus on reducing carbon emissions and improving model interpretability.

Sources

arXiv - cs.LG

Rethinking the Harmonic Loss via Non-Euclidean Distance Layers

AI Commentary

Executive Summary

Key Points

Merits

Systematic Evaluation

Practical Implications

Methodological Contributions

Demerits

Limited Generalizability

Computational Complexity

Scalability

Expert Commentary

Recommendations

Sources

Related Articles

ConstitutionGPT: An AI-Powered Multilingual Legal Assistance System for Indian Citizens

AI Copyright Infringement: Navigating the Legal Risks of AI-Generated Content

The Rhetoric of Machine Learning

Busemann energy-based attention for emotion analysis in Poincar\'e discs

JCG, PC

HSOLLC Co., Ltd.

Related Articles

ConstitutionGPT: An AI-Powered Multilingual Legal Assistance System for Indian Citizens

AI Copyright Infringement: Navigating the Legal Risks of AI-Generated Content

The Rhetoric of Machine Learning

Busemann energy-based attention for emotion analysis in Poincar\'e discs