Skip to main content
Academic

TCM-DiffRAG: Personalized Syndrome Differentiation Reasoning Method for Traditional Chinese Medicine based on Knowledge Graph and Chain of Thought

arXiv:2602.22828v1 Announce Type: new Abstract: Background: Retrieval augmented generation (RAG) technology can empower large language models (LLMs) to generate more accurate, professional, and timely responses without fine tuning. However, due to the complex reasoning processes and substantial individual differences involved in traditional Chinese medicine (TCM) clinical diagnosis and treatment, traditional RAG methods often exhibit poor performance in this domain. Objective: To address the limitations of conventional RAG approaches in TCM applications, this study aims to develop an improved RAG framework tailored to the characteristics of TCM reasoning. Methods: We developed TCM-DiffRAG, an innovative RAG framework that integrates knowledge graphs (KG) with chains of thought (CoT). TCM-DiffRAG was evaluated on three distinctive TCM test datasets. Results: The experimental results demonstrated that TCM-DiffRAG achieved significant performance improvements over native LLMs. For exampl

arXiv:2602.22828v1 Announce Type: new Abstract: Background: Retrieval augmented generation (RAG) technology can empower large language models (LLMs) to generate more accurate, professional, and timely responses without fine tuning. However, due to the complex reasoning processes and substantial individual differences involved in traditional Chinese medicine (TCM) clinical diagnosis and treatment, traditional RAG methods often exhibit poor performance in this domain. Objective: To address the limitations of conventional RAG approaches in TCM applications, this study aims to develop an improved RAG framework tailored to the characteristics of TCM reasoning. Methods: We developed TCM-DiffRAG, an innovative RAG framework that integrates knowledge graphs (KG) with chains of thought (CoT). TCM-DiffRAG was evaluated on three distinctive TCM test datasets. Results: The experimental results demonstrated that TCM-DiffRAG achieved significant performance improvements over native LLMs. For example, the qwen-plus model achieved scores of 0.927, 0.361, and 0.038, which were significantly enhanced to 0.952, 0.788, and 0.356 with TCM-DiffRAG. The improvements were even more pronounced for non-Chinese LLMs. Additionally, TCM-DiffRAG outperformed directly supervised fine-tuned (SFT) LLMs and other benchmark RAG methods. Conclusions: TCM-DiffRAG shows that integrating structured TCM knowledge graphs with Chain of Thought based reasoning substantially improves performance in individualized diagnostic tasks. The joint use of universal and personalized knowledge graphs enables effective alignment between general knowledge and clinical reasoning. These results highlight the potential of reasoning-aware RAG frameworks for advancing LLM applications in traditional Chinese medicine.

Executive Summary

This study presents TCM-DiffRAG, a novel Retrieval-Augmented Generation (RAG) framework tailored to Traditional Chinese Medicine (TCM) clinical diagnosis and treatment. By integrating knowledge graphs and chain of thought, TCM-DiffRAG demonstrates significant performance improvements over native Large Language Models (LLMs) and benchmark RAG methods. The framework's ability to align general knowledge with clinical reasoning showcases its potential for advancing LLM applications in TCM. The study's results are promising, but further evaluation on diverse datasets and real-world scenarios is necessary to establish TCM-DiffRAG's robustness and generalizability.

Key Points

  • TCM-DiffRAG integrates knowledge graphs and chain of thought to improve performance in TCM clinical diagnosis and treatment
  • Significant performance enhancements over native LLMs and benchmark RAG methods
  • Potential for advancing LLM applications in Traditional Chinese Medicine

Merits

Strength in Personalized Reasoning

TCM-DiffRAG's ability to incorporate personalized knowledge graphs enables effective alignment between general knowledge and clinical reasoning, leading to improved diagnostic accuracy.

Improved Performance over Benchmark Methods

TCM-DiffRAG outperforms directly supervised fine-tuned (SFT) LLMs and other benchmark RAG methods, demonstrating its potential for real-world applications.

Demerits

Limited Evaluation on Real-World Scenarios

The study's results are promising, but further evaluation on diverse datasets and real-world scenarios is necessary to establish TCM-DiffRAG's robustness and generalizability.

Dependence on High-Quality Knowledge Graphs

The effectiveness of TCM-DiffRAG relies heavily on the quality and accuracy of the knowledge graphs used, which may be challenging to maintain and update in real-world applications.

Expert Commentary

While TCM-DiffRAG represents a significant advancement in the application of RAG to TCM, it is essential to recognize the limitations and challenges associated with this framework. Future research should focus on addressing these limitations, including the development of high-quality knowledge graphs, evaluation on diverse datasets, and exploration of real-world scenarios. Additionally, the integration of TCM-DiffRAG with other AI and NLP techniques may further enhance its performance and potential for real-world applications.

Recommendations

  • Future research should prioritize the development of high-quality knowledge graphs and evaluation on diverse datasets and real-world scenarios
  • The integration of TCM-DiffRAG with other AI and NLP techniques may further enhance its performance and potential for real-world applications

Sources