Healthcare systems around the world are grappling with issues like inefficient diagnostics, rising costs, and limited access to specialists. These problems often lead to delays in treatment and poor health outcomes. Most current AI and deep learning diagnostic systems are not very interactive or transparent, making them less effective in real-world, patient-centered environments. This research introduces a diagnostic chatbot powered by a Large Language Model (LLM), using GPT-4o, Retrieval-Augmented Generation, and explainable AI techniques. The chatbot engages patients in a dynamic conversation, helping to extract and normalize symptoms while prioritizing potential diagnoses through similarity matching and adaptive questioning. With Chain-of-Thought prompting, the system also offers more transparent reasoning behind its diagnoses. When tested against traditional machine learning models like Naive Bayes, Logistic Regression, SVM, Random Forest, and KNN, the LLM-based system delivered impressive results, achieving an accuracy of 90% and Top-3 accuracy of 100%. These findings offer a promising outlook for more transparent, interactive, and clinically relevant AI in healthcare.
翻译:全球医疗系统正面临诊断效率低下、成本上升及专科医生资源有限等问题,这些挑战常导致治疗延误与不良健康结局。当前多数基于人工智能与深度学习的诊断系统缺乏交互性与透明度,难以在实际以患者为中心的场景中有效应用。本研究提出一种基于大语言模型(LLM)的诊断对话系统,融合GPT-4o、检索增强生成及可解释人工智能技术。该系统通过动态对话引导患者,协助提取并规范化症状描述,同时借助相似性匹配与自适应提问机制对潜在诊断进行优先级排序。结合思维链提示技术,该系统能够提供更具透明度的诊断推理过程。在与朴素贝叶斯、逻辑回归、支持向量机、随机森林及K近邻等传统机器学习模型的对比测试中,基于大语言模型的系统表现出卓越性能,其诊断准确率达90%,Top-3准确率达100%。这些发现为开发更透明、交互性更强且具临床实用价值的医疗人工智能系统提供了前瞻性视角。