The diagnosis of most mental disorders, including psychiatric evaluations, primarily depends on dialogues between psychiatrists and patients. This subjective process can lead to variability in diagnoses across clinicians and patients, resulting in inconsistencies and challenges in achieving reliable outcomes. To address these issues and standardize psychiatric diagnoses, we propose a Fine-Tuned Large Language Model (LLM) Consortium and OpenAI-gpt-oss Reasoning LLM-enabled Decision Support System for the clinical diagnosis of mental disorders. Our approach leverages fine-tuned LLMs trained on conversational datasets involving psychiatrist-patient interactions focused on mental health conditions (e.g., depression). The diagnostic predictions from individual models are aggregated through a consensus-based decision-making process, refined by the OpenAI-gpt-oss reasoning LLM. We propose a novel method for deploying LLM agents that orchestrate communication between the LLM consortium and the reasoning LLM, ensuring transparency, reliability, and responsible AI across the entire diagnostic workflow. Experimental results demonstrate the transformative potential of combining fine-tuned LLMs with a reasoning model to create a robust and highly accurate diagnostic system for mental health assessment. A prototype of the proposed platform, integrating three fine-tuned LLMs with the OpenAI-gpt-oss reasoning LLM, was developed in collaboration with the U.S. Army Medical Research Team in Norfolk, Virginia, USA. To the best of our knowledge, this work represents the first application of a fine-tuned LLM consortium integrated with a reasoning LLM for clinical mental health diagnosis paving the way for next-generation AI-powered eHealth systems aimed at standardizing psychiatric diagnoses.
翻译:大多数精神障碍(包括精神科评估)的诊断主要依赖于精神科医生与患者之间的对话。这种主观过程可能导致不同临床医生和患者之间的诊断存在差异,从而造成结果不一致且难以获得可靠结论。为解决这些问题并实现精神疾病诊断的标准化,我们提出了一种基于微调大语言模型联盟与OpenAI-gpt-oss推理大语言模型的决策支持系统,用于精神障碍的临床诊断。我们的方法利用在精神科医患对话数据集上微调的大语言模型,这些数据集专注于心理健康状况(如抑郁症)。各模型的诊断预测通过基于共识的决策流程进行聚合,并由OpenAI-gpt-oss推理大语言模型进行优化。我们提出了一种新颖的大语言模型智能体部署方法,用于协调大语言模型联盟与推理大语言模型之间的通信,确保整个诊断工作流程的透明度、可靠性及负责任的人工智能应用。实验结果表明,将微调大语言模型与推理模型相结合,能够构建出稳健且高精度的心理健康评估诊断系统,具有变革性潜力。我们与美国弗吉尼亚州诺福克陆军医学研究团队合作,开发了一个集成三个微调大语言模型与OpenAI-gpt-oss推理大语言模型的平台原型。据我们所知,本研究首次将微调大语言模型联盟与推理大语言模型集成应用于临床心理健康诊断,为旨在标准化精神疾病诊断的下一代人工智能驱动电子健康系统开辟了道路。