multiMentalRoBERTa：一种用于心理健康障碍细粒度多类别分类的微调模型 (multiMentalRoBERTa: A Fine-tuned Multiclass Classifier for Mental Health Disorder)

The early detection of mental health disorders from social media text is critical for enabling timely support, risk assessment, and referral to appropriate resources. This work introduces multiMentalRoBERTa, a fine-tuned RoBERTa model designed for multiclass classification of common mental health conditions, including stress, anxiety, depression, post-traumatic stress disorder (PTSD), suicidal ideation, and neutral discourse. Drawing on multiple curated datasets, data exploration is conducted to analyze class overlaps, revealing strong correlations between depression and suicidal ideation as well as anxiety and PTSD, while stress emerges as a broad, overlapping category. Comparative experiments with traditional machine learning methods, domain-specific transformers, and prompting-based large language models demonstrate that multiMentalRoBERTa achieves superior performance, with macro F1-scores of 0.839 in the six-class setup and 0.870 in the five-class setup (excluding stress), outperforming both fine-tuned MentalBERT and baseline classifiers. Beyond predictive accuracy, explainability methods, including Layer Integrated Gradients and KeyBERT, are applied to identify lexical cues that drive classification, with a particular focus on distinguishing depression from suicidal ideation. The findings emphasize the effectiveness of fine-tuned transformers for reliable and interpretable detection in sensitive contexts, while also underscoring the importance of fairness, bias mitigation, and human-in-the-loop safety protocols. Overall, multiMentalRoBERTa is presented as a lightweight, robust, and deployable solution for enhancing support in mental health platforms.

翻译：基于社交媒体文本早期检测心理健康障碍对于及时提供支持、风险评估及转介至适当资源至关重要。本研究提出multiMentalRoBERTa，这是一种基于RoBERTa架构微调的模型，专为常见心理健康状况的多类别分类而设计，涵盖压力、焦虑、抑郁、创伤后应激障碍（PTSD）、自杀意念及中性话语。通过整合多个精选数据集进行数据探索，分析类别重叠现象，发现抑郁与自杀意念、焦虑与PTSD之间存在强相关性，而压力则呈现为广泛且重叠的类别。与传统机器学习方法、领域专用Transformer模型及基于提示的大语言模型对比实验表明，multiMentalRoBERTa在六类别设置中宏观F1分数达0.839，在五类别设置（排除压力）中达0.870，其性能优于微调版MentalBERT及基线分类器。除预测准确性外，研究采用层集成梯度与KeyBERT等可解释性方法识别驱动分类的词汇线索，特别聚焦于区分抑郁与自杀意念。研究结果强调了微调Transformer模型在敏感场景中实现可靠、可解释检测的有效性，同时凸显了公平性、偏见缓解及人机协同安全协议的重要性。总体而言，multiMentalRoBERTa作为一种轻量级、鲁棒且可部署的解决方案，为提升心理健康平台的支持能力提供了有效工具。