Depression is one of the most common mental health disorders, and a large number of depressed people commit suicide each year. Potential depression sufferers usually do not consult psychological doctors because they feel ashamed or are unaware of any depression, which may result in severe delay of diagnosis and treatment. In the meantime, evidence shows that social media data provides valuable clues about physical and mental health conditions. In this paper, we argue that it is feasible to identify depression at an early stage by mining online social behaviours. Our approach, which is innovative to the practice of depression detection, does not rely on the extraction of numerous or complicated features to achieve accurate depression detection. Instead, we propose a novel classifier, namely, Cost-sensitive Boosting Pruning Trees (CBPT), which demonstrates a strong classification ability on two publicly accessible Twitter depression detection datasets. To comprehensively evaluate the classification capability of the CBPT, we use additional three datasets from the UCI machine learning repository and the CBPT obtains appealing classification results against several state of the arts boosting algorithms. Finally, we comprehensively explore the influence factors of model prediction, and the results manifest that our proposed framework is promising for identifying Twitter users with depression.
翻译:抑郁症是最常见的心理健康疾病之一,大量抑郁症患者每年自杀。潜在的抑郁患者通常不咨询心理医生,因为他们感到羞耻或不知道任何抑郁症,这可能导致诊断和治疗严重延误。与此同时,有证据表明社交媒体数据为身心健康状况提供了宝贵的线索。在本文中,我们争辩说,通过开发在线社会行为来早期识别抑郁症是可行的。我们的方法是针对抑郁症检测做法的创新做法,并不依靠提取众多或复杂的特征来实现准确的抑郁症检测。相反,我们提议了一个新的分类器,即成本敏感型的缓冲树(CBPT),它展示了两种公众可访问的Twitter抑郁症检测数据集的强大分类能力。为了全面评估CBPT的分类能力,我们使用UCI机器学习库和CBPT的另外三个数据集,对艺术增强算法的若干状态产生了吸引力。最后,我们全面探索了模型预测的影响因素,并表明我们提议的框架有望确定患有抑郁症的Twitter用户。