Building a socially intelligent agent involves many challenges, one of which is to teach the agent to speak guided by its value like a human. However, value-driven chatbots are still understudied in the area of dialogue systems. Most existing datasets focus on commonsense reasoning or social norm modeling. In this work, we present a new large-scale human value dataset called ValueNet, which contains human attitudes on 21,374 text scenarios. The dataset is organized in ten dimensions that conform to the basic human value theory in intercultural research. We further develop a Transformer-based value regression model on ValueNet to learn the utility distribution. Comprehensive empirical results show that the learned value model could benefit a wide range of dialogue tasks. For example, by teaching a generative agent with reinforcement learning and the rewards from the value model, our method attains state-of-the-art performance on the personalized dialog generation dataset: Persona-Chat. With values as additional features, existing emotion recognition models enable capturing rich human emotions in the context, which further improves the empathetic response generation performance in the EmpatheticDialogues dataset. To the best of our knowledge, ValueNet is the first large-scale text dataset for human value modeling, and we are the first one trying to incorporate a value model into emotionally intelligent dialogue systems. The dataset is available at https://liang-qiu.github.io/ValueNet/.
翻译:建设社会智能剂涉及许多挑战,其中之一是教导代理商以其价值如人的价值来说话。然而,价值驱动的聊天机仍然在对话系统领域研究不足。大多数现有数据集侧重于常识推理或社会规范建模。在这项工作中,我们展示了一个新的大规模人类价值数据集,称为价值网,其中包含21 374个文本情景的人类态度。数据集以十个维度组成,这符合文化间研究中基本的人类价值理论。我们在价值网上进一步开发一个基于变异器的价值回归模型,以学习实用性分布。综合经验结果显示,学习的价值模型可以有益于广泛的对话任务。例如,通过教授一个带有强化学习和来自价值模型的奖励的基因化工具,我们的方法在个人化对话生成数据集上取得了最先进的表现:人文-Chat。现有的情感识别模型作为附加特征,可以捕捉到环境中的丰富的人类情感。这进一步改进了模型Dalder-Dialogs的第一个模型/智能数据生成性能,这是我们最高级的智能数据,我们最能测试的模型/智能系统。