We present HPD: Harry Potter Dialogue Dataset to facilitate the study of building dialogue agents for characters in a story. It differs from existing dialogue datasets in two aspects: 1) HPD provides rich background information about the novel Harry Potter, including scene, character attributes, and character relations; 2) All these background information will change as the story goes on. In other words, each dialogue session in HPD correlates to a different background, and the storyline determines how the background changes. We evaluate some baselines (e.g., GPT-2, BOB) on both automatic and human metrics to determine how well they can generate Harry Potter-like responses. Experimental results indicate that although the generated responses are fluent and relevant to the dialogue history, they are remained to sound out of character for Harry, indicating there is a large headroom for future studies. Our dataset is available.
翻译:我们介绍HPD: Harry Potter 对话数据集, 以便利于为故事中的人物建立对话代理器的研究。 它与现有的对话数据集在两个方面有不同:(1) 住房和财产管理局提供关于哈利波特(Harry Potter)的丰富背景资料,包括场景、字符属性和字符关系;(2) 所有这些背景资料随着故事的继续而变化。换句话说, 住房和财产管理局的每次对话会都与不同的背景相关, 故事线决定背景的变化方式。 我们评估了自动和人文衡量标准上的一些基线(例如, GPT-2, BOB), 以确定它们能如何产生哈利波特式的反应。 实验结果显示,尽管所产生的反应流畅且与对话历史相关, 但对于Harry来说,它们仍然会失灵, 表明未来研究有一个很大的头室。 我们的数据集是可用的。