We explore Boccaccio's Decameron to see how digital humanities tools can be used for tasks that have limited data in a language no longer in contemporary use: medieval Italian. We focus our analysis on the question: Do the different storytellers in the text exhibit distinct personalities? To answer this question, we curate and release a dataset based on the authoritative edition of the text. We use supervised classification methods to predict storytellers based on the stories they tell, confirming the difficulty of the task, and demonstrate that topic modeling can extract thematic storyteller "profiles."
翻译:我们探索Boccaccio的Decameron, 看看如何将数字人文学工具用于那些在当代不再使用的语言(中世纪意大利语)中数据有限的任务。 我们集中分析一个问题: 文本中不同的讲故事者是否具有不同的个性? 为了回答这个问题, 我们根据权威文本编辑并发布数据集。 我们使用监管分类方法根据他们讲述的故事预测讲故事者, 证实任务难度, 并证明主题建模可以提取专题故事家的“ 描述 ” 。