Online encyclopedias, such as Wikipedia, have been well-developed and researched in the last two decades. One can find any attributes or other information of a wiki item on a wiki page edited by a community of volunteers. However, the traditional text along with images can hardly express some other aspects of an item. For example, when we talk about "Shiba Inu", one may care more about "How to feed it" or "How to train it to not protect its food". Currently, short-video platforms have become a hallmark in the online world. Whether you're on TikTok, Instagram, Kuaishou, or YouTube Shorts, short-video apps have changed how we consume and create content today. Except for entertainment short videos, we can find more and more authors sharing insightful knowledge widely across all walks of life. These short videos, which we call knowledge videos, can easily express any aspects (E.g. hair or how-to-feed) consumers want to know about an item (E.g. Shiba Inu), and they can be systematically analyzed and organized like an online encyclopedia. In this paper, we propose Kuaipedia, a massive multi-modal encyclopedia consisting of items, aspects, and short videos linking to them, which is extracted from billions of videos of Kuaishou, a well-known short-video platform in China. We first collected items from multiple sources and mined user-centered aspects from millions of users' queries to build an item-aspect tree. Then we propose a new task called "multi-modal item-aspect linking" as an expansion of "entity linking" to link short videos into item-aspect pairs and build the whole short video encyclopedia. Intrinsic evaluations show that our encyclopedia is of large scale and highly accurate.
翻译:在线百科全书( 如维基百科) 等在线百科全书( 维基百科) 在过去二十年中已经得到了完善和研究。 人们可以在由志愿者团体编辑的维基页面上找到任何维基内容的属性或其他信息。 然而, 传统文本和图像几乎无法表达某项内容的其他方面。 例如, 当我们谈论“ 希巴伊努 ” 时, 人们可能更关心“ 如何喂养它” 或“ 如何训练它来保护食物 ” 。 目前, 短视平台已经成为了在线世界的标志。 您是否在TikTok、 Instagram、 Kuaishou 或 YouTube Shorts 上找到任何维基内容的属性或其他信息。 短视应用程序改变了我们今天如何消费和创建内容。 除了娱乐短视视频之外, 我们能找到越来越多的作者在生活各行各行各业之间分享深刻的知识。 这些短视视频可以轻松地表达任何方面( E. g. ha ) 。 短视平台的短视网站和短视数据从我们可以系统分析并组织成一个链接。