通过异种网络嵌入的对身份敏感的单词 (Identity-sensitive Word Embedding through Heterogeneous Networks)

Most existing word embedding approaches do not distinguish the same words in different contexts, therefore ignoring their contextual meanings. As a result, the learned embeddings of these words are usually a mixture of multiple meanings. In this paper, we acknowledge multiple identities of the same word in different contexts and learn the \textbf{identity-sensitive} word embeddings. Based on an identity-labeled text corpora, a heterogeneous network of words and word identities is constructed to model different-levels of word co-occurrences. The heterogeneous network is further embedded into a low-dimensional space through a principled network embedding approach, through which we are able to obtain the embeddings of words and the embeddings of word identities. We study three different types of word identities including topics, sentiments and categories. Experimental results on real-world data sets show that the identity-sensitive word embeddings learned by our approach indeed capture different meanings of words and outperforms competitive methods on tasks including text classification and word similarity computation.

翻译：大多数现有的嵌入字词方法在不同背景中并不区分相同的词,因此忽略了它们的背景含义。因此, 这些字所学的嵌入通常是多种含义的混合体。在本文中, 我们承认不同背景中同一词的多重特性, 并学习了 \ textbf{ 身份敏感} 字嵌入。基于身份标签的文本组合, 构建了一个不同的词和字身份网络, 以模拟不同层次的单词共发。混成的网络通过一个原则性网络嵌入法进一步嵌入一个低维空间, 通过这个方法, 我们能够获得文字嵌入和单词身份嵌入。我们研究了三种不同的单词特性, 包括主题、情绪和类别。真实世界数据集的实验结果显示, 通过我们的方法所学到的身份敏感字嵌入的词确实捕捉了不同的文字含义, 并且超越了包括文本分类和类似词的计算在内的任务的竞争方法。

相关内容

异构网络

关注 5

在计算机网络中，异构网络是一种连接计算机和其他设备的网络，其中操作系统和协议有显著差异。例如，将基于微软Windows和Linux的个人计算机与苹果Macintosh计算机连接起来的局域网(LANs)是异构的。异构网络也被用于使用不同接入技术的无线网络中。例如，通过无线局域网提供服务并在切换到蜂窝网络时能够维持服务的无线网络称为无线异构网络。

【AAAI 2019】双曲异构信息网络嵌入，Hyperbolic Heterogeneous Information Network Embedding

专知会员服务

59+阅读 · 2020年6月28日

【AAAI 2020】双曲图注意力网络，Hyperbolic Graph Attention Network

专知会员服务

93+阅读 · 2020年6月15日

【WWW 2019】异质图注意力网络，Heterogeneous Graph Attention Network

专知会员服务

74+阅读 · 2020年6月14日

【ACL 2020】低维双曲知识图谱嵌入，Low-Dimensional Hyperbolic Knowledge Graph Embeddings

专知会员服务

74+阅读 · 2020年6月14日