主题模型论文 - 专知

会员服务 ·

主题模型

主题模型，顾名思义，就是对文字中隐含主题的一种建模方法。“苹果”这个词的背后既包含是苹果公司这样一个主题，也包括了水果的主题。　　在这里，我们先定义一下主题究竟是什么。主题就是一个概念、一个方面。它表现为一系列相关的词语。比如一个文章如果涉及到“百度”这个主题，那么“中文搜索”、“李彦宏”等词语就会以较高的频率出现，而如果涉及到“IBM”这个主题，那么“笔记本”等就会出现的很频繁。如果用数学来描述一下的话，主题就是词汇表上词语的条件概率分布。与主题关系越密切的词语，它的条件概率越大，反之则越小。

The Deep Latent Position Topic Model for Clustering and Representation of Networks with Textual Edges

Arxiv

0+阅读 · 2023年4月14日

G2T: A Simple but Effective Framework for Topic Modeling based on Pretrained Language Model and Community Detection

Arxiv

0+阅读 · 2023年4月14日

G2T: A simple but versatile framework for topic modeling based on pretrained language model and community detection

G2T: A simple but versatile framework for topic modeling based on pretrained language model and community detection

Arxiv

0+阅读 · 2023年4月13日

A User-Centered, Interactive, Human-in-the-Loop Topic Modelling System

Arxiv

0+阅读 · 2023年4月4日

What Does the Indian Parliament Discuss? An Exploratory Analysis of the Question Hour in the Lok Sabha

Arxiv

0+阅读 · 2023年4月1日

Do Neural Topic Models Really Need Dropout? Analysis of the Effect of Dropout in Topic Modeling

Arxiv

0+阅读 · 2023年3月28日

Improving Contextualized Topic Models with Negative Sampling

Arxiv

0+阅读 · 2023年3月27日

参考链接

父主题

概率图模型

子主题

微信扫码咨询专知VIP会员