可区别的 Perturb 和剖析: 带有结构变异自动编码器的半操作解析 (Differentiable Perturb-and-Parse: Semi-Supervised Parsing with a Structured Variational Autoencoder)

Human annotation for syntactic parsing is expensive, and large resources are available only for a fraction of languages. A question we ask is whether one can leverage abundant unlabeled texts to improve syntactic parsers, beyond just using the texts to obtain more generalisable lexical features (i.e. beyond word embeddings). To this end, we propose a novel latent-variable generative model for semi-supervised syntactic dependency parsing. As exact inference is intractable, we introduce a differentiable relaxation to obtain approximate samples and compute gradients with respect to the parser parameters. Our method (Differentiable Perturb-and-Parse) relies on differentiable dynamic programming over stochastically perturbed edge scores. We demonstrate effectiveness of our approach with experiments on English, French and Swedish.

翻译：用于合成分析的人类注解非常昂贵,而且大量资源只能用于一小部分语言。我们问的问题是,除了利用文本来获取更通用的词汇特征(即,除嵌入文字外)之外,我们能否利用大量未贴标签的文本来改进合成分析者。为此,我们为半受监督合成依赖分析提出了一个新的潜在可变基因模型。由于精确的推论是难解的,我们引入了一种不同的放松,以获取近似样本,并计算与剖析参数有关的梯度。我们的方法(可移动的 Perturb-和-Parse)依赖于与相近的边缘分数不同的动态编程。我们用英语、法语和瑞典语的实验展示了我们的方法的有效性。

相关内容

自编码器

关注 138

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

105+阅读 · 2020年6月10日

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

专知会员服务

28+阅读 · 2020年5月19日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

59+阅读 · 2020年3月19日

自动结构变分推理，Automatic structured variational inference

专知会员服务

38+阅读 · 2020年2月10日