变换器计算器的表达性上的更紧环环环 (Tighter Bounds on the Expressivity of Transformer Encoders) - 专知论文

会员服务 ·

0

变换 · 可辨认的 · UniFormer · 确切的 · Networking ·

2023 年 1 月 25 日

Tighter Bounds on the Expressivity of Transformer Encoders

翻译：变换器计算器的表达性上的更紧环环环

David Chiang,Peter Cholak,Anand Pillay

Characterizing neural networks in terms of better-understood formal systems has the potential to yield new insights into the power and limitations of these networks. Doing so for transformers remains an active area of research. Bhattamishra and others have shown that transformer encoders are at least as expressive as a certain kind of counter machine, while Merrill and Sabharwal have shown that fixed-precision transformer encoders recognize only languages in uniform $TC^0$. We connect and strengthen these results by identifying a variant of first-order logic with counting quantifiers that is simultaneously an upper bound for fixed-precision transformer encoders and a lower bound for transformer encoders. This brings us much closer than before to an exact characterization of the languages that transformer encoders recognize.

翻译：将神经网络定性为更清楚的正规系统,有可能对这些网络的力量和局限性产生新的洞察力。对于变压器来说,这样做仍是一个活跃的研究领域。 Bhattmishra等人已经表明,变压器编码器至少像某种反制机器一样能表达,而Merrill和Sabharwal则表明,固定精密变压器编码器只承认统一为$TC$0的语文。我们将这些结果连接起来并加强这些结果,方法是找出一种第一阶逻辑的变式,用计数量化符同时对固定精密变压器编码器和变压器编码器进行上限。这使我们比以前更接近于变压器所认识的语文的确切特征。

4

相关内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

核磁共振研究抗病毒蛋白IFITM3的结构和抗病毒分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于绝对坐标与SPH方法的充气薄膜空间结构展开动力学研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于微应变的热障涂层/高温合金力学行为原位表征及失效机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

飞机GLARE层板结构空气耦合超声兰姆波成像检测方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

插层结构水滑石阵列的合成及超电容性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

铁基超导体中元素掺杂与电子相图

国家自然科学基金

0+阅读 · 2011年12月31日

准一致熔PMN-PT基赝三元系高Trt弛豫铁电材料的MPB组分设计与强制对流条件下的晶体生长

国家自然科学基金

0+阅读 · 2009年12月31日

W、Re对单晶高温合金再结晶形核与长大的影响

国家自然科学基金

0+阅读 · 2009年12月31日

二次电池组单体电池的分选方法

国家自然科学基金

0+阅读 · 2008年12月31日

Effectively Modeling Time Series with Simple Discrete State Spaces

Arxiv

0+阅读 · 2023年3月16日

Towards Lower Bounds on the Depth of ReLU Neural Networks

Arxiv

0+阅读 · 2023年3月16日

Geometric Analysis of Noisy Low-rank Matrix Recovery in the Exact Parameterized and the Overparameterized Regimes

Arxiv

0+阅读 · 2023年3月15日

MP-Former: Mask-Piloted Transformer for Image Segmentation

MP-Former: Mask-Piloted Transformer for Image Segmentation

Arxiv

0+阅读 · 2023年3月15日

Vector Quantized Time Series Generation with a Bidirectional Prior Model

Arxiv

0+阅读 · 2023年3月15日

On the number of subproblem iterations per coupling step in partitioned fluid-structure interaction simulations

Arxiv

0+阅读 · 2023年3月15日

Statistical learning on measures: an application to persistence diagrams

Arxiv

0+阅读 · 2023年3月15日

An optimal transport regularized model to image reconstruction problems

Arxiv

0+阅读 · 2023年3月14日

Fast Regularized Discrete Optimal Transport with Group-Sparse Regularizers

Arxiv

0+阅读 · 2023年3月14日

Lightweight feature encoder for wake-up word detection based on self-supervised speech representation

Arxiv

0+阅读 · 2023年3月14日

VIP会员

文章信息

相关主题

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型中的检索与结构化增强生成综述

《实现多层防御多轮交战机制的扩展型随机齐射模型》2025年最新83页

【CMU博士论文】交互驱动的人体动作估计与生成

如何避免生成式人工智能在作战中失控失效

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

Effectively Modeling Time Series with Simple Discrete State Spaces

Arxiv

0+阅读 · 2023年3月16日

Towards Lower Bounds on the Depth of ReLU Neural Networks

Arxiv

0+阅读 · 2023年3月16日

Geometric Analysis of Noisy Low-rank Matrix Recovery in the Exact Parameterized and the Overparameterized Regimes

Arxiv

0+阅读 · 2023年3月15日

MP-Former: Mask-Piloted Transformer for Image Segmentation

MP-Former: Mask-Piloted Transformer for Image Segmentation

Arxiv

0+阅读 · 2023年3月15日

Vector Quantized Time Series Generation with a Bidirectional Prior Model

Arxiv

0+阅读 · 2023年3月15日

On the number of subproblem iterations per coupling step in partitioned fluid-structure interaction simulations

Arxiv

0+阅读 · 2023年3月15日

Statistical learning on measures: an application to persistence diagrams

Arxiv

0+阅读 · 2023年3月15日

An optimal transport regularized model to image reconstruction problems

Arxiv

0+阅读 · 2023年3月14日

Fast Regularized Discrete Optimal Transport with Group-Sparse Regularizers

Arxiv

0+阅读 · 2023年3月14日

Lightweight feature encoder for wake-up word detection based on self-supervised speech representation

Arxiv

0+阅读 · 2023年3月14日

相关基金

核磁共振研究抗病毒蛋白IFITM3的结构和抗病毒分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于绝对坐标与SPH方法的充气薄膜空间结构展开动力学研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于微应变的热障涂层/高温合金力学行为原位表征及失效机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

飞机GLARE层板结构空气耦合超声兰姆波成像检测方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

插层结构水滑石阵列的合成及超电容性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

铁基超导体中元素掺杂与电子相图

国家自然科学基金

0+阅读 · 2011年12月31日

准一致熔PMN-PT基赝三元系高Trt弛豫铁电材料的MPB组分设计与强制对流条件下的晶体生长

国家自然科学基金

0+阅读 · 2009年12月31日

W、Re对单晶高温合金再结晶形核与长大的影响

国家自然科学基金

0+阅读 · 2009年12月31日

二次电池组单体电池的分选方法

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员