Tracr:编译变形器作为可解释性实验室 (Tracr: Compiled Transformers as a Laboratory for Interpretability) - 专知论文

会员服务 ·

0

变换 · 编译器 · MoDELS · 真实值 · TOOLS ·

2023 年 1 月 12 日

Tracr: Compiled Transformers as a Laboratory for Interpretability

翻译：Tracr:编译变形器作为可解释性实验室

David Lindner,János Kramár,Matthew Rahtz,Thomas McGrath,Vladimir Mikulik

Interpretability research aims to build tools for understanding machine learning (ML) models. However, such tools are inherently hard to evaluate because we do not have ground truth information about how ML models actually work. In this work, we propose to build transformer models manually as a testbed for interpretability research. We introduce Tracr, a "compiler" for translating human-readable programs into weights of a transformer model. Tracr takes code written in RASP, a domain-specific language (Weiss et al. 2021), and translates it into weights for a standard, decoder-only, GPT-like transformer architecture. We use Tracr to create a range of ground truth transformers that implement programs including computing token frequencies, sorting, and Dyck-n parenthesis checking, among others. To enable the broader research community to explore and use compiled models, we provide an open-source implementation of Tracr at https://github.com/deepmind/tracr.

翻译：解释性研究旨在建立理解机器学习(ML)模型的工具。但是,这些工具本身很难评估, 因为我们没有关于ML模型实际作用的地面真实信息。在这项工作中, 我们提议手工建立变压器模型, 作为可解释性研究的测试台。我们引入了Tracr, 将人读程序转换成变压器模型的重量的“ compiler ” 。 Tracr 使用一种域名语言( Weiss et al. 2021) 的 RASP 代码, 并将其转换成标准、解码器、类似 GPT 的变压器结构的重量。我们使用Tracr 创建了一系列地面变压器, 实施程序, 包括计算符号频率、排序和 Dyck- n 母体检查等。为了让更广泛的研究界探索和使用编译的模型, 我们在 https://github.com/ deepmind/tracr提供Tracr 的公开源实施。

0

相关内容

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

278+阅读 · 2020年11月26日

2020数据工程师成长路线图

专知会员服务

17+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

专知会员服务

28+阅读 · 2019年10月30日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

144+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

49+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

35+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

1+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

28+阅读 · 2019年3月1日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

抑制Hedgehog信号通路的植物C21甾体化合物的构效关系、结构优化及抗肿瘤作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

通过工程化改造Oct4研究体细胞重编程过程的基因组调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

星形胶质细胞内源性PLD正性调控树突的发育

国家自然科学基金

0+阅读 · 2013年12月31日

IL-32/Integrins/FAK通路在肝纤维化形成中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

C1型尼曼-匹克氏症轴突发育异常的病理机制

国家自然科学基金

0+阅读 · 2013年12月31日

ROC1活化mTOR通路促进膀胱癌侵袭及转移的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

肝移植术后缺血型胆道病变发生机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

MRTF-A调控CYR61介导间充质干细胞向内皮细胞分化的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

HGF诱导NSCLC细胞对EGFR-TKIs耐药机制的研究。

国家自然科学基金

0+阅读 · 2011年12月31日

肝癌磁共振成像的高效靶向造影剂CS-LA@SPION的制备及应用基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

Knowledge-augmented Graph Machine Learning for Drug Discovery: A Survey from Precision to Interpretability

Arxiv

0+阅读 · 2023年3月7日

Interpretable Architecture Neural Networks for Function Visualization

Arxiv

0+阅读 · 2023年3月3日

Interpretable reduced-order modeling with time-scale separation

Arxiv

0+阅读 · 2023年3月3日

Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding

Arxiv

12+阅读 · 2021年12月30日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

Interpretable and Efficient Heterogeneous Graph Convolutional Network

Arxiv

15+阅读 · 2021年9月8日

Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Arxiv

21+阅读 · 2021年9月2日

A Survey of Transformers

Arxiv

102+阅读 · 2021年6月8日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

11+阅读 · 2020年6月23日

Interpretable machine learning: definitions, methods, and applications

Interpretable machine learning: definitions, methods, and applications

Arxiv

18+阅读 · 2019年1月14日

VIP会员

文章信息

相关主题

相关VIP内容

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

278+阅读 · 2020年11月26日

2020数据工程师成长路线图

专知会员服务

17+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

专知会员服务

28+阅读 · 2019年10月30日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

144+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

49+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

35+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

热门VIP内容

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

1+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

28+阅读 · 2019年3月1日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

Knowledge-augmented Graph Machine Learning for Drug Discovery: A Survey from Precision to Interpretability

Arxiv

0+阅读 · 2023年3月7日

Interpretable Architecture Neural Networks for Function Visualization

Arxiv

0+阅读 · 2023年3月3日

Interpretable reduced-order modeling with time-scale separation

Arxiv

0+阅读 · 2023年3月3日

Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding

Arxiv

12+阅读 · 2021年12月30日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

Interpretable and Efficient Heterogeneous Graph Convolutional Network

Arxiv

15+阅读 · 2021年9月8日

Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Arxiv

21+阅读 · 2021年9月2日

A Survey of Transformers

Arxiv

102+阅读 · 2021年6月8日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

11+阅读 · 2020年6月23日

Interpretable machine learning: definitions, methods, and applications

Interpretable machine learning: definitions, methods, and applications

Arxiv

18+阅读 · 2019年1月14日

相关基金

抑制Hedgehog信号通路的植物C21甾体化合物的构效关系、结构优化及抗肿瘤作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

通过工程化改造Oct4研究体细胞重编程过程的基因组调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

星形胶质细胞内源性PLD正性调控树突的发育

国家自然科学基金

0+阅读 · 2013年12月31日

IL-32/Integrins/FAK通路在肝纤维化形成中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

C1型尼曼-匹克氏症轴突发育异常的病理机制

国家自然科学基金

0+阅读 · 2013年12月31日

ROC1活化mTOR通路促进膀胱癌侵袭及转移的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

肝移植术后缺血型胆道病变发生机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

MRTF-A调控CYR61介导间充质干细胞向内皮细胞分化的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

HGF诱导NSCLC细胞对EGFR-TKIs耐药机制的研究。

国家自然科学基金

0+阅读 · 2011年12月31日

肝癌磁共振成像的高效靶向造影剂CS-LA@SPION的制备及应用基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员