NLP中性能效果的绘图因果推断 (Drawing Causal Inferences About Performance Effects in NLP) - 专知论文

会员服务 ·

0

Performer · Processing（编程语言） · 推断 · 泛化误差 · 泛化理论 ·

2022 年 9 月 14 日

Drawing Causal Inferences About Performance Effects in NLP

翻译：NLP中性能效果的绘图因果推断

Sandra Wankmüller

from arxiv, 15 pages

This article emphasizes that NLP as a science seeks to make inferences about the performance effects that result from applying one method (compared to another method) in the processing of natural language. Yet NLP research in practice usually does not achieve this goal: In NLP research articles, typically only a few models are compared. Each model results from a specific procedural pipeline (here named processing system) that is composed of a specific collection of methods that are used in preprocessing, pretraining, hyperparameter tuning, and training on the target task. To make generalizing inferences about the performance effect that is caused by applying some method A vs. another method B, it is not sufficient to compare a few specific models that are produced by a few specific (probably incomparable) processing systems. Rather, the following procedure would allow drawing inferences about methods' performance effects: (1) A population of processing systems that researchers seek to infer to has to be defined. (2) A random sample of processing systems from this population is drawn. (The drawn processing systems in the sample will vary with regard to the methods they apply along their procedural pipelines and also will vary regarding the compositions of their training and test data sets used for training and evaluation.) (3) Each processing system is applied once with method A and once with method B. (4) Based on the sample of applied processing systems, the expected generalization errors of method A and method B are approximated. (5) The difference between the expected generalization errors of method A and method B is the estimated average treatment effect due to applying method A compared to method B in the population of processing systems.

翻译：本条强调,作为科学,国家实验室规划方案力求对在自然语言处理中采用一种方法(相对于另一种方法)所产生的性能效果作出推论;然而,国家实验室规划方案的实际研究通常没有实现这一目标:在国家实验室规划研究文章中,通常只比较几个模型;每个模型都来自具体的程序管道(此处称为处理系统),其中包括在预处理、预培训、超参数调和培训中所使用的具体方法的集合;为了对采用某种方法(相对于另一种方法)对自然语言处理中产生的性能效果效果作出推论;为了对采用某种方法(相对于另一种方法B)造成的性能效果作一般性推论,仅仅比较几个具体模型是不够的:在《国家实验室规划方案》的研究文章中,通常只比较几个特定(可能无法比较的)处理系统所产生的具体模型。

0

相关内容

Performer

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【NLP| 推荐文章】基于文本和知识库的语义搜索（Semantic search on text and knowledge bases）

专知会员服务

46+阅读 · 2019年11月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

神经系统seipin缺失诱发精神迟滞的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

miR-182调控松果体Clock基因介导缺氧缺血性脑损伤后节律紊乱的新机制

国家自然科学基金

0+阅读 · 2014年12月31日

骨髓间充质干细胞旁分泌CTRP3水平影响心肌梗死疗效及机制

国家自然科学基金

0+阅读 · 2014年12月31日

经修饰的自聚肽优化移植微环境在干细胞治疗阿尔茨海默病中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

T-bet在较低危骨髓增生异常综合征骨髓衰竭发病中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

高速钢中W-Mo-Cr-V-Ti等合金元素形成硬质相的微观机制及性能调控

国家自然科学基金

0+阅读 · 2012年12月31日

骨髓间充质干细胞成骨功能异常在骨髓增生异常综合征发病机制中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

sRAGE对缺血/再灌注的心脏保护作用及其机制

国家自然科学基金

0+阅读 · 2008年12月31日

DNA损伤应激反应中变异剪接基因的鉴定及其功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

曲古菌素A对人类体细胞核移植胚胎表观遗传重编程的影响

国家自然科学基金

0+阅读 · 2008年12月31日

Discourse Context Predictability Effects in Hindi Word Order

Arxiv

0+阅读 · 2022年10月25日

On the Robustness of Dataset Inference

Arxiv

0+阅读 · 2022年10月24日

Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing

Arxiv

0+阅读 · 2022年10月24日

A framework for causal inference in the presence of extreme inverse probability weights: the role of overlap weights

Arxiv

0+阅读 · 2022年10月24日

Instruction-Following Agents with Jointly Pre-Trained Vision-Language Models

Arxiv

0+阅读 · 2022年10月24日

Model Specification in Mixed-Effects Models: A Focus on Random Effects

Arxiv

0+阅读 · 2022年10月24日

Towards Out-of-Distribution Sequential Event Prediction: A Causal Treatment

Arxiv

9+阅读 · 2022年10月24日

A Survey of Data Optimization for Problems in Computer Vision Datasets

Arxiv

0+阅读 · 2022年10月21日

Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Arxiv

21+阅读 · 2021年9月2日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【NLP| 推荐文章】基于文本和知识库的语义搜索（Semantic search on text and knowledge bases）

专知会员服务

46+阅读 · 2019年11月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《概率数值计算：贝叶斯求积法与人机协作》最新博士论文

【NTU博士论文】多模态神经三维资产合成

人工智能：实时战斗适应

《运用作战人员数字孪生与生成式人工智能预测任务成果》最新文献

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Discourse Context Predictability Effects in Hindi Word Order

Arxiv

0+阅读 · 2022年10月25日

On the Robustness of Dataset Inference

Arxiv

0+阅读 · 2022年10月24日

Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing

Arxiv

0+阅读 · 2022年10月24日

A framework for causal inference in the presence of extreme inverse probability weights: the role of overlap weights

Arxiv

0+阅读 · 2022年10月24日

Instruction-Following Agents with Jointly Pre-Trained Vision-Language Models

Arxiv

0+阅读 · 2022年10月24日

Model Specification in Mixed-Effects Models: A Focus on Random Effects

Arxiv

0+阅读 · 2022年10月24日

Towards Out-of-Distribution Sequential Event Prediction: A Causal Treatment

Arxiv

9+阅读 · 2022年10月24日

A Survey of Data Optimization for Problems in Computer Vision Datasets

Arxiv

0+阅读 · 2022年10月21日

Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Arxiv

21+阅读 · 2021年9月2日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

相关基金

神经系统seipin缺失诱发精神迟滞的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

miR-182调控松果体Clock基因介导缺氧缺血性脑损伤后节律紊乱的新机制

国家自然科学基金

0+阅读 · 2014年12月31日

骨髓间充质干细胞旁分泌CTRP3水平影响心肌梗死疗效及机制

国家自然科学基金

0+阅读 · 2014年12月31日

经修饰的自聚肽优化移植微环境在干细胞治疗阿尔茨海默病中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

T-bet在较低危骨髓增生异常综合征骨髓衰竭发病中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

高速钢中W-Mo-Cr-V-Ti等合金元素形成硬质相的微观机制及性能调控

国家自然科学基金

0+阅读 · 2012年12月31日

骨髓间充质干细胞成骨功能异常在骨髓增生异常综合征发病机制中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

sRAGE对缺血/再灌注的心脏保护作用及其机制

国家自然科学基金

0+阅读 · 2008年12月31日

DNA损伤应激反应中变异剪接基因的鉴定及其功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

曲古菌素A对人类体细胞核移植胚胎表观遗传重编程的影响

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员