培训前语言模式的贬低性技术效果经验调查 (An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-Trained Language Models) - 专知论文

会员服务 ·

0

语言模型化 · 有偏 · MoDELS · Performer · NLU ·

2021 年 10 月 16 日

An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-Trained Language Models

翻译：培训前语言模式的贬低性技术效果经验调查

Nicholas Meade,Elinor Poole-Dayan,Siva Reddy

Recent work has shown that pre-trained language models capture social biases from the text corpora they are trained on. This has attracted attention to developing techniques that mitigate such biases. In this work, we perform a empirical survey of five recently proposed debiasing techniques: Counterfactual Data Augmentation (CDA), Dropout, Iterative Nullspace Projection, Self-Debias, and SentenceDebias. We quantify the effectiveness of each technique using three different bias benchmarks while also measuring the impact of these techniques on a model's language modeling ability, as well as its performance on downstream NLU tasks. We experimentally find that: (1) CDA and Self-Debias are the strongest of the debiasing techniques, obtaining improved scores on most of the bias benchmarks (2) Current debiasing techniques do not generalize well beyond gender bias; And (3) improvements on bias benchmarks such as StereoSet and CrowS-Pairs by using debiasing strategies are usually accompanied by a decrease in language modeling ability, making it difficult to determine whether the bias mitigation is effective.

翻译：最近的工作表明,经过培训的语文模式从它们所培训的文字体体中反映了社会偏见,这吸引了对减少这种偏见的技术的注意。在这项工作中,我们对最近提出的五种偏向性技术进行了实证调查:(1) 反事实数据增强技术(CDA)、辍学技术、循环性大气投影技术、自失能技术以及句子Debias技术。我们用三种不同的偏差基准来量化每种技术的有效性,同时衡量这些技术对模型语言建模能力的影响,以及这些技术对下游NLU任务的业绩。我们实验发现:(1) CDA和自毁技术是最强的偏向性技术,在多数偏向基准上获得更好的分数 (2) 目前的偏向性技术除了性别偏见之外没有普遍化;(3) 通过使用偏向战略改进StereoSet和CrowS-Pairs等偏向基准,通常伴随着语言建模能力的下降,因此难以确定偏见缓解是否有效。

0

相关内容

语言模型化

语言模型化

自然语言处理预训练技术综述

专知会员服务

80+阅读 · 2021年8月20日

移动数字广告与互联网反欺诈蓝皮报告

移动数字广告与互联网反欺诈蓝皮报告

专知会员服务

26+阅读 · 2021年5月13日

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

【论文翻译】2020最新预训练语言模型综述：Pre-trained Models for Natural Language Processing: A Survey

【论文翻译】2020最新预训练语言模型综述：Pre-trained Models for Natural Language Processing: A Survey

专知会员服务

92+阅读 · 2020年4月13日

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

96+阅读 · 2020年4月10日

【论文推荐】联邦学习的个性化技术综述，Survey of Personalization Techniques for Federated Learning

【论文推荐】联邦学习的个性化技术综述，Survey of Personalization Techniques for Federated Learning

专知会员服务

78+阅读 · 2020年3月19日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

12+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

32+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

169+阅读 · 2019年10月11日

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

26+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Language Models are not Models of Language

Arxiv

0+阅读 · 2021年12月13日

The King is Naked: on the Notion of Robustness for Natural Language Processing

Arxiv

0+阅读 · 2021年12月13日

Probing Linguistic Information For Logical Inference In Pre-trained Language Models

Arxiv

5+阅读 · 2021年12月3日

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

Arxiv

30+阅读 · 2021年11月1日

Pre-trained Language Models in Biomedical Domain: A Systematic Survey

Arxiv

10+阅读 · 2021年10月12日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

Investigating the Successes and Failures of BERT for Passage Re-Ranking

Investigating the Successes and Failures of BERT for Passage Re-Ranking

Arxiv

3+阅读 · 2019年5月5日

An Attentive Survey of Attention Models

Arxiv

19+阅读 · 2019年4月5日

When and Why are Pre-trained Word Embeddings Useful for Neural Machine Translation?

Arxiv

3+阅读 · 2018年4月18日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

自然语言处理预训练技术综述

专知会员服务

80+阅读 · 2021年8月20日

移动数字广告与互联网反欺诈蓝皮报告

移动数字广告与互联网反欺诈蓝皮报告

专知会员服务

26+阅读 · 2021年5月13日

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

【论文翻译】2020最新预训练语言模型综述：Pre-trained Models for Natural Language Processing: A Survey

【论文翻译】2020最新预训练语言模型综述：Pre-trained Models for Natural Language Processing: A Survey

专知会员服务

92+阅读 · 2020年4月13日

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

96+阅读 · 2020年4月10日

【论文推荐】联邦学习的个性化技术综述，Survey of Personalization Techniques for Federated Learning

【论文推荐】联邦学习的个性化技术综述，Survey of Personalization Techniques for Federated Learning

专知会员服务

78+阅读 · 2020年3月19日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

12+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

32+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

169+阅读 · 2019年10月11日

热门VIP内容

相关资讯

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

26+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Language Models are not Models of Language

Arxiv

0+阅读 · 2021年12月13日

The King is Naked: on the Notion of Robustness for Natural Language Processing

Arxiv

0+阅读 · 2021年12月13日

Probing Linguistic Information For Logical Inference In Pre-trained Language Models

Arxiv

5+阅读 · 2021年12月3日

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

Arxiv

30+阅读 · 2021年11月1日

Pre-trained Language Models in Biomedical Domain: A Systematic Survey

Arxiv

10+阅读 · 2021年10月12日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

Investigating the Successes and Failures of BERT for Passage Re-Ranking

Investigating the Successes and Failures of BERT for Passage Re-Ranking

Arxiv

3+阅读 · 2019年5月5日

An Attentive Survey of Attention Models

Arxiv

19+阅读 · 2019年4月5日

When and Why are Pre-trained Word Embeddings Useful for Neural Machine Translation?

Arxiv

3+阅读 · 2018年4月18日

微信扫码咨询专知VIP会员