西斯拉夫语言模型中的性别偏见测量 (Measuring Gender Bias in West Slavic Language Models) - 专知论文

会员服务 ·

0

性别偏见 · 语言模型 · 数据集 · 语言建模 · 预训练语言模型 ·

2023 年 4 月 13 日

Measuring Gender Bias in West Slavic Language Models

翻译：西斯拉夫语言模型中的性别偏见测量

Sandra Martinková,Karolina Stańczak,Isabelle Augenstein

Pre-trained language models have been known to perpetuate biases from the underlying datasets to downstream tasks. However, these findings are predominantly based on monolingual language models for English, whereas there are few investigative studies of biases encoded in language models for languages beyond English. In this paper, we fill this gap by analysing gender bias in West Slavic language models. We introduce the first template-based dataset in Czech, Polish, and Slovak for measuring gender bias towards male, female and non-binary subjects. We complete the sentences using both mono- and multilingual language models and assess their suitability for the masked language modelling objective. Next, we measure gender bias encoded in West Slavic language models by quantifying the toxicity and genderness of the generated words. We find that these language models produce hurtful completions that depend on the subject's gender. Perhaps surprisingly, Czech, Slovak, and Polish language models produce more hurtful completions with men as subjects, which, upon inspection, we find is due to completions being related to violence, death, and sickness.

翻译：预训练语言模型已知会将底层数据集中的偏见传承到下游任务中。然而，这些发现主要基于单语言种的英语语言模型，而对于英语以外的语言，有关编码偏见的调查研究相当少。在本文中，我们填补了此空白，通过分析西斯拉夫语族语言模型中的性别偏见来解决这个问题。我们引入了Czech、Polish和Slovak的第一个基于模板的数据集，用于衡量对男性、女性和非二元主体的性别偏见。我们使用单语言和多语言语言模型完成这些句子，并评估它们是否适合掩码语言建模目标。接下来，我们通过量化生成的单词的有毒性和性别特征，测量西斯拉夫语言模型中编码的性别偏见。我们发现，这些语言模型会产生会伤害人的完成，而这些完成受主体的性别的影响。或许令人惊讶的是，捷克语、斯洛伐克语和波兰语的语言模型在以男性为主体时会产生更多伤害性的完成，这是因为完成与暴力、死亡和疾病有关。

0

相关内容

性别偏见

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

69+阅读 · 2023年3月31日

【纽约大学 Ethan Perez 博士论文】在预训练语言模型中发现和修正不良行为，217页pdf，，Finding and Fixing Undesirable Behaviors in Pretrained Language Models

【纽约大学 Ethan Perez 博士论文】在预训练语言模型中发现和修正不良行为，217页pdf，，Finding and Fixing Undesirable Behaviors in Pretrained Language Models

专知会员服务

16+阅读 · 2022年3月16日

【AAAI2021】知识增强的视觉-语言预训练技术 ERNIE-ViL

【AAAI2021】知识增强的视觉-语言预训练技术 ERNIE-ViL

专知会员服务

25+阅读 · 2021年1月29日

【ICML2020】统一预训练伪掩码语言模型

【ICML2020】统一预训练伪掩码语言模型

专知会员服务

25+阅读 · 2020年7月23日

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

专知会员服务

10+阅读 · 2020年5月25日

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

专知会员服务

32+阅读 · 2020年5月20日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

52+阅读 · 2020年1月30日

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

专知会员服务

11+阅读 · 2019年12月27日

【ICLR2020 预训练的百科全书】弱监督的知识-预训练的语言模型（PRETRAINED ENCYCLOPEDIA: WEAKLY SUPERVISED KNOWLEDGE-PRETRAINED LANGUAGE MODEL）

【ICLR2020 预训练的百科全书】弱监督的知识-预训练的语言模型（PRETRAINED ENCYCLOPEDIA: WEAKLY SUPERVISED KNOWLEDGE-PRETRAINED LANGUAGE MODEL）

专知会员服务

24+阅读 · 2019年12月26日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

52+阅读 · 2019年9月29日

EMNLP 2022 | 校准预训练模型中的事实知识

EMNLP 2022 | 校准预训练模型中的事实知识

PaperWeekly

1+阅读 · 2022年11月22日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACL 2022：评估单词多义性不再困扰？一种新的基准“DIBIMT”

ACL 2022：评估单词多义性不再困扰？一种新的基准“DIBIMT”

大数据文摘

0+阅读 · 2022年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

专知

15+阅读 · 2018年2月3日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

原创 | Attention Modeling for Targeted Sentiment

原创 | Attention Modeling for Targeted Sentiment

黑龙江大学自然语言处理实验室

25+阅读 · 2017年11月5日

Copine VII在阿尔茨海默病中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

LncRNA-TC0101441抑制KiSS-1促进卵巢癌侵袭转移的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

小G蛋白泛素化分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于主干成分的句法统计机器翻译模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

CAP70经调节PTEN磷酸化影响肾癌恶性表型的研究

国家自然科学基金

0+阅读 · 2012年12月31日

噪声和混响对儿童课堂言语识别和记忆行为的影响

国家自然科学基金

0+阅读 · 2012年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

饮食胆固醇吸收的分子机制与功能调控

国家自然科学基金

0+阅读 · 2011年12月31日

αctinin 4介导NHERF1调节细胞微丝骨架及其对肿瘤细胞黏附与迁移的影响

国家自然科学基金

0+阅读 · 2011年12月31日

非编码RNA Snord116相结合的RNA与蛋白复合体的成分鉴定及在Prader-Willi综合症小鼠模型中的功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

ChatGPT an ENFJ, Bard an ISTJ: Empirical Study on Personalities of Large Language Models

Arxiv

0+阅读 · 2023年5月31日

On the Hidden Mystery of OCR in Large Multimodal Models

Arxiv

0+阅读 · 2023年5月31日

On the Power of Foundation Models

Arxiv

1+阅读 · 2023年5月31日

"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation

Arxiv

0+阅读 · 2023年5月30日

Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models

Arxiv

0+阅读 · 2023年5月29日

Exploring Effectiveness of GPT-3 in Grammatical Error Correction: A Study on Performance and Controllability in Prompt-Based Methods

Arxiv

0+阅读 · 2023年5月29日

Large Language Models, scientific knowledge and factuality: A systematic analysis in antibiotic discovery

Arxiv

0+阅读 · 2023年5月28日

Can Language Models Be Specific? How?

Arxiv

0+阅读 · 2023年5月26日

Grounding Language Models to Images for Multimodal Inputs and Outputs

Arxiv

0+阅读 · 2023年5月26日

Counterfactual VQA: A Cause-Effect Look at Language Bias

Arxiv

14+阅读 · 2020年12月28日

VIP会员

文章信息

相关主题

预训练语言模型

相关VIP内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

69+阅读 · 2023年3月31日

【纽约大学 Ethan Perez 博士论文】在预训练语言模型中发现和修正不良行为，217页pdf，，Finding and Fixing Undesirable Behaviors in Pretrained Language Models

【纽约大学 Ethan Perez 博士论文】在预训练语言模型中发现和修正不良行为，217页pdf，，Finding and Fixing Undesirable Behaviors in Pretrained Language Models

专知会员服务

16+阅读 · 2022年3月16日

【AAAI2021】知识增强的视觉-语言预训练技术 ERNIE-ViL

【AAAI2021】知识增强的视觉-语言预训练技术 ERNIE-ViL

专知会员服务

25+阅读 · 2021年1月29日

【ICML2020】统一预训练伪掩码语言模型

【ICML2020】统一预训练伪掩码语言模型

专知会员服务

25+阅读 · 2020年7月23日

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

专知会员服务

10+阅读 · 2020年5月25日

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

专知会员服务

32+阅读 · 2020年5月20日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

52+阅读 · 2020年1月30日

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

专知会员服务

11+阅读 · 2019年12月27日

【ICLR2020 预训练的百科全书】弱监督的知识-预训练的语言模型（PRETRAINED ENCYCLOPEDIA: WEAKLY SUPERVISED KNOWLEDGE-PRETRAINED LANGUAGE MODEL）

【ICLR2020 预训练的百科全书】弱监督的知识-预训练的语言模型（PRETRAINED ENCYCLOPEDIA: WEAKLY SUPERVISED KNOWLEDGE-PRETRAINED LANGUAGE MODEL）

专知会员服务

24+阅读 · 2019年12月26日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

52+阅读 · 2019年9月29日

热门VIP内容

相关资讯

EMNLP 2022 | 校准预训练模型中的事实知识

EMNLP 2022 | 校准预训练模型中的事实知识

PaperWeekly

1+阅读 · 2022年11月22日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACL 2022：评估单词多义性不再困扰？一种新的基准“DIBIMT”

ACL 2022：评估单词多义性不再困扰？一种新的基准“DIBIMT”

大数据文摘

0+阅读 · 2022年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

专知

15+阅读 · 2018年2月3日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

原创 | Attention Modeling for Targeted Sentiment

原创 | Attention Modeling for Targeted Sentiment

黑龙江大学自然语言处理实验室

25+阅读 · 2017年11月5日

相关论文

ChatGPT an ENFJ, Bard an ISTJ: Empirical Study on Personalities of Large Language Models

Arxiv

0+阅读 · 2023年5月31日

On the Hidden Mystery of OCR in Large Multimodal Models

Arxiv

0+阅读 · 2023年5月31日

On the Power of Foundation Models

Arxiv

1+阅读 · 2023年5月31日

"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation

Arxiv

0+阅读 · 2023年5月30日

Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models

Arxiv

0+阅读 · 2023年5月29日

Exploring Effectiveness of GPT-3 in Grammatical Error Correction: A Study on Performance and Controllability in Prompt-Based Methods

Arxiv

0+阅读 · 2023年5月29日

Large Language Models, scientific knowledge and factuality: A systematic analysis in antibiotic discovery

Arxiv

0+阅读 · 2023年5月28日

Can Language Models Be Specific? How?

Arxiv

0+阅读 · 2023年5月26日

Grounding Language Models to Images for Multimodal Inputs and Outputs

Arxiv

0+阅读 · 2023年5月26日

Counterfactual VQA: A Cause-Effect Look at Language Bias

Arxiv

14+阅读 · 2020年12月28日

相关基金

Copine VII在阿尔茨海默病中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

LncRNA-TC0101441抑制KiSS-1促进卵巢癌侵袭转移的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

小G蛋白泛素化分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于主干成分的句法统计机器翻译模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

CAP70经调节PTEN磷酸化影响肾癌恶性表型的研究

国家自然科学基金

0+阅读 · 2012年12月31日

噪声和混响对儿童课堂言语识别和记忆行为的影响

国家自然科学基金

0+阅读 · 2012年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

饮食胆固醇吸收的分子机制与功能调控

国家自然科学基金

0+阅读 · 2011年12月31日

αctinin 4介导NHERF1调节细胞微丝骨架及其对肿瘤细胞黏附与迁移的影响

国家自然科学基金

0+阅读 · 2011年12月31日

非编码RNA Snord116相结合的RNA与蛋白复合体的成分鉴定及在Prader-Willi综合症小鼠模型中的功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员