探究预训练语言模型在价值观跨文化差异方面的作用 (Probing Pre-Trained Language Models for Cross-Cultural Differences in Values) - 专知论文

会员服务 ·

0

预训练语言模型 · 语言模型 · 预训练 · 工具 · 嵌入 ·

2023 年 4 月 6 日

Probing Pre-Trained Language Models for Cross-Cultural Differences in Values

翻译：探究预训练语言模型在价值观跨文化差异方面的作用

Arnav Arora,Lucie-Aimée Kaffee,Isabelle Augenstein

Language embeds information about social, cultural, and political values people hold. Prior work has explored social and potentially harmful biases encoded in Pre-Trained Language models (PTLMs). However, there has been no systematic study investigating how values embedded in these models vary across cultures. In this paper, we introduce probes to study which values across cultures are embedded in these models, and whether they align with existing theories and cross-cultural value surveys. We find that PTLMs capture differences in values across cultures, but those only weakly align with established value surveys. We discuss implications of using mis-aligned models in cross-cultural settings, as well as ways of aligning PTLMs with value surveys.

翻译：语言蕴含了人们持有的社会、文化和政治价值观。以往的研究已经探讨了预训练语言模型（PTLMs）中编码的社会和潜在的有害偏见。然而，还没有系统研究探究这些模型中嵌入的不同文化背景下的价值观。在本文中，我们引入探针研究模型中嵌入的各种文化中的价值观，并检查其是否与现有的理论和跨文化价值观测量工具相符。我们发现，PTLMs 捕捉到了不同文化背景下的价值观差异，但这些差异与已有的价值观测量工具只有微弱的符合度。我们讨论了在跨文化环境中使用未对齐模型的影响，以及将PTLMs与价值观测量工具对齐的方法。

0

相关内容

预训练语言模型

预训练语言模型

近年来，预训练模型（例如ELMo、GPT、BERT和XLNet等）的快速发展大幅提升了诸多NLP任务的整体水平，同时也使得很多应用场景进入到实际落地阶段。预训练语言模型本身就是神经网络语言模型，它的特点包括：第一，可以使用大规模无标注纯文本语料进行训练；第二，可以用于各类下游NLP任务，不是针对某项定制的，但以后可用在下游NIP任务上，你不需要为下游任务专门设计一种神经网络，或者提供一种结构，直接在几种给定的固定框架中选择一种进行 fine-tune，就可以从而得到很好的结果。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

【Meta AI】多模态理解研究进展，Advances in multimodal understanding research at Meta AI

【Meta AI】多模态理解研究进展，Advances in multimodal understanding research at Meta AI

专知会员服务

68+阅读 · 2022年3月20日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

最新《自然语言处理迁移学习》综述论文，A Survey on Transfer Learning in Natural Language Processing

最新《自然语言处理迁移学习》综述论文，A Survey on Transfer Learning in Natural Language Processing

专知会员服务

139+阅读 · 2020年7月10日

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

专知会员服务

36+阅读 · 2020年5月20日

【论文翻译】2020最新预训练语言模型综述：Pre-trained Models for Natural Language Processing: A Survey

【论文翻译】2020最新预训练语言模型综述：Pre-trained Models for Natural Language Processing: A Survey

专知会员服务

94+阅读 · 2020年4月13日

【ACL2020-Facebook AI】跨语言表示学习，Unsupervised Cross-lingual Representation Learning at Scale

【ACL2020-Facebook AI】跨语言表示学习，Unsupervised Cross-lingual Representation Learning at Scale

专知会员服务

27+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

专知会员服务

18+阅读 · 2019年12月14日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

跨越注意力：Cross-Attention

跨越注意力：Cross-Attention

我爱读PAMI

172+阅读 · 2018年6月2日

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

专知

12+阅读 · 2018年5月9日

【论文推荐】最新六篇图像描述生成相关论文—视频摘要、注意力张量积、非自回归神经序列模型、副词识别、多主体、多样性度量

【论文推荐】最新六篇图像描述生成相关论文—视频摘要、注意力张量积、非自回归神经序列模型、副词识别、多主体、多样性度量

专知

10+阅读 · 2018年3月2日

【论文推荐】最新5篇情感分析相关论文—深度学习情感分析综述、情感分析语料库、情感预测性、上下文和位置感知的因子分解模型、LSTM

【论文推荐】最新5篇情感分析相关论文—深度学习情感分析综述、情感分析语料库、情感预测性、上下文和位置感知的因子分解模型、LSTM

专知

55+阅读 · 2018年1月28日

骨髓源性巨噬细胞microRNA-155对动脉粥样硬化的调控机制

国家自然科学基金

0+阅读 · 2016年12月31日

情绪影响人际信任的效应与机制研究

国家自然科学基金

1+阅读 · 2015年12月31日

南方设施葡萄一年两熟栽培模式的能耗评估与节能路径优化

国家自然科学基金

0+阅读 · 2015年12月31日

Syndecan-3调节猪采食的作用机制及其营养调控

国家自然科学基金

1+阅读 · 2014年12月31日

ApoA1/ABCA1抗动脉粥样硬化新机制-自噬介导的血管外周脂肪组织抗炎途径

国家自然科学基金

0+阅读 · 2014年12月31日

音高能力差异对音乐意义加工的影响及其神经机制

国家自然科学基金

2+阅读 · 2014年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

脂肪组织巨噬细胞（ATMs）替代激活及调控在股骨头坏死发病机制及治疗中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

慢性高眼压视网膜Müer细胞激活对神经节细胞损伤的作用及机制探讨

国家自然科学基金

0+阅读 · 2012年12月31日

分“#31867;”#30340;文化差异对品牌延伸评价的影响

国家自然科学基金

0+阅读 · 2009年12月31日

Towards Reasoning in Large Language Models: A Survey

Towards Reasoning in Large Language Models: A Survey

Arxiv

0+阅读 · 2023年5月26日

RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question

Arxiv

0+阅读 · 2023年5月26日

Visually-augmented pretrained language models for NLP tasks without images

Arxiv

0+阅读 · 2023年5月26日

Multi-lingual and Multi-cultural Figurative Language Understanding

Arxiv

0+阅读 · 2023年5月25日

From Text to MITRE Techniques: Exploring the Malicious Use of Large Language Models for Generating Cyber Attack Payloads

Arxiv

0+阅读 · 2023年5月24日

EvEval: A Comprehensive Evaluation of Event Semantics for Large Language Models

Arxiv

0+阅读 · 2023年5月24日

ChatAgri: Exploring Potentials of ChatGPT on Cross-linguistic Agricultural Text Classification

Arxiv

0+阅读 · 2023年5月24日

Vision-Language Pre-training: Basics, Recent Advances, and Future Trends

Arxiv

28+阅读 · 2022年10月17日

Survey: Transformer based Video-Language Pre-training

Arxiv

20+阅读 · 2021年9月21日

Advances and Challenges in Conversational Recommender Systems: A Survey

Arxiv

14+阅读 · 2021年1月23日

VIP会员

文章信息

相关主题

预训练语言模型

相关VIP内容

【Meta AI】多模态理解研究进展，Advances in multimodal understanding research at Meta AI

【Meta AI】多模态理解研究进展，Advances in multimodal understanding research at Meta AI

专知会员服务

68+阅读 · 2022年3月20日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

最新《自然语言处理迁移学习》综述论文，A Survey on Transfer Learning in Natural Language Processing

最新《自然语言处理迁移学习》综述论文，A Survey on Transfer Learning in Natural Language Processing

专知会员服务

139+阅读 · 2020年7月10日

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

专知会员服务

36+阅读 · 2020年5月20日

【论文翻译】2020最新预训练语言模型综述：Pre-trained Models for Natural Language Processing: A Survey

【论文翻译】2020最新预训练语言模型综述：Pre-trained Models for Natural Language Processing: A Survey

专知会员服务

94+阅读 · 2020年4月13日

【ACL2020-Facebook AI】跨语言表示学习，Unsupervised Cross-lingual Representation Learning at Scale

【ACL2020-Facebook AI】跨语言表示学习，Unsupervised Cross-lingual Representation Learning at Scale

专知会员服务

27+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

专知会员服务

18+阅读 · 2019年12月14日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体网络：用AI智能体编织下一代网络

万字长文 | 超越OODA循环：矩阵作战与特种作战的未来

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

跨越注意力：Cross-Attention

跨越注意力：Cross-Attention

我爱读PAMI

172+阅读 · 2018年6月2日

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

专知

12+阅读 · 2018年5月9日

【论文推荐】最新六篇图像描述生成相关论文—视频摘要、注意力张量积、非自回归神经序列模型、副词识别、多主体、多样性度量

【论文推荐】最新六篇图像描述生成相关论文—视频摘要、注意力张量积、非自回归神经序列模型、副词识别、多主体、多样性度量

专知

10+阅读 · 2018年3月2日

【论文推荐】最新5篇情感分析相关论文—深度学习情感分析综述、情感分析语料库、情感预测性、上下文和位置感知的因子分解模型、LSTM

【论文推荐】最新5篇情感分析相关论文—深度学习情感分析综述、情感分析语料库、情感预测性、上下文和位置感知的因子分解模型、LSTM

专知

55+阅读 · 2018年1月28日

相关论文

Towards Reasoning in Large Language Models: A Survey

Towards Reasoning in Large Language Models: A Survey

Arxiv

0+阅读 · 2023年5月26日

RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question

Arxiv

0+阅读 · 2023年5月26日

Visually-augmented pretrained language models for NLP tasks without images

Arxiv

0+阅读 · 2023年5月26日

Multi-lingual and Multi-cultural Figurative Language Understanding

Arxiv

0+阅读 · 2023年5月25日

From Text to MITRE Techniques: Exploring the Malicious Use of Large Language Models for Generating Cyber Attack Payloads

Arxiv

0+阅读 · 2023年5月24日

EvEval: A Comprehensive Evaluation of Event Semantics for Large Language Models

Arxiv

0+阅读 · 2023年5月24日

ChatAgri: Exploring Potentials of ChatGPT on Cross-linguistic Agricultural Text Classification

Arxiv

0+阅读 · 2023年5月24日

Vision-Language Pre-training: Basics, Recent Advances, and Future Trends

Arxiv

28+阅读 · 2022年10月17日

Survey: Transformer based Video-Language Pre-training

Arxiv

20+阅读 · 2021年9月21日

Advances and Challenges in Conversational Recommender Systems: A Survey

Arxiv

14+阅读 · 2021年1月23日

相关基金

骨髓源性巨噬细胞microRNA-155对动脉粥样硬化的调控机制

国家自然科学基金

0+阅读 · 2016年12月31日

情绪影响人际信任的效应与机制研究

国家自然科学基金

1+阅读 · 2015年12月31日

南方设施葡萄一年两熟栽培模式的能耗评估与节能路径优化

国家自然科学基金

0+阅读 · 2015年12月31日

Syndecan-3调节猪采食的作用机制及其营养调控

国家自然科学基金

1+阅读 · 2014年12月31日

ApoA1/ABCA1抗动脉粥样硬化新机制-自噬介导的血管外周脂肪组织抗炎途径

国家自然科学基金

0+阅读 · 2014年12月31日

音高能力差异对音乐意义加工的影响及其神经机制

国家自然科学基金

2+阅读 · 2014年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

脂肪组织巨噬细胞（ATMs）替代激活及调控在股骨头坏死发病机制及治疗中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

慢性高眼压视网膜Müer细胞激活对神经节细胞损伤的作用及机制探讨

国家自然科学基金

0+阅读 · 2012年12月31日

分“#31867;”#30340;文化差异对品牌延伸评价的影响

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员