新颖的多轴偏见评估度量标准Bipol及其NLP解释性 (Bipol: A Novel Multi-Axes Bias Evaluation Metric with Explainability for NLP) - 专知论文

会员服务 ·

0

度量 · NLP · 数据集 · 文本数据 · 检测模型 ·

2023 年 4 月 8 日

Bipol: A Novel Multi-Axes Bias Evaluation Metric with Explainability for NLP

翻译：新颖的多轴偏见评估度量标准Bipol及其NLP解释性

Lama Alkhaled,Tosin Adewumi,Sana Sabah Sabry

from arxiv, 12 pages, 4 images

We introduce bipol, a new metric with explainability, for estimating social bias in text data. Harmful bias is prevalent in many online sources of data that are used for training machine learning (ML) models. In a step to address this challenge we create a novel metric that involves a two-step process: corpus-level evaluation based on model classification and sentence-level evaluation based on (sensitive) term frequency (TF). After creating new models to detect bias along multiple axes using SotA architectures, we evaluate two popular NLP datasets (COPA and SQUAD). As additional contribution, we created a large dataset (with almost 2 million labelled samples) for training models in bias detection and make it publicly available. We also make public our codes.

翻译：我们介绍了Bipol，这是一种具有解释性的新指标，用于评估文本数据中的社会偏见。有害的偏见在许多在线数据来源中普遍存在，这些数据用于训练机器学习（ML）模型。为了解决这一挑战，我们创建了一种新的指标，它涉及两个步骤：基于模型分类的语料库级别评估和基于（敏感）词频（TF）的句子级别评估。创建了使用SotA架构检测多个轴上的偏差的新模型后，我们评估了两个流行的NLP数据集（COPA和SQUAD）。作为额外的贡献，我们创建了一个大型数据集（几乎有200万个标记样本）来训练偏见检测模型，并将其公开。我们还公开了我们的代码。

0

相关内容

谷歌教你学 AI -机器学习的7步骤

谷歌教你学 AI -机器学习的7步骤

专知会员服务

28+阅读 · 2022年3月13日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

【WWW2020-华为诺亚方舟论文】元学习推荐系统MetaSelector

【WWW2020-华为诺亚方舟论文】元学习推荐系统MetaSelector

专知会员服务

56+阅读 · 2020年2月10日

【O’Reilly讲座】基于深度学习的异常检测方法用于检测大型数据集的质量：Anomaly detection using deep learning to measure the quality of large datasets

【O’Reilly讲座】基于深度学习的异常检测方法用于检测大型数据集的质量：Anomaly detection using deep learning to measure the quality of large datasets

专知会员服务

31+阅读 · 2020年1月11日

【斯坦福大学ICLR2020】无任务的持续元学习，Continue Meta-learning without tasks

【斯坦福大学ICLR2020】无任务的持续元学习，Continue Meta-learning without tasks

专知会员服务

16+阅读 · 2019年12月18日

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

专知会员服务

102+阅读 · 2019年12月9日

253页通俗易懂最新的机器学习系统入门书籍（Machine-Learning-Systems）（附pdf下载）

253页通俗易懂最新的机器学习系统入门书籍（Machine-Learning-Systems）（附pdf下载）

专知会员服务

77+阅读 · 2019年10月27日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

AI可解释性文献列表

AI可解释性文献列表

专知

42+阅读 · 2019年10月7日

推荐：一文教你如何处理不平衡数据集（附代码）

推荐：一文教你如何处理不平衡数据集（附代码）

数据分析

20+阅读 · 2019年6月3日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【数据集】新的YELP数据集官方下载

【数据集】新的YELP数据集官方下载

机器学习研究会

16+阅读 · 2017年8月31日

基于在线消费者购买意向挖掘的个性化推荐研究

国家自然科学基金

0+阅读 · 2015年12月31日

可调控的功能性水相超分子聚合物的构建及性能

国家自然科学基金

0+阅读 · 2014年12月31日

多参数传热反问题的RBF-MLPG方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

乙肝病毒与弥漫性大B细胞淋巴瘤因果关联的生物学证据及其作用机制

国家自然科学基金

0+阅读 · 2013年12月31日

miR-301a在缺氧诱导胰腺癌EMT中的功能机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

科研网络社区中社会化的知识推荐方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

微小RNA-1268在先天性心脏病发病中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

风险投资支持的企业IPO折价、择机与后管理问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

关于流形学习的有效性算法与特征提取解释理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

p53对大肠癌中Numb/Notch信号通路调控的分子机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles

Arxiv

0+阅读 · 2023年5月26日

DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation

Arxiv

0+阅读 · 2023年5月25日

Learning Answer Generation using Supervision from Automatic Question Answering Evaluators

Learning Answer Generation using Supervision from Automatic Question Answering Evaluators

Arxiv

0+阅读 · 2023年5月24日

Is Your Model "MADD"? A Novel Metric to Evaluate Algorithmic Fairness for Predictive Student Models

Arxiv

0+阅读 · 2023年5月24日

Benchmarking Arabic AI with Large Language Models

Arxiv

0+阅读 · 2023年5月24日

FITNESS: A Causal De-correlation Approach for Mitigating Bias in Machine Learning Software

Arxiv

0+阅读 · 2023年5月23日

A Survey of Explainable Graph Neural Networks: Taxonomy and Evaluation Metrics

Arxiv

14+阅读 · 2022年7月26日

Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Arxiv

21+阅读 · 2021年9月2日

A Survey of the State of Explainable AI for Natural Language Processing

Arxiv

26+阅读 · 2020年10月1日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

VIP会员

文章信息

相关主题

相关VIP内容

谷歌教你学 AI -机器学习的7步骤

谷歌教你学 AI -机器学习的7步骤

专知会员服务

28+阅读 · 2022年3月13日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

【WWW2020-华为诺亚方舟论文】元学习推荐系统MetaSelector

【WWW2020-华为诺亚方舟论文】元学习推荐系统MetaSelector

专知会员服务

56+阅读 · 2020年2月10日

【O’Reilly讲座】基于深度学习的异常检测方法用于检测大型数据集的质量：Anomaly detection using deep learning to measure the quality of large datasets

【O’Reilly讲座】基于深度学习的异常检测方法用于检测大型数据集的质量：Anomaly detection using deep learning to measure the quality of large datasets

专知会员服务

31+阅读 · 2020年1月11日

【斯坦福大学ICLR2020】无任务的持续元学习，Continue Meta-learning without tasks

【斯坦福大学ICLR2020】无任务的持续元学习，Continue Meta-learning without tasks

专知会员服务

16+阅读 · 2019年12月18日

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

专知会员服务

102+阅读 · 2019年12月9日

253页通俗易懂最新的机器学习系统入门书籍（Machine-Learning-Systems）（附pdf下载）

253页通俗易懂最新的机器学习系统入门书籍（Machine-Learning-Systems）（附pdf下载）

专知会员服务

77+阅读 · 2019年10月27日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

AI可解释性文献列表

AI可解释性文献列表

专知

42+阅读 · 2019年10月7日

推荐：一文教你如何处理不平衡数据集（附代码）

推荐：一文教你如何处理不平衡数据集（附代码）

数据分析

20+阅读 · 2019年6月3日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【数据集】新的YELP数据集官方下载

【数据集】新的YELP数据集官方下载

机器学习研究会

16+阅读 · 2017年8月31日

相关论文

GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles

Arxiv

0+阅读 · 2023年5月26日

DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation

Arxiv

0+阅读 · 2023年5月25日

Learning Answer Generation using Supervision from Automatic Question Answering Evaluators

Learning Answer Generation using Supervision from Automatic Question Answering Evaluators

Arxiv

0+阅读 · 2023年5月24日

Is Your Model "MADD"? A Novel Metric to Evaluate Algorithmic Fairness for Predictive Student Models

Arxiv

0+阅读 · 2023年5月24日

Benchmarking Arabic AI with Large Language Models

Arxiv

0+阅读 · 2023年5月24日

FITNESS: A Causal De-correlation Approach for Mitigating Bias in Machine Learning Software

Arxiv

0+阅读 · 2023年5月23日

A Survey of Explainable Graph Neural Networks: Taxonomy and Evaluation Metrics

Arxiv

14+阅读 · 2022年7月26日

Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Arxiv

21+阅读 · 2021年9月2日

A Survey of the State of Explainable AI for Natural Language Processing

Arxiv

26+阅读 · 2020年10月1日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

相关基金

基于在线消费者购买意向挖掘的个性化推荐研究

国家自然科学基金

0+阅读 · 2015年12月31日

可调控的功能性水相超分子聚合物的构建及性能

国家自然科学基金

0+阅读 · 2014年12月31日

多参数传热反问题的RBF-MLPG方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

乙肝病毒与弥漫性大B细胞淋巴瘤因果关联的生物学证据及其作用机制

国家自然科学基金

0+阅读 · 2013年12月31日

miR-301a在缺氧诱导胰腺癌EMT中的功能机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

科研网络社区中社会化的知识推荐方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

微小RNA-1268在先天性心脏病发病中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

风险投资支持的企业IPO折价、择机与后管理问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

关于流形学习的有效性算法与特征提取解释理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

p53对大肠癌中Numb/Notch信号通路调控的分子机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员