未筛选图像文本数据集：揭示人口统计偏见 (Uncurated Image-Text Datasets: Shedding Light on Demographic Bias) - 专知论文

会员服务 ·

0

注释（编程） · 数据集 · 文本数据 · 视觉语言模型 · 图像标注 ·

2023 年 4 月 6 日

Uncurated Image-Text Datasets: Shedding Light on Demographic Bias

翻译：未筛选图像文本数据集：揭示人口统计偏见

Noa Garcia,Yusuke Hirota,Yankun Wu,Yuta Nakashima

from arxiv, CVPR 2023

The increasing tendency to collect large and uncurated datasets to train vision-and-language models has raised concerns about fair representations. It is known that even small but manually annotated datasets, such as MSCOCO, are affected by societal bias. This problem, far from being solved, may be getting worse with data crawled from the Internet without much control. In addition, the lack of tools to analyze societal bias in big collections of images makes addressing the problem extremely challenging. Our first contribution is to annotate part of the Google Conceptual Captions dataset, widely used for training vision-and-language models, with four demographic and two contextual attributes. Our second contribution is to conduct a comprehensive analysis of the annotations, focusing on how different demographic groups are represented. Our last contribution lies in evaluating three prevailing vision-and-language tasks: image captioning, text-image CLIP embeddings, and text-to-image generation, showing that societal bias is a persistent problem in all of them.

翻译：收集庞大而未经筛选的数据集以训练视觉语言模型的趋势日益增多，引起了公平表示的担忧。已知即使是像MSCOCO这样的小型但手动注释的数据集也受到社会偏见的影响。这个问题远未得到解决，随着从互联网爬取数据而缺乏严格控制，可能会变得更加严重。此外，缺乏分析大量图像中社会偏见的工具也使解决问题变得极其具有挑战性。我们的第一项贡献是使用四个人口统计学和两个上下文属性对广泛用于训练视觉语言模型的Google Conceptual Captions数据集的部分进行注释。我们的第二项贡献是对注释进行全面分析，重点关注不同人口统计群体的代表性。我们的最后一项贡献在于评估三个普遍的视觉语言任务：图像标注、文本-图像CLIP嵌入和文本-图像生成，表明社会偏见在所有任务中都是一个持久的问题。

0

相关内容

注释（编程）

注释（编程）

注释（编程）

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

自然语言处理顶会EMNLP2021奖项公布，剑桥刘方宇、哥大杨子小帆一作论文分获最佳长、短论文奖

自然语言处理顶会EMNLP2021奖项公布，剑桥刘方宇、哥大杨子小帆一作论文分获最佳长、短论文奖

专知会员服务

14+阅读 · 2021年10月31日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

专知会员服务

26+阅读 · 2020年4月2日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

【AAAI2020】多模态注意力语义图嵌入多标签分类（Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification）

【AAAI2020】多模态注意力语义图嵌入多标签分类（Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification）

专知会员服务

92+阅读 · 2019年12月22日

【AAAI2020接受论文】利用图卷积网络将知识注入文本任务，Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

【AAAI2020接受论文】利用图卷积网络将知识注入文本任务，Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

专知会员服务

45+阅读 · 2019年11月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

半监督进化文本聚类算法在动态多源文本分析上的研究

国家自然科学基金

2+阅读 · 2014年12月31日

大气污染对人口迁移的影响研究

国家自然科学基金

0+阅读 · 2013年12月31日

PCV2感染猪肺泡巨噬细胞自噬过程中miRNA差异表达谱及靶基因功能调控网络研究

国家自然科学基金

0+阅读 · 2013年12月31日

污泥掺烧过程中Cl/S/P交互作用对重金属迁移转化和脱除影响的机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

新疆布鲁氏菌病传播机理的动力学模型和疾病预防控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

布鲁氏菌感染巨噬细胞诱导的自噬抑制细胞凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

城市大气重金属干湿沉降对土壤-蔬菜系统的污染效应

国家自然科学基金

0+阅读 · 2012年12月31日

不同膳食对单纯餐后高血糖型糖尿病血清游离脂肪酸谱影响的代谢组学研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于fMRI的个性化图像情感标注及其本体库研究

国家自然科学基金

0+阅读 · 2009年12月31日

多文种文档图像识别的多层次马尔可夫随机场模型研究

国家自然科学基金

1+阅读 · 2008年12月31日

What You See is What You Read? Improving Text-Image Alignment Evaluation

Arxiv

0+阅读 · 2023年5月22日

On The Empirical Effectiveness of Unrealistic Adversarial Hardening Against Realistic Adversarial Attacks

Arxiv

0+阅读 · 2023年5月22日

Model Debiasing via Gradient-based Explanation on Representation

Arxiv

0+阅读 · 2023年5月20日

Survey of Automatic Plankton Image Recognition: Challenges, Existing Solutions and Future Perspectives

Arxiv

0+阅读 · 2023年5月19日

A Survey of Deep Graph Clustering: Taxonomy, Challenge, and Application

Arxiv

13+阅读 · 2022年11月23日

Multi-Modal Knowledge Graph Construction and Application: A Survey

Arxiv

79+阅读 · 2022年2月11日

Towards Out-Of-Distribution Generalization: A Survey

Arxiv

38+阅读 · 2021年8月31日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Arxiv

17+阅读 · 2019年10月9日

VIP会员

文章信息

相关主题

注释（编程）

视觉语言模型

相关VIP内容

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

自然语言处理顶会EMNLP2021奖项公布，剑桥刘方宇、哥大杨子小帆一作论文分获最佳长、短论文奖

自然语言处理顶会EMNLP2021奖项公布，剑桥刘方宇、哥大杨子小帆一作论文分获最佳长、短论文奖

专知会员服务

14+阅读 · 2021年10月31日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

专知会员服务

26+阅读 · 2020年4月2日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

【AAAI2020】多模态注意力语义图嵌入多标签分类（Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification）

【AAAI2020】多模态注意力语义图嵌入多标签分类（Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification）

专知会员服务

92+阅读 · 2019年12月22日

【AAAI2020接受论文】利用图卷积网络将知识注入文本任务，Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

【AAAI2020接受论文】利用图卷积网络将知识注入文本任务，Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

专知会员服务

45+阅读 · 2019年11月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

What You See is What You Read? Improving Text-Image Alignment Evaluation

Arxiv

0+阅读 · 2023年5月22日

On The Empirical Effectiveness of Unrealistic Adversarial Hardening Against Realistic Adversarial Attacks

Arxiv

0+阅读 · 2023年5月22日

Model Debiasing via Gradient-based Explanation on Representation

Arxiv

0+阅读 · 2023年5月20日

Survey of Automatic Plankton Image Recognition: Challenges, Existing Solutions and Future Perspectives

Arxiv

0+阅读 · 2023年5月19日

A Survey of Deep Graph Clustering: Taxonomy, Challenge, and Application

Arxiv

13+阅读 · 2022年11月23日

Multi-Modal Knowledge Graph Construction and Application: A Survey

Arxiv

79+阅读 · 2022年2月11日

Towards Out-Of-Distribution Generalization: A Survey

Arxiv

38+阅读 · 2021年8月31日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Arxiv

17+阅读 · 2019年10月9日

相关基金

半监督进化文本聚类算法在动态多源文本分析上的研究

国家自然科学基金

2+阅读 · 2014年12月31日

大气污染对人口迁移的影响研究

国家自然科学基金

0+阅读 · 2013年12月31日

PCV2感染猪肺泡巨噬细胞自噬过程中miRNA差异表达谱及靶基因功能调控网络研究

国家自然科学基金

0+阅读 · 2013年12月31日

污泥掺烧过程中Cl/S/P交互作用对重金属迁移转化和脱除影响的机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

新疆布鲁氏菌病传播机理的动力学模型和疾病预防控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

布鲁氏菌感染巨噬细胞诱导的自噬抑制细胞凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

城市大气重金属干湿沉降对土壤-蔬菜系统的污染效应

国家自然科学基金

0+阅读 · 2012年12月31日

不同膳食对单纯餐后高血糖型糖尿病血清游离脂肪酸谱影响的代谢组学研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于fMRI的个性化图像情感标注及其本体库研究

国家自然科学基金

0+阅读 · 2009年12月31日

多文种文档图像识别的多层次马尔可夫随机场模型研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员