培训后检测两系列和多种组合情景的后门袭击 (Post-Training Detection of Backdoor Attacks for Two-Class and Multi-Attack Scenarios) - 专知论文

会员服务 ·

0

Performer · 训练集 · 情景 · 测试样本 · state-of-the-art ·

2022 年 1 月 20 日

Post-Training Detection of Backdoor Attacks for Two-Class and Multi-Attack Scenarios

翻译：培训后检测两系列和多种组合情景的后门袭击

Zhen Xiang,David J. Miller,George Kesidis

from arxiv, Accepted to ICLR2022

Backdoor attacks (BAs) are an emerging threat to deep neural network classifiers. A victim classifier will predict to an attacker-desired target class whenever a test sample is embedded with the same backdoor pattern (BP) that was used to poison the classifier's training set. Detecting whether a classifier is backdoor attacked is not easy in practice, especially when the defender is, e.g., a downstream user without access to the classifier's training set. This challenge is addressed here by a reverse-engineering defense (RED), which has been shown to yield state-of-the-art performance in several domains. However, existing REDs are not applicable when there are only {\it two classes} or when {\it multiple attacks} are present. These scenarios are first studied in the current paper, under the practical constraints that the defender neither has access to the classifier's training set nor to supervision from clean reference classifiers trained for the same domain. We propose a detection framework based on BP reverse-engineering and a novel {\it expected transferability} (ET) statistic. We show that our ET statistic is effective {\it using the same detection threshold}, irrespective of the classification domain, the attack configuration, and the BP reverse-engineering algorithm that is used. The excellent performance of our method is demonstrated on six benchmark datasets. Notably, our detection framework is also applicable to multi-class scenarios with multiple attacks.

翻译：深神经网络分类器(BAs)正在对深神经网络分类器形成威胁。受害人分类器将向攻击者渴望的目标类别预测,只要测试样品嵌入的是一种用于毒害分类员训练的相同的后门模式(BP),即用于毒害分类员训练的后门模式(BP ) 。在实践中,检测分类器是否是后门攻击并非易事, 特别是在维护者既无法获得分类器训练的下游用户, 也得不到为同一领域培训的清洁分类器的监督的情况下。我们在此提出一个反向工程防御(RED)来应对这一挑战, 它已经显示在若干领域产生最先进的性能。然而,当只有 ~ ~两个类别或存在 ~ 多次攻击时, 现有的RED 不适用。这些情况首先在本文中研究, 在实际限制下, 特别是保护者既无法获得分类器的训练, 也没有接受过为同一领域培训的清洁的分类器师培训。我们提议了一个基于BP反向- 工程和新预期的可转移性(ET) (ET) 统计, 我们展示了可应用的快速操作的系统测试的系统测试模型的系统测试, 我们的系统测试, 测试是使用BVI- 矩阵的多重攻击的快速分析方法。

0

相关内容

Performer

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

季铵生物碱的氧化石墨烯基新型萃取材料及色谱分离材料研究

国家自然科学基金

0+阅读 · 2014年12月31日

建筑生命周期评价的时间有效性研究

国家自然科学基金

1+阅读 · 2013年12月31日

周期结构中电磁反散射问题的理论与数值算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

甘肃河西走廊盐碱土壤中放线菌生态分布及物种多样性研究

国家自然科学基金

0+阅读 · 2012年12月31日

小麦抗麦红吸浆虫QTL定位与关联分析

国家自然科学基金

0+阅读 · 2012年12月31日

基于关联分析的野生毛花猕猴桃AsA富集相关基因发掘及功能解析

国家自然科学基金

0+阅读 · 2012年12月31日

高分辨率SAR图像典型地物目标样本特征提取和识别研究

国家自然科学基金

2+阅读 · 2012年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

Unscented卡尔曼滤波算法及其在通信中的应用

国家自然科学基金

0+阅读 · 2008年12月31日

抗原特异性和非抗原特异性CD4+CD25+ Treg细胞对Th1细胞分化、效应功能和记忆Th1细胞形成的影响

国家自然科学基金

0+阅读 · 2008年12月31日

Adversarial Scratches: Deployable Attacks to CNN Classifiers

Arxiv

0+阅读 · 2022年4月20日

Cyber-Forensic Review of Human Footprint and Gait for Personal Identification

Arxiv

0+阅读 · 2022年4月20日

Robustness Testing of Data and Knowledge Driven Anomaly Detection in Cyber-Physical Systems

Arxiv

0+阅读 · 2022年4月20日

Indiscriminate Data Poisoning Attacks on Neural Networks

Arxiv

0+阅读 · 2022年4月19日

A Novel Sybil Attack Detection Scheme Based on Edge Computing for Mobile IoT Environment

Arxiv

0+阅读 · 2022年4月19日

Jacobian Ensembles Improve Robustness Trade-offs to Adversarial Attacks

Arxiv

0+阅读 · 2022年4月19日

Dual-Key Multimodal Backdoors for Visual Question Answering

Arxiv

1+阅读 · 2022年4月18日

Distributed Learning of Deep Neural Networks using Independent Subnet Training

Arxiv

2+阅读 · 2022年4月18日

Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information

Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information

Arxiv

0+阅读 · 2022年4月15日

Backdoor Learning: A Survey

Arxiv

15+阅读 · 2020年10月26日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

全球AI工具市场发展现状与趋势分析2025

自动驾驶地图：全流程综述与前沿进展

协同智能体：多智能体人工智能系统如何变革军事训练及其他领域

【NeurIPS2025】TITAN：一种面向轨迹感知的大规模 VQE 自适应参数冻结技术

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

Adversarial Scratches: Deployable Attacks to CNN Classifiers

Arxiv

0+阅读 · 2022年4月20日

Cyber-Forensic Review of Human Footprint and Gait for Personal Identification

Arxiv

0+阅读 · 2022年4月20日

Robustness Testing of Data and Knowledge Driven Anomaly Detection in Cyber-Physical Systems

Arxiv

0+阅读 · 2022年4月20日

Indiscriminate Data Poisoning Attacks on Neural Networks

Arxiv

0+阅读 · 2022年4月19日

A Novel Sybil Attack Detection Scheme Based on Edge Computing for Mobile IoT Environment

Arxiv

0+阅读 · 2022年4月19日

Jacobian Ensembles Improve Robustness Trade-offs to Adversarial Attacks

Arxiv

0+阅读 · 2022年4月19日

Dual-Key Multimodal Backdoors for Visual Question Answering

Arxiv

1+阅读 · 2022年4月18日

Distributed Learning of Deep Neural Networks using Independent Subnet Training

Arxiv

2+阅读 · 2022年4月18日

Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information

Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information

Arxiv

0+阅读 · 2022年4月15日

Backdoor Learning: A Survey

Arxiv

15+阅读 · 2020年10月26日

相关基金

季铵生物碱的氧化石墨烯基新型萃取材料及色谱分离材料研究

国家自然科学基金

0+阅读 · 2014年12月31日

建筑生命周期评价的时间有效性研究

国家自然科学基金

1+阅读 · 2013年12月31日

周期结构中电磁反散射问题的理论与数值算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

甘肃河西走廊盐碱土壤中放线菌生态分布及物种多样性研究

国家自然科学基金

0+阅读 · 2012年12月31日

小麦抗麦红吸浆虫QTL定位与关联分析

国家自然科学基金

0+阅读 · 2012年12月31日

基于关联分析的野生毛花猕猴桃AsA富集相关基因发掘及功能解析

国家自然科学基金

0+阅读 · 2012年12月31日

高分辨率SAR图像典型地物目标样本特征提取和识别研究

国家自然科学基金

2+阅读 · 2012年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

Unscented卡尔曼滤波算法及其在通信中的应用

国家自然科学基金

0+阅读 · 2008年12月31日

抗原特异性和非抗原特异性CD4+CD25+ Treg细胞对Th1细胞分化、效应功能和记忆Th1细胞形成的影响

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员