对基于后门的神经网络验证人的袭击 (Availability Attacks Against Neural Network Certifiers Based on Backdoors) - 专知论文

会员服务 ·

0

可约的 · 稳健性 · Networking · Neural Networks · Extensibility ·

2022 年 10 月 2 日

Availability Attacks Against Neural Network Certifiers Based on Backdoors

翻译：对基于后门的神经网络验证人的袭击

Tobias Lorenz,Marta Kwiatkowska,Mario Fritz

To achieve reliable, robust, and safe AI systems it is important to implement fallback strategies when AI predictions cannot be trusted. Certifiers for neural networks are a reliable way to check the robustness of these predictions. They guarantee for some predictions that a certain class of manipulations or attacks could not have changed the outcome. For the remaining predictions without guarantees, the method abstains from making a prediction and a fallback strategy needs to be invoked, which typically incurs additional costs, can require a human operator, or even fail to provide any prediction. While this is a key concept towards safe and secure AI, we show for the first time that this approach comes with its own security risks, as such fallback strategies can be deliberately triggered by an adversary. Using training-time attacks, the adversary can significantly reduce the certified robustness of the model, making it unavailable. This transfers the main system load onto the fallback, reducing the overall system's integrity and availability. We design two novel backdoor attacks which show the practical relevance of these threats. For example, adding 1% poisoned data during training is sufficient to reduce certified robustness by up to 95 percentage points. Our extensive experiments across multiple datasets, model architectures, and certifiers demonstrate the wide applicability of these attacks. A first investigation into potential defenses shows that current approaches are insufficient to mitigate the issue, highlighting the need for new, more specific solutions.

翻译：为了实现可靠、稳健和安全的AI系统,当AI预测无法被信任时,必须执行后退战略。神经网络的验证者是检查这些预测是否稳健的可靠方法。它们保证某些预测能够保证某类操纵或攻击不会改变结果。对于其余的没有保证的预测来说,需要采用这种方法来避免作出预测和后退战略,这通常会产生额外的成本,可能需要一个人类操作者,甚至不能提供任何预测。虽然这是安全可靠的AI的关键概念,但我们第一次显示这一方法具有自己的安全风险,因为这种后退战略可以由对手故意触发。使用培训时间攻击,对手可以大大降低模型的经认证的稳健性,使其无法使用。这种方法将主系统负荷转移到后退,降低整个系统的完整性和可用性。我们设计了两种新型的后门攻击,表明这些威胁的实际相关性。例如,在培训期间增加1%的毒害数据足以减少经认证的稳健性,达到95个百分点。我们进行的广泛防御实验显示,这些潜在的多套数据测试将降低当前的精确度。

0

相关内容

可约的

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

高速宽带TIADC并行采集系统非均匀失配动态补偿研究

国家自然科学基金

0+阅读 · 2015年12月31日

新的小分子化合物WJ460通过靶向Myoferlin抑制乳腺癌转移和复发的分子机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

移动Ad Hoc网络动态信任管理机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

克服vemurafenib耐药的新型B-Raf（V600E）与EGFR双重抑制剂的设计与合成研究

国家自然科学基金

0+阅读 · 2013年12月31日

晶圆制造Interbay物料运输系统的动态调度研究

国家自然科学基金

0+阅读 · 2012年12月31日

从内质网应激介导的CHOP凋亡途径探讨BPD发生机制

国家自然科学基金

0+阅读 · 2012年12月31日

云计算环境下数据中心的power capping关键问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

双极性树枝状蓝光PhOLED用Ir（Ⅲ）金属配合物的合成与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

有机膦小分子催化的活泼共轭二烯的Rauhut-Currier串联反应研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于动态混合故障模型和进化博弈论的可生存性分析方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

Simulator-based explanation and debugging of hazard-triggering events in DNN-based safety-critical systems

Arxiv

0+阅读 · 2022年11月8日

Neural Architectural Backdoors

Arxiv

0+阅读 · 2022年11月7日

Resilience of Wireless Ad Hoc Federated Learning against Model Poisoning Attacks

Arxiv

0+阅读 · 2022年11月7日

NIP: Neuron-level Inverse Perturbation Against Adversarial Attacks

Arxiv

0+阅读 · 2022年11月7日

Detection Of Insider Attacks In Block Chain Network Using The Trusted Two Way Intrusion Detection System

Arxiv

0+阅读 · 2022年11月6日

Experience Report on the Challenges and Opportunities in Securing Smartphones Against Zero-Click Attacks

Arxiv

0+阅读 · 2022年11月6日

Leveraging Siamese Networks for One-Shot Intrusion Detection Model

Arxiv

0+阅读 · 2022年11月5日

Textual Manifold-based Defense Against Natural Language Adversarial Examples

Arxiv

0+阅读 · 2022年11月5日

An Adversarial Robustness Perspective on the Topology of Neural Networks

Arxiv

0+阅读 · 2022年11月4日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

全球AI工具市场发展现状与趋势分析2025

自动驾驶地图：全流程综述与前沿进展

协同智能体：多智能体人工智能系统如何变革军事训练及其他领域

【NeurIPS2025】TITAN：一种面向轨迹感知的大规模 VQE 自适应参数冻结技术

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Simulator-based explanation and debugging of hazard-triggering events in DNN-based safety-critical systems

Arxiv

0+阅读 · 2022年11月8日

Neural Architectural Backdoors

Arxiv

0+阅读 · 2022年11月7日

Resilience of Wireless Ad Hoc Federated Learning against Model Poisoning Attacks

Arxiv

0+阅读 · 2022年11月7日

NIP: Neuron-level Inverse Perturbation Against Adversarial Attacks

Arxiv

0+阅读 · 2022年11月7日

Detection Of Insider Attacks In Block Chain Network Using The Trusted Two Way Intrusion Detection System

Arxiv

0+阅读 · 2022年11月6日

Experience Report on the Challenges and Opportunities in Securing Smartphones Against Zero-Click Attacks

Arxiv

0+阅读 · 2022年11月6日

Leveraging Siamese Networks for One-Shot Intrusion Detection Model

Arxiv

0+阅读 · 2022年11月5日

Textual Manifold-based Defense Against Natural Language Adversarial Examples

Arxiv

0+阅读 · 2022年11月5日

An Adversarial Robustness Perspective on the Topology of Neural Networks

Arxiv

0+阅读 · 2022年11月4日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

相关基金

高速宽带TIADC并行采集系统非均匀失配动态补偿研究

国家自然科学基金

0+阅读 · 2015年12月31日

新的小分子化合物WJ460通过靶向Myoferlin抑制乳腺癌转移和复发的分子机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

移动Ad Hoc网络动态信任管理机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

克服vemurafenib耐药的新型B-Raf（V600E）与EGFR双重抑制剂的设计与合成研究

国家自然科学基金

0+阅读 · 2013年12月31日

晶圆制造Interbay物料运输系统的动态调度研究

国家自然科学基金

0+阅读 · 2012年12月31日

从内质网应激介导的CHOP凋亡途径探讨BPD发生机制

国家自然科学基金

0+阅读 · 2012年12月31日

云计算环境下数据中心的power capping关键问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

双极性树枝状蓝光PhOLED用Ir（Ⅲ）金属配合物的合成与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

有机膦小分子催化的活泼共轭二烯的Rauhut-Currier串联反应研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于动态混合故障模型和进化博弈论的可生存性分析方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员