Backdoor清洗与无标签数据 (Backdoor Cleansing with Unlabeled Data) - 专知论文

会员服务 ·

0

无标签数据 · DNN · 攻击 · 后门攻击 · 初始化 ·

2023 年 4 月 6 日

Backdoor Cleansing with Unlabeled Data

翻译：Backdoor清洗与无标签数据

Lu Pang,Tao Sun,Haibin Ling,Chao Chen

Due to the increasing computational demand of Deep Neural Networks (DNNs), companies and organizations have begun to outsource the training process. However, the externally trained DNNs can potentially be backdoor attacked. It is crucial to defend against such attacks, i.e., to postprocess a suspicious model so that its backdoor behavior is mitigated while its normal prediction power on clean inputs remain uncompromised. To remove the abnormal backdoor behavior, existing methods mostly rely on additional labeled clean samples. However, such requirement may be unrealistic as the training data are often unavailable to end users. In this paper, we investigate the possibility of circumventing such barrier. We propose a novel defense method that does not require training labels. Through a carefully designed layer-wise weight re-initialization and knowledge distillation, our method can effectively cleanse backdoor behaviors of a suspicious network with negligible compromise in its normal behavior. In experiments, we show that our method, trained without labels, is on-par with state-of-the-art defense methods trained using labels. We also observe promising defense results even on out-of-distribution data. This makes our method very practical. Code is available at: https://github.com/luluppang/BCU.

翻译：由于深度神经网络（DNN）需求的计算越来越大，公司和组织已开始外包培训过程。然而，外部培训的DNN可能会遭到后门攻击。防范此类攻击是至关重要的，即对可疑模型进行后期处理，使其后门行为得到缓解，同时其对清洁输入的正常预测能力保持不受损。为了消除异常后门行为，现有方法主要依赖于额外的有标签干净样本。然而，这种要求可能是不现实的，因为培训数据通常对最终用户不可用。在本文中，我们研究了绕过这种障碍的可能性。我们提出了一种新颖的防御方法，不需要培训标签。通过精心设计的逐层权重重新初始化和知识蒸馏，我们的方法可以有效地清洗可疑网络的后门行为，而其在正常输入上的正常行为保持不受损害。实验中，我们展示了我们的方法，无需标签即可与使用标签的最先进防御方法相当。我们还观察到即使在分布之外的数据上也有有希望的防御效果，这使我们的方法非常实用。代码可在以下链接处获得： https://github.com/luluppang/BCU。

0

相关内容

无标签数据

无标签数据

【ACL2022教程】有限文本数据学习，Learning with Limited Text Data

【ACL2022教程】有限文本数据学习，Learning with Limited Text Data

专知会员服务

29+阅读 · 2022年5月22日

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

专知会员服务

15+阅读 · 2021年1月31日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

专知会员服务

81+阅读 · 2020年5月20日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

时滞输入大规模前馈非线性系统的控制设计

国家自然科学基金

1+阅读 · 2015年12月31日

多元数据与函数型数据的序贯检验方法与控制图研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于众包的数据清洗关键技术研究

国家自然科学基金

5+阅读 · 2014年12月31日

新型闪烁晶体Gd2Si2O7:Ce的结晶行为、制备及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

硅基III-V族纳米线选区横向生长及其高迁移率3D晶体管研究

国家自然科学基金

0+阅读 · 2012年12月31日

IRES调控EV71神经毒性的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

TiO2的光助导电性、光助气敏性及光催化性能的关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ag掺杂锰基稀土氧化物LIV效应机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

含有局部非线性子模型的模糊系统控制综合

国家自然科学基金

0+阅读 · 2009年12月31日

过渡金属催化卤代芳烃对芳醛的Barbier类型反应研究

国家自然科学基金

0+阅读 · 2009年12月31日

Summarizing Stream Data for Memory-Restricted Online Continual Learning

Arxiv

0+阅读 · 2023年5月26日

EXACT: Extensive Attack for Split Learning

Arxiv

0+阅读 · 2023年5月25日

STAR: Boosting Low-Resource Event Extraction by Structure-to-Text Data Generation with Large Language Models

Arxiv

0+阅读 · 2023年5月24日

From Shortcuts to Triggers: Backdoor Defense with Denoised PoE

Arxiv

0+阅读 · 2023年5月24日

Reconstructive Neuron Pruning for Backdoor Defense

Arxiv

0+阅读 · 2023年5月24日

Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models

Arxiv

0+阅读 · 2023年5月24日

Mithridates: Boosting Natural Resistance to Backdoor Learning

Arxiv

0+阅读 · 2023年5月23日

Backdoor Learning: A Survey

Arxiv

14+阅读 · 2020年10月26日

Few-shot Learning with Meta Metric Learners

Arxiv

13+阅读 · 2019年1月26日

Attention U-Net: Learning Where to Look for the Pancreas

Arxiv

17+阅读 · 2018年5月20日

VIP会员

文章信息

相关主题

无标签数据

相关VIP内容

【ACL2022教程】有限文本数据学习，Learning with Limited Text Data

【ACL2022教程】有限文本数据学习，Learning with Limited Text Data

专知会员服务

29+阅读 · 2022年5月22日

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

专知会员服务

15+阅读 · 2021年1月31日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

专知会员服务

81+阅读 · 2020年5月20日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

相关论文

Summarizing Stream Data for Memory-Restricted Online Continual Learning

Arxiv

0+阅读 · 2023年5月26日

EXACT: Extensive Attack for Split Learning

Arxiv

0+阅读 · 2023年5月25日

STAR: Boosting Low-Resource Event Extraction by Structure-to-Text Data Generation with Large Language Models

Arxiv

0+阅读 · 2023年5月24日

From Shortcuts to Triggers: Backdoor Defense with Denoised PoE

Arxiv

0+阅读 · 2023年5月24日

Reconstructive Neuron Pruning for Backdoor Defense

Arxiv

0+阅读 · 2023年5月24日

Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models

Arxiv

0+阅读 · 2023年5月24日

Mithridates: Boosting Natural Resistance to Backdoor Learning

Arxiv

0+阅读 · 2023年5月23日

Backdoor Learning: A Survey

Arxiv

14+阅读 · 2020年10月26日

Few-shot Learning with Meta Metric Learners

Arxiv

13+阅读 · 2019年1月26日

Attention U-Net: Learning Where to Look for the Pancreas

Arxiv

17+阅读 · 2018年5月20日

相关基金

时滞输入大规模前馈非线性系统的控制设计

国家自然科学基金

1+阅读 · 2015年12月31日

多元数据与函数型数据的序贯检验方法与控制图研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于众包的数据清洗关键技术研究

国家自然科学基金

5+阅读 · 2014年12月31日

新型闪烁晶体Gd2Si2O7:Ce的结晶行为、制备及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

硅基III-V族纳米线选区横向生长及其高迁移率3D晶体管研究

国家自然科学基金

0+阅读 · 2012年12月31日

IRES调控EV71神经毒性的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

TiO2的光助导电性、光助气敏性及光催化性能的关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ag掺杂锰基稀土氧化物LIV效应机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

含有局部非线性子模型的模糊系统控制综合

国家自然科学基金

0+阅读 · 2009年12月31日

过渡金属催化卤代芳烃对芳醛的Barbier类型反应研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员