Cassandra:从对地扰动中探测铁轨网络 (Cassandra: Detecting Trojaned Networks from Adversarial Perturbations)

Deep neural networks are being widely deployed for many critical tasks due to their high classification accuracy. In many cases, pre-trained models are sourced from vendors who may have disrupted the training pipeline to insert Trojan behaviors into the models. These malicious behaviors can be triggered at the adversary's will and hence, cause a serious threat to the widespread deployment of deep models. We propose a method to verify if a pre-trained model is Trojaned or benign. Our method captures fingerprints of neural networks in the form of adversarial perturbations learned from the network gradients. Inserting backdoors into a network alters its decision boundaries which are effectively encoded in their adversarial perturbations. We train a two stream network for Trojan detection from its global ($L_\infty$ and $L_2$ bounded) perturbations and the localized region of high energy within each perturbation. The former encodes decision boundaries of the network and latter encodes the unknown trigger shape. We also propose an anomaly detection method to identify the target class in a Trojaned network. Our methods are invariant to the trigger type, trigger size, training data and network architecture. We evaluate our methods on MNIST, NIST-Round0 and NIST-Round1 datasets, with up to 1,000 pre-trained models making this the largest study to date on Trojaned network detection, and achieve over 92\% detection accuracy to set the new state-of-the-art.

翻译：深心神经网络因其高分类精确度而被广泛用于许多关键任务。在许多情况下, 预先培训的模型来自供应商, 供应商可能中断了培训管道, 将特洛伊人的行为插入模型中。这些恶意行为可以由对手的意志触发, 从而对广泛部署深心模型构成严重威胁。我们提出一个方法来核实预培训模式是否是Trojan或良型的。我们的方法是捕捉神经网络的指纹, 其形式是从网络梯度中学习的对抗性扰动。将后门插入网络改变了其决定界限, 而这些界限在对抗性扰动中被有效编码。我们从全球敌人的意志中训练两条Trojan探测的流网络, 从而对深度模型造成严重威胁。我们的方法是将网络的检测类型、大小和 2美元约束值。之前的网络决定范围, 并随后将新的触发形状编码为未知的触发形状。我们还提议了一个异常检测方法, 用来在Trojan网络中识别目标班级的准确度。我们的方法是触发型的网络,, 设置了最大规模的网络, 和最大规模的数据结构, 我们的网络, 设置了我们的数据结构,, 设置了我们的数据结构, 我们的网络, 的网络, 设置了比, 的网络, 的网络到的的的网络,, 和网络的的的的的的的的的的,,,,, 和网络, 的和网络的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的和的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的和

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

专知会员服务

38+阅读 · 2020年7月3日