对采用转移进化战略的微调模型的跨部交叉跨结构跨结构黑箱袭击 (Cross-domain Cross-architecture Black-box Attacks on Fine-tuned Models with Transferred Evolutionary Strategies)

Fine-tuning can be vulnerable to adversarial attacks. Existing works about black-box attacks on fine-tuned models (BAFT) are limited by strong assumptions. To fill the gap, we propose two novel BAFT settings, cross-domain and cross-domain cross-architecture BAFT, which only assume that (1) the target model for attacking is a fine-tuned model, and (2) the source domain data is known and accessible. To successfully attack fine-tuned models under both settings, we propose to first train an adversarial generator against the source model, which adopts an encoder-decoder architecture and maps a clean input to an adversarial example. Then we search in the low-dimensional latent space produced by the encoder of the adversarial generator. The search is conducted under the guidance of the surrogate gradient obtained from the source model. Experimental results on different domains and different network architectures demonstrate that the proposed attack method can effectively and efficiently attack the fine-tuned models.

翻译：微调很容易受到对抗性攻击。微调模型( BAFT) 黑盒攻击黑盒攻击的现有工作受到强烈的假设的限制。为了填补空白, 我们提议了两种新型的BAFT设置, 跨域和跨域跨结构跨结构BAFT, 仅假设:(1) 攻击的目标模式是一个微调模型, 以及(2) 源域数据是已知和可获取的。为了在两种情况下成功攻击微调模型, 我们提议首先训练一个对源模型的对称生成器, 该源模型采用编码- 解码器结构, 并绘制对对抗性模型的清洁输入图。然后我们搜索由对称生成的对称发电机编码器生成的低维潜在空间。搜索是在从源模型获得的代谢梯度指导下进行的。不同领域和不同网络结构的实验结果显示, 拟议的攻击方法能够有效和高效地攻击微调模型。

相关内容

MoDELS

关注 30

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【图机器学习进展与趋势@ICML2022】Graph Machine Learning @ ICML 2022

专知会员服务

36+阅读 · 2022年7月25日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

44+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

76+阅读 · 2020年7月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

59+阅读 · 2020年3月19日