多模态大语言模型能否检测钓鱼攻击？聚焦动态威胁与学术环境下多模态评估的综合安全基准套件 (Can MLLMs Detect Phishing? A Comprehensive Security Benchmark Suite Focusing on Dynamic Threats and Multimodal Evaluation in Academic Environments) - 专知论文

会员服务 ·

0

多模 · 模态 · 攻击 · 基准 · 多模态 ·

Can MLLMs Detect Phishing? A Comprehensive Security Benchmark Suite Focusing on Dynamic Threats and Multimodal Evaluation in Academic Environments

翻译：多模态大语言模型能否检测钓鱼攻击？聚焦动态威胁与学术环境下多模态评估的综合安全基准套件

The rapid proliferation of Multimodal Large Language Models (MLLMs) has introduced unprecedented security challenges, particularly in phishing detection within academic environments. Academic institutions and researchers are high-value targets, facing dynamic, multilingual, and context-dependent threats that leverage research backgrounds, academic collaborations, and personal information to craft highly tailored attacks. Existing security benchmarks largely rely on datasets that do not incorporate specific academic background information, making them inadequate for capturing the evolving attack patterns and human-centric vulnerability factors specific to academia. To address this gap, we present AdapT-Bench, a unified methodological framework and benchmark suite for systematically evaluating MLLM defense capabilities against dynamic phishing attacks in academic settings.

翻译：多模态大语言模型的快速扩散带来了前所未有的安全挑战，尤其在学术环境中的钓鱼检测领域。学术机构与研究人员作为高价值目标，面临着动态、多语言且依赖上下文的威胁，这些威胁利用研究背景、学术合作及个人信息构建高度定制化的攻击。现有安全基准大多依赖未纳入特定学术背景信息的数据集，导致其难以捕捉学术领域特有的动态攻击模式及以人为中心的脆弱性因素。为填补这一空白，我们提出AdapT-Bench——一个统一的方法论框架与基准套件，用于系统评估多模态大语言模型在学术环境中抵御动态钓鱼攻击的防御能力。

0

相关内容

深度学习模型反演攻击与防御：全面综述

深度学习模型反演攻击与防御：全面综述

专知会员服务

23+阅读 · 2月3日

【WSDM2024】数据中的恶魔：通过部分知识蒸馏学习公平的图神经网络

【WSDM2024】数据中的恶魔：通过部分知识蒸馏学习公平的图神经网络

专知会员服务

31+阅读 · 2023年12月1日

如何检测大模型“幻觉”？剑桥提出SelfCheckGPT: 针对生成型大型语言模型的零资源黑盒子幻觉检测

如何检测大模型“幻觉”？剑桥提出SelfCheckGPT: 针对生成型大型语言模型的零资源黑盒子幻觉检测

专知会员服务

43+阅读 · 2023年8月22日

【书籍】网络安全《移动目标防御 II：博弈论和对抗性建模的应用》210页，Moving Target Defense II：Application of Game Theory and Adversarial Modeling

【书籍】网络安全《移动目标防御 II：博弈论和对抗性建模的应用》210页，Moving Target Defense II：Application of Game Theory and Adversarial Modeling

专知会员服务

66+阅读 · 2022年4月14日

图对抗防御研究进展

图对抗防御研究进展

专知会员服务

39+阅读 · 2021年12月13日

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知

11+阅读 · 2021年4月23日

【KDD2020-Tutorial】深度学习异常检测，180页ppt

【KDD2020-Tutorial】深度学习异常检测，180页ppt

专知

49+阅读 · 2020年8月28日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

专知

11+阅读 · 2020年3月17日

如何用机器学习精准辨别“背景”和“目标”

如何用机器学习精准辨别“背景”和“目标”

论智

10+阅读 · 2018年10月22日

基于学习的智能化漏洞挖掘关键技术研究

国家自然科学基金

6+阅读 · 2017年12月31日

满足差分隐私的频繁模式挖掘研究

国家自然科学基金

2+阅读 · 2015年12月31日

面向大数据的安全迁移学习方法

国家自然科学基金

29+阅读 · 2015年12月31日

大数据环境下基于GMDH的客户分类半监督集成模型研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于自适应模型检测的安全协议自动建模与设计研究

国家自然科学基金

1+阅读 · 2014年12月31日

A Practical Framework for Evaluating Medical AI Security: Reproducible Assessment of Jailbreaking and Privacy Vulnerabilities Across Clinical Specialties

Arxiv

0+阅读 · 12月9日

ASTRIDE: A Security Threat Modeling Platform for Agentic-AI Applications

Arxiv

0+阅读 · 12月4日

Adaptive and Robust Data Poisoning Detection and Sanitization in Wearable IoT Systems using Large Language Models

Arxiv

0+阅读 · 11月21日

Can MLLMs Detect Phishing? A Comprehensive Security Benchmark Suite Focusing on Dynamic Threats and Multimodal Evaluation in Academic Environments

Arxiv

0+阅读 · 11月19日

A Large Scale Study of AI-based Binary Function Similarity Detection Techniques for Security Researchers and Practitioners

Arxiv

0+阅读 · 11月3日

VIP会员

文章信息

相关主题

相关VIP内容

深度学习模型反演攻击与防御：全面综述

深度学习模型反演攻击与防御：全面综述

专知会员服务

23+阅读 · 2月3日

【WSDM2024】数据中的恶魔：通过部分知识蒸馏学习公平的图神经网络

【WSDM2024】数据中的恶魔：通过部分知识蒸馏学习公平的图神经网络

专知会员服务

31+阅读 · 2023年12月1日

如何检测大模型“幻觉”？剑桥提出SelfCheckGPT: 针对生成型大型语言模型的零资源黑盒子幻觉检测

如何检测大模型“幻觉”？剑桥提出SelfCheckGPT: 针对生成型大型语言模型的零资源黑盒子幻觉检测

专知会员服务

43+阅读 · 2023年8月22日

【书籍】网络安全《移动目标防御 II：博弈论和对抗性建模的应用》210页，Moving Target Defense II：Application of Game Theory and Adversarial Modeling

【书籍】网络安全《移动目标防御 II：博弈论和对抗性建模的应用》210页，Moving Target Defense II：Application of Game Theory and Adversarial Modeling

专知会员服务

66+阅读 · 2022年4月14日

图对抗防御研究进展

图对抗防御研究进展

专知会员服务

39+阅读 · 2021年12月13日

热门VIP内容

开通专知VIP会员享更多权益服务

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

前沿人工智能趋势报告（Frontier AI Trends Report）

音退化问题：基于输入操控的鲁棒语音转换综述

相关资讯

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知

11+阅读 · 2021年4月23日

【KDD2020-Tutorial】深度学习异常检测，180页ppt

【KDD2020-Tutorial】深度学习异常检测，180页ppt

专知

49+阅读 · 2020年8月28日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

专知

11+阅读 · 2020年3月17日

如何用机器学习精准辨别“背景”和“目标”

如何用机器学习精准辨别“背景”和“目标”

论智

10+阅读 · 2018年10月22日

相关论文

A Practical Framework for Evaluating Medical AI Security: Reproducible Assessment of Jailbreaking and Privacy Vulnerabilities Across Clinical Specialties

Arxiv

0+阅读 · 12月9日

ASTRIDE: A Security Threat Modeling Platform for Agentic-AI Applications

Arxiv

0+阅读 · 12月4日

Adaptive and Robust Data Poisoning Detection and Sanitization in Wearable IoT Systems using Large Language Models

Arxiv

0+阅读 · 11月21日

Can MLLMs Detect Phishing? A Comprehensive Security Benchmark Suite Focusing on Dynamic Threats and Multimodal Evaluation in Academic Environments

Arxiv

0+阅读 · 11月19日

A Large Scale Study of AI-based Binary Function Similarity Detection Techniques for Security Researchers and Practitioners

Arxiv

0+阅读 · 11月3日

相关基金

基于学习的智能化漏洞挖掘关键技术研究

国家自然科学基金

6+阅读 · 2017年12月31日

满足差分隐私的频繁模式挖掘研究

国家自然科学基金

2+阅读 · 2015年12月31日

面向大数据的安全迁移学习方法

国家自然科学基金

29+阅读 · 2015年12月31日

大数据环境下基于GMDH的客户分类半监督集成模型研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于自适应模型检测的安全协议自动建模与设计研究

国家自然科学基金

1+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员