ASTRIDE：面向智能体AI应用的安全威胁建模平台 (ASTRIDE: A Security Threat Modeling Platform for Agentic-AI Applications)

Eranga Bandara,Amin Hass,Ross Gore,Sachin Shetty,Ravi Mukkamala,Safdar H. Bouk,Xueping Liang,Ng Wee Keong,Kasun De Zoysa,Aruna Withanage,Nilaan Loganathan

AI agent-based systems are becoming increasingly integral to modern software architectures, enabling autonomous decision-making, dynamic task execution, and multimodal interactions through large language models (LLMs). However, these systems introduce novel and evolving security challenges, including prompt injection attacks, context poisoning, model manipulation, and opaque agent-to-agent communication, that are not effectively captured by traditional threat modeling frameworks. In this paper, we introduce ASTRIDE, an automated threat modeling platform purpose-built for AI agent-based systems. ASTRIDE extends the classical STRIDE framework by introducing a new threat category, A for AI Agent-Specific Attacks, which encompasses emerging vulnerabilities such as prompt injection, unsafe tool invocation, and reasoning subversion, unique to agent-based applications. To automate threat modeling, ASTRIDE combines a consortium of fine-tuned vision-language models (VLMs) with the OpenAI-gpt-oss reasoning LLM to perform end-to-end analysis directly from visual agent architecture diagrams, such as data flow diagrams(DFDs). LLM agents orchestrate the end-to-end threat modeling automation process by coordinating interactions between the VLM consortium and the reasoning LLM. Our evaluations demonstrate that ASTRIDE provides accurate, scalable, and explainable threat modeling for next-generation intelligent systems. To the best of our knowledge, ASTRIDE is the first framework to both extend STRIDE with AI-specific threats and integrate fine-tuned VLMs with a reasoning LLM to fully automate diagram-driven threat modeling in AI agent-based applications.

翻译：基于AI智能体的系统正日益成为现代软件架构的核心组成部分，其通过大型语言模型（LLMs）实现了自主决策、动态任务执行与多模态交互。然而，这些系统引入了新颖且不断演进的安全挑战，包括提示注入攻击、上下文污染、模型操纵以及不透明的智能体间通信，这些挑战无法被传统威胁建模框架有效捕捉。本文提出ASTRIDE，一个专为基于AI智能体的系统设计的自动化威胁建模平台。ASTRIDE通过引入新的威胁类别——A（AI智能体特定攻击），扩展了经典的STRIDE框架。该类别涵盖了智能体应用特有的新兴漏洞，如提示注入、不安全工具调用和推理颠覆。为实现威胁建模的自动化，ASTRIDE结合了微调的视觉语言模型（VLMs）联盟与OpenAI-gpt-oss推理LLM，直接从视觉化智能体架构图（如数据流图DFDs）进行端到端分析。LLM智能体通过协调VLM联盟与推理LLM之间的交互，编排端到端的威胁建模自动化流程。评估结果表明，ASTRIDE为下一代智能系统提供了准确、可扩展且可解释的威胁建模。据我们所知，ASTRIDE是首个既通过AI特定威胁扩展STRIDE，又集成微调VLMs与推理LLM以在基于AI智能体的应用中实现图驱动威胁建模全自动化的框架。

相关内容

关注 7073

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

【AAAI2026】TOFA：面向视觉-语言模型的免训练一次性联邦自适应方法

专知会员服务

12+阅读 · 11月23日

面向具身操作的高效视觉–语言–动作模型：系统综述

专知会员服务

21+阅读 · 10月22日

美海军《表征军事领域的新奇性》开发和评估对新事物具有鲁棒性的智能体；DARPA人工智能科学和开放世界新事物学习（SAIL-ON）项目

专知会员服务

31+阅读 · 2023年3月1日

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日