AI agent-based systems are becoming increasingly integral to modern software architectures, enabling autonomous decision-making, dynamic task execution, and multimodal interactions through large language models (LLMs). However, these systems introduce novel and evolving security challenges, including prompt injection attacks, context poisoning, model manipulation, and opaque agent-to-agent communication, that are not effectively captured by traditional threat modeling frameworks. In this paper, we introduce ASTRIDE, an automated threat modeling platform purpose-built for AI agent-based systems. ASTRIDE extends the classical STRIDE framework by introducing a new threat category, A for AI Agent-Specific Attacks, which encompasses emerging vulnerabilities such as prompt injection, unsafe tool invocation, and reasoning subversion, unique to agent-based applications. To automate threat modeling, ASTRIDE combines a consortium of fine-tuned vision-language models (VLMs) with the OpenAI-gpt-oss reasoning LLM to perform end-to-end analysis directly from visual agent architecture diagrams, such as data flow diagrams(DFDs). LLM agents orchestrate the end-to-end threat modeling automation process by coordinating interactions between the VLM consortium and the reasoning LLM. Our evaluations demonstrate that ASTRIDE provides accurate, scalable, and explainable threat modeling for next-generation intelligent systems. To the best of our knowledge, ASTRIDE is the first framework to both extend STRIDE with AI-specific threats and integrate fine-tuned VLMs with a reasoning LLM to fully automate diagram-driven threat modeling in AI agent-based applications.
翻译:基于AI智能体的系统正日益成为现代软件架构的核心组成部分,其通过大型语言模型(LLMs)实现了自主决策、动态任务执行与多模态交互。然而,这些系统引入了新颖且不断演进的安全挑战,包括提示注入攻击、上下文污染、模型操纵以及不透明的智能体间通信,这些挑战无法被传统威胁建模框架有效捕捉。本文提出ASTRIDE,一个专为基于AI智能体的系统设计的自动化威胁建模平台。ASTRIDE通过引入新的威胁类别——A(AI智能体特定攻击),扩展了经典的STRIDE框架。该类别涵盖了智能体应用特有的新兴漏洞,如提示注入、不安全工具调用和推理颠覆。为实现威胁建模的自动化,ASTRIDE结合了微调的视觉语言模型(VLMs)联盟与OpenAI-gpt-oss推理LLM,直接从视觉化智能体架构图(如数据流图DFDs)进行端到端分析。LLM智能体通过协调VLM联盟与推理LLM之间的交互,编排端到端的威胁建模自动化流程。评估结果表明,ASTRIDE为下一代智能系统提供了准确、可扩展且可解释的威胁建模。据我们所知,ASTRIDE是首个既通过AI特定威胁扩展STRIDE,又集成微调VLMs与推理LLM以在基于AI智能体的应用中实现图驱动威胁建模全自动化的框架。