In an era where vast amounts of data are collected and processed from diverse sources, there is a growing demand for sophisticated AI systems capable of intelligently fusing and analyzing this information. To address these challenges, researchers have turned towards integrating tools into LLM-powered agents to enhance the overall information fusion process. However, the conjunction of these technologies and the proposed enhancements in several state-of-the-art works followed a non-unified software architecture, resulting in a lack of modularity and terminological inconsistencies among researchers. To address these issues, we propose a novel LLM-based Agent Unified Modeling Framework (LLM-Agent-UMF) that establishes a clear foundation for agent development from both functional and software architectural perspectives, developed and evaluated using the Architecture Tradeoff and Risk Analysis Framework (ATRAF). Our framework clearly distinguishes between the different components of an LLM-based agent, setting LLMs and tools apart from a new element, the core-agent, which plays the role of central coordinator. This pivotal entity comprises five modules: planning, memory, profile, action, and security -- the latter often neglected in previous works. By classifying core-agents into passive and active types based on their authoritative natures, we propose various multi-core agent architectures that combine unique characteristics of distinctive agents to tackle complex tasks more efficiently. We evaluate our framework by applying it to thirteen state-of-the-art agents, thereby demonstrating its alignment with their functionalities and clarifying overlooked architectural aspects. Moreover, we thoroughly assess five architecture variants of our framework by designing new agent architectures that combine characteristics of state-of-the-art agents to address specific goals. ...
翻译:在一个从多样化来源收集和处理海量数据的时代,对能够智能融合与分析这些信息的复杂人工智能系统的需求日益增长。为应对这些挑战,研究者们开始将工具集成到基于大语言模型(LLM)的智能体中,以增强整体信息融合过程。然而,这些技术的结合以及若干前沿工作中提出的改进遵循了非统一的软件架构,导致模块化程度不足以及研究者间的术语不一致。为解决这些问题,我们提出了一种新颖的基于LLM的智能体统一建模框架(LLM-Agent-UMF),该框架从功能和软件架构两个视角为智能体开发奠定了清晰的基础,并使用架构权衡与风险分析框架(ATRAF)进行开发和评估。我们的框架明确区分了基于LLM的智能体的不同组件,将LLM和工具与一个新元素——核心智能体区分开来,后者扮演着中央协调者的角色。这一关键实体包含五个模块:规划、记忆、档案、行动和安全——其中安全模块在以往工作中常被忽视。通过根据核心智能体的权威性质将其分为被动型和主动型,我们提出了多种多核心智能体架构,这些架构结合了不同智能体的独特特性,以更高效地处理复杂任务。我们通过将该框架应用于十三个前沿智能体来评估其有效性,从而证明其与这些智能体功能的一致性,并阐明了被忽视的架构方面。此外,我们通过设计结合前沿智能体特性以解决特定目标的新智能体架构,对我们的框架的五种架构变体进行了全面评估。