Ensuring the reliability of agent architectures and effectively identifying problematic agents when failures occur are crucial challenges in multi-agent systems (MAS). Advances in large language models (LLMs) have established LLM-based agents as a major branch of MAS, enabling major breakthroughs in complex problem solving and world modeling. However, the reliability implications of this shift remain largely unexplored. i.e., whether substituting traditional agents with LLM-based agents can effectively enhance the reliability of MAS. In this work, we investigate and quantify the reliability of LLM-based agents from the perspective of Byzantine fault tolerance. We observe that LLM-based agents demonstrate stronger skepticism when processing erroneous message flows, a characteristic that enables them to outperform traditional agents across different topological structures. Motivated by the results of the pilot experiment, we design CP-WBFT, a confidence probe-based weighted Byzantine Fault Tolerant consensus mechanism to enhance the stability of MAS with different topologies. It capitalizes on the intrinsic reflective and discriminative capabilities of LLMs by employing a probe-based, weighted information flow transmission method to improve the reliability of LLM-based agents. Extensive experiments demonstrate that CP-WBFT achieves superior performance across diverse network topologies under extreme Byzantine conditions (85.7\% fault rate). Notably, our approach surpasses traditional methods by attaining remarkable accuracy on various topologies and maintaining strong reliability in both mathematical reasoning and safety assessment tasks.
翻译:确保智能体架构的可靠性,并在发生故障时有效识别问题智能体,是多智能体系统(MAS)面临的关键挑战。大型语言模型(LLM)的进展使基于LLM的智能体成为MAS的重要分支,在复杂问题求解与世界建模方面实现了重大突破。然而,这一转变对可靠性的影响尚未得到充分探索,即用基于LLM的智能体替代传统智能体是否能有效提升MAS的可靠性。本研究从拜占庭容错的角度,对基于LLM的智能体的可靠性进行了系统性调查与量化分析。我们观察到,基于LLM的智能体在处理错误信息流时表现出更强的怀疑倾向,这一特性使其在不同拓扑结构中均能超越传统智能体的表现。基于初步实验结果,我们设计了CP-WBFT——一种基于置信度探测的加权拜占庭容错共识机制,以提升不同拓扑结构下MAS的稳定性。该机制通过采用基于探测的加权信息流传输方法,充分利用LLM固有的反思与判别能力,从而增强基于LLM的智能体的可靠性。大量实验表明,在极端拜占庭故障条件(85.7%故障率)下,CP-WBFT能在多样化网络拓扑中实现卓越性能。值得注意的是,该方法在各类拓扑结构上均取得显著准确度,在数学推理与安全评估任务中均保持强可靠性,全面超越了传统方法。