Are Large Language Models (LLMs) a new form of strategic intelligence, able to reason about goals in competitive settings? We present compelling supporting evidence. The Iterated Prisoner's Dilemma (IPD) has long served as a model for studying decision-making. We conduct the first ever series of evolutionary IPD tournaments, pitting canonical strategies (e.g., Tit-for-Tat, Grim Trigger) against agents from the leading frontier AI companies OpenAI, Google, and Anthropic. By varying the termination probability in each tournament (the "shadow of the future"), we introduce complexity and chance, confounding memorisation. Our results show that LLMs are highly competitive, consistently surviving and sometimes even proliferating in these complex ecosystems. Furthermore, they exhibit distinctive and persistent "strategic fingerprints": Google's Gemini models proved strategically ruthless, exploiting cooperative opponents and retaliating against defectors, while OpenAI's models remained highly cooperative, a trait that proved catastrophic in hostile environments. Anthropic's Claude emerged as the most forgiving reciprocator, showing remarkable willingness to restore cooperation even after being exploited or successfully defecting. Analysis of nearly 32,000 prose rationales provided by the models reveals that they actively reason about both the time horizon and their opponent's likely strategy, and we demonstrate that this reasoning is instrumental to their decisions. This work connects classic game theory with machine psychology, offering a rich and granular view of algorithmic decision-making under uncertainty.
翻译:大型语言模型(LLMs)是否构成一种新型的策略智能,能够在竞争性环境中对目标进行推理?我们提出了有力的支持证据。迭代囚徒困境(IPD)长期以来一直是研究决策行为的经典模型。我们首次开展了一系列演化IPD锦标赛,将经典策略(如“以牙还牙”、“冷酷触发”)与来自前沿AI公司OpenAI、Google和Anthropic的智能体进行对抗。通过改变每场锦标赛的终止概率(即“未来的阴影”),我们引入了复杂性与随机性,从而避免了单纯记忆的影响。结果表明,LLMs展现出高度的竞争力,能够持续在这些复杂生态系统中存活,有时甚至实现种群扩张。此外,它们表现出独特且持久的“策略指纹”:Google的Gemini模型在策略上表现出冷酷性,善于利用合作对手并对背叛者实施报复;而OpenAI的模型则保持高度合作性,这一特性在敌对环境中被证明是灾难性的。Anthropic的Claude则成为最具宽容性的互惠者,即使在遭受剥削或成功背叛后,仍表现出显著的合作恢复意愿。通过对模型提供的近32,000条文本推理进行分析,我们发现它们能够主动对时间跨度和对手可能策略进行推理,并证明这种推理对其决策具有关键作用。本研究将经典博弈论与机器心理学相连接,为不确定性下的算法决策提供了丰富而细致的观察视角。