基于拍卖任务分配和行为树的反应式多智能体协作 (Reactive Multi-agent Coordination using Auction-based Task Allocation and Behavior Trees)

This article presents an architecture for multi-agent task allocation and task execution, through the unification of a market-inspired task-auctioning system with Behavior Trees for managing and executing lower level behaviors. We consider the scenario with multi-stage tasks, such as 'pick and place', whose arrival times are not known a priori. In such a scenario, a coordinating architecture is expected to be reactive to newly arrived tasks and the resulting rerouting of agents should be dependent on the stage of completion of their current multi-stage tasks. In the novel architecture proposed in this article, a central auctioning system gathers bids (cost-estimates for completing currently available tasks) from all agents, and solves a combinatorial problem to optimally assign tasks to agents. For every agent, it's participation in the auctioning system and execution of an assigned multi-stage task is managed using behavior trees, which switch among several well-defined behaviors in response to changing scenarios. The auctioning system is run at a fixed rate, allowing for newly added tasks to be incorporated into the auctioning system, which makes the solution reactive and allows for the rerouting of some agents (subject to the states of the behavior trees). We demonstrate that the proposed architecture is especially well-suited for multi-stage tasks, where high costs are incurred when rerouting agents who have completed one or more stages of their current tasks. The scalability analysis of the proposed architecture reveals that it scales well with the number of agents and number of tasks. The proposed framework is experimentally validated in multiple scenarios in a lab environment. A video of a demonstration can be viewed at: https://youtu.be/ZdEkoOOlB2g}.

翻译：本文提出了一种用于多智能体任务分配和任务执行的架构，通过将市场启发式的任务拍卖系统与用于管理和执行低层行为的行为树统一起来。我们考虑到了多阶段任务的情况，例如“取放”，其到达时间不是事先知道的。在这种情况下，预计协调架构应对新到达的任务做出反应，根据它们当前的多阶段任务的完成阶段，其结果将取决于其当前多阶段任务的完成阶段，以重新路由智能体。本文提出的新架构，一个中央的拍卖系统收集所有智能体的出价（完成当前可用任务的成本估计），并解决一个组合问题，以将任务最优地分配给智能体。对于每个智能体，它参与拍卖系统和执行分配的多阶段任务都是使用行为树管理的，行为树在响应于不断变化的情况下在几个定义良好的行为中切换。拍卖系统以固定的速率运行，允许新添加的任务纳入拍卖系统中，从而使解决方案具有反应性，并允许重新路由某些智能体（取决于行为树的状态）。我们证明，所提出的架构特别适用于多阶段任务，其中在完成一个或多个阶段的任务之后重新路由智能体会产生高额成本。该提议的架构的可扩展性分析表明，它随着智能体数量和任务数量的增加而扩展。所提出的框架已在实验室环境中的多个场景中进行了实验验证。演示视频可在以下链接中查看：https://youtu.be/ZdEkoOOlB2g。