Finding near-optimal solutions for dense multi-agent pathfinding (MAPF) problems in real-time remains challenging even for state-of-the-art planners. To this end, we develop a hybrid framework that integrates a learned heuristic derived from MAGAT, a neural MAPF policy with a graph attention scheme, into a leading search-based algorithm, LaCAM. While prior work has explored learning-guided search in MAPF, such methods have historically underperformed. In contrast, our approach, termed LaGAT, outperforms both purely search-based and purely learning-based methods in dense scenarios. This is achieved through an enhanced MAGAT architecture, a pre-train-then-fine-tune strategy on maps of interest, and a deadlock detection scheme to account for imperfect neural guidance. Our results demonstrate that, when carefully designed, hybrid search offers a powerful solution for tightly coupled, challenging multi-agent coordination problems.
翻译:即使在最先进的规划器中,实时求解密集多智能体路径规划(MAPF)问题的近似最优解仍具挑战性。为此,我们开发了一种混合框架,将源自MAGAT(一种采用图注意力机制的神经MAPF策略)的学习启发式,集成到领先的基于搜索的算法LaCAM中。尽管先前工作已探索过MAPF中的学习引导搜索,但此类方法历来表现不佳。相比之下,我们的方法(称为LaGAT)在密集场景中优于纯搜索方法和纯学习方法。这得益于改进的MAGAT架构、在目标地图上采用的预训练-微调策略,以及为应对不完美的神经引导而设计的死锁检测机制。我们的结果表明,经过精心设计后,混合搜索为紧密耦合、具有挑战性的多智能体协调问题提供了一种强大的解决方案。