动态多试剂系统中的资源分配 (Resource allocation in dynamic multiagent systems)

Resource allocation and task prioritisation are key problem domains in the fields of autonomous vehicles, networking, and cloud computing. The challenge in developing efficient and robust algorithms comes from the dynamic nature of these systems, with many components communicating and interacting in complex ways. The multi-group resource allocation optimisation (MG-RAO) algorithm we present uses multiple function approximations of resource demand over time, alongside reinforcement learning techniques, to develop a novel method of optimising resource allocation in these multi-agent systems. This method is applicable where there are competing demands for shared resources, or in task prioritisation problems. Evaluation is carried out in a simulated environment containing multiple competing agents. We compare the new algorithm to an approach where child agents distribute their resources uniformly across all the tasks they can be allocated. We also contrast the performance of the algorithm where resource allocation is modelled separately for groups of agents, as to being modelled jointly over all agents. The MG-RAO algorithm shows a 23 - 28% improvement over fixed resource allocation in the simulated environments. Results also show that, in a volatile system, using the MG-RAO algorithm configured so that child agents model resource allocation for all agents as a whole has 46.5% of the performance of when it is set to model multiple groups of agents. These results demonstrate the ability of the algorithm to solve resource allocation problems in multi-agent systems and to perform well in dynamic environments.

翻译：在自主车辆、网络和云计算领域,资源分配和任务优先排序是关键的问题领域。开发高效和稳健的算法的挑战来自这些系统的动态性质,这些系统有许多组成部分以复杂的方式进行沟通和互动。我们介绍的多组资源分配优化算法(MG-RAO),我们采用的多组资源分配优化算法(MG-RAO),与强化学习技术一道,对资源需求进行多种功能性比对,以制定在这些多试剂系统中优化资源分配的新方法。这种方法适用于对共享资源或任务优先排序问题有相互竞争需求的领域。评价是在模拟环境中进行的,在模拟环境中进行,在包含多个竞争代理人的模拟环境中进行。我们比较新的算法,让儿童代理人在可分配的所有任务中统一分配资源。我们还比较了在资源配置上分别针对各种代理人的多种功能近似比值,即对所有代理人进行模拟环境中的固定资源分配比重23-28%的改进。结果还显示,在一个变化的系统中,使用MG-RAO的算法,对多种竞争代理人进行了模拟的计算,从而将儿童代理人的资源分配能力分配模式和所有代理人的多种代理人的资源分配结果都显示,所有代理人的计算结果都显示整个业绩。