Cellular networks are comprised of software-based entities, with main functions encapsulated as Virtual Network Functions (VNFs) deployed on Commercial-off-the-Shelf (COTS) hardware. As a key enabler of 5G, network slicing offers logically isolated Quality of Service (QoS) for diverse use cases. With the transition to cloud-native infrastructures, optimizing network slice placement across multi-cloud environments remains challenging due to heterogeneous resource capabilities and varying slice-specific demands. This paper presents SlicePilot, a modular framework that enables autonomous and near-optimal VNF placement using a disaggregated Multi-Agent Reinforcement Learning (MARL) approach. SlicePilot collects real-world traffic profiles to estimate resource needs for each slice type. These estimates guide a MARL-based scheduler that minimizes deployment costs while satisfying QoS constraints. We evaluate SlicePilot on a multi-cloud testbed and demonstrate a 19x speed-up over combinatorial optimization methods, while keeping deployment costs within 7.8% of the optimal. Although SlicePilot results in 2.42x more QoS violations under high-load conditions, this trade-off is offset by faster decision-making and reduced computational overhead. Overall, SlicePilot delivers a scalable, cost-efficient solution for network slice placement, making it suitable for real-time deployments where responsiveness and efficiency are critical.
翻译:蜂窝网络由基于软件的实体构成,其主要功能被封装为虚拟网络功能(VNF),部署在商用现成硬件上。作为5G的关键使能技术,网络切片为多样化用例提供逻辑隔离的服务质量保证。随着向云原生基础设施的过渡,由于异构资源能力和不同切片特定需求的差异,在多云环境中优化网络切片部署仍具挑战性。本文提出SlicePilot,一种模块化框架,采用解耦的多智能体强化学习方法实现自主且接近最优的VNF部署。SlicePilot收集实际流量特征以估计每种切片类型的资源需求,这些估计指导基于MARL的调度器在满足QoS约束的同时最小化部署成本。我们在多云测试平台上评估SlicePilot,结果显示其比组合优化方法加速19倍,同时将部署成本控制在最优解的7.8%以内。尽管在高负载条件下SlicePilot会导致QoS违规增加2.42倍,但这一权衡被更快的决策速度和降低的计算开销所抵消。总体而言,SlicePilot为网络切片部署提供了可扩展且成本高效的解决方案,适用于对响应速度和效率有严格要求的实时部署场景。