Edge computing decentralizes computing resources, allowing for novel applications in domains such as the Internet of Things (IoT) in healthcare and agriculture by reducing latency and improving performance. This decentralization is achieved through the implementation of microservice architectures, which require low latencies to meet stringent service level agreements (SLA) such as performance, reliability, and availability metrics. While cloud computing offers the large data storage and computation resources necessary to handle peak demands, a hybrid cloud and edge environment is required to ensure SLA compliance. This is achieved by sophisticated orchestration strategies such as Kubernetes, which help facilitate resource management. The orchestration strategies alone do not guarantee SLA adherence due to the inherent delay of scaling resources. Existing auto-scaling algorithms have been proposed to address these challenges, but they suffer from performance issues and configuration complexity. In this paper, a novel auto-scaling algorithm is proposed for SLA-constrained edge computing applications. This approach combines a Machine Learning (ML) based proactive auto-scaling algorithm, capable of predicting incoming resource requests to forecast demand, with a reactive autoscaler which considers current resource utilization and SLA constraints for immediate adjustments. The algorithm is integrated into Kubernetes as an extension, and its performance is evaluated through extensive experiments in an edge environment with real applications. The results demonstrate that existing solutions have an SLA violation rate of up to 23%, whereas the proposed hybrid solution outperforms the baselines with an SLA violation rate of only 6%, ensuring stable SLA compliance across various applications.
翻译:边缘计算通过分散计算资源,在医疗保健和农业等领域的物联网应用中降低延迟并提升性能,从而支持新型应用场景。这种分布式架构通过微服务架构实现,需要低延迟以满足严格的性能、可靠性和可用性等服务水平协议指标。尽管云计算提供了处理峰值需求所需的大规模数据存储与计算资源,但为确保SLA合规性,仍需构建混合云与边缘环境。这通过Kubernetes等复杂编排策略实现,以协助资源管理。然而,由于资源扩缩固有的延迟,仅靠编排策略无法保证SLA的遵守。现有自动扩缩算法虽被提出以应对这些挑战,但仍存在性能问题和配置复杂性。本文提出一种面向SLA约束边缘计算应用的新型自动扩缩算法。该方法结合了基于机器学习的主动式自动扩缩算法(能够通过预测资源请求来预判需求)与反应式自动扩缩器(考虑当前资源利用率和SLA约束以进行即时调整)。该算法作为扩展集成至Kubernetes中,并通过在边缘环境中使用真实应用进行大量实验评估其性能。结果表明,现有解决方案的SLA违规率高达23%,而所提出的混合方案优于基线方法,SLA违规率仅为6%,确保了跨不同应用的稳定SLA合规性。