Federated Learning (FL) has emerged as a privacy-preserving paradigm for training machine learning models across distributed edge devices in the Internet of Things (IoT). By keeping data local and coordinating model training through a central server, FL effectively addresses privacy concerns and reduces communication overhead. However, the limited computational power, memory, and bandwidth of IoT edge devices pose significant challenges to the efficiency and scalability of FL, especially when training deep neural networks. Various FL frameworks have been proposed to reduce computation and communication overheads through dropout or layer freezing. However, these approaches often sacrifice accuracy or neglect memory constraints. To this end, in this work, we introduce Federated Learning with Ordered Layer Freezing (FedOLF). FedOLF consistently freezes layers in a predefined order before training, significantly mitigating computation and memory requirements. To further reduce communication and energy costs, we incorporate Tensor Operation Approximation (TOA), a lightweight alternative to conventional quantization that better preserves model accuracy. Experimental results demonstrate that over non-iid data, FedOLF achieves at least 0.3%, 6.4%, 5.81%, 4.4%, 6.27% and 1.29% higher accuracy than existing works respectively on EMNIST (with CNN), CIFAR-10 (with AlexNet), CIFAR-100 (with ResNet20 and ResNet44), and CINIC-10 (with ResNet20 and ResNet44), along with higher energy efficiency and lower memory footprint.
翻译:联邦学习作为一种隐私保护范式,已在物联网分布式边缘设备上的机器学习模型训练中得到广泛应用。通过将数据保留在本地并通过中央服务器协调模型训练,联邦学习有效解决了隐私问题并降低了通信开销。然而,物联网边缘设备有限的计算能力、内存和带宽对联邦学习的效率和可扩展性构成了重大挑战,尤其是在训练深度神经网络时。已有多种联邦学习框架通过丢弃或层冻结技术来降低计算和通信开销,但这些方法往往以牺牲精度为代价或忽略了内存限制。为此,本研究提出了基于有序层冻结的联邦学习方法。该方法在训练前按照预定顺序持续冻结网络层,显著降低了计算和内存需求。为进一步减少通信和能耗,我们引入了张量操作近似技术——这是一种轻量化的传统量化替代方案,能更好地保持模型精度。实验结果表明,在非独立同分布数据上,相较于现有方法,FedOLF在EMNIST数据集(使用CNN)、CIFAR-10数据集(使用AlexNet)、CIFAR-100数据集(使用ResNet20和ResNet44)以及CINIC-10数据集(使用ResNet20和ResNet44)上分别实现了至少0.3%、6.4%、5.81%、4.4%、6.27%和1.29%的精度提升,同时具有更高的能效和更低的内存占用。