Federated Fine-Tuning (FFT) has attracted growing interest as it leverages both server- and client-side data to enhance global model generalization while preserving privacy, and significantly reduces the computational burden on edge devices by avoiding training from scratch. Despite these advantages, FFT performance is often degraded by unreliable server-client connections and heterogeneous client data distributions. Most existing methods assume homogeneous network conditions or require prior knowledge of connection failures. However, these assumptions are impractical in real-world networks characterized by diverse communication standards (e.g., wired, Wi-Fi, 4G, and 5G) and heterogeneous failure patterns. To address these limitations, we propose FedAuto, a novel FFT framework that mitigates the combined effects of connection failures and data heterogeneity via adaptive aggregation. FedAuto operates without prior knowledge of network conditions or modifications to existing infrastructure, enabling seamless plug-and-play deployment. Moreover, we establish a rigorous convergence guarantee for FedAuto. By adopting a novel per-round aggregation perspective, our analysis removes the need for assumptions on connection failures probabilities or client selection strategies commonly imposed in prior work, and guarantees convergence of FedAuto for each individual realization, providing a stronger theoretical assurance. Extensive experiments demonstrate that FedAuto consistently outperforms state-of-the-art baselines under diverse connection failure scenarios for both full-parameter and partial-parameter fine-tuning (e.g., LoRA), and even surpasses strategies that rely on complex communication resource optimization.
翻译:联邦微调(FFT)通过协同利用服务器端与客户端数据来增强全局模型泛化能力,同时保护数据隐私,并避免从零训练从而显著降低边缘设备计算负担,因此受到日益广泛的关注。尽管具备这些优势,FFT的性能常因服务器-客户端连接不可靠及客户端数据分布异构而下降。现有方法大多假设网络条件同构或需要连接故障的先验知识,然而这些假设在现实网络中并不适用,因为实际网络通常具有多样化的通信标准(如有线、Wi-Fi、4G和5G)与异构的故障模式。为克服这些局限,本文提出FedAuto——一种通过自适应聚合来缓解连接故障与数据异构性联合影响的新型FFT框架。FedAuto无需网络条件先验知识,也无需修改现有基础设施,可实现即插即用的无缝部署。此外,我们为FedAuto建立了严格的收敛性保证。通过采用新颖的每轮聚合视角,我们的分析无需先前工作中常设的连接故障概率或客户端选择策略假设,并保证FedAuto在每次独立实现中均收敛,提供了更强的理论保证。大量实验表明,在全参数微调与部分参数微调(如LoRA)中,FedAuto在多种连接故障场景下始终优于最先进的基线方法,甚至超越了依赖复杂通信资源优化的策略。