TAP：联邦学习中多任务多模态基础模型的两阶段自适应个性化 (TAP: Two-Stage Adaptive Personalization of Multi-task and Multi-Modal Foundation Models in Federated Learning)

Federated Learning (FL), despite demonstrating impressive capabilities in the training of multiple models in a decentralized manner, has been shown to produce a final model not necessarily well-suited to the needs of each client. While extensive work has been conducted on how to create tailored personalized models, called Personalized Federated Learning (PFL), less attention has been given to personalization via fine-tuning of foundation models with multi-task and multi-modal properties. Moreover, there exists a lack of understanding in the literature on how to fine-tune and personalize such models in a setting that is heterogeneous across clients not only in data, but also in tasks and modalities. To address this gap in the literature, we propose TAP (Two-Stage Adaptive Personalization), which (i) leverages mismatched model architectures between the clients and server to selectively conduct replacement operations when it benefits a client's local tasks and (ii) engages in post-FL knowledge distillation for capturing beneficial general knowledge without compromising personalization. We also introduce the first convergence analysis of the server model under its modality-task pair architecture, and demonstrate that as the number of modality-task pairs increases, its ability to cater to all tasks suffers. Through extensive experiments, we demonstrate the effectiveness of our proposed algorithm across a variety of datasets and tasks in comparison to a multitude of baselines. Implementation code is publicly available at https://github.com/lee3296/TAP.

翻译：尽管联邦学习（FL）在去中心化训练多个模型方面展现出卓越能力，但最终生成的模型未必能充分满足每个客户端的需求。虽然已有大量研究致力于创建定制化的个性化模型（即个性化联邦学习，PFL），但针对具有多任务与多模态特性的基础模型进行微调以实现个性化的研究尚不充分。此外，现有文献对如何在客户端间存在数据、任务及模态三重异质性的场景下微调并个性化此类模型仍缺乏深入理解。为填补这一研究空白，我们提出TAP（两阶段自适应个性化）方法，其具备以下特点：（i）利用客户端与服务器间的失配模型架构，在有利于客户端本地任务时选择性执行模型替换操作；（ii）通过联邦学习后阶段的知识蒸馏获取有益通用知识，同时保持个性化性能。我们首次对服务器模型在其模态-任务对架构下的收敛性进行了理论分析，证明随着模态-任务对数量增加，模型服务所有任务的能力将受到制约。通过大量实验，我们在多种数据集和任务场景中验证了所提算法相较于多种基线方法的优越性。实现代码已公开于https://github.com/lee3296/TAP。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日