异质数据软件联邦学习 (Heterogeneous Data-Aware Federated Learning)

Federated learning (FL) is an appealing concept to perform distributed training of Neural Networks (NN) while keeping data private. With the industrialization of the FL framework, we identify several problems hampering its successful deployment, such as presence of non i.i.d data, disjoint classes, signal multi-modality across datasets. In this work, we address these problems by proposing a novel method that not only (1) aggregates generic model parameters (e.g. a common set of task generic NN layers) on server (e.g. in traditional FL), but also (2) keeps a set of parameters (e.g, a set of task specific NN layer) specific to each client. We validate our method on the traditionally used public benchmarks (e.g., Femnist) as well as on our proprietary collected dataset (i.e., traffic classification). Results show the benefit of our method, with significant advantage on extreme cases.

翻译：联邦学习(FL)是一个在保持数据私密的同时对神经网络进行分布式培训的颇具吸引力的概念。随着FL框架的工业化,我们发现了阻碍成功部署它的一些问题,例如存在非i.d数据、脱节分类、跨数据集的信号多式信号等。在这项工作中,我们提出一种新的方法来解决这些问题,不仅(1) 将服务器上的通用模型参数(如一套通用任务通用的NNN层次)汇总在一起(如传统FL中的一种通用任务),而且(2) 保留一套专门针对每个客户的参数(如一套任务专用的NNN层次)。我们验证了我们传统上使用的公共基准(如Femnist)和我们收集的专有数据集(即交通分类)的方法。结果显示了我们方法的好处,在极端情况下有很大优势。

相关内容

联邦学习

关注 199

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。