差别化私人(大)期望最大化与统计保障 (Differentially Private (Gradient) Expectation Maximization Algorithm with Statistical Guarantees) - 专知论文

会员服务 ·

0

统计量 · 期望极大算法 · MoDELS · 估计/估计量 · 高斯混合（模型） ·

2021 年 7 月 22 日

Differentially Private (Gradient) Expectation Maximization Algorithm with Statistical Guarantees

翻译：差别化私人(大)期望最大化与统计保障

Di Wang,Jiahao Ding,Lijie Hu,Zejun Xie,Miao Pan,Jinhui Xu

from arxiv, Submiited. arXiv admin note: text overlap with arXiv:2010.09576

(Gradient) Expectation Maximization (EM) is a widely used algorithm for estimating the maximum likelihood of mixture models or incomplete data problems. A major challenge facing this popular technique is how to effectively preserve the privacy of sensitive data. Previous research on this problem has already lead to the discovery of some Differentially Private (DP) algorithms for (Gradient) EM. However, unlike in the non-private case, existing techniques are not yet able to provide finite sample statistical guarantees. To address this issue, we propose in this paper the first DP version of (Gradient) EM algorithm with statistical guarantees. Moreover, we apply our general framework to three canonical models: Gaussian Mixture Model (GMM), Mixture of Regressions Model (MRM) and Linear Regression with Missing Covariates (RMC). Specifically, for GMM in the DP model, our estimation error is near optimal in some cases. For the other two models, we provide the first finite sample statistical guarantees. Our theory is supported by thorough numerical experiments.

翻译：期望最大化(EM)是一种广泛使用的算法,用于估计混合模型的最大可能性或不完整的数据问题。这一流行技术所面临的一项主要挑战是如何有效保护敏感数据的隐私。以前对这一问题的研究已经导致发现了某些(显著)EM的差别私人算法。然而,与非私人案例不同,现有技术尚不能提供有限的抽样统计保证。为解决这一问题,我们在本文件中提议了第一个带有统计保障的(显著)EM算法的DP版本。此外,我们把我们的一般框架应用到三个卡通模型:高斯混合混合模型(GMM)、倒退模型(MRM)和与失踪共变体(RMC)的线性回归模型(RMC)。具体地说,对于DP模型中的GMM,我们的估计错误在某些案例中几乎是最佳的。对于其他两种模型,我们提供了第一个有统计保障的(显著)EM的样本。我们的理论得到了彻底的数字实验的支持。

0

相关内容

统计量

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【PNAS】深度神经网络中的理论议题，麻省理工Tomaso Poggio撰写

专知会员服务

20+阅读 · 2021年1月23日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

59+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

专知会员服务

134+阅读 · 2020年4月14日

《应用随机微分方程》(Applied Stochastic Differential Equations)324页pdf新书分享

《应用随机微分方程》(Applied Stochastic Differential Equations)324页pdf新书分享

专知会员服务

44+阅读 · 2019年10月28日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

最佳实践：深度学习用于自然语言处理（三）

最佳实践：深度学习用于自然语言处理（三）

待字闺中

3+阅读 · 2017年8月20日

已删除

将门创投

8+阅读 · 2017年7月21日

Improved Convergence Guarantees for Learning Gaussian Mixture Models by EM and Gradient EM

Arxiv

0+阅读 · 2021年9月23日

Influence of sampling on the convergence rates of greedy algorithms for parameter-dependent random variables

Influence of sampling on the convergence rates of greedy algorithms for parameter-dependent random variables

Arxiv

0+阅读 · 2021年9月23日

TODG: Distributed Task Offloading with Delay Guarantees for Edge Computing

Arxiv

0+阅读 · 2021年9月23日

Memory-Efficient Convex Optimization for Self-Dictionary Separable Nonnegative Matrix Factorization: A Frank-Wolfe Approach

Arxiv

0+阅读 · 2021年9月23日

A unified interpretation of the Gaussian mechanism for differential privacy through the sensitivity index

Arxiv

0+阅读 · 2021年9月22日

Simpson's Paradox: A Singularity of Statistical and Inductive Inference

Arxiv

0+阅读 · 2021年9月22日

Sharp global convergence guarantees for iterative nonconvex optimization: A Gaussian process perspective

Arxiv

0+阅读 · 2021年9月20日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

LDP-FL: Practical Private Aggregation in Federated Learning with Local Differential Privacy

Arxiv

5+阅读 · 2020年7月31日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

VIP会员

文章信息

相关主题

期望极大算法

估计/估计量

高斯混合（模型）

相关VIP内容

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【PNAS】深度神经网络中的理论议题，麻省理工Tomaso Poggio撰写

专知会员服务

20+阅读 · 2021年1月23日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

59+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

专知会员服务

134+阅读 · 2020年4月14日

《应用随机微分方程》(Applied Stochastic Differential Equations)324页pdf新书分享

《应用随机微分方程》(Applied Stochastic Differential Equations)324页pdf新书分享

专知会员服务

44+阅读 · 2019年10月28日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向可扩展深度神经网络的预测编码：理论与实践

如何快速获取数百万架无人机？

EMNLP 2025 | RTQA：递归思想求解复杂的时间知识图谱问答

组合式零样本学习综述

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

最佳实践：深度学习用于自然语言处理（三）

最佳实践：深度学习用于自然语言处理（三）

待字闺中

3+阅读 · 2017年8月20日

已删除

将门创投

8+阅读 · 2017年7月21日

相关论文

Improved Convergence Guarantees for Learning Gaussian Mixture Models by EM and Gradient EM

Arxiv

0+阅读 · 2021年9月23日

Influence of sampling on the convergence rates of greedy algorithms for parameter-dependent random variables

Influence of sampling on the convergence rates of greedy algorithms for parameter-dependent random variables

Arxiv

0+阅读 · 2021年9月23日

TODG: Distributed Task Offloading with Delay Guarantees for Edge Computing

Arxiv

0+阅读 · 2021年9月23日

Memory-Efficient Convex Optimization for Self-Dictionary Separable Nonnegative Matrix Factorization: A Frank-Wolfe Approach

Arxiv

0+阅读 · 2021年9月23日

A unified interpretation of the Gaussian mechanism for differential privacy through the sensitivity index

Arxiv

0+阅读 · 2021年9月22日

Simpson's Paradox: A Singularity of Statistical and Inductive Inference

Arxiv

0+阅读 · 2021年9月22日

Sharp global convergence guarantees for iterative nonconvex optimization: A Gaussian process perspective

Arxiv

0+阅读 · 2021年9月20日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

LDP-FL: Practical Private Aggregation in Federated Learning with Local Differential Privacy

Arxiv

5+阅读 · 2020年7月31日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

微信扫码咨询专知VIP会员