We introduce a simple, general framework that takes any differentially private estimator of any arbitrary quantity as a black box, and from it constructs a differentially private nonparametric confidence interval of that quantity. Our approach repeatedly subsamples the data, applies the private estimator to each subsample, and then post-processes the resulting empirical CDF to a confidence interval. Our analysis uses the randomness from the subsampling to achieve privacy amplification. Under mild assumptions, the empirical CDF we obtain approaches the CDF of the private statistic as the sample size grows. We use this to show that the confidence intervals we estimate are asymptotically valid, tight, and equivalent to their non-private counterparts. We provide empirical evidence that our method performs well compared with the (less-general) state-of-the-art algorithms.
翻译:我们提出了一种简单通用的框架,该框架将任意数量的差分隐私估计器作为黑盒输入,并由此构建该数量的差分隐私非参数置信区间。我们的方法通过重复子采样数据,对每个子样本应用隐私估计器,然后将得到的经验累积分布函数后处理为置信区间。分析利用子采样的随机性实现隐私放大。在温和假设下,随着样本量增加,我们获得的经验累积分布函数趋近于隐私统计量的累积分布函数。基于此,我们证明所估计的置信区间具有渐近有效性、紧致性,且与非隐私对应区间等价。我们提供的实证证据表明,与(通用性较低的)最先进算法相比,本方法表现优异。