We study the problem of consistently recovering the sparsity pattern of a regression parameter vector from correlated observations governed by deterministic missing data patterns using Lasso. We consider the case in which the observed dataset is censored by a deterministic, non-uniform filter. Recovering the sparsity pattern in datasets with deterministic missing structure can be arguably more challenging than recovering in a uniformly-at-random scenario. In this paper, we propose an efficient algorithm for missing value imputation by utilizing the topological property of the censorship filter. We then provide novel theoretical results for exact recovery of the sparsity pattern using the proposed imputation strategy. Our analysis shows that, under certain statistical and topological conditions, the hidden sparsity pattern can be recovered consistently with high probability in polynomial time and logarithmic sample complexity.
翻译:我们研究从使用Lasso的确定性缺失数据模式管辖的相关观测中不断从相关观测中恢复回归参数矢量的广度模式的问题。我们考虑了观察到的数据集被一个确定性非统一过滤器检查的情况。恢复确定性缺失结构数据集中的宽度模式可能比在统一随机假设中恢复更具挑战性。在本文中,我们提出一种有效的算法,利用审查过滤器的表层属性来计算缺失的值估算。然后我们提供新的理论结果,以便利用拟议的估算战略准确恢复宽度模式。我们的分析表明,在某些统计和地形条件下,隐藏的宽度模式可以在多元时间和对数样本复杂度的高度概率下持续恢复。