We consider the problem of estimating curvature where the data can be viewed as a noisy sample from an underlying manifold. For manifolds of dimension greater than one there are multiple definitions of local curvature, each suggesting a different estimation process for a given data set. Recently, there has been progress in proving that estimates of ``local point cloud curvature" converge to the related smooth notion of local curvature as the density of the point cloud approaches infinity. Herein we investigate practical limitations of such convergence theorems and discuss the significant impact of bias in such estimates as reported in recent literature. We provide theoretical arguments for the fact that bias increases drastically in higher dimensions, so much so that in high dimensions, the probability that a naive curvature estimate lies in a small interval near the true curvature could be near zero. We present a probabilistic framework that enables the construction of more accurate estimators of curvature for arbitrary noise models. The efficacy of our technique is supported with experiments on spheres of dimension as large as twelve.
翻译:本文研究从噪声采样数据估计曲率的问题,其中数据可视为来自底层流形的含噪样本。对于维度大于一的流形,存在多种局部曲率的定义,每种定义对应不同的数据集估计方法。近期研究表明,随着点云密度趋近无穷大,“局部点云曲率”的估计值会收敛于光滑流形的局部曲率概念。本文探讨此类收敛定理的实际局限性,并分析近期文献中报道的估计偏差的显著影响。我们从理论上论证了偏差在高维情况下急剧增加,以至于在高维空间中,朴素曲率估计值落入真实曲率附近小区间的概率可能接近零。我们提出一个概率框架,能够为任意噪声模型构建更精确的曲率估计器。通过在维度高达十二的球面上进行实验,验证了该方法的有效性。