离散轨道回收模型的可能性景观和最大可能性估计 (Likelihood landscape and maximum likelihood estimation for the discrete orbit recovery model)

We study the non-convex optimization landscape for maximum likelihood estimation in the discrete orbit recovery model with Gaussian noise. This model is motivated by applications in molecular microscopy and image processing, where each measurement of an unknown object is subject to an independent random rotation from a rotational group. Equivalently, it is a Gaussian mixture model where the mixture centers belong to a group orbit. We show that fundamental properties of the likelihood landscape depend on the signal-to-noise ratio and the group structure. At low noise, this landscape is "benign" for any discrete group, possessing no spurious local optima and only strict saddle points. At high noise, this landscape may develop spurious local optima, depending on the specific group. We discuss several positive and negative examples, and provide a general condition that ensures a globally benign landscape. For cyclic permutations of coordinates on $\mathbb{R}^d$ (multi-reference alignment), there may be spurious local optima when $d \geq 6$, and we establish a correspondence between these local optima and those of a surrogate function of the phase variables in the Fourier domain. We show that the Fisher information matrix transitions from resembling that of a single Gaussian in low noise to having a graded eigenvalue structure in high noise, which is determined by the graded algebra of invariant polynomials under the group action. In a local neighborhood of the true object, the likelihood landscape is strongly convex in a reparametrized system of variables given by a transcendence basis of this polynomial algebra. We discuss implications for optimization algorithms, including slow convergence of expectation-maximization, and possible advantages of momentum-based acceleration and variable reparametrization for first- and second-order descent methods.

翻译：在离散轨道回收模型中,我们用高斯噪音来研究非cavex优化优化景观,以进行最大可能性估计离散轨道回收模型中。这个模型的动机是分子显微镜和图像处理中的应用。分子显微镜和图像处理中,每个未知对象的测量都由一个旋转组进行独立的随机旋转。等量而言, 这是一个高斯混合模型, 混合物中心属于一个分组轨道。我们显示, 概率景观的基本特性取决于信号对声音的比例和组结构。在低噪音中, 任何离散组的景观“ 黄金”, 没有虚假的本地opima, 并且只有严格的挂载点。在高噪音组中, 这种景观可能会产生虚假的本地opifima, 取决于特定组。我们讨论一些正和负的例子, 提供了一个确保全球友好环境的景观。对于 $\ mathbb{R ⁇ d==drobision 的坐标偏差值, 当值变差值中,我们建立这些本地的视野之间的通信, 将显示在高位变变变变变变的系统中, 将显示高的变变变变变变的系统。