从对称到几何:可轨非 Conconvex 问题 (From Symmetry to Geometry: Tractable Nonconvex Problems)

As science and engineering have become increasingly data-driven, the role of optimization has expanded to touch almost every stage of the data analysis pipeline, from the signal and data acquisition to modeling and prediction. The optimization problems encountered in practice are often nonconvex. While challenges vary from problem to problem, one common source of nonconvexity is nonlinearity in the data or measurement model. Nonlinear models often exhibit symmetries, creating complicated, nonconvex objective landscapes, with multiple equivalent solutions. Nevertheless, simple methods (e.g., gradient descent) often perform surprisingly well in practice. The goal of this survey is to highlight a class of tractable nonconvex problems, which can be understood through the lens of symmetries. These problems exhibit a characteristic geometric structure: local minimizers are symmetric copies of a single "ground truth" solution, while other critical points occur at balanced superpositions of symmetric copies of the ground truth, and exhibit negative curvature in directions that break the symmetry. This structure enables efficient methods to obtain global minimizers. We discuss examples of this phenomenon arising from a wide range of problems in imaging, signal processing, and data analysis. We highlight the key role of symmetry in shaping the objective landscape and discuss the different roles of rotational and discrete symmetries. This area is rich with observed phenomena and open problems; we close by highlighting directions for future research.

翻译：随着科学和工程日益成为数据驱动因素,优化的作用已经扩大,几乎触及数据分析管道的每一个阶段,从信号和数据获取到建模和预测,从信号和数据获取到模型和预测。实践中遇到的优化问题往往是非康化的。虽然问题各不相同,但非康化的一个共同来源是数据或测量模型的不线性。非线性模型往往显示对称,造成复杂、非康化客观景观,并有多种同等的解决办法。然而,简单的方法(如梯度下降)在实践中往往表现得令人惊讶。这一调查的目的是突出一组可移动的非康化问题,可以通过对称的透镜来理解这些问题。这些问题显示出一个典型的几何结构:本地的最小化是单一“地面真相”解决方案的对称副本,而其他关键点则出现在平衡的对称副本的超位上,在打破对称的方向上表现出负面的曲直。这一结构使得能够有效地获得全球最小化的方法。可以通过对称的方式来理解非康化问题,这些问题可以通过对称的视角来理解。这些问题可以通过对称来理解。这些问题具有特征的地理结构结构分析,我们从观察到的模型分析了这一关键的图像,通过对立面分析领域,我们从观察了对立面分析中得出了一种不同趋势。