Simulation-free training frameworks have been at the forefront of the generative modelling revolution in continuous spaces, leading to large-scale diffusion and flow matching models. However, such modern generative models suffer from expensive inference, inhibiting their use in numerous scientific applications like Boltzmann Generators (BGs) for molecular conformations that require fast likelihood evaluation. In this paper, we revisit classical normalizing flows in the context of BGs that offer efficient sampling and likelihoods, but whose training via maximum likelihood is often unstable and computationally challenging. We propose Regression Training of Normalizing Flows (RegFlow), a novel and scalable regression-based training objective that bypasses the numerical instability and computational challenge of conventional maximum likelihood training in favour of a simple $\ell_2$-regression objective. Specifically, RegFlow maps prior samples under our flow to targets computed using optimal transport couplings or a pre-trained continuous normalizing flow (CNF). To enhance numerical stability, RegFlow employs effective regularization strategies such as a new forward-backward self-consistency loss that enjoys painless implementation. Empirically, we demonstrate that RegFlow unlocks a broader class of architectures that were previously intractable to train for BGs with maximum likelihood. We also show RegFlow exceeds the performance, computational cost, and stability of maximum likelihood training in equilibrium sampling in Cartesian coordinates of alanine dipeptide, tripeptide, and tetrapeptide, showcasing its potential in molecular systems.
翻译:无仿真训练框架已成为连续空间生成模型革命的前沿,催生了大规模扩散模型与流匹配模型。然而,此类现代生成模型存在推理成本高昂的问题,限制了其在众多科学应用中的使用,例如需要快速似然评估的分子构象玻尔兹曼生成器。本文在玻尔兹曼生成器的背景下重新审视经典正则化流方法——该方法虽能提供高效采样与似然计算,但通过最大似然进行的训练常存在不稳定性和计算挑战。我们提出正则化流回归训练法,这是一种新颖且可扩展的基于回归的训练目标,通过简单的ℓ₂回归目标规避了传统最大似然训练中的数值不稳定性和计算难题。具体而言,RegFlow将先验样本通过我们的流映射至基于最优传输耦合或预训练连续正则化流计算得到的目标值。为增强数值稳定性,RegFlow采用了有效的正则化策略,例如新型前向-后向自洽损失函数,其实现过程简洁高效。实验表明,RegFlow能够解锁更广泛的模型架构类别,这些架构此前因最大似然训练困难而无法应用于玻尔兹曼生成器。我们进一步证明,在丙氨酸二肽、三肽及四肽的笛卡尔坐标平衡采样任务中,RegFlow在性能、计算成本与稳定性方面均超越最大似然训练,展现了其在分子系统中的应用潜力。