As a powerful technique in generative modeling, Flow Matching (FM) aims to learn velocity fields from noise to data, which is often explained and implemented as solving Optimal Transport (OT) problems. In this study, we bridge FM and the recent theory of Optimal Acceleration Transport (OAT), developing an improved FM method called OAT-FM and exploring its benefits in both theory and practice. In particular, we demonstrate that the straightening objective hidden in existing OT-based FM methods is mathematically equivalent to minimizing the physical action associated with acceleration defined by OAT. Accordingly, instead of enforcing constant velocity, OAT-FM optimizes the acceleration transport in the product space of sample and velocity, whose objective corresponds to a necessary and sufficient condition of flow straightness. An efficient algorithm is designed to achieve OAT-FM with low complexity. OAT-FM motivates a new two-phase FM paradigm: Given a generative model trained by an arbitrary FM method, whose velocity information has been relatively reliable, we can fine-tune and improve it via OAT-FM. This paradigm eliminates the risk of data distribution drift and the need to generate a large number of noise data pairs, which consistently improves model performance in various generative tasks. Code is available at: https://github.com/AngxiaoYue/OAT-FM
翻译:作为一种强大的生成建模技术,流匹配(Flow Matching, FM)旨在学习从噪声到数据的速度场,其通常被解释和实现为求解最优传输(Optimal Transport, OT)问题。在本研究中,我们将FM与近期的最优加速传输(Optimal Acceleration Transport, OAT)理论联系起来,发展了一种改进的FM方法,称为OAT-FM,并从理论和实践两方面探讨了其优势。具体而言,我们证明了现有基于OT的FM方法中隐含的“拉直”目标,在数学上等价于最小化由OAT定义的加速度所关联的物理作用量。因此,OAT-FM不再强制恒定速度,而是在样本与速度的乘积空间中优化加速传输,其目标对应于流直线性的一个充分必要条件。我们设计了一种高效算法以低复杂度实现OAT-FM。OAT-FM启发了一种新的两阶段FM范式:给定一个由任意FM方法训练得到的生成模型(其速度信息已相对可靠),我们可以通过OAT-FM对其进行微调和改进。该范式消除了数据分布漂移的风险以及生成大量噪声数据对的需求,并在多种生成任务中持续提升了模型性能。代码发布于:https://github.com/AngxiaoYue/OAT-FM