限制通过连续时间系统优化非消费-非消费-非消费 Minimax (Limiting Behaviors of Nonconvex-Nonconcave Minimax Optimization via Continuous-Time Systems) - 专知论文

会员服务 ·

0

优化器 · GAN · 非凸 · 正则化项 · 最优化 ·

2021 年 3 月 4 日

Limiting Behaviors of Nonconvex-Nonconcave Minimax Optimization via Continuous-Time Systems

翻译：限制通过连续时间系统优化非消费-非消费-非消费 Minimax

Benjamin Grimmer,Haihao Lu,Pratik Worah,Vahab Mirrokni

Unlike nonconvex optimization, where gradient descent is guaranteed to converge to a local optimizer, algorithms for nonconvex-nonconcave minimax optimization can have topologically different solution paths: sometimes converging to a solution, sometimes never converging and instead following a limit cycle, and sometimes diverging. In this paper, we study the limiting behaviors of three classic minimax algorithms: gradient descent ascent (GDA), alternating gradient descent ascent (AGDA), and the extragradient method (EGM). Numerically, we observe that all of these limiting behaviors can arise in Generative Adversarial Networks (GAN) training and are easily demonstrated for a range of GAN problems. To explain these different behaviors, we study the high-order resolution continuous-time dynamics that correspond to each algorithm, which results in the sufficient (and almost necessary) conditions for the local convergence by each method. Moreover, this ODE perspective allows us to characterize the phase transition between these different limiting behaviors caused by introducing regularization as Hopf Bifurcations.

翻译：与非 Convex 优化不同, 梯度下降可以保证与本地优化相融合, 而非colve- nonconcolve minimax优化的算法则则在结构学上可能具有不同的解决办法: 有时会与一个解决方案相融合, 有时从不相融合, 代之以一个极限周期, 有时会有所不同。在本文中, 我们研究三种经典迷你运算法的局限性行为: 梯度下降率( GDA ), 交替梯度下降率( AGDA ), 和异常方法( EGM ) 。从数字上看, 我们观察到所有这些限制行为都可以在 General Aversarial 网络( GAN) 培训中出现, 并且很容易地展示出一系列 GAN 问题。为了解释这些不同的行为, 我们研究与每种算法相对应的高阶梯度分辨率连续时间的动态, 导致每种方法对本地趋同的足够( 几乎必要的) 条件。此外, 通过ODE 观点可以让我们描述这些不同的限制行为之间的阶段过渡, 。

0

相关内容

优化器

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

专知会员服务

158+阅读 · 2020年1月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

随波逐流：Similarity-Adaptive and Discrete Optimization

随波逐流：Similarity-Adaptive and Discrete Optimization

我爱读PAMI

5+阅读 · 2018年2月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Self-Bounding Majority Vote Learning Algorithms by the Direct Minimization of a Tight PAC-Bayesian C-Bound

Arxiv

0+阅读 · 2021年4月28日

A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic Constraints

Arxiv

0+阅读 · 2021年4月27日

Optimal controller synthesis for timed systems

Arxiv

0+阅读 · 2021年4月26日

A Control-Theoretic Perspective on Optimal High-Order Optimization

Arxiv

0+阅读 · 2021年4月24日

Optimal Dynamic Regret in Exp-Concave Online Learning

Arxiv

0+阅读 · 2021年4月23日

Exact priors of finite neural networks

Arxiv

0+阅读 · 2021年4月23日

Generating Continuous Motion and Force Plans in Real-Time for Legged Mobile Manipulation

Generating Continuous Motion and Force Plans in Real-Time for Legged Mobile Manipulation

Arxiv

0+阅读 · 2021年4月23日

Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design

Arxiv

0+阅读 · 2021年4月23日

Optimal Cost Design for Model Predictive Control

Arxiv

0+阅读 · 2021年4月23日

Decentralized Multi-Agents by Imitation of a Centralized Controller

Arxiv

0+阅读 · 2021年4月22日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

专知会员服务

158+阅读 · 2020年1月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《具备集体态势感知能力的深度强化学习智能体在超视距空战中的应用研究》最新文献

《美军条令文件：频谱管理操作技术》2025最新100页

反制小型无人机：一项重大挑战

《AI作战：将人机协作集成至实时、虚拟与建构环境（LVC）的建模与仿真》

相关资讯

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

随波逐流：Similarity-Adaptive and Discrete Optimization

随波逐流：Similarity-Adaptive and Discrete Optimization

我爱读PAMI

5+阅读 · 2018年2月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Self-Bounding Majority Vote Learning Algorithms by the Direct Minimization of a Tight PAC-Bayesian C-Bound

Arxiv

0+阅读 · 2021年4月28日

A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic Constraints

Arxiv

0+阅读 · 2021年4月27日

Optimal controller synthesis for timed systems

Arxiv

0+阅读 · 2021年4月26日

A Control-Theoretic Perspective on Optimal High-Order Optimization

Arxiv

0+阅读 · 2021年4月24日

Optimal Dynamic Regret in Exp-Concave Online Learning

Arxiv

0+阅读 · 2021年4月23日

Exact priors of finite neural networks

Arxiv

0+阅读 · 2021年4月23日

Generating Continuous Motion and Force Plans in Real-Time for Legged Mobile Manipulation

Generating Continuous Motion and Force Plans in Real-Time for Legged Mobile Manipulation

Arxiv

0+阅读 · 2021年4月23日

Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design

Arxiv

0+阅读 · 2021年4月23日

Optimal Cost Design for Model Predictive Control

Arxiv

0+阅读 · 2021年4月23日

Decentralized Multi-Agents by Imitation of a Centralized Controller

Arxiv

0+阅读 · 2021年4月22日

微信扫码咨询专知VIP会员