RAN-1 矩阵完成, 以渐变源和小随机初始化完成 (Rank-1 Matrix Completion with Gradient Descent and Small Random Initialization) - 专知论文

会员服务 ·

0

通用动力公司 · 随机初始化 · 正则化项 · 非凸 · 对称矩阵 ·

2023 年 2 月 8 日

Rank-1 Matrix Completion with Gradient Descent and Small Random Initialization

翻译：RAN-1 矩阵完成, 以渐变源和小随机初始化完成

Daesung Kim,Hye Won Chung

The nonconvex formulation of matrix completion problem has received significant attention in recent years due to its affordable complexity compared to the convex formulation. Gradient descent (GD) is the simplest yet efficient baseline algorithm for solving nonconvex optimization problems. The success of GD has been witnessed in many different problems in both theory and practice when it is combined with random initialization. However, previous works on matrix completion require either careful initialization or regularizers to prove the convergence of GD. In this work, we study the rank-1 symmetric matrix completion and prove that GD converges to the ground truth when small random initialization is used. We show that in logarithmic amount of iterations, the trajectory enters the region where local convergence occurs. We provide an upper bound on the initialization size that is sufficient to guarantee the convergence and show that a larger initialization can be used as more samples are available. We observe that implicit regularization effect of GD plays a critical role in the analysis, and for the entire trajectory, it prevents each entry from becoming much larger than the others.

翻译：近年来,矩阵完成问题的非碳化物配方由于与卷轴配方相比具有负担得起的复杂性而引起人们的极大关注。渐渐下降(GD)是解决非碳化物优化问题的简单而有效的基线算法。当GD与随机初始化相结合时,在理论和实践的许多不同问题上都见证了GD的成功。然而,以前的矩阵完成工作需要仔细初始化或正规化,以证明GD的趋同。在这项工作中,我们研究了一级对称矩阵完成情况,并证明在使用小规模随机初始化时GD会与地面对齐。我们表明,在对数的迭代量中,轨迹进入了发生本地趋同的区域。我们提供了初始化规模的上限,足以保证趋同,并表明更大的初始化可以用作更多的样本。我们注意到,GD的隐含的正规化效应在分析中发挥着关键作用,对于整个轨迹而言,它使每个条目都无法变得比其他大得多。

0

相关内容

通用动力公司

通用动力公司

通用动力公司（General Dynamics）是一家美国的国防企业集团。2008年时通用动力是世界第五大国防工业承包商。由于近年来不断的扩充和并购其他公司，通用动力现今的组成与面貌已与冷战时期时大不相同。现今通用动力包含三大业务集团：海洋、作战系统和资讯科技集团。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

428+阅读 · 2021年1月11日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

非线性Schrödinger方程孤立子和怪波的数值方法

国家自然科学基金

0+阅读 · 2015年12月31日

拟南芥组蛋白HIS1-3与WRKY1互作调控盐胁迫响应的分子机理

国家自然科学基金

0+阅读 · 2015年12月31日

ERBB4 3'非翻译区致病变异的发现及其在慢性HBV感染和肝细胞癌中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

抛物型Monge-Ampere方程的外问题与多值解

国家自然科学基金

0+阅读 · 2012年12月31日

抗癌症干细胞天然产物Rakicidin A的合成及构效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

研究Netrin-1基因对骨髓干细胞移植治疗心肌梗死的改善作用

国家自然科学基金

0+阅读 · 2012年12月31日

两类Monge-Ampere方程问题的研究

国家自然科学基金

1+阅读 · 2012年12月31日

随机变分不等式

国家自然科学基金

0+阅读 · 2011年12月31日

改进Max-SAT算法的关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

Completion of Matrices with Low Description Complexity

Arxiv

0+阅读 · 2023年3月30日

Two algorithms to decide Quantifier-free Definability in Finite Algebraic Structures

Arxiv

0+阅读 · 2023年3月29日

Data inaccuracy quantification and uncertainty propagation for bibliometric indicators

Arxiv

0+阅读 · 2023年3月29日

Coarser Equivalences for Concurrent Program Runs

Arxiv

0+阅读 · 2023年3月29日

Decidability of One-Clock Weighted Timed Games with Arbitrary Weights

Arxiv

0+阅读 · 2023年3月28日

Ensemble Domain Decomposition Algorithm for the Fully-mixed Random Stokes-Darcy Model with the Beavers-Joseph Interface Conditions

Arxiv

0+阅读 · 2023年3月28日

Pre-training Transformers for Knowledge Graph Completion

Arxiv

0+阅读 · 2023年3月28日

Algebra of L-banded Matrices

Arxiv

0+阅读 · 2023年3月27日

Similarity and Matching of Neural Network Representations

Arxiv

10+阅读 · 2021年10月27日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

VIP会员

文章信息

相关主题

通用动力公司

随机初始化

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

428+阅读 · 2021年1月11日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型中的检索与结构化增强生成综述

《实现多层防御多轮交战机制的扩展型随机齐射模型》2025年最新83页

【CMU博士论文】交互驱动的人体动作估计与生成

如何避免生成式人工智能在作战中失控失效

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Completion of Matrices with Low Description Complexity

Arxiv

0+阅读 · 2023年3月30日

Two algorithms to decide Quantifier-free Definability in Finite Algebraic Structures

Arxiv

0+阅读 · 2023年3月29日

Data inaccuracy quantification and uncertainty propagation for bibliometric indicators

Arxiv

0+阅读 · 2023年3月29日

Coarser Equivalences for Concurrent Program Runs

Arxiv

0+阅读 · 2023年3月29日

Decidability of One-Clock Weighted Timed Games with Arbitrary Weights

Arxiv

0+阅读 · 2023年3月28日

Ensemble Domain Decomposition Algorithm for the Fully-mixed Random Stokes-Darcy Model with the Beavers-Joseph Interface Conditions

Arxiv

0+阅读 · 2023年3月28日

Pre-training Transformers for Knowledge Graph Completion

Arxiv

0+阅读 · 2023年3月28日

Algebra of L-banded Matrices

Arxiv

0+阅读 · 2023年3月27日

Similarity and Matching of Neural Network Representations

Arxiv

10+阅读 · 2021年10月27日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

相关基金

非线性Schrödinger方程孤立子和怪波的数值方法

国家自然科学基金

0+阅读 · 2015年12月31日

拟南芥组蛋白HIS1-3与WRKY1互作调控盐胁迫响应的分子机理

国家自然科学基金

0+阅读 · 2015年12月31日

ERBB4 3'非翻译区致病变异的发现及其在慢性HBV感染和肝细胞癌中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

抛物型Monge-Ampere方程的外问题与多值解

国家自然科学基金

0+阅读 · 2012年12月31日

抗癌症干细胞天然产物Rakicidin A的合成及构效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

研究Netrin-1基因对骨髓干细胞移植治疗心肌梗死的改善作用

国家自然科学基金

0+阅读 · 2012年12月31日

两类Monge-Ampere方程问题的研究

国家自然科学基金

1+阅读 · 2012年12月31日

随机变分不等式

国家自然科学基金

0+阅读 · 2011年12月31日

改进Max-SAT算法的关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员