可区别的 SVD (Robust Differentiable SVD) - 专知论文

会员服务 ·

0

奇异值分解 · 稳健性 · 图片分类 · Integration · 模型评估 ·

2021 年 4 月 8 日

Robust Differentiable SVD

翻译：可区别的 SVD

Wei Wang,Zheng Dang,Yinlin Hu,Pascal Fua,Mathieu Salzmann

from arxiv, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) PREPRINT 2021

Eigendecomposition of symmetric matrices is at the heart of many computer vision algorithms. However, the derivatives of the eigenvectors tend to be numerically unstable, whether using the SVD to compute them analytically or using the Power Iteration (PI) method to approximate them. This instability arises in the presence of eigenvalues that are close to each other. This makes integrating eigendecomposition into deep networks difficult and often results in poor convergence, particularly when dealing with large matrices. While this can be mitigated by partitioning the data into small arbitrary groups, doing so has no theoretical basis and makes it impossible to exploit the full power of eigendecomposition. In previous work, we mitigated this using SVD during the forward pass and PI to compute the gradients during the backward pass. However, the iterative deflation procedure required to compute multiple eigenvectors using PI tends to accumulate errors and yield inaccurate gradients. Here, we show that the Taylor expansion of the SVD gradient is theoretically equivalent to the gradient obtained using PI without relying in practice on an iterative process and thus yields more accurate gradients. We demonstrate the benefits of this increased accuracy for image classification and style transfer.

翻译：对称矩阵的 Eigendecomposition 是许多计算机视觉算法的核心。但是,对称矩阵的衍生物在数字上往往不稳定, 无论是使用 SVD 进行分析性计算, 还是使用 Power Exeration (PI) 方法来估计它们。这种不稳定性产生于相互接近的 eigenvalue 。这使得将eigendecommission 整合到深层网络中很难, 并往往导致不准确的趋同, 特别是在处理大矩阵时。虽然可以通过将数据分成小的任意组来减轻这一点, 但这样做没有理论依据, 使得无法利用 eigendecomposition 的全部能量。在以往的工作中, 我们用 SVD 来用 SVD 来进行分析, 或用 PI 来计算后向过关的梯度。但是, 使用 PI 来计算多个 Eigenvistor 往往会积累错误并产生不准确的梯度。我们在这里表明, SVD 梯度的扩展在理论上等同于使用 PI 的梯度, 而不必在迭接式转换过程中依赖 PI 。

9

相关内容

奇异值分解

奇异值分解

奇异值分解（Singular Value Decomposition）是线性代数中一种重要的矩阵分解，奇异值分解则是特征分解在任意矩阵上的推广。在信号处理、统计学等领域有重要应用。

【TPAMI2021】鲁棒可微SVD，Robust Differentiable SVD

专知会员服务

23+阅读 · 2021年4月10日

现代机器学习技术导论，596页pdf

专知会员服务

167+阅读 · 2020年7月27日

最新《机器学习最优化》课程笔记，36页pdf，Optimization for Machine Learning

专知会员服务

170+阅读 · 2020年5月10日

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

专知会员服务

92+阅读 · 2020年5月5日

【机器学习最优化课程笔记】Optimization for Machine Learning，36页pdf

【机器学习最优化课程笔记】Optimization for Machine Learning，36页pdf

专知会员服务

117+阅读 · 2020年3月25日

机器学习速查手册，135页pdf

机器学习速查手册，135页pdf

专知会员服务

342+阅读 · 2020年3月15日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

集成学习相关资源大列表

集成学习相关资源大列表

专知

9+阅读 · 2019年8月5日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Github项目推荐 | 图神经网络(GNN)相关资源大列表

Github项目推荐 | 图神经网络(GNN)相关资源大列表

AI研习社

58+阅读 · 2019年4月1日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

【TED】什么让我们生病

【TED】什么让我们生病

英语演讲视频每日一推

7+阅读 · 2019年1月23日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

机器学习(29)之奇异值分解SVD原理与应用详解

机器学习(29)之奇异值分解SVD原理与应用详解

机器学习算法与Python学习

6+阅读 · 2017年11月30日

深度学习NLP相关资源大列表

深度学习NLP相关资源大列表

机器学习研究会

3+阅读 · 2017年9月17日

WSR: A WiFi Sensor for Collaborative Robotics

Arxiv

0+阅读 · 2021年6月1日

Wireless Federated Learning with Limited Communication and Differential Privacy

Arxiv

0+阅读 · 2021年6月1日

Recovering wavelet coefficients from binary samples using fast transforms

Arxiv

0+阅读 · 2021年6月1日

Analysis of classifiers robust to noisy labels

Arxiv

0+阅读 · 2021年6月1日

RNN-based Online Learning: An Efficient First-Order Optimization Algorithm with a Convergence Guarantee

Arxiv

0+阅读 · 2021年5月31日

Safe Pontryagin Differentiable Programming

Arxiv

0+阅读 · 2021年5月31日

Fast Design Space Exploration of Nonlinear Systems: Part I

Arxiv

0+阅读 · 2021年5月27日

Differential Dynamic Programming Neural Optimizer

Arxiv

7+阅读 · 2020年6月29日

Towards Understanding Acceleration Tradeoff between Momentum and Asynchrony in Nonconvex Stochastic Optimization

Arxiv

3+阅读 · 2018年10月1日

Together or Alone: The Price of Privacy in Collaborative Learning

Arxiv

4+阅读 · 2018年2月28日

VIP会员

文章信息

相关主题

奇异值分解

相关VIP内容

【TPAMI2021】鲁棒可微SVD，Robust Differentiable SVD

专知会员服务

23+阅读 · 2021年4月10日

现代机器学习技术导论，596页pdf

专知会员服务

167+阅读 · 2020年7月27日

最新《机器学习最优化》课程笔记，36页pdf，Optimization for Machine Learning

专知会员服务

170+阅读 · 2020年5月10日

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

专知会员服务

92+阅读 · 2020年5月5日

【机器学习最优化课程笔记】Optimization for Machine Learning，36页pdf

【机器学习最优化课程笔记】Optimization for Machine Learning，36页pdf

专知会员服务

117+阅读 · 2020年3月25日

机器学习速查手册，135页pdf

机器学习速查手册，135页pdf

专知会员服务

342+阅读 · 2020年3月15日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACL2025教程】大语言模型的护栏与安全性：对其应用的安全、可靠与可控引导

《实现协同自主：从人机协作到多智能体系统》最新190页

【ICML2025】SToFM：一种用于空间转录组学的多尺度基础模型

通信网络智能体白皮书V1.0，61页pdf

相关资讯

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

集成学习相关资源大列表

集成学习相关资源大列表

专知

9+阅读 · 2019年8月5日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Github项目推荐 | 图神经网络(GNN)相关资源大列表

Github项目推荐 | 图神经网络(GNN)相关资源大列表

AI研习社

58+阅读 · 2019年4月1日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

【TED】什么让我们生病

【TED】什么让我们生病

英语演讲视频每日一推

7+阅读 · 2019年1月23日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

机器学习(29)之奇异值分解SVD原理与应用详解

机器学习(29)之奇异值分解SVD原理与应用详解

机器学习算法与Python学习

6+阅读 · 2017年11月30日

深度学习NLP相关资源大列表

深度学习NLP相关资源大列表

机器学习研究会

3+阅读 · 2017年9月17日

相关论文

WSR: A WiFi Sensor for Collaborative Robotics

Arxiv

0+阅读 · 2021年6月1日

Wireless Federated Learning with Limited Communication and Differential Privacy

Arxiv

0+阅读 · 2021年6月1日

Recovering wavelet coefficients from binary samples using fast transforms

Arxiv

0+阅读 · 2021年6月1日

Analysis of classifiers robust to noisy labels

Arxiv

0+阅读 · 2021年6月1日

RNN-based Online Learning: An Efficient First-Order Optimization Algorithm with a Convergence Guarantee

Arxiv

0+阅读 · 2021年5月31日

Safe Pontryagin Differentiable Programming

Arxiv

0+阅读 · 2021年5月31日

Fast Design Space Exploration of Nonlinear Systems: Part I

Arxiv

0+阅读 · 2021年5月27日

Differential Dynamic Programming Neural Optimizer

Arxiv

7+阅读 · 2020年6月29日

Towards Understanding Acceleration Tradeoff between Momentum and Asynchrony in Nonconvex Stochastic Optimization

Arxiv

3+阅读 · 2018年10月1日

Together or Alone: The Price of Privacy in Collaborative Learning

Arxiv

4+阅读 · 2018年2月28日

微信扫码咨询专知VIP会员