Poincaré ResNet (Poincaré ResNet) - 专知论文

会员服务 ·

0

残差网络 · 批归一化 · ResNet · 计算图 · 初始化 ·

2023 年 3 月 24 日

Poincaré ResNet

翻译：Poincaré ResNet

Max van Spengler,Erwin Berkhout,Pascal Mettes

This paper introduces an end-to-end residual network that operates entirely on the Poincar\'e ball model of hyperbolic space. Hyperbolic learning has recently shown great potential for visual understanding, but is currently only performed in the penultimate layer(s) of deep networks. All visual representations are still learned through standard Euclidean networks. In this paper we investigate how to learn hyperbolic representations of visual data directly from the pixel-level. We propose Poincar\'e ResNet, a hyperbolic counterpart of the celebrated residual network, starting from Poincar\'e 2D convolutions up to Poincar\'e residual connections. We identify three roadblocks for training convolutional networks entirely in hyperbolic space and propose a solution for each: (i) Current hyperbolic network initializations collapse to the origin, limiting their applicability in deeper networks. We provide an identity-based initialization that preserves norms over many layers. (ii) Residual networks rely heavily on batch normalization, which comes with expensive Fr\'echet mean calculations in hyperbolic space. We introduce Poincar\'e midpoint batch normalization as a faster and equally effective alternative. (iii) Due to the many intermediate operations in Poincar\'e layers, we lastly find that the computation graphs of deep learning libraries blow up, limiting our ability to train on deep hyperbolic networks. We provide manual backward derivations of core hyperbolic operations to maintain manageable computation graphs.

翻译：本文介绍了一种完全基于Poincaré双曲球模型运作的端到端残差网络。近来，超几何学习已经展现了在视觉理解方面的巨大潜力，但目前仅在深度网络的倒数第二层或倒数第一层中执行超几何学习。所有视觉表示仍然通过标准的欧几里得网络学习。在本文中，我们调查了如何直接从像素级别学习视觉数据的超几何表示。我们提出了Poincaré ResNet，这是一个著名残差网络的超几何对应物，从Poincaré 2D卷积开始，直到Poincaré残差连接。我们发现，在纯双曲空间中训练卷积网络存在三个障碍，并针对每个障碍提出了解决方案：(i) 当前的超几何网络初始化崩溃为原点，限制了它们在更深的网络中的适用性。我们提供了一个基于身份的初始化，可以在许多层上保留范数。(ii) 残差网络严重依赖批归一化，在超几何空间中带有昂贵的Fréchet均值计算。我们引入了Poincaré中点批归一化作为更快且同样有效的替代方案。(iii) 由于Poincaré层中的许多中间操作，最后我们发现深度学习库的计算图膨胀，限制了我们在深度超几何网络上训练的能力。我们提供了核心超几何操作的手动反向推导，以保持可管理的计算图。

0

相关内容

残差网络

【ICML2022】几何多模态对比表示学习

【ICML2022】几何多模态对比表示学习

专知会员服务

45+阅读 · 2022年7月17日

KDD 2022 | GraphMAE:自监督掩码图自编码器

KDD 2022 | GraphMAE:自监督掩码图自编码器

专知会员服务

20+阅读 · 2022年7月14日

2021机器学习研究风向是啥？MLP→CNN→Transformer→MLP！

2021机器学习研究风向是啥？MLP→CNN→Transformer→MLP！

专知会员服务

67+阅读 · 2021年5月23日

【AAAI 2019】双曲异构信息网络嵌入，Hyperbolic Heterogeneous Information Network Embedding

【AAAI 2019】双曲异构信息网络嵌入，Hyperbolic Heterogeneous Information Network Embedding

专知会员服务

60+阅读 · 2020年6月28日

【NeurIPS 2019】多关系庞加莱图嵌入，Multi-relational Poincaré Graph Embeddings

【NeurIPS 2019】多关系庞加莱图嵌入，Multi-relational Poincaré Graph Embeddings

专知会员服务

49+阅读 · 2020年6月15日

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

专知会员服务

26+阅读 · 2020年3月16日

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

专知会员服务

39+阅读 · 2020年2月21日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

八篇 ICCV 2019 【图神经网络（GNN）+CV】相关论文

八篇 ICCV 2019 【图神经网络（GNN）+CV】相关论文

专知会员服务

30+阅读 · 2020年1月10日

【论文】多关系庞加莱图嵌入（Multi-relational Poincaré Graph Embeddings），爱丁堡大学| Ivana Balažević

【论文】多关系庞加莱图嵌入（Multi-relational Poincaré Graph Embeddings），爱丁堡大学| Ivana Balažević

专知会员服务

59+阅读 · 2019年12月30日

赛尔笔记 | 自然语言处理中的迁移学习(下)

赛尔笔记 | 自然语言处理中的迁移学习(下)

AI科技评论

11+阅读 · 2019年10月21日

卷积神经网络四种卷积类型

卷积神经网络四种卷积类型

炼数成金订阅号

18+阅读 · 2019年4月16日

后ResNet时代：SENet与SKNet

后ResNet时代：SENet与SKNet

PaperWeekly

23+阅读 · 2019年3月25日

NLP预训练模型大集合！

NLP预训练模型大集合！

机器之心

21+阅读 · 2018年12月28日

Seq2seq强化，Pointer Network简介

Seq2seq强化，Pointer Network简介

机器学习算法与Python学习

15+阅读 · 2018年12月8日

【泡泡点云时空】PointNetVLAD：基于点云检索的场景识别（CVPR-13）

【泡泡点云时空】PointNetVLAD：基于点云检索的场景识别（CVPR-13）

泡泡机器人SLAM

85+阅读 · 2018年9月12日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【语义分割】一文概览主要语义分割网络：FCN,SegNet,U-Net...

【语义分割】一文概览主要语义分割网络：FCN,SegNet,U-Net...

产业智能官

18+阅读 · 2018年7月26日

【泡泡一分钟】端到端的弱监督语义对齐

【泡泡一分钟】端到端的弱监督语义对齐

泡泡机器人SLAM

53+阅读 · 2018年4月5日

【论文推荐】最新六篇视频分类相关论文—层次标签推断、知识图谱、CNNs、DAiSEE、表观和关系网络、转移学习

【论文推荐】最新六篇视频分类相关论文—层次标签推断、知识图谱、CNNs、DAiSEE、表观和关系网络、转移学习

专知

13+阅读 · 2018年2月18日

单位球面中极小超曲面的第一特征值的Yau的猜想

国家自然科学基金

0+阅读 · 2015年12月31日

分段光滑系统的分支问题

国家自然科学基金

0+阅读 · 2013年12月31日

具有对称结构的时滞微分系统的等变分支

国家自然科学基金

0+阅读 · 2013年12月31日

分段光滑Filippov系统的动力学研究

国家自然科学基金

0+阅读 · 2013年12月31日

复杂地形下耦合多基元的低空倾斜立体影像匹配研究

国家自然科学基金

0+阅读 · 2013年12月31日

动力系统的可积、分支与嵌入流

国家自然科学基金

0+阅读 · 2012年12月31日

有限维Banach几何与关于凸体覆盖的Hadwiger猜想

国家自然科学基金

0+阅读 · 2012年12月31日

关于一类非凸全局优化和变分问题的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于泛函空间和微分包含的非光滑变分与优化

国家自然科学基金

0+阅读 · 2009年12月31日

具低复杂度序列势的离散薛定谔算子谱结构

国家自然科学基金

0+阅读 · 2009年12月31日

Hyperbolic Geometry in Computer Vision: A Novel Framework for Convolutional Neural Networks

Arxiv

0+阅读 · 2023年5月15日

Single-view Neural Radiance Fields with Depth Teacher

Arxiv

0+阅读 · 2023年5月11日

Evaluating Embedding APIs for Information Retrieval

Arxiv

0+阅读 · 2023年5月10日

A synchronization-capturing multi-scale solver to the noisy integrate-and-fire neuron networks

Arxiv

0+阅读 · 2023年5月10日

Max-Margin Contrastive Learning

Max-Margin Contrastive Learning

Arxiv

18+阅读 · 2021年12月21日

Self-Supervised Multi-Channel Hypergraph Convolutional Network for Social Recommendation

Arxiv

15+阅读 · 2021年1月21日

CNN+CNN: Convolutional Decoders for Image Captioning

Arxiv

21+阅读 · 2018年5月23日

Self-Attention with Relative Position Representations

Arxiv

14+阅读 · 2018年3月6日

Multi-Pointer Co-Attention Networks for Recommendation

Arxiv

12+阅读 · 2018年1月28日

Additive Margin Softmax for Face Verification

Arxiv

11+阅读 · 2018年1月18日

VIP会员

文章信息

相关主题

相关VIP内容

【ICML2022】几何多模态对比表示学习

【ICML2022】几何多模态对比表示学习

专知会员服务

45+阅读 · 2022年7月17日

KDD 2022 | GraphMAE:自监督掩码图自编码器

KDD 2022 | GraphMAE:自监督掩码图自编码器

专知会员服务

20+阅读 · 2022年7月14日

2021机器学习研究风向是啥？MLP→CNN→Transformer→MLP！

2021机器学习研究风向是啥？MLP→CNN→Transformer→MLP！

专知会员服务

67+阅读 · 2021年5月23日

【AAAI 2019】双曲异构信息网络嵌入，Hyperbolic Heterogeneous Information Network Embedding

【AAAI 2019】双曲异构信息网络嵌入，Hyperbolic Heterogeneous Information Network Embedding

专知会员服务

60+阅读 · 2020年6月28日

【NeurIPS 2019】多关系庞加莱图嵌入，Multi-relational Poincaré Graph Embeddings

【NeurIPS 2019】多关系庞加莱图嵌入，Multi-relational Poincaré Graph Embeddings

专知会员服务

49+阅读 · 2020年6月15日

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

【厦门大学-CVPR2020】协调可迁移性与可判别性的自适应目标检测器，Adapting Object Detectors

专知会员服务

26+阅读 · 2020年3月16日

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

专知会员服务

39+阅读 · 2020年2月21日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

八篇 ICCV 2019 【图神经网络（GNN）+CV】相关论文

八篇 ICCV 2019 【图神经网络（GNN）+CV】相关论文

专知会员服务

30+阅读 · 2020年1月10日

【论文】多关系庞加莱图嵌入（Multi-relational Poincaré Graph Embeddings），爱丁堡大学| Ivana Balažević

【论文】多关系庞加莱图嵌入（Multi-relational Poincaré Graph Embeddings），爱丁堡大学| Ivana Balažević

专知会员服务

59+阅读 · 2019年12月30日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

《多体环境下定位导航授时（PNT）系统研究》228页

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

相关资讯

赛尔笔记 | 自然语言处理中的迁移学习(下)

赛尔笔记 | 自然语言处理中的迁移学习(下)

AI科技评论

11+阅读 · 2019年10月21日

卷积神经网络四种卷积类型

卷积神经网络四种卷积类型

炼数成金订阅号

18+阅读 · 2019年4月16日

后ResNet时代：SENet与SKNet

后ResNet时代：SENet与SKNet

PaperWeekly

23+阅读 · 2019年3月25日

NLP预训练模型大集合！

NLP预训练模型大集合！

机器之心

21+阅读 · 2018年12月28日

Seq2seq强化，Pointer Network简介

Seq2seq强化，Pointer Network简介

机器学习算法与Python学习

15+阅读 · 2018年12月8日

【泡泡点云时空】PointNetVLAD：基于点云检索的场景识别（CVPR-13）

【泡泡点云时空】PointNetVLAD：基于点云检索的场景识别（CVPR-13）

泡泡机器人SLAM

85+阅读 · 2018年9月12日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【语义分割】一文概览主要语义分割网络：FCN,SegNet,U-Net...

【语义分割】一文概览主要语义分割网络：FCN,SegNet,U-Net...

产业智能官

18+阅读 · 2018年7月26日

【泡泡一分钟】端到端的弱监督语义对齐

【泡泡一分钟】端到端的弱监督语义对齐

泡泡机器人SLAM

53+阅读 · 2018年4月5日

【论文推荐】最新六篇视频分类相关论文—层次标签推断、知识图谱、CNNs、DAiSEE、表观和关系网络、转移学习

【论文推荐】最新六篇视频分类相关论文—层次标签推断、知识图谱、CNNs、DAiSEE、表观和关系网络、转移学习

专知

13+阅读 · 2018年2月18日

相关论文

Hyperbolic Geometry in Computer Vision: A Novel Framework for Convolutional Neural Networks

Arxiv

0+阅读 · 2023年5月15日

Single-view Neural Radiance Fields with Depth Teacher

Arxiv

0+阅读 · 2023年5月11日

Evaluating Embedding APIs for Information Retrieval

Arxiv

0+阅读 · 2023年5月10日

A synchronization-capturing multi-scale solver to the noisy integrate-and-fire neuron networks

Arxiv

0+阅读 · 2023年5月10日

Max-Margin Contrastive Learning

Max-Margin Contrastive Learning

Arxiv

18+阅读 · 2021年12月21日

Self-Supervised Multi-Channel Hypergraph Convolutional Network for Social Recommendation

Arxiv

15+阅读 · 2021年1月21日

CNN+CNN: Convolutional Decoders for Image Captioning

Arxiv

21+阅读 · 2018年5月23日

Self-Attention with Relative Position Representations

Arxiv

14+阅读 · 2018年3月6日

Multi-Pointer Co-Attention Networks for Recommendation

Arxiv

12+阅读 · 2018年1月28日

Additive Margin Softmax for Face Verification

Arxiv

11+阅读 · 2018年1月18日

相关基金

单位球面中极小超曲面的第一特征值的Yau的猜想

国家自然科学基金

0+阅读 · 2015年12月31日

分段光滑系统的分支问题

国家自然科学基金

0+阅读 · 2013年12月31日

具有对称结构的时滞微分系统的等变分支

国家自然科学基金

0+阅读 · 2013年12月31日

分段光滑Filippov系统的动力学研究

国家自然科学基金

0+阅读 · 2013年12月31日

复杂地形下耦合多基元的低空倾斜立体影像匹配研究

国家自然科学基金

0+阅读 · 2013年12月31日

动力系统的可积、分支与嵌入流

国家自然科学基金

0+阅读 · 2012年12月31日

有限维Banach几何与关于凸体覆盖的Hadwiger猜想

国家自然科学基金

0+阅读 · 2012年12月31日

关于一类非凸全局优化和变分问题的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于泛函空间和微分包含的非光滑变分与优化

国家自然科学基金

0+阅读 · 2009年12月31日

具低复杂度序列势的离散薛定谔算子谱结构

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员