GMFlow:通过全球配对学习光学流动 (GMFlow: Learning Optical Flow via Global Matching) - 专知论文

会员服务 ·

0

Learning · Raft算法 · 相关系数 · 层 · 可辨认的 ·

2022 年 6 月 10 日

GMFlow: Learning Optical Flow via Global Matching

翻译：GMFlow:通过全球配对学习光学流动

Haofei Xu,Jing Zhang,Jianfei Cai,Hamid Rezatofighi,Dacheng Tao

from arxiv, CVPR 2022, Oral

Learning-based optical flow estimation has been dominated with the pipeline of cost volume with convolutions for flow regression, which is inherently limited to local correlations and thus is hard to address the long-standing challenge of large displacements. To alleviate this, the state-of-the-art framework RAFT gradually improves its prediction quality by using a large number of iterative refinements, achieving remarkable performance but introducing linearly increasing inference time. To enable both high accuracy and efficiency, we completely revamp the dominant flow regression pipeline by reformulating optical flow as a global matching problem, which identifies the correspondences by directly comparing feature similarities. Specifically, we propose a GMFlow framework, which consists of three main components: a customized Transformer for feature enhancement, a correlation and softmax layer for global feature matching, and a self-attention layer for flow propagation. We further introduce a refinement step that reuses GMFlow at higher feature resolution for residual flow prediction. Our new framework outperforms 31-refinements RAFT on the challenging Sintel benchmark, while using only one refinement and running faster, suggesting a new paradigm for accurate and efficient optical flow estimation. Code is available at https://github.com/haofeixu/gmflow.

翻译：以学习为基础的光学流估计一直以成本量与流量回归的演变过程相联而为主,这必然限于当地的相关性,因此难以应对长期存在的大规模迁移的挑战。为缓解这种情况,先进框架RAFT通过大量迭接改进逐步提高其预测质量,取得显著的性能,但引入了线性增加的推论时间。为了能够提高准确性和效率,我们完全改造了主导流量回归管道,将光流作为全球匹配问题,通过直接比较特征相似点来查明对应点。具体地说,我们提议了一个GMFlow框架,由三个主要部分组成:一个定制的功能增强变异器,一个用于全球特征匹配的关联和软模层,以及一个流动传播的自我注意层。我们进一步引入了一个改进步骤,将GMFlow再利用更高的特征分辨率进行剩余流量预测。我们的新框架在挑战性的Sintel基准上比31项更精确,而RAFT要精确地精炼,同时仅使用一个改进和运行得更快,为精确和高效的光学流估计提供了新的范例。

0

相关内容

Learning

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

紧区间上保向微分同胚的光滑嵌入流

国家自然科学基金

0+阅读 · 2015年12月31日

EAST高功率低杂波与边界等离子体非线性相互作用的机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

EAST装置上离子回旋新型长天线的设计与耦合研究

国家自然科学基金

0+阅读 · 2012年12月31日

BRCA1蛋白出核的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

中国淡水异极藻科（ Gomphonemaceae）植物的分类学研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

海蜇补体系统的结构、功能及其激活途径

国家自然科学基金

0+阅读 · 2012年12月31日

Snail1调控STOML2的表达在糖尿病肾病EMT发生中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于点约束的工业机器人在线自标定方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于pQDs-CD133mAb的双模态探针对胶质瘤CD133+干细胞靶向成像的实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

Event-guided Deblurring of Unknown Exposure Time Videos

Arxiv

0+阅读 · 2022年7月26日

Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph Captioning

Arxiv

0+阅读 · 2022年7月26日

Multi-Scale RAFT: Combining Hierarchical Concepts for Learning-based Optical FLow Estimation

Arxiv

0+阅读 · 2022年7月25日

MeshLoc: Mesh-Based Visual Localization

Arxiv

0+阅读 · 2022年7月25日

RealFlow: EM-based Realistic Optical Flow Dataset Generation from Videos

Arxiv

0+阅读 · 2022年7月22日

Fine-Grained Neural Network Explanation by Identifying Input Features with Predictive Information

Arxiv

10+阅读 · 2021年10月4日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Evolving Losses for Unsupervised Video Representation Learning

Arxiv

23+阅读 · 2020年2月26日

Mobile Video Object Detection with Temporally-Aware Feature Maps

Arxiv

11+阅读 · 2018年3月28日

Matching Networks for One Shot Learning

Arxiv

10+阅读 · 2017年12月29日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《实现多层防御多轮交战机制的扩展型随机齐射模型》2025年最新83页

《美军条令：小部队指挥官山地作战指南》最新238页

如何避免生成式人工智能在作战中失控失效

《俄乌战争中俄罗斯两栖作战能力：黑海舰队战力与作战失利研究》2025年最新111页

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Event-guided Deblurring of Unknown Exposure Time Videos

Arxiv

0+阅读 · 2022年7月26日

Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph Captioning

Arxiv

0+阅读 · 2022年7月26日

Multi-Scale RAFT: Combining Hierarchical Concepts for Learning-based Optical FLow Estimation

Arxiv

0+阅读 · 2022年7月25日

MeshLoc: Mesh-Based Visual Localization

Arxiv

0+阅读 · 2022年7月25日

RealFlow: EM-based Realistic Optical Flow Dataset Generation from Videos

Arxiv

0+阅读 · 2022年7月22日

Fine-Grained Neural Network Explanation by Identifying Input Features with Predictive Information

Arxiv

10+阅读 · 2021年10月4日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Evolving Losses for Unsupervised Video Representation Learning

Arxiv

23+阅读 · 2020年2月26日

Mobile Video Object Detection with Temporally-Aware Feature Maps

Arxiv

11+阅读 · 2018年3月28日

Matching Networks for One Shot Learning

Arxiv

10+阅读 · 2017年12月29日

相关基金

紧区间上保向微分同胚的光滑嵌入流

国家自然科学基金

0+阅读 · 2015年12月31日

EAST高功率低杂波与边界等离子体非线性相互作用的机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

EAST装置上离子回旋新型长天线的设计与耦合研究

国家自然科学基金

0+阅读 · 2012年12月31日

BRCA1蛋白出核的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

中国淡水异极藻科（ Gomphonemaceae）植物的分类学研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

海蜇补体系统的结构、功能及其激活途径

国家自然科学基金

0+阅读 · 2012年12月31日

Snail1调控STOML2的表达在糖尿病肾病EMT发生中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于点约束的工业机器人在线自标定方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于pQDs-CD133mAb的双模态探针对胶质瘤CD133+干细胞靶向成像的实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员