转换标题： (ProContEXT: Exploring Progressive Context Transformer for Tracking) - 专知论文

会员服务 ·

0

上下文 · 跟踪器 · 上下文转换器 · 上下文建模 · 视觉目标跟踪 ·

2023 年 3 月 27 日

ProContEXT: Exploring Progressive Context Transformer for Tracking

翻译：转换标题：

Jin-Peng Lan,Zhi-Qi Cheng,Jun-Yan He,Chenyang Li,Bin Luo,Xu Bao,Wangmeng Xiang,Yifeng Geng,Xuansong Xie

from arxiv, Accepted at ICASSP 2023, source code is at https://github.com/zhiqic/ProContEXT

Existing Visual Object Tracking (VOT) only takes the target area in the first frame as a template. This causes tracking to inevitably fail in fast-changing and crowded scenes, as it cannot account for changes in object appearance between frames. To this end, we revamped the tracking framework with Progressive Context Encoding Transformer Tracker (ProContEXT), which coherently exploits spatial and temporal contexts to predict object motion trajectories. Specifically, ProContEXT leverages a context-aware self-attention module to encode the spatial and temporal context, refining and updating the multi-scale static and dynamic templates to progressively perform accurate tracking. It explores the complementary between spatial and temporal context, raising a new pathway to multi-context modeling for transformer-based trackers. In addition, ProContEXT revised the token pruning technique to reduce computational complexity. Extensive experiments on popular benchmark datasets such as GOT-10k and TrackingNet demonstrate that the proposed ProContEXT achieves state-of-the-art performance.

翻译：渐进式上下文转换器（ProContEXT）用于跟踪翻译摘要：现有的视觉目标跟踪（VOT）只将第一帧中的目标区域作为模板。这会导致在快速变化和拥挤的场景中跟踪必然失败，因为它无法考虑帧之间目标外观的变化。为此，我们使用渐进式上下文编码变换跟踪器（ProContEXT）重新设计了跟踪框架，以协同利用空间和时间上下文来预测对象运动轨迹。具体来说，ProContEXT利用上下文感知的自我注意力模块来编码空间和时间上下文，逐步改进和更新多尺度静态和动态模板，逐步执行精确跟踪。它探索了空间和时间上下文之间的互补性，为基于变压器的跟踪器提供了一条新的多上下文建模路径。此外，ProContEXT修改了令牌修剪技术以减少计算复杂性。对流行的基准数据集（如GOT-10k和TrackingNet）进行的广泛实验表明，所提出的ProContEXT实现了最先进的性能。

0

相关内容

上下文

【CVPR 2022】基于灵活模态Transformer的人脸防伪 FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing

【CVPR 2022】基于灵活模态Transformer的人脸防伪 FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing

专知会员服务

17+阅读 · 2022年3月19日

【CVPR 2022】基于Tracklet查询和建议的高效视频实例分割，Efficient Video Instance Segmentation via Tracklet Query and Proposal

【CVPR 2022】基于Tracklet查询和建议的高效视频实例分割，Efficient Video Instance Segmentation via Tracklet Query and Proposal

专知会员服务

16+阅读 · 2022年3月3日

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

专知会员服务

28+阅读 · 2022年3月3日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

近期必读的9篇 CVPR 2019【视觉目标跟踪】相关论文和代码

近期必读的9篇 CVPR 2019【视觉目标跟踪】相关论文和代码

专知会员服务

33+阅读 · 2020年1月10日

必读的7篇IJCAI 2019【图神经网络（GNN）】相关论文-Part2

必读的7篇IJCAI 2019【图神经网络（GNN）】相关论文-Part2

专知会员服务

62+阅读 · 2020年1月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

CVPR 2022 | 从自注意力中学习语义Affinity，用于端到端弱监督语义分割

CVPR 2022 | 从自注意力中学习语义Affinity，用于端到端弱监督语义分割

PaperWeekly

0+阅读 · 2022年6月18日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

CVPR2019丨首个siamese网络中训练GCNs的视觉追踪方法《Graph Convolutional Tracking》

CVPR2019丨首个siamese网络中训练GCNs的视觉追踪方法《Graph Convolutional Tracking》

极市平台

17+阅读 · 2019年10月6日

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

AI研习社

11+阅读 · 2019年6月21日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

专知

18+阅读 · 2018年9月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新六篇视觉问答相关论文—深度嵌入学习、句子表征学习、深度特征聚合、3D匹配、细粒度文本摘要

【论文推荐】最新六篇视觉问答相关论文—深度嵌入学习、句子表征学习、深度特征聚合、3D匹配、细粒度文本摘要

专知

12+阅读 · 2018年6月9日

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

泡泡机器人SLAM

10+阅读 · 2018年5月26日

LDPE/MWCNTs复合材料低能电子辐致力学损伤效应与机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

信号稀疏表示与重构的神经网络算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

微米液滴冲击微纳米结构表面的流动与传热机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

软骨寡聚基质蛋白通过BMPs/BMPRII通路介导肺动脉平滑肌细胞表型转化

国家自然科学基金

0+阅读 · 2013年12月31日

纳米复合热电材料温差发电过程热-电传输的微观机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

DNA序列氧化损伤的电致化学发光定量检测

国家自然科学基金

0+阅读 · 2013年12月31日

高能重离子辐照高压条件下地质材料的相变研究

国家自然科学基金

0+阅读 · 2013年12月31日

嵌段共聚物逐层自组装制备三维有序CO2纳米通道

国家自然科学基金

0+阅读 · 2012年12月31日

超精度视频内容三维重建

国家自然科学基金

0+阅读 · 2011年12月31日

S$^3$Track: Self-supervised Tracking with Soft Assignment Flow

Arxiv

0+阅读 · 2023年5月17日

Ray-Patch: An Efficient Decoder for Light Field Transformers

Arxiv

0+阅读 · 2023年5月16日

Pre-Training to Learn in Context

Arxiv

0+阅读 · 2023年5月16日

LoViT: Long Video Transformer for Surgical Phase Recognition

Arxiv

0+阅读 · 2023年5月15日

MultiTACRED: A Multilingual Version of the TAC Relation Extraction Dataset

Arxiv

0+阅读 · 2023年5月15日

Expertise-based Weighting for Regression Models with Noisy Labels

Arxiv

0+阅读 · 2023年5月12日

Full Stack Optimization of Transformer Inference: a Survey

Arxiv

19+阅读 · 2023年2月27日

Transformer Tracking

Arxiv

17+阅读 · 2021年3月29日

Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks

Arxiv

11+阅读 · 2019年9月8日

Multimodal Sentiment Analysis using Hierarchical Fusion with Context Modeling

Arxiv

11+阅读 · 2018年6月16日

VIP会员

文章信息

相关主题

上下文转换器

上下文建模

视觉目标跟踪

相关VIP内容

【CVPR 2022】基于灵活模态Transformer的人脸防伪 FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing

【CVPR 2022】基于灵活模态Transformer的人脸防伪 FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing

专知会员服务

17+阅读 · 2022年3月19日

【CVPR 2022】基于Tracklet查询和建议的高效视频实例分割，Efficient Video Instance Segmentation via Tracklet Query and Proposal

【CVPR 2022】基于Tracklet查询和建议的高效视频实例分割，Efficient Video Instance Segmentation via Tracklet Query and Proposal

专知会员服务

16+阅读 · 2022年3月3日

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

专知会员服务

28+阅读 · 2022年3月3日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

近期必读的9篇 CVPR 2019【视觉目标跟踪】相关论文和代码

近期必读的9篇 CVPR 2019【视觉目标跟踪】相关论文和代码

专知会员服务

33+阅读 · 2020年1月10日

必读的7篇IJCAI 2019【图神经网络（GNN）】相关论文-Part2

必读的7篇IJCAI 2019【图神经网络（GNN）】相关论文-Part2

专知会员服务

62+阅读 · 2020年1月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

CVPR 2022 | 从自注意力中学习语义Affinity，用于端到端弱监督语义分割

CVPR 2022 | 从自注意力中学习语义Affinity，用于端到端弱监督语义分割

PaperWeekly

0+阅读 · 2022年6月18日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

CVPR2019丨首个siamese网络中训练GCNs的视觉追踪方法《Graph Convolutional Tracking》

CVPR2019丨首个siamese网络中训练GCNs的视觉追踪方法《Graph Convolutional Tracking》

极市平台

17+阅读 · 2019年10月6日

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

AI研习社

11+阅读 · 2019年6月21日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

专知

18+阅读 · 2018年9月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新六篇视觉问答相关论文—深度嵌入学习、句子表征学习、深度特征聚合、3D匹配、细粒度文本摘要

【论文推荐】最新六篇视觉问答相关论文—深度嵌入学习、句子表征学习、深度特征聚合、3D匹配、细粒度文本摘要

专知

12+阅读 · 2018年6月9日

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

泡泡机器人SLAM

10+阅读 · 2018年5月26日

相关论文

S$^3$Track: Self-supervised Tracking with Soft Assignment Flow

Arxiv

0+阅读 · 2023年5月17日

Ray-Patch: An Efficient Decoder for Light Field Transformers

Arxiv

0+阅读 · 2023年5月16日

Pre-Training to Learn in Context

Arxiv

0+阅读 · 2023年5月16日

LoViT: Long Video Transformer for Surgical Phase Recognition

Arxiv

0+阅读 · 2023年5月15日

MultiTACRED: A Multilingual Version of the TAC Relation Extraction Dataset

Arxiv

0+阅读 · 2023年5月15日

Expertise-based Weighting for Regression Models with Noisy Labels

Arxiv

0+阅读 · 2023年5月12日

Full Stack Optimization of Transformer Inference: a Survey

Arxiv

19+阅读 · 2023年2月27日

Transformer Tracking

Arxiv

17+阅读 · 2021年3月29日

Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks

Arxiv

11+阅读 · 2019年9月8日

Multimodal Sentiment Analysis using Hierarchical Fusion with Context Modeling

Arxiv

11+阅读 · 2018年6月16日

相关基金

LDPE/MWCNTs复合材料低能电子辐致力学损伤效应与机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

信号稀疏表示与重构的神经网络算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

微米液滴冲击微纳米结构表面的流动与传热机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

软骨寡聚基质蛋白通过BMPs/BMPRII通路介导肺动脉平滑肌细胞表型转化

国家自然科学基金

0+阅读 · 2013年12月31日

纳米复合热电材料温差发电过程热-电传输的微观机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

DNA序列氧化损伤的电致化学发光定量检测

国家自然科学基金

0+阅读 · 2013年12月31日

高能重离子辐照高压条件下地质材料的相变研究

国家自然科学基金

0+阅读 · 2013年12月31日

嵌段共聚物逐层自组装制备三维有序CO2纳米通道

国家自然科学基金

0+阅读 · 2012年12月31日

超精度视频内容三维重建

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员