Gated-VigAT:利用新的框架选择政策和定位机制,高效率自下而上事件确认和解释 (Gated-ViGAT: Efficient Bottom-Up Event Recognition and Explanation Using a New Frame Selection Policy and Gating Mechanism) - 专知论文

会员服务 ·

0

门控机制 · 自下而上 · Networking · 图注意力网络 · INFORMS ·

2023 年 1 月 18 日

Gated-ViGAT: Efficient Bottom-Up Event Recognition and Explanation Using a New Frame Selection Policy and Gating Mechanism

翻译：Gated-VigAT:利用新的框架选择政策和定位机制,高效率自下而上事件确认和解释

Nikolaos Gkalelis,Dimitrios Daskalakis,Vasileios Mezaris

from arxiv, Accepted for publication in the proceedings of IEEE Int. Symposium on Multimedia (ISM), Naples, Italy, Dec. 2022

In this paper, Gated-ViGAT, an efficient approach for video event recognition, utilizing bottom-up (object) information, a new frame sampling policy and a gating mechanism is proposed. Specifically, the frame sampling policy uses weighted in-degrees (WiDs), derived from the adjacency matrices of graph attention networks (GATs), and a dissimilarity measure to select the most salient and at the same time diverse frames representing the event in the video. Additionally, the proposed gating mechanism fetches the selected frames sequentially, and commits early-exiting when an adequately confident decision is achieved. In this way, only a few frames are processed by the computationally expensive branch of our network that is responsible for the bottom-up information extraction. The experimental evaluation on two large, publicly available video datasets (MiniKinetics, ActivityNet) demonstrates that Gated-ViGAT provides a large computational complexity reduction in comparison to our previous approach (ViGAT), while maintaining the excellent event recognition and explainability performance. Gated-ViGAT source code is made publicly available at https://github.com/bmezaris/Gated-ViGAT

翻译：在本文中,Gated-ViGAT是一种有效的视频事件识别方法,它利用自下而上(对象)信息,提出了新的框架抽样政策和定位机制,具体地说,框架抽样政策使用了从图形关注网络的相邻矩阵(GATs)中得出的加权量度(WiDs),以及选择最突出且同时代表视频中事件的不同框架的不同度度量。此外,拟议的定位机制按顺序获取选定的框架,并在作出具有充分信心的决定时进行早期应用。在这种方式中,只有少数框架是由负责自下而上信息提取的我们网络中计算费用昂贵的分支处理的。对两个大、公开的视频数据集(MiniKinetics,活动网)的实验性评估表明,Gated-ViGAT提供了与我们先前方法(ViGAT)相比的计算复杂性大幅降低,同时保持了出色的事件识别和解释性。Ged-ViGAT源码在https://gthub.com/bmasari/gmazari/Gratatedations)上公开提供。

0

相关内容

门控机制

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

肉类食品禁用/限用添加剂的表面增强拉曼光谱特性及基底作用机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于片段的新型BET Bromodomain小分子抑制剂的设计、合成与生物活性研究

国家自然科学基金

0+阅读 · 2014年12月31日

磁性稀土金属-有机骨架分子材料的动态调控及多功能化研究

国家自然科学基金

0+阅读 · 2014年12月31日

含生物可降解嵌段的氢键键合两亲性超分子共聚物的胶束化与溶液自组装

国家自然科学基金

0+阅读 · 2012年12月31日

疏水化淀粉微粒对Pickering乳状液界面稳定机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

深海放线菌Streptomyces sp. SCSIO 03032抗肿瘤天然产物Spiroindimicins生物合成研究

国家自然科学基金

0+阅读 · 2012年12月31日

纳米结构基元的三维网络化自组装及性能调控研究

国家自然科学基金

0+阅读 · 2011年12月31日

脂肪因子adiponutrin在肥胖、胰岛素抵抗和2型糖尿病发病机制中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

硅杂环戊二烯共轭低聚物的设计、合成及光电性能

国家自然科学基金

0+阅读 · 2008年12月31日

Efficient Recovery Learning using Model Predictive Meta-Reasoning

Arxiv

0+阅读 · 2023年3月9日

Dynamic Multi-View Fusion Mechanism For Chinese Relation Extraction

Arxiv

0+阅读 · 2023年3月9日

EvConv: Fast CNN Inference on Event Camera Inputs For High-Speed Robot Perception

Arxiv

0+阅读 · 2023年3月8日

Point Cloud Classification Using Content-based Transformer via Clustering in Feature Space

Arxiv

0+阅读 · 2023年3月8日

Extreme Masking for Learning Instance and Distributed Visual Representations

Arxiv

0+阅读 · 2023年3月8日

DANet: Density Adaptive Convolutional Network with Interactive Attention for 3D Point Clouds

Arxiv

0+阅读 · 2023年3月8日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Arxiv

15+阅读 · 2020年3月31日

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Arxiv

11+阅读 · 2018年1月11日

Learning Hierarchical Features for Visual Object Tracking with Recursive Neural Networks

Arxiv

13+阅读 · 2018年1月6日

VIP会员

文章信息

相关主题

图注意力网络

相关VIP内容

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

[ICCV2025]EAMamba：面向图像恢复的高效全能视觉状态空间模型

ICCV 2025 | 超越π0，无界智慧提出A0，首个空间可供性感知的通用操作模型

【博士论文】大规模人工智能中的强化学习智能体：高效训练与更严谨分析

大语言模型推理系统综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

Efficient Recovery Learning using Model Predictive Meta-Reasoning

Arxiv

0+阅读 · 2023年3月9日

Dynamic Multi-View Fusion Mechanism For Chinese Relation Extraction

Arxiv

0+阅读 · 2023年3月9日

EvConv: Fast CNN Inference on Event Camera Inputs For High-Speed Robot Perception

Arxiv

0+阅读 · 2023年3月8日

Point Cloud Classification Using Content-based Transformer via Clustering in Feature Space

Arxiv

0+阅读 · 2023年3月8日

Extreme Masking for Learning Instance and Distributed Visual Representations

Arxiv

0+阅读 · 2023年3月8日

DANet: Density Adaptive Convolutional Network with Interactive Attention for 3D Point Clouds

Arxiv

0+阅读 · 2023年3月8日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Arxiv

15+阅读 · 2020年3月31日

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Arxiv

11+阅读 · 2018年1月11日

Learning Hierarchical Features for Visual Object Tracking with Recursive Neural Networks

Arxiv

13+阅读 · 2018年1月6日

相关基金

肉类食品禁用/限用添加剂的表面增强拉曼光谱特性及基底作用机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于片段的新型BET Bromodomain小分子抑制剂的设计、合成与生物活性研究

国家自然科学基金

0+阅读 · 2014年12月31日

磁性稀土金属-有机骨架分子材料的动态调控及多功能化研究

国家自然科学基金

0+阅读 · 2014年12月31日

含生物可降解嵌段的氢键键合两亲性超分子共聚物的胶束化与溶液自组装

国家自然科学基金

0+阅读 · 2012年12月31日

疏水化淀粉微粒对Pickering乳状液界面稳定机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

深海放线菌Streptomyces sp. SCSIO 03032抗肿瘤天然产物Spiroindimicins生物合成研究

国家自然科学基金

0+阅读 · 2012年12月31日

纳米结构基元的三维网络化自组装及性能调控研究

国家自然科学基金

0+阅读 · 2011年12月31日

脂肪因子adiponutrin在肥胖、胰岛素抵抗和2型糖尿病发病机制中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

硅杂环戊二烯共轭低聚物的设计、合成及光电性能

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员