MemX: 个性化动作自动抓获引人注意智能眼装系统 (MemX: An Attention-Aware Smart Eyewear System for Personalized Moment Auto-capture)

This work presents MemX: a biologically-inspired attention-aware eyewear system developed with the goal of pursuing the long-awaited vision of a personalized visual Memex. MemX captures human visual attention on the fly, analyzes the salient visual content, and records moments of personal interest in the form of compact video snippets. Accurate attentive scene detection and analysis on resource-constrained platforms is challenging because these tasks are computation and energy intensive. We propose a new temporal visual attention network that unifies human visual attention tracking and salient visual content analysis. Attention tracking focuses computation-intensive video analysis on salient regions, while video analysis makes human attention detection and tracking more accurate. Using the YouTube-VIS dataset and 30 participants, we experimentally show that MemX significantly improves the attention tracking accuracy over the eye-tracking-alone method, while maintaining high system energy efficiency. We have also conducted 11 in-field pilot studies across a range of daily usage scenarios, which demonstrate the feasibility and potential benefits of MemX.

翻译：这份工作展示了MemX:一个生物刺激的注意的眼罩系统,目的是追求期待已久的个人视觉化视觉Memex的视觉影像。MemX捕捉了人类在苍蝇上的视觉关注,分析了突出的视觉内容,记录了个人以紧凑的视频片段形式感兴趣的时刻。对资源紧缺的平台进行仔细的现场探测和分析具有挑战性,因为这些任务是计算和能源密集型的。我们提议建立一个新的时间视觉关注网络,将人类视觉关注跟踪和突出的视觉内容分析统一起来。关注跟踪将计算密集的视频分析集中在显著区域,而视频分析则使人类注意力的探测和跟踪更加精确。我们利用YouTube-VIS数据集和30名参与者实验性地表明,MemX在保持高系统能效的同时,大大提高了对眼睛跟踪单独方法的准确性的注意力跟踪。我们还在一系列日常使用设想中进行了11次实地试点研究,展示了MemX的可行性和潜在效益。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【南洋理工Xavier】图神经网络架构的最新进展，Graph Network Architectures，附80页ppt

专知会员服务

74+阅读 · 2020年11月6日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【论文翻译】NLP注意力机制综述论文翻译，Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

专知会员服务

96+阅读 · 2020年4月18日