NeuriCam: 物联网摄像头的关键帧视频超分辨率和上色 (NeuriCam: Key-Frame Video Super-Resolution and Colorization for IoT Cameras) - 专知论文

会员服务 ·

0

视频超分辨率 · 视频超分 · 超分辨率 · 超分 · 关键帧 ·

2023 年 4 月 13 日

NeuriCam: Key-Frame Video Super-Resolution and Colorization for IoT Cameras

翻译：NeuriCam: 物联网摄像头的关键帧视频超分辨率和上色

Bandhav Veluri,Collin Pernu,Ali Saffari,Joshua Smith,Michael Taylor,Shyamnath Gollakota

from arxiv, MobiCom 2023 camera-ready

We present NeuriCam, a novel deep learning-based system to achieve video capture from low-power dual-mode IoT camera systems. Our idea is to design a dual-mode camera system where the first mode is low-power (1.1 mW) but only outputs grey-scale, low resolution, and noisy video and the second mode consumes much higher power (100 mW) but outputs color and higher resolution images. To reduce total energy consumption, we heavily duty cycle the high power mode to output an image only once every second. The data for this camera system is then wirelessly sent to a nearby plugged-in gateway, where we run our real-time neural network decoder to reconstruct a higher-resolution color video. To achieve this, we introduce an attention feature filter mechanism that assigns different weights to different features, based on the correlation between the feature map and the contents of the input frame at each spatial location. We design a wireless hardware prototype using off-the-shelf cameras and address practical issues including packet loss and perspective mismatch. Our evaluations show that our dual-camera approach reduces energy consumption by 7x compared to existing systems. Further, our model achieves an average greyscale PSNR gain of 3.7 dB over prior single and dual-camera video super-resolution methods and 5.6 dB RGB gain over prior color propagation methods. Open-source code: https://github.com/vb000/NeuriCam.

翻译：我们提出了 NeuriCam，这是一个新颖的基于深度学习的系统，旨在实现从低功耗双模 IoT 摄像头系统中捕获视频。我们的想法是设计一个双模式摄像头系统，其中第一种模式是低功耗（1.1 毫瓦），但仅输出灰色、低分辨率和嘈杂的视频，第二种模式消耗更高的功率（100 毫瓦），但输出颜色和更高分辨率的图像。为了减少总能量消耗，我们将高功率模式重度循环，每秒只输出一次图像。然后将这个相机系统的数据无线发送到附近的插入式网关，在那里运行实时的神经网络解码器来重建更高分辨率的彩色视频。为了实现这一点，我们引入了一种注意力特征过滤机制，它根据特征图与每个空间位置的输入帧内容之间的相关性，分配不同的权重给不同的特征。我们使用现成的相机设计了无线硬件原型，并解决了包丢失和透视不匹配等实际问题。我们的评估表明，与现有系统相比，我们的双摄像头方法将能源消耗降低了 7 倍。此外，我们的模型比单摄像头视频超分辨率方法和双摄像头视频超分辨率方法平均提高了 3.7 dB 灰度 PSNR 增益，以及比上述颜色传播方法平均提高了 5.6 dB RGB 增益。开源代码：https://github.com/vb000/NeuriCam。

0

相关内容

视频超分辨率

视频超分辨率

【CVPR2022】多视图聚合的大规模三维语义分割

【CVPR2022】多视图聚合的大规模三维语义分割

专知会员服务

21+阅读 · 2022年4月20日

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

专知会员服务

15+阅读 · 2022年3月12日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

137+阅读 · 2022年2月6日

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

专知会员服务

44+阅读 · 2020年3月26日

【2020新书】深度学习视觉系统，Deep Learning for Vision Systems, 396页pdf

【2020新书】深度学习视觉系统，Deep Learning for Vision Systems, 396页pdf

专知会员服务

171+阅读 · 2020年2月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

专知会员服务

24+阅读 · 2019年12月15日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

小扎戴着新头显秀击剑！Quest Pro彩色透视旗舰功能效果惊人

小扎戴着新头显秀击剑！Quest Pro彩色透视旗舰功能效果惊人

新智元

0+阅读 · 2022年10月1日

CVPR 2021 论文盘点-人脸识别篇

CVPR 2021 论文盘点-人脸识别篇

CVer

2+阅读 · 2022年5月25日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

泡泡机器人SLAM

11+阅读 · 2019年5月22日

【泡泡一分钟】基于机器人的视觉惯性里程计（IROS2018-10）

【泡泡一分钟】基于机器人的视觉惯性里程计（IROS2018-10）

泡泡机器人SLAM

13+阅读 · 2019年1月3日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

泡泡机器人SLAM

13+阅读 · 2018年12月20日

镜头间的风格转换行人重识别

镜头间的风格转换行人重识别

统计学习与视觉计算组

13+阅读 · 2018年8月16日

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

泡泡机器人SLAM

11+阅读 · 2018年3月31日

【泡泡一分钟】动态环境下稳健的单目SLAM

【泡泡一分钟】动态环境下稳健的单目SLAM

泡泡机器人SLAM

13+阅读 · 2018年3月22日

基于毛孔尺度面部特征的高效人脸识别研究

国家自然科学基金

1+阅读 · 2015年12月31日

山葡萄雄株性别CKX基因家族分析与VaCKX的性别转换功能研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于叶绿体蛋白质组学和代谢组学解析CO2加富缓解黄瓜干旱胁迫的生理机制

国家自然科学基金

0+阅读 · 2014年12月31日

无线蜂窝D2D网络的传输容量分析及其新型干扰协调策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于非现场勘测的无线室内定位与导航技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于皮秒级精度可编程延时控制器的新型3D-TOF CMOS视觉传感系统关键问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

氢气调控苜蓿干旱胁迫耐性的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

物联网服务资源管理与调度技术的研究

国家自然科学基金

3+阅读 · 2012年12月31日

数据中心的光交换及节能调度算法设计

国家自然科学基金

0+阅读 · 2011年12月31日

印刷图像颜色信息的高保真传输与再现

国家自然科学基金

0+阅读 · 2009年12月31日

Video frame interpolation for high dynamic range sequences captured with dual-exposure sensors

Arxiv

0+阅读 · 2023年5月31日

Pathology Synthesis of 3D-Consistent Cardiac MR Images using 2D VAEs and GANs

Arxiv

0+阅读 · 2023年5月30日

Toward Real-World Light Field Super-Resolution

Arxiv

0+阅读 · 2023年5月30日

Disentangling Light Fields for Super-Resolution and Disparity Estimation

Arxiv

0+阅读 · 2023年5月29日

DataChat: Prototyping a Conversational Agent for Dataset Search and Visualization

Arxiv

0+阅读 · 2023年5月26日

InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition

Arxiv

0+阅读 · 2023年5月24日

IoT Solutions with Multi-Sensor Fusion and Signal-Image Encoding for Secure Data Transfer and Decision Making

Arxiv

37+阅读 · 2021年6月2日

SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition

Arxiv

12+阅读 · 2021年5月30日

Counterfactual Zero-Shot and Open-Set Visual Recognition

Arxiv

12+阅读 · 2021年3月1日

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Arxiv

10+阅读 · 2021年1月24日

VIP会员

文章信息

相关主题

视频超分辨率

相关VIP内容

【CVPR2022】多视图聚合的大规模三维语义分割

【CVPR2022】多视图聚合的大规模三维语义分割

专知会员服务

21+阅读 · 2022年4月20日

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

专知会员服务

15+阅读 · 2022年3月12日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

137+阅读 · 2022年2月6日

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

专知会员服务

44+阅读 · 2020年3月26日

【2020新书】深度学习视觉系统，Deep Learning for Vision Systems, 396页pdf

【2020新书】深度学习视觉系统，Deep Learning for Vision Systems, 396页pdf

专知会员服务

171+阅读 · 2020年2月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

专知会员服务

24+阅读 · 2019年12月15日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

无人艇集群路径规划研究综述: 深度强化学习

【WWW2025教程】人工智能在复杂网络中的应用：潜力、方法与应用

【ICML2025】利用多样本推理优化语言模型的温度参数

【NTU博士论文】让语言模型更接近人类学习者

相关资讯

小扎戴着新头显秀击剑！Quest Pro彩色透视旗舰功能效果惊人

小扎戴着新头显秀击剑！Quest Pro彩色透视旗舰功能效果惊人

新智元

0+阅读 · 2022年10月1日

CVPR 2021 论文盘点-人脸识别篇

CVPR 2021 论文盘点-人脸识别篇

CVer

2+阅读 · 2022年5月25日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

泡泡机器人SLAM

11+阅读 · 2019年5月22日

【泡泡一分钟】基于机器人的视觉惯性里程计（IROS2018-10）

【泡泡一分钟】基于机器人的视觉惯性里程计（IROS2018-10）

泡泡机器人SLAM

13+阅读 · 2019年1月3日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

泡泡机器人SLAM

13+阅读 · 2018年12月20日

镜头间的风格转换行人重识别

镜头间的风格转换行人重识别

统计学习与视觉计算组

13+阅读 · 2018年8月16日

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

泡泡机器人SLAM

11+阅读 · 2018年3月31日

【泡泡一分钟】动态环境下稳健的单目SLAM

【泡泡一分钟】动态环境下稳健的单目SLAM

泡泡机器人SLAM

13+阅读 · 2018年3月22日

相关论文

Video frame interpolation for high dynamic range sequences captured with dual-exposure sensors

Arxiv

0+阅读 · 2023年5月31日

Pathology Synthesis of 3D-Consistent Cardiac MR Images using 2D VAEs and GANs

Arxiv

0+阅读 · 2023年5月30日

Toward Real-World Light Field Super-Resolution

Arxiv

0+阅读 · 2023年5月30日

Disentangling Light Fields for Super-Resolution and Disparity Estimation

Arxiv

0+阅读 · 2023年5月29日

DataChat: Prototyping a Conversational Agent for Dataset Search and Visualization

Arxiv

0+阅读 · 2023年5月26日

InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition

Arxiv

0+阅读 · 2023年5月24日

IoT Solutions with Multi-Sensor Fusion and Signal-Image Encoding for Secure Data Transfer and Decision Making

Arxiv

37+阅读 · 2021年6月2日

SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition

Arxiv

12+阅读 · 2021年5月30日

Counterfactual Zero-Shot and Open-Set Visual Recognition

Arxiv

12+阅读 · 2021年3月1日

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Arxiv

10+阅读 · 2021年1月24日

相关基金

基于毛孔尺度面部特征的高效人脸识别研究

国家自然科学基金

1+阅读 · 2015年12月31日

山葡萄雄株性别CKX基因家族分析与VaCKX的性别转换功能研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于叶绿体蛋白质组学和代谢组学解析CO2加富缓解黄瓜干旱胁迫的生理机制

国家自然科学基金

0+阅读 · 2014年12月31日

无线蜂窝D2D网络的传输容量分析及其新型干扰协调策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于非现场勘测的无线室内定位与导航技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于皮秒级精度可编程延时控制器的新型3D-TOF CMOS视觉传感系统关键问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

氢气调控苜蓿干旱胁迫耐性的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

物联网服务资源管理与调度技术的研究

国家自然科学基金

3+阅读 · 2012年12月31日

数据中心的光交换及节能调度算法设计

国家自然科学基金

0+阅读 · 2011年12月31日

印刷图像颜色信息的高保真传输与再现

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员