F-CAD: 用于编码器Avotar解码的探索硬件加速器框架 (F-CAD: A Framework to Explore Hardware Accelerators for Codec Avatar Decoding)

Creating virtual avatars with realistic rendering is one of the most essential and challenging tasks to provide highly immersive virtual reality (VR) experiences. It requires not only sophisticated deep neural network (DNN) based codec avatar decoders to ensure high visual quality and precise motion expression, but also efficient hardware accelerators to guarantee smooth real-time rendering using lightweight edge devices, like untethered VR headsets. Existing hardware accelerators, however, fail to deliver sufficient performance and efficiency targeting such decoders which consist of multi-branch DNNs and require demanding compute and memory resources. To address these problems, we propose an automation framework, called F-CAD (Facebook Codec avatar Accelerator Design), to explore and deliver optimized hardware accelerators for codec avatar decoding. Novel technologies include 1) a new accelerator architecture to efficiently handle multi-branch DNNs; 2) a multi-branch dynamic design space to enable fine-grained architecture configurations; and 3) an efficient architecture search for picking the optimized hardware design based on both application-specific demands and hardware resource constraints. To the best of our knowledge, F-CAD is the first automation tool that supports the whole design flow of hardware acceleration of codec avatar decoders, allowing joint optimization on decoder designs in popular machine learning frameworks and corresponding customized accelerator design with cycle-accurate evaluation. Results show that the accelerators generated by F-CAD can deliver up to 122.1 frames per second (FPS) and 91.6% hardware efficiency when running the latest codec avatar decoder. Compared to the state-of-the-art designs, F-CAD achieves 4.0X and 2.8X higher throughput, 62.5% and 21.2% higher efficiency than DNNBuilder and HybridDNN by targeting the same hardware device.

翻译：创建具有现实效果的虚拟变异器是最重要的和最具挑战性的任务之一。它不仅需要精密的深层神经网络(DNN)基于codc avatar 解码器以确保高视觉质量和精确运动表达式, 还需要高效的硬件加速器来保证使用轻度边缘设备(如未节奏的VR头饰)进行平稳实时转换。但是,现有的硬件加速器无法提供足够高的性能和效率, 以这些解码器为目标, 这些解码器由多分支 DNNP组成, 需要高要求的编译和记忆资源。为了解决这些问题, 我们提议了一个自动化框架, 叫做FC( Facebook Codeder avader acational), 探索并提供最优化的硬件加速器, 像不动的Vatartreator Dalder de daddaddoration。诺尔技术包括1) 一个新的加速器结构, 以高效的解码器加速式结构设计空间, 以精细的配置为基础, 节制的硬化的硬化的硬件设计框架, 运行的FC daldealdeal deadd dad dead dad dad dadd dadd dadd dadd dadd dal dede dead dede dede dede deaddal dede dede dede dede dede dead dad dal deal deal dede dede dede dex a 。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【Google】利用AUTOML实现加速感知神经网络设计

专知会员服务

30+阅读 · 2020年3月5日

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

专知会员服务

61+阅读 · 2019年12月29日