在视频序列中提取面罩 (Face Mask Extraction in Video Sequence)

Inspired by the recent development of deep network-based methods in semantic image segmentation, we introduce an end-to-end trainable model for face mask extraction in video sequence. Comparing to landmark-based sparse face shape representation, our method can produce the segmentation masks of individual facial components, which can better reflect their detailed shape variations. By integrating Convolutional LSTM (ConvLSTM) algorithm with Fully Convolutional Networks (FCN), our new ConvLSTM-FCN model works on a per-sequence basis and takes advantage of the temporal correlation in video clips. In addition, we also propose a novel loss function, called Segmentation Loss, to directly optimise the Intersection over Union (IoU) performances. In practice, to further increase segmentation accuracy, one primary model and two additional models were trained to focus on the face, eyes, and mouth regions, respectively. Our experiment shows the proposed method has achieved a 16.99% relative improvement (from 54.50% to 63.76% mean IoU) over the baseline FCN model on the 300 Videos in the Wild (300VW) dataset.

翻译：在语义图像分割方面,我们最近开发了基于深网络的语义图像分割法,因此,我们引入了在视频序列中面罩提取的端到端的可训练模型。比较了基于里程碑的分散面形显示,我们的方法可以产生单个面部组成部分的分解面罩,这可以更好地反映其详细的形状变化。通过将Convolutional LSTM(ConvLSTM)算法与完整的进化网络(FCN)相结合,我们新的ConvLSTM-FCN模型按顺序进行工作,并利用视频剪辑中的时间相关性。此外,我们还提议了一个新的损失函数,称为分解损失函数,直接优化交错功能。在实践中,为了进一步提高分解准确性,我们培训了一个主要模型和另外两个模型,分别侧重于脸部、眼睛和嘴部区域。我们的实验表明,拟议方法比野生(300VW)数据集300个视频模型的基准FCN模型取得了16.99%的相对改进(从54.50%到63.76 %为IoU)。

相关内容

MoDELS

关注 30

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

专知会员服务

21+阅读 · 2021年4月20日

【CVPR2020-香港中文大学】PointGroup:用于3D实例分割的双设置点分组，PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation

专知会员服务

11+阅读 · 2020年4月6日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

35+阅读 · 2020年1月23日

【NUS】神经问题生成的最近进展（Recent Advances in Neural Question Generation）

专知会员服务

15+阅读 · 2019年12月22日