Visible-infrared cross modality person re-identification (VI-ReId) is an important task for video surveillance in poorly illuminated or dark environments. Despite many recent studies on person re-identification in visible domain (ReId), there are few studies dealing with VI-ReId. Besides challenges that are common for both ReId and VI-ReId such as pose/illumination variations, background clutter and occlusion, VI-ReId has additional challenges as color information is not available in infrared images. As a result, the performance of VI-ReId systems is typically lower than ReId systems. In this work, we propose a 4-stream framework to improve VI-ReId performance. We train a separate deep convolutional neural network in each stream using different representations of input images. We expect that different and complementary features can be learned from each stream. In our framework, grayscale and infrared input images are used to train the ResNet in the first stream. In the second stream, RGB and 3-channel infrared images (created by repeating infrared channel) are used. In the remaining two streams, we use local pattern maps as input images. These maps are generated utilizing local Zernike moments transformation. Local pattern maps are obtained from grayscale and infrared images in the 3rd stream and from RGB and 3-channel infrared images in the last stream. We improve the performance of the proposed framework by employing a re-ranking algorithm for post processing. Our results indicate that the proposed framework outperforms current state-of-the-art on SYSU-MM01 dataset with large margin by improving Rank-1/mAP by 34.2%/37.9% and 37.4%/34.8% under all-search and indoor-search modes, respectively.
翻译:可见红外红外线的人重新定位(VI-ReId)是光度差或黑暗环境中视频监控的一项重要任务。尽管最近对可见域(ReId)的人重新定位进行了许多研究,但关于VI-ReId的研究很少。除了ReId和VI-ReId常见的挑战外,ReId和VI-ReId都存在一些共同的挑战,例如成像/发光变异、背景模糊和隐蔽,VI-ReId在红红外图像中无法提供彩色信息,因此,VI-ReId在视频中是一项重要任务。因此,VI-ReId系统的表现通常低于ReId系统。在这项工作中,我们提议了一个四流框架来改进VI-ReId的重新定位。我们建议了一个四流框架。RIdrial-lad Sliver 4的性能。我们用不同的表达式图像来在每条流里程中,我们用灰度和红外图像来在第一个流中培训ResNet。在第二流中,RGB 3 和3 红外图像(由不断红红外的) 格式中,我们用最新版本的图像的 Rmar-ral 3 的图像在最后的图像中,我们使用两个方向上,我们使用。