以视觉和运动因子探测跟踪物体 (Object Tracking by Detection with Visual and Motion Cues)

Self-driving cars and other autonomous vehicles need to detect and track objects in camera images. We present a simple online tracking algorithm that is based on a constant velocity motion model with a Kalman filter, and an assignment heuristic. The assignment heuristic relies on four metrics: An embedding vector that describes the appearance of objects and can be used to re-identify them, a displacement vector that describes the object movement between two consecutive video frames, the Mahalanobis distance between the Kalman filter states and the new detections, and a class distance. These metrics are combined with a linear SVM, and then the assignment problem is solved by the Hungarian algorithm. We also propose an efficient CNN architecture that estimates these metrics. Our multi-frame model accepts two consecutive video frames which are processed individually in the backbone, and then optical flow is estimated on the resulting feature maps. This allows the network heads to estimate the displacement vectors. We evaluate our approach on the challenging BDD100K tracking dataset. Our multi-frame model achieves a good MOTA value of 39.1% with low localization error of 0.206 in MOTP. Our fast single-frame model achieves an even lower localization error of 0.202 in MOTP, and a MOTA value of 36.8%.

翻译：自驾驶汽车和其他自主飞行器需要用摄像图像探测和跟踪天体。我们展示了一个简单的在线跟踪算法, 以使用卡尔曼过滤器的恒定速度运动模型为基础, 以及一个任务超常。任务超常依赖四度 : 嵌入矢量, 描述天体的外观, 并可用于重新识别天体, 描述两个连续的视频框之间的天体移动, 马哈拉诺比斯在卡尔曼过滤器状态和新检测之间的距离, 以及一个阶级距离。这些度量与线性 SVM 相结合, 然后通过匈牙利算法解决任务问题。我们还提出了一个高效的CNN结构, 用于估算这些天体。我们的多框架模型接受两个连续的视频框, 描述天体外物体的外观, 然后根据生成的地貌图来估计光流。我们的网络头可以对具有挑战性的 BDD100K 跟踪数据集的方法进行评估。我们的多框架模型取得了39.1% 的MOTA值是好的, 在MOTP中, 0. 0. 0. 0. 0. 0. 0. 8 和 MTA 的的快速一框架模型, 的的的 10 10 的的 10- 1 的的的模型的的的的的的的的 0. 1 0. 1 0. 1 值的的的 0. 1 0. 1 0. 1 0. 1 0. 1 的的的 0. 1 0. 1 0. 1 0. 1 0. 1 0. 1 0. 1 的 0. 1 0. 1 的 0. 1 0. 1 0. 1 0. 1 的的的的 0. 1 的 0. 1 0. 1 的的的的的的的的的的的 0. 1 的的 0. 1 0. 1 0. 1 的的 0. 1 0. 1 0. 1 的 0. 1 的 0. 1 的的的的的的的的的的 0. 1 0. 1 的的 0. 1 0. 1 0. 1 0. 1 的 0. 1 的