基于 YOLO 探测器的硬性实时自动许可证板识别 (A Robust Real-Time Automatic License Plate Recognition based on the YOLO Detector)

Automatic License Plate Recognition (ALPR) has been a frequent topic of research due to many practical applications. However, many of the current solutions are still not robust in real-world situations, commonly depending on many constraints. This paper presents a robust and efficient ALPR system based on the state-of-the-art YOLO object detection. The Convolutional Neural Networks (CNNs) are trained and fine-tuned for each ALPR stage so that they are robust under different conditions (e.g., variations in camera, lighting, and background). Specially for character segmentation and recognition, we design a two-stage approach employing simple data augmentation tricks such as inverted License Plates (LPs) and flipped characters. The resulting ALPR approach achieved impressive results in two datasets. First, in the SSIG dataset, composed of 2,000 frames from 101 vehicle videos, our system achieved a recognition rate of 93.53% and 47 Frames Per Second (FPS), performing better than both Sighthound and OpenALPR commercial systems (89.80% and 93.03%, respectively) and considerably outperforming previous results (81.80%). Second, targeting a more realistic scenario, we introduce a larger public dataset, called UFPR-ALPR dataset, designed to ALPR. This dataset contains 150 videos and 4,500 frames captured when both camera and vehicles are moving and also contains different types of vehicles (cars, motorcycles, buses and trucks). In our proposed dataset, the trial versions of commercial systems achieved recognition rates below 70%. On the other hand, our system performed better, with recognition rate of 78.33% and 35 FPS.

翻译：由于许多实际应用,自动牌板识别(ALPR)一直是经常研究的主题。然而,许多当前解决方案在现实世界中仍然不够健全,通常取决于许多限制因素。本文件展示了基于最先进的YOLO物体检测的强大有效的ALPR系统。Cultural NealNetwork(CNNs)为每个ALPR阶段进行了培训和微调,以便在不同的条件下(例如相机、照明和背景的变异),这些变异性读取是一个经常的研究主题。特别是,在字符分割和识别方面,我们设计了两阶段的方法,使用简单的数据增强技巧,例如反转的牌牌和翻转的字符。本文介绍了基于最先进的YOLO物体检测的强大和高效的ALPR系统。首先,在由101个车辆视频视频的2 000个框架组成的SSIG数据集中,我们的系统实现了93.53%和47个框架(FPS)的确认率,比Sighound和OprealPR的商业系统(89.80%和93.03 %)要使用简单的数据,在现实化的FLPLVV的版本中,第二个版本的数据(8180)比我们更接近的版本,另一个版本的数据,另一个版本的版本为我们更接近的版本。