End-to-end text-spotting, which aims to integrate detection and recognition in a unified framework, has attracted increasing attention due to its simplicity of the two complimentary tasks. It remains an open problem especially when processing arbitrarily-shaped text instances. Previous methods can be roughly categorized into two groups: character-based and segmentation-based, which often require character-level annotations and/or complex post-processing due to the unstructured output. Here, we tackle end-to-end text spotting by presenting Adaptive Bezier Curve Network v2 (ABCNet v2). Our main contributions are four-fold: 1) For the first time, we adaptively fit arbitrarily-shaped text by a parameterized Bezier curve, which, compared with segmentation-based methods, can not only provide structured output but also controllable representation. 2) We design a novel BezierAlign layer for extracting accurate convolution features of a text instance of arbitrary shapes, significantly improving the precision of recognition over previous methods. 3) Different from previous methods, which often suffer from complex post-processing and sensitive hyper-parameters, our ABCNet v2 maintains a simple pipeline with the only post-processing non-maximum suppression (NMS). 4) As the performance of text recognition closely depends on feature alignment, ABCNet v2 further adopts a simple yet effective coordinate convolution to encode the position of the convolutional filters, which leads to a considerable improvement with negligible computation overhead. Comprehensive experiments conducted on various bilingual (English and Chinese) benchmark datasets demonstrate that ABCNet v2 can achieve state-of-the-art performance while maintaining very high efficiency.
翻译:端对端文本点点,目的是将检测和识别纳入一个统一的框架中,由于这两类辅助任务的简单性而引起越来越多的注意。它仍然是一个开放的问题,特别是在处理任意形状的文本时。以前的方法可以大致分为两类:基于字符的分解和基于分解的文本点,由于产出结构化,常常需要字符级说明和/或复杂的后处理。这里,我们通过展示适应性贝塞尔曲线网络 v2(ABCNet v.2)来处理端对端文本点点。 与以往的方法不同,我们的主要贡献是四倍:1)第一次,我们通过一个参数化的贝塞尔曲线适应了任意型文本。 与基于分解的方法相比,前者不仅可以提供结构化的输出,而且可以控制性表示。 2 我们设计了一个全新的贝齐尔图层图层层图,以提取任意形状的准确的进化特征,大大改进了以往方法的准确度。 3)与以往的方法不同,我们的主要贡献是复杂的后处理和敏感的超常量度计,我们ABC网络的文本通过一个简单的直径直径直径直径直径直径直径直径直径直径直径直径直径直径直径直径直径直径直径直径直径直的计算。