Digitizing engineering diagrams like Piping and Instrumentation Diagrams (P&IDs) plays a vital role in maintainability and operational efficiency of process and hydraulic systems. Previous methods typically decompose the task into separate steps such as symbol detection and line detection, which can limit their ability to capture the structure in these diagrams. In this work, a transformer-based approach leveraging the Relationformer that addresses this limitation by jointly extracting symbols and their interconnections from P&IDs is introduced. To evaluate our approach and compare it to a modular digitization approach, we present the first publicly accessible benchmark dataset for P&ID digitization, annotated with graph-level ground truth. Experimental results on real-world diagrams show that our method significantly outperforms the modular baseline, achieving over 25% improvement in edge detection accuracy. This research contributes a reproducible evaluation framework and demonstrates the effectiveness of transformer models for structural understanding of complex engineering diagrams. The dataset is available under https://zenodo.org/records/14803338.
翻译:数字化如管道与仪表图(P&IDs)等工程图表对于流程与液压系统的可维护性与运行效率至关重要。以往的方法通常将该任务分解为符号检测与线条检测等独立步骤,这可能限制其捕捉此类图表结构的能力。本研究引入了一种基于Transformer的方法,该方法利用Relationformer通过从P&IDs中联合提取符号及其相互连接来解决此局限性。为评估我们的方法并与模块化数字化方法进行比较,我们提出了首个公开可访问的P&ID数字化基准数据集,该数据集标注有图级真实值。在实际工程图上的实验结果表明,我们的方法显著优于模块化基线,在边检测准确率上实现了超过25%的提升。本研究贡献了一个可复现的评估框架,并证明了Transformer模型在复杂工程图表结构理解方面的有效性。数据集可通过 https://zenodo.org/records/14803338 获取。