The existence of metallic implants in projection images for cone-beam computed tomography (CBCT) introduces undesired artifacts which degrade the quality of reconstructed images. In order to reduce metal artifacts, projection inpainting is an essential step in many metal artifact reduction algorithms. In this work, a hybrid network combining the shift window (Swin) vision transformer (ViT) and a convolutional neural network is proposed as a baseline network for the inpainting task. To incorporate metal information for the Swin ViT-based encoder, metal-conscious self-embedding and neighborhood-embedding methods are investigated. Both methods have improved the performance of the baseline network. Furthermore, by choosing appropriate window size, the model with neighborhood-embedding could achieve the lowest mean absolute error of 0.079 in metal regions and the highest peak signal-to-noise ratio of 42.346 in CBCT projections. At the end, the efficiency of metal-conscious embedding on both simulated and real cadaver CBCT data has been demonstrated, where the inpainting capability of the baseline network has been enhanced.
翻译:为了减少金属制品,投影油漆是许多金属制品减少算法中的一个必要步骤。在这项工作中,提议将移动窗口(Swin)视觉变压器(VIT)和共生神经网络相结合的混合网络作为油漆任务的基准网络。为了将金属信息纳入基于Swin ViT的编码器、注意到金属的自我编造和周边编造方法,对这两种方法都进行了调查,从而改善了基线网络的性能。此外,通过选择适当的窗口大小,集成区模型可以达到金属区域0.079最低的绝对误差,而CBCT预测中最高的信号-噪声比率为42 346。在最后,金属意识嵌入模拟和真实的CBCT数据的效率得到了证明,从而加强了基线网络的油漆能力。