Extracting digital material representations from images is a necessary prerequisite for a quantitative analysis of material properties. Different segmentation approaches have been extensively studied in the past to achieve this task, but were often lacking accuracy or speed. With the advent of machine learning, supervised convolutional neural networks (CNNs) have achieved state-of-the-art performance for different segmentation tasks. However, these models are often trained in a supervised manner, which requires large labeled datasets. Unsupervised approaches do not require ground-truth data for learning, but suffer from long segmentation times and often worse segmentation accuracy. Hidden Markov Random Fields (HMRF) are an unsupervised segmentation approach that incorporates concepts of neighborhood and class distributions. We present a method that integrates HMRF theory and CNN segmentation, leveraging the advantages of both areas: unsupervised learning and fast segmentation times. We investigate the contribution of different neighborhood terms and components for the unsupervised HMRF loss. We demonstrate that the HMRF-UNet enables high segmentation accuracy without ground truth on a Micro-Computed Tomography ($μ$CT) image dataset of Polyurethane (PU) foam structures. Finally, we propose and demonstrate a pre-training strategy that considerably reduces the required amount of ground-truth data when training a segmentation model.
翻译:从图像中提取数字材料表征是定量分析材料性能的必要前提。为实现这一目标,过去已广泛研究了多种分割方法,但常存在精度或速度不足的问题。随着机器学习的发展,监督式卷积神经网络(CNN)已在不同分割任务中达到最先进的性能。然而,这些模型通常以监督方式训练,需要大量标注数据集。无监督方法无需真实标注数据即可学习,但存在分割时间长且分割精度通常较差的问题。隐马尔可夫随机场(HMRF)是一种结合邻域与类别分布概念的无监督分割方法。本文提出一种整合HMRF理论与CNN分割的方法,充分发挥两领域的优势:无监督学习与快速分割。我们研究了不同邻域项及组件对无监督HMRF损失的贡献。实验证明,在聚氨酯(PU)泡沫结构的微计算机断层扫描(μCT)图像数据集上,HMRF-UNet无需真实标注即可实现高精度分割。最后,我们提出并验证了一种预训练策略,可显著降低训练分割模型时所需的真实标注数据量。