基于深度学习的树木横截面图像髓心自动检测 (Automatic Pith Detection in Tree Cross-Section Images Using Deep Learning)

Pith detection in tree cross-sections is essential for forestry and wood quality analysis but remains a manual, error-prone task. This study evaluates deep learning models -- YOLOv9, U-Net, Swin Transformer, DeepLabV3, and Mask R-CNN -- to automate the process efficiently. A dataset of 582 labeled images was dynamically augmented to improve generalization. Swin Transformer achieved the highest accuracy (0.94), excelling in fine segmentation. YOLOv9 performed well for bounding box detection but struggled with boundary precision. U-Net was effective for structured patterns, while DeepLabV3 captured multi-scale features with slight boundary imprecision. Mask R-CNN initially underperformed due to overlapping detections, but applying Non-Maximum Suppression (NMS) improved its IoU from 0.45 to 0.80. Generalizability was next tested using an oak dataset of 11 images from Oregon State University's Tree Ring Lab. Additionally, for exploratory analysis purposes, an additional dataset of 64 labeled tree cross-sections was used to train the worst-performing model to see if this would improve its performance generalizing to the unseen oak dataset. Key challenges included tensor mismatches and boundary inconsistencies, addressed through hyperparameter tuning and augmentation. Our results highlight deep learning's potential for tree cross-section pith detection, with model choice depending on dataset characteristics and application needs.

翻译：树木横截面髓心检测对于林业和木材质量分析至关重要，但目前仍依赖人工操作且易出错。本研究评估了YOLOv9、U-Net、Swin Transformer、DeepLabV3和Mask R-CNN等深度学习模型，以实现高效自动化检测。通过动态增强582张标注图像的数据集以提升泛化能力。Swin Transformer取得了最高精度（0.94），在精细分割任务中表现突出；YOLOv9在边界框检测中表现良好，但边界精度存在不足；U-Net对结构化纹理效果显著，而DeepLabV3能捕捉多尺度特征但存在轻微边界偏差。Mask R-CNN因重叠检测初始性能不佳，应用非极大值抑制（NMS）后其交并比从0.45提升至0.80。进一步使用俄勒冈州立大学树木年轮实验室提供的11张橡木数据集测试模型泛化性。此外，为探索性分析，采用64张标注树木横截面额外数据集训练性能最差的模型，以验证其对未见橡木数据集的泛化改进效果。研究通过超参数调优和数据增强解决了张量维度不匹配与边界不一致等关键挑战。结果表明深度学习在树木横截面髓心检测中具有显著潜力，模型选择需依据数据集特征与应用需求而定。

相关内容

数据集

关注 0

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

专知会员服务

17+阅读 · 2022年5月10日

【CVPR 2022】基于双噪声标签的可见光-红外人再识别学习，Learning with Twin Noisy Labels for Visible-Infrared Person Re-Identification

专知会员服务

14+阅读 · 2022年3月28日

【ICLR 2020】基于组合的多关系图卷积网络 Composition-Based Multi-Relational Graph Convolutional Networks

专知会员服务

108+阅读 · 2020年3月29日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

136+阅读 · 2020年3月8日