Early detection of lung cancer is critical for improvement of patient survival. To address the clinical need for efficacious treatments, genetically engineered mouse models (GEMM) have become integral in identifying and evaluating the molecular underpinnings of this complex disease that may be exploited as therapeutic targets. Assessment of GEMM tumor burden on histopathological sections performed by manual inspection is both time consuming and prone to subjective bias. Therefore, an interplay of needs and challenges exists for computer-aided diagnostic tools, for accurate and efficient analysis of these histopathology images. In this paper, we propose a simple machine learning approach called the graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E). Our method comprises four steps: 1) cascaded graph-based sparse PCA, 2) PCA binary hashing, 3) block-wise histograms, and 4) support vector machine (SVM) classification. In our proposed architecture, graph-based sparse PCA is employed to learn the filter banks of the multiple stages of a convolutional network. This is followed by PCA hashing and block histograms for indexing and pooling. The meaningful features extracted from this GS-PCA are then fed to an SVM classifier. We evaluate the performance of the proposed algorithm on H&E slides obtained from an inducible K-rasG12D lung cancer mouse model using precision/recall rates, F-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC) and show that our algorithm is efficient and provides improved detection accuracy compared to existing algorithms.
翻译:肺癌的早期检测对于改善患者生存至关重要。 为了满足对有效治疗的临床需求,基因工程鼠标模型(GEMM)已经成为确定和评估这一复杂疾病的分子根基的不可或缺的组成部分,可用作治疗目标。 人工检查对基因病理部分的GEM肿瘤负担的评估既耗时,又容易产生主观偏向。 因此,计算机辅助诊断工具存在需求和挑战的相互作用,以便准确和有效地分析这些组织病理学图像。 在本文中,我们提议了一种简单的机器学习方法,称为基于图形的零散主要成分分析(GS-PCA)网络,用于自动检测可能作为治疗目标的这种复杂疾病的分子根基。 通过手动检查对基因病理病理病理病理科部分的肿瘤负担进行评估,这四个步骤是:(1) 基于图表的分散病理病理病理学,2) 五氯苯二进量,3个基于块的直线图和4) 支持病媒机(SVMM) 的分类。 在我们拟议的结构中,基于图表的稀释甲醚精度精度分析库中, 将使用SAU-A值的精度的精度和直观网络的精度 展示,然后用SALM 的精度网络的精度显示,然后用SAA- s- 的精度显示的精度显示的精度显示的精度显示的精度进行。