Automated retinal disease diagnosis is vital given the rising prevalence of conditions such as diabetic retinopathy and macular degeneration. Conventional deep learning approaches require large annotated datasets, which are costly and often imbalanced across disease categories, limiting their reliability in practice. Few-shot learning (FSL) addresses this challenge by enabling models to generalize from only a few labeled samples per class. In this study,we propose a balanced few-shot episodic learning framework tailored to the Retinal Fundus Multi-Disease Image Dataset (RFMiD). Focusing on the ten most represented classes, which still show substantial imbalance between majority diseases (e.g., Diabetic Retinopathy, Macular Hole) and minority ones (e.g., Optic Disc Edema, Branch Retinal Vein Occlusion), our method integrates three key components: (i) balanced episodic sampling, ensuring equal participation of all classes in each 5-way 5-shot episode; (ii) targeted augmentation, including Contrast Limited Adaptive Histogram Equalization (CLAHE) and color/geometry transformations, to improve minority-class di- versity; and (iii) a ResNet-50 encoder pretrained on ImageNet, selected for its superior ability to capture fine-grained retinal features. Prototypes are computed in the embedding space and classification is performed with cosine similarity for improved stability. Trained on 100 episodes and evaluated on 1,000 test episodes, our framework achieves substantial accuracy gains and reduces bias toward majority classes, with notable improvements for underrepresented diseases. These results demonstrate that dataset-aware few-shot pipelines, combined with balanced sampling and CLAHE-enhanced preprocessing, can deliver more robust and clinically fair retinal disease diagnosis under data-constrained conditions.
翻译:鉴于糖尿病视网膜病变和黄斑变性等疾病的患病率不断上升,自动化视网膜疾病诊断至关重要。传统的深度学习方法需要大量标注数据集,这些数据集成本高昂且通常在疾病类别间存在不平衡,限制了其在实际应用中的可靠性。少样本学习(FSL)通过使模型仅从每个类别的少量标记样本中泛化,解决了这一挑战。在本研究中,我们提出了一种针对视网膜眼底多疾病图像数据集(RFMiD)定制的平衡少样本情景学习框架。聚焦于十个最具代表性的类别(这些类别在多数疾病(如糖尿病视网膜病变、黄斑裂孔)与少数疾病(如视盘水肿、分支视网膜静脉阻塞)之间仍存在显著不平衡),我们的方法整合了三个关键组件:(i)平衡情景采样,确保所有类别在每个5-way 5-shot情景中平等参与;(ii)针对性增强,包括对比度受限自适应直方图均衡化(CLAHE)和颜色/几何变换,以提高少数类别的多样性;以及(iii)在ImageNet上预训练的ResNet-50编码器,因其在捕捉细粒度视网膜特征方面的卓越能力而被选用。原型在嵌入空间中计算,分类使用余弦相似度进行以提高稳定性。在100个情景上训练并在1,000个测试情景上评估,我们的框架实现了显著的准确率提升,并减少了对多数类别的偏差,对代表性不足的疾病有显著改善。这些结果表明,结合平衡采样和CLAHE增强预处理的数据集感知少样本流程,可以在数据受限条件下提供更稳健且临床公平的视网膜疾病诊断。