正在学习不完全标签多例多重标签学习 (Active Learning in Incomplete Label Multiple Instance Multiple Label Learning)

In multiple instance multiple label learning, each sample, a bag, consists of multiple instances. To alleviate labeling complexity, each sample is associated with a set of bag-level labels leaving instances within the bag unlabeled. This setting is more convenient and natural for representing complicated objects, which have multiple semantic meanings. Compared to single instance labeling, this approach allows for labeling larger datasets at an equivalent labeling cost. However, for sufficiently large datasets, labeling all bags may become prohibitively costly. Active learning uses an iterative labeling and retraining approach aiming to provide reasonable classification performance using a small number of labeled samples. To our knowledge, only a few works in the area of active learning in the MIML setting are available. These approaches can provide practical solutions to reduce labeling cost but their efficacy remains unclear. In this paper, we propose a novel bag-class pair based approach for active learning in the MIML setting. Due to the partial availability of bag-level labels, we focus on the incomplete-label MIML setting for the proposed active learning approach. Our approach is based on a discriminative graphical model with efficient and exact inference. For the query process, we adapt active learning criteria to the novel bag-class pair selection strategy. Additionally, we introduce an online stochastic gradient descent algorithm to provide an efficient model update after each query. Numerical experiments on benchmark datasets illustrate the robustness of the proposed approach.

翻译：在多个实例的多标签学习中,每个样本,一个包包,由多个实例组成。为了减轻标签的复杂性,每个样本都与一组包级标签挂钩,在包内没有标签的包级标签中留下一些实例。这个设置更方便,更自然,代表复杂的对象,具有多种语义含义。与单例标签相比,这种方法允许以同等标签成本标出更大的数据集。然而,如果足够大的数据集,标记所有袋,标签可能变得过于昂贵。积极学习使用迭代级标签和再培训方法,目的是利用少量标签样本提供合理的分类性工作。据我们了解,只有少数在MIML设置的积极学习领域开展工作。这些方法可以提供实用的解决方案,降低标签成本,但效果仍然不清楚。在本文中,我们建议采用基于新包级配对配对方法,以便在MIML设置的设置中积极学习。由于部分存在袋级标签,我们侧重于拟议的积极学习方法的不完整的标签MIML设置。我们的方法基于一个带有高效和精确的动态排序的图表模型模型模型模型模型,我们提供了一种在线选择。

相关内容

主动学习

关注 240

主动学习是机器学习（更普遍的说是人工智能）的一个子领域，在统计学领域也叫查询学习、最优实验设计。“学习模块”和“选择策略”是主动学习算法的2个基本且重要的模块。主动学习是“一种学习方法，在这种方法中，学生会主动或体验性地参与学习过程，并且根据学生的参与程度，有不同程度的主动学习。” （Bonwell＆Eison 1991）Bonwell＆Eison（1991）指出：“学生除了被动地听课以外，还从事其他活动。” 在高等教育研究协会（ASHE）的一份报告中，作者讨论了各种促进主动学习的方法。他们引用了一些文献，这些文献表明学生不仅要做听，还必须做更多的事情才能学习。他们必须阅读，写作，讨论并参与解决问题。此过程涉及三个学习领域，即知识，技能和态度（KSA）。这种学习行为分类法可以被认为是“学习过程的目标”。特别是，学生必须从事诸如分析，综合和评估之类的高级思维任务。