Preferential Bayesian optimization allows optimization of objectives that are either expensive or difficult to measure directly, by relying on a minimal number of comparative evaluations done by a human expert. Generating candidate solutions for evaluation is also often expensive, but this cost is ignored by existing methods. We generalize preference-based optimization to explicitly account for production and evaluation costs with Consecutive Preferential Bayesian Optimization, reducing production cost by constraining comparisons to involve previously generated candidates. We also account for the perceptual ambiguity of the oracle providing the feedback by incorporating a Just-Noticeable Difference threshold into a probabilistic preference model to capture indifference to small utility differences. We adapt an information-theoretic acquisition strategy to this setting, selecting new configurations that are most informative about the unknown optimum under a preference model accounting for the perceptual ambiguity. We empirically demonstrate a notable increase in accuracy in setups with high production costs or with indifference feedback.
翻译:偏好贝叶斯优化通过依赖人类专家进行最小数量的比较评估,实现了对成本高昂或难以直接测量的目标函数的优化。生成用于评估的候选解通常也代价不菲,但现有方法忽略了这一成本。我们通过连续偏好贝叶斯优化,将基于偏好的优化推广至显式考虑生成与评估成本,通过将比较约束于先前生成的候选解来降低生成成本。同时,我们通过将恰可察觉差异阈值纳入概率偏好模型以捕捉对微小效用差异的无差异状态,从而考虑提供反馈的预言机的感知模糊性。我们针对该场景调整了一种信息论获取策略,在考虑感知模糊性的偏好模型下,选择对未知最优解最具信息量的新配置。实验结果表明,在高生成成本或存在无差异反馈的设置中,该方法显著提升了优化精度。