The Rashomon effect describes the phenomenon where multiple models trained on the same data produce identical predictions while differing in which features they rely on internally. This effect has been studied extensively in classification tasks, but not in sequential decision-making, where an agent learns a policy to achieve an objective by taking actions in an environment. In this paper, we translate the Rashomon effect to sequential decision-making. We define it as multiple policies that exhibit identical behavior, visiting the same states and selecting the same actions, while differing in their internal structure, such as feature attributions. Verifying identical behavior in sequential decision-making differs from classification. In classification, predictions can be directly compared to ground-truth labels. In sequential decision-making with stochastic transitions, the same policy may succeed or fail on any single trajectory due to randomness. We address this using formal verification methods that construct and compare the complete probabilistic behavior of each policy in the environment. Our experiments demonstrate that the Rashomon effect exists in sequential decision-making. We further show that ensembles constructed from the Rashomon set exhibit greater robustness to distribution shifts than individual policies. Additionally, permissive policies derived from the Rashomon set reduce computational requirements for verification while maintaining optimal performance.
翻译:罗生门效应描述了这样一种现象:在相同数据上训练的多个模型虽然内部依赖的特征不同,却产生完全相同的预测结果。该效应已在分类任务中得到广泛研究,但尚未在序列决策领域进行探讨——在序列决策中,智能体通过与环境交互学习达成目标的策略。本文将罗生门效应引入序列决策领域,将其定义为:多个策略在访问相同状态并选择相同动作、表现出完全一致行为的同时,其内部结构(如特征归因)却存在差异。验证序列决策中的行为一致性不同于分类任务:在分类中,预测结果可直接与真实标签比较;而在具有随机转移特性的序列决策中,由于随机性影响,同一策略在单条轨迹上可能成功也可能失败。我们通过形式化验证方法解决该问题,该方法构建并比较各策略在环境中的完整概率行为。实验证明罗生门效应确实存在于序列决策中。我们进一步发现,基于罗生门集合构建的集成策略相比单一策略对分布偏移具有更强的鲁棒性。此外,从罗生门集合衍生的宽松策略能在保持最优性能的同时,显著降低验证过程的计算需求。