The proliferation of synthetic facial imagery has intensified the need for robust Open-World DeepFake Attribution (OW-DFA), which aims to attribute both known and unknown forgeries using labeled data for known types and unlabeled data containing a mixture of known and novel types. However, existing OW-DFA methods face two critical limitations: 1) A confidence skew that leads to unreliable pseudo-labels for novel forgeries, resulting in biased training. 2) An unrealistic assumption that the number of unknown forgery types is known *a priori*. To address these challenges, we propose a Confidence-Aware Asymmetric Learning (CAL) framework, which adaptively balances model confidence across known and novel forgery types. CAL mainly consists of two components: Confidence-Aware Consistency Regularization (CCR) and Asymmetric Confidence Reinforcement (ACR). CCR mitigates pseudo-label bias by dynamically scaling sample losses based on normalized confidence, gradually shifting the training focus from high- to low-confidence samples. ACR complements this by separately calibrating confidence for known and novel classes through selective learning on high-confidence samples, guided by their confidence gap. Together, CCR and ACR form a mutually reinforcing loop that significantly improves the model's OW-DFA performance. Moreover, we introduce a Dynamic Prototype Pruning (DPP) strategy that automatically estimates the number of novel forgery types in a coarse-to-fine manner, removing the need for unrealistic prior assumptions and enhancing the scalability of our methods to real-world OW-DFA scenarios. Extensive experiments on the standard OW-DFA benchmark and a newly extended benchmark incorporating advanced manipulations demonstrate that CAL consistently outperforms previous methods, achieving new state-of-the-art performance on both known and novel forgery attribution.
翻译:合成人脸图像的激增强化了对鲁棒的开放世界深度伪造溯源的需求,该任务旨在利用已知伪造类型的标注数据以及包含已知与新型伪造类型的未标注数据,对已知和未知伪造进行溯源。然而,现有开放世界深度伪造溯源方法面临两个关键局限:1)置信度偏斜导致对新型伪造的伪标签不可靠,从而引发训练偏差;2)不切实际地假设未知伪造类型的数量是预先已知的。为应对这些挑战,我们提出了一种置信度感知非对称学习框架,该框架自适应地平衡模型对已知与新型伪造类型的置信度。该框架主要由两个组件构成:置信度感知一致性正则化与非对称置信度增强。置信度感知一致性正则化通过基于归一化置信度动态缩放样本损失,逐步将训练焦点从高置信度样本转移至低置信度样本,从而缓解伪标签偏差。非对称置信度增强则通过在高置信度样本上进行选择性学习,并依据其置信度差距分别校准已知类与新型类的置信度,以此作为补充。两者共同形成一个相互增强的循环,显著提升了模型的开放世界深度伪造溯源性能。此外,我们提出了一种动态原型剪枝策略,该策略以从粗到细的方式自动估计新型伪造类型的数量,消除了不切实际的先验假设需求,并增强了我们方法在现实世界开放世界深度伪造溯源场景中的可扩展性。在标准开放世界深度伪造溯源基准以及新扩展的包含高级操纵技术的基准上进行的大量实验表明,该框架持续优于先前方法,在已知与新型伪造溯源上均取得了新的最先进性能。