Evaluating the quality of synthetic data remains a persistent challenge in the Android malware domain due to instability and the lack of standardization among existing metrics. This work integrates into MalDataGen a Super-Metric that aggregates eight metrics across four fidelity dimensions, producing a single weighted score. Experiments involving ten generative models and five balanced datasets demonstrate that the Super-Metric is more stable and consistent than traditional metrics, exhibiting stronger correlations with the actual performance of classifiers.
翻译:在Android恶意软件领域,由于现有度量的不稳定性及缺乏标准化,评估合成数据的质量仍是一个持续存在的挑战。本研究在MalDataGen中集成了一种超度量,该超度量聚合了四个保真度维度上的八种度量指标,生成一个加权综合分数。涉及十种生成模型和五个平衡数据集的实验表明,超度量比传统度量更稳定、更一致,并且与分类器的实际性能表现出更强的相关性。