CooFuzzing:测试带有覆盖制导模糊的神经法模型 (CoCoFuzzing: Testing Neural Code Models with Coverage-Guided Fuzzing)

Deep learning-based code processing models have shown good performance for tasks such as predicting method names, summarizing programs, and comment generation. However, despite the tremendous progress, deep learning models are often prone to adversarial attacks, which can significantly threaten the robustness and generalizability of these models by leading them to misclassification with unexpected inputs. To address the above issue, many deep learning testing approaches have been proposed, however, these approaches mainly focus on testing deep learning applications in the domains of image, audio, and text analysis, etc., which cannot be directly applied to neural models for code due to the unique properties of programs. In this paper, we propose a coverage-based fuzzing framework, CoCoFuzzing, for testing deep learning-based code processing models. In particular, we first propose ten mutation operators to automatically generate valid and semantically preserving source code examples as tests; then we propose a neuron coverage-based approach to guide the generation of tests. We investigate the performance of CoCoFuzzing on three state-of-the-art neural code models, i.e., NeuralCodeSum, CODE2SEQ, and CODE2VEC. Our experiment results demonstrate that CoCoFuzzing can generate valid and semantically preserving source code examples for testing the robustness and generalizability of these models and improve the neuron coverage. Moreover, these tests can be used to improve the performance of the target neural code models through adversarial retraining.

翻译：然而,尽管取得了巨大进步,深层次学习模式往往容易发生对抗性攻击,这可能导致这些模式的稳健性和可概括性,导致它们与意外投入的分类错误。然而,为解决上述问题,提出了许多深层次的学习测试方法,这些方法主要侧重于测试图像、音频和文本分析等领域的深层次学习应用,这些应用由于程序的独特性能而无法直接应用于代码的神经模型。在本文件中,我们提出了一个基于覆盖的模糊框架,即CooFuzzzing,用于测试深层次的基于学习的代码处理模型。特别是,我们首先建议10个突变操作者自动生成有效和语义保存源代码示例作为测试;然后我们提出一种基于神经覆盖的方法来指导测试的生成。我们调查CoFuzzzzzzz在三个状态的神经代码模型上的性能,例如,NeuralcodeSuz, CODE2SEQ和SOCOFSOLSU的常规测试和常规测试,这些常规的源值测试,这些常规的测试可以改进常规和常规的源值。