代码变换:通过自我监督的深层学习和高性能计算机破除硅代码的语言 (CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing)

Currently, a growing number of mature natural language processing applications make people's life more convenient. Such applications are built by source code - the language in software engineering. However, the applications for understanding source code language to ease the software engineering process are under-researched. Simultaneously, the transformer model, especially its combination with transfer learning, has been proven to be a powerful technique for natural language processing tasks. These breakthroughs point out a promising direction for process source code and crack software engineering tasks. This paper describes CodeTrans - an encoder-decoder transformer model for tasks in the software engineering domain, that explores the effectiveness of encoder-decoder transformer models for six software engineering tasks, including thirteen sub-tasks. Moreover, we have investigated the effect of different training strategies, including single-task learning, transfer learning, multi-task learning, and multi-task learning with fine-tuning. CodeTrans outperforms the state-of-the-art models on all the tasks. To expedite future works in the software engineering domain, we have published our pre-trained models of CodeTrans. https://github.com/agemagician/CodeTrans

翻译：目前,越来越多的成熟的自然语言处理应用程序使得人们的生活更加方便。这些应用程序是由源代码――软件工程中的语言。然而,用于理解源代码语言以方便软件工程过程的应用程序研究不足。与此同时,变压器模型,特别是它与转移学习相结合,已证明是自然语言处理任务的有力技术。这些突破指出了处理源代码和破碎软件工程任务的有希望的方向。本文件描述了代码Trans――软件工程领域任务的一个编码-解码变压器变压器模型,它探索了包括13个子任务在内的6个软件工程任务的编码变压器模型的有效性。此外,我们研究了不同培训战略的影响,包括单任务学习、转移学习、多任务学习和通过微调进行多任务学习。代码超越了所有任务方面的最新技术模型。为了加快软件工程领域的未来工程,我们出版了我们事先培训过的代码Transer模型。 https://github.com/agegigicopician https://transycrodecoprigician/dogrationian。

相关内容

Engineering

关注 6

《工程》是中国工程院（CAE）于2015年推出的国际开放存取期刊。其目的是提供一个高水平的平台，传播和分享工程研发的前沿进展、当前主要研究成果和关键成果；报告工程科学的进展，讨论工程发展的热点、兴趣领域、挑战和前景，在工程中考虑人与环境的福祉和伦理道德，鼓励具有深远经济和社会意义的工程突破和创新，使之达到国际先进水平，成为新的生产力，从而改变世界，造福人类，创造新的未来。期刊链接：https://www.sciencedirect.com/journal/engineering

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

129+阅读 · 2021年6月16日