面向存储受限应用的GPU性能预测模型和通信优化关键技术研究

项目名称： 面向存储受限应用的GPU性能预测模型和通信优化关键技术研究

项目编号： No.61502509

项目类型： 青年科学基金项目

立项/批准年度： 2016

项目学科： 自动化技术、计算机技术

项目作者： 苏华友

作者单位： 中国人民解放军国防科技大学

项目金额： 21万元

中文摘要： GPU通用计算技术给高性能计算领域带来了前所未有的机遇和挑战。目前GPU已经在生物信息学、计算金融、机器学习、国防和医学成像等领域得到广泛的应用。基于GPU的异构计算是未来高性能计算发展的主流方向之一。然而，随着GPU的计算能力和峰值带宽之间的差距越来越大，程序的性能更多的由GPU各个存储层次之间的数据移动能力决定。然而，要开发高效的GPU程序还面临着诸多问题，其中以GPU程序的性能优化空间巨大、工作繁杂以及基于GPU的大规模并行计算可扩展性差两个方面最为突出。本课题提出面向存储受限应用的GPU性能分析预测模型和通信优化关键技术研究，从数据传输的角度建立GPU程序的性能预测模拟，通过模型驱动的方式简化GPU程序的优化，结合多流多线程并行机制、负载均衡、计算与通信重叠等技术隐藏节点之间通信开销，提高GPU之间的数据传输效率和大规模并行程序的可扩展性。

中文关键词： 图形处理器；存储受限应用；性能预测模型；通信优化

英文摘要： General purpose GPU computing technology has greatly promoted the development of the field of high performance computing. At the same time, it also poses some challenges to this domain. Currently, GPU has been widely used in bioinformatics, computational finance, machine learning, defense, medical imaging and other areas. GPU-based heterogeneous computing is one of the major development trends of the future high-performance computing. However, with the gap between the GPU computational power and peak bandwidth is growing, the GPU program's performance is mainly determined by the ability of moving data between memory hierarchies of GPU. In order to develop efficient GPU program, programmers should address several problems. First, for single GPU, the performance optimization space is very huge. It is very difficult to locate the performance bottleneck. Second, for GPU-based massively parallel computing, the program’s scalability is not well good due to low efficient communication method. This project aims to perform key techniques research on GPU performance model and communication optimization targeted at memory bound applications. Our purpose is to establish a GPU performance analysis model from the angle of data traffic for memory bound applications. Through model-driven approach, it can simplify the process of GPU program optimization. In addition, by using multi-stream and multi-threaded parallel mechanisms, combining with hybrid programming technologies, we are aiming to implement the overlap between computing and communication, which can improve the scalability of GPU-based massively computation.

英文关键词： GPU ;Memory bound applications;Performance model; Communication optimization

成为VIP会员查看完整内容