Accel-Sim is a widely used computer architecture simulator that models the behavior of modern NVIDIA GPUs in great detail. However, although Accel-Sim and the underlying GPGPU-Sim model many of the features of real GPUs, thus far it has not been able to track statistics separately per stream. Instead, Accel-Sim combines statistics (e.g., cycles and cache hits/misses) across all simultaneously running streams. This can prevent users from properly identifying the behavior of specific kernels and streams and potentially lead to incorrect conclusions. Thus, in this work we extend Accel-Sim's and GPGPU-Sim's statistic tracking support to track per-stream statistics. To validate this support, we designed a series of multi-stream microbenchmarks and checked their reported per-kernel, per-stream counts.
翻译:----
研究论文摘要:Accel-Sim是广泛使用的计算机架构模拟器,以很高的精度模拟了现代NVIDIA GPU的行为。然而,虽然Accel-Sim和基础的GPGPU-Sim模拟了实际GPU的许多特性,但迄今为止还未能单独跟踪每个流的统计信息。相反,Accel-Sim将所有同时运行的流的统计数据(例如,循环计数和缓存命中/失误数)合并在一起。这可能会阻碍用户正确识别特定核函数和流的行为,有可能导致错误的结论。因此,在本研究中,我们扩展了Accel-Sim和GPGPU-Sim的统计跟踪的支持,以跟踪每个流的统计数据。为了验证支持性能,我们设计了一系列多流微基准测试,并检查它们报告的每个核函数、每个流的计数。