Neural schedulers based on deep reinforcement learning (DRL) have shown considerable potential for solving real-world resource allocation problems, as they have demonstrated significant performance gain in the domain of cluster computing. In this paper, we investigate the feasibility of neural schedulers for the domain of System-on-Chip (SoC) resource allocation through extensive experiments and comparison with non-neural, heuristic schedulers. The key finding is three-fold. First, neural schedulers designed for cluster computing domain do not work well for SoC due to i) heterogeneity of SoC computing resources and ii) variable action set caused by randomness in incoming jobs. Second, our novel neural scheduler technique, Eclectic Interaction Matching (EIM), overcomes the above challenges, thus significantly improving the existing neural schedulers. Specifically, we rationalize the underlying reasons behind the performance gain by the EIM-based neural scheduler. Third, we discover that the ratio of the average processing elements (PE) switching delay and the average PE computation time significantly impacts the performance of neural SoC schedulers even with EIM. Consequently, future neural SoC scheduler design must consider this metric as well as its implementation overhead for practical utility.
翻译:基于深度强化学习(DRL)的神经调度仪表显示,在解决实际世界资源分配问题方面具有相当大的潜力,因为这些仪表显示,集集计算领域取得了显著的绩效收益。在本文件中,我们通过广泛的实验和与非神经、脉冲调度仪表的比较,调查了系统-芯片(SoC)领域资源配置神经调度仪的可行性。关键结论是三重。首先,为集束计算领域设计的神经调度仪表对于SoC来说效果不佳,因为i) SoC计算资源的异质性化和ii)由即将到来的工作随机性造成的可变动作。第二,我们新的神经调度仪技术(ECIM)克服了上述挑战,从而大大改进了现有的神经调度仪。具体地说,我们把基于EIM的神经调度仪的性能收益背后的根本原因合理化。第三,我们发现平均处理要素(PE)的转换延迟率和平均PEE计算时间对神经调度员的性能影响很大,即使与EIM的随机性工作也是如此。因此,未来的神经调度仪表必须考虑实际设计,作为实际的顶级设计。