通过深入自我转让加强学习实现 " 节省 " 的动态速率分配 (Cache-Enabled Dynamic Rate Allocation via Deep Self-Transfer Reinforcement Learning)

Caching and rate allocation are two promising approaches to support video streaming over wireless network. However, existing rate allocation designs do not fully exploit the advantages of the two approaches. This paper investigates the problem of cache-enabled QoE-driven video rate allocation problem. We establish a mathematical model for this problem, and point out that it is difficult to solve the problem with traditional dynamic programming. Then we propose a deep reinforcement learning approaches to solve it. First, we model the problem as a Markov decision problem. Then we present a deep Q-learning algorithm with a special knowledge transfer process to find out effective allocation policy. Finally, numerical results are given to demonstrate that the proposed solution can effectively maintain high-quality user experience of mobile user moving among small cells. We also investigate the impact of configuration of critical parameters on the performance of our algorithm.

翻译：缓存和费率分配是支持无线网络视频流的两种有希望的办法。然而,现有的费率分配设计并没有充分利用这两种办法的优势。本文调查了缓存驱动的QoE驱动的视频率分配问题。我们为此问题建立了一个数学模型,并指出很难用传统的动态编程来解决问题。然后我们提出一种深入强化的学习方法来解决这个问题。首先,我们把这个问题作为Markov决定问题来模拟。然后,我们提出了一个深层次的Q-学习算法,并有一个特别的知识转移程序来找出有效的分配政策。最后,提供了数字结果,以证明拟议的解决办法能够有效地保持小细胞移动用户的高质量用户经验。我们还调查关键参数的配置对我们算法绩效的影响。