具有深强化学习方法的时变化式移动边缘网络动态卸载设计 (Dynamic Offloading Design in Time-Varying Mobile Edge Networks with Deep Reinforcement Learning Approach) - 专知论文

会员服务 ·

0

DQN · 学成 · 边 · Networking · 优化器 ·

2021 年 3 月 3 日

Dynamic Offloading Design in Time-Varying Mobile Edge Networks with Deep Reinforcement Learning Approach

翻译：具有深强化学习方法的时变化式移动边缘网络动态卸载设计

Liang Yu,Rui Wang,Minyan Shi,Jun Wu

Mobile edge computing (MEC) is regarded as a promising wireless access architecture to alleviate the intensive computation burden at resource limited mobile terminals (MTs). Allowing the MTs to offload partial tasks to MEC servers could significantly decrease task processing delay. In this study, to minimize the processing delay for a multi-user MEC system, we jointly optimize the local content splitting ratio, the transmission/computation power allocation, and the MEC server selection under a dynamic environment with time-varying task arrivals and wireless channels. The reinforcement learning (RL) technique is utilized to deal with the considered problem. Two deep RL strategies, that is, deep Q-learning network (DQN) and deep deterministic policy gradient (DDPG), are proposed to efficiently learn the offloading policies adaptively. The proposed DQN strategy takes the MEC selection as a unique action while using convex optimization approach to obtain the remaining variables. And the DDPG strategy takes all dynamic variables as actions. Numerical results demonstrates that both proposed strategies perform better than existing schemes. And the DDPG strategy is superior to the DQN strategy as it can learn all variables online although it requires relatively large complexity.

翻译：移动边缘计算(MEC)被认为是一个有希望的无线访问架构,可以减轻资源有限的流动终端(MTs)的密集计算负担。允许MTs向MEC服务器卸载部分任务可以大大减少任务处理延迟。在这项研究中,为了最大限度地减少多用户MEC系统的处理延迟,我们共同优化本地内容分割率、传输/计算能力分配,以及在具有时间分配任务到达和无线频道的动态环境中选择MEC服务器。强化学习(RL)技术被用来处理所考虑的问题。两个深RL战略,即深Q学习网络(DQN)和深度确定性政策梯度(DDGPG),被提议以适应的方式高效学习卸载政策。拟议的DQN战略将MEC选择作为一种独特的行动,同时使用Convex优化方法获取剩余变量。DDPG战略将所有动态变量作为行动。数值显示,拟议的战略都比现有方案效果更好。DDPG战略优于DDPG战略,即深QN战略,因为它可以学习所有复杂的在线变量。

0

相关内容

DQN

多Agent深度强化学习综述(中文版)，21页pdf

专知会员服务

102+阅读 · 2020年12月31日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

123+阅读 · 2020年4月19日

【Manning2020新书】深度强化学习实战，351页pdf，Deep Reinforcement Learning

【Manning2020新书】深度强化学习实战，351页pdf，Deep Reinforcement Learning

专知会员服务

277+阅读 · 2020年3月10日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

81+阅读 · 2020年2月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

176+阅读 · 2020年2月1日

TensorFlow深度学习，从线性回归到强化学习的深度学习（TensorFlow for Deep Learning From Linear Regression to Reinforcement Learning），附页256页pdf

TensorFlow深度学习，从线性回归到强化学习的深度学习（TensorFlow for Deep Learning From Linear Regression to Reinforcement Learning），附页256页pdf

专知会员服务

44+阅读 · 2020年1月1日

【模型泛化教程】标签平滑与Keras, TensorFlow，和深度学习

【模型泛化教程】标签平滑与Keras, TensorFlow，和深度学习

专知会员服务

20+阅读 · 2019年12月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

144+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

168+阅读 · 2019年10月11日

谷歌足球游戏环境使用介绍

谷歌足球游戏环境使用介绍

CreateAMind

31+阅读 · 2019年6月27日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

14+阅读 · 2019年4月13日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【推荐】直接未来预测：增强学习监督学习

【推荐】直接未来预测：增强学习监督学习

机器学习研究会

6+阅读 · 2017年11月24日

【推荐】树莓派/OpenCV/dlib人脸定位/瞌睡检测

【推荐】树莓派/OpenCV/dlib人脸定位/瞌睡检测

机器学习研究会

9+阅读 · 2017年10月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

A unified Neural Network Approach to E-CommerceRelevance Learning

Arxiv

0+阅读 · 2021年4月26日

AWAC: Accelerating Online Reinforcement Learning with Offline Datasets

Arxiv

0+阅读 · 2021年4月24日

A Deep Reinforcement Learning Approach for the Meal Delivery Problem

Arxiv

0+阅读 · 2021年4月24日

Task Offloading Optimization in NOMA-Enabled Multi-hop Mobile Edge Computing System Using Conflict Graph

Arxiv

0+阅读 · 2021年4月23日

Learning to reflect: A unifying approach for data-driven stochastic control strategies

Arxiv

0+阅读 · 2021年4月23日

Learning to Walk via Deep Reinforcement Learning

Arxiv

7+阅读 · 2018年12月26日

Residual Policy Learning

Residual Policy Learning

Arxiv

4+阅读 · 2018年12月15日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

15+阅读 · 2018年6月27日

Experience-driven Networking: A Deep Reinforcement Learning based Approach

Arxiv

9+阅读 · 2018年1月17日

Cellular-Connected UAVs over 5G: Deep Reinforcement Learning for Interference Management

Arxiv

4+阅读 · 2018年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

多Agent深度强化学习综述(中文版)，21页pdf

专知会员服务

102+阅读 · 2020年12月31日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

123+阅读 · 2020年4月19日

【Manning2020新书】深度强化学习实战，351页pdf，Deep Reinforcement Learning

【Manning2020新书】深度强化学习实战，351页pdf，Deep Reinforcement Learning

专知会员服务

277+阅读 · 2020年3月10日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

81+阅读 · 2020年2月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

176+阅读 · 2020年2月1日

TensorFlow深度学习，从线性回归到强化学习的深度学习（TensorFlow for Deep Learning From Linear Regression to Reinforcement Learning），附页256页pdf

TensorFlow深度学习，从线性回归到强化学习的深度学习（TensorFlow for Deep Learning From Linear Regression to Reinforcement Learning），附页256页pdf

专知会员服务

44+阅读 · 2020年1月1日

【模型泛化教程】标签平滑与Keras, TensorFlow，和深度学习

【模型泛化教程】标签平滑与Keras, TensorFlow，和深度学习

专知会员服务

20+阅读 · 2019年12月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

144+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

168+阅读 · 2019年10月11日

热门VIP内容

相关资讯

谷歌足球游戏环境使用介绍

谷歌足球游戏环境使用介绍

CreateAMind

31+阅读 · 2019年6月27日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

14+阅读 · 2019年4月13日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【推荐】直接未来预测：增强学习监督学习

【推荐】直接未来预测：增强学习监督学习

机器学习研究会

6+阅读 · 2017年11月24日

【推荐】树莓派/OpenCV/dlib人脸定位/瞌睡检测

【推荐】树莓派/OpenCV/dlib人脸定位/瞌睡检测

机器学习研究会

9+阅读 · 2017年10月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

A unified Neural Network Approach to E-CommerceRelevance Learning

Arxiv

0+阅读 · 2021年4月26日

AWAC: Accelerating Online Reinforcement Learning with Offline Datasets

Arxiv

0+阅读 · 2021年4月24日

A Deep Reinforcement Learning Approach for the Meal Delivery Problem

Arxiv

0+阅读 · 2021年4月24日

Task Offloading Optimization in NOMA-Enabled Multi-hop Mobile Edge Computing System Using Conflict Graph

Arxiv

0+阅读 · 2021年4月23日

Learning to reflect: A unifying approach for data-driven stochastic control strategies

Arxiv

0+阅读 · 2021年4月23日

Learning to Walk via Deep Reinforcement Learning

Arxiv

7+阅读 · 2018年12月26日

Residual Policy Learning

Residual Policy Learning

Arxiv

4+阅读 · 2018年12月15日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

15+阅读 · 2018年6月27日

Experience-driven Networking: A Deep Reinforcement Learning based Approach

Arxiv

9+阅读 · 2018年1月17日

Cellular-Connected UAVs over 5G: Deep Reinforcement Learning for Interference Management

Arxiv

4+阅读 · 2018年1月16日

微信扫码咨询专知VIP会员