智能学习的网络安全智能代理的统一仿真训练环境 (Unified Emulation-Simulation Training Environment for Autonomous Cyber Agents) - 专知论文

会员服务 ·

0

智能代理 · 高保真 · 全集成 · 脱机 · 网络安全 ·

2023 年 4 月 3 日

Unified Emulation-Simulation Training Environment for Autonomous Cyber Agents

翻译：智能学习的网络安全智能代理的统一仿真训练环境

Li Li,Jean-Pierre S. El Rami,Adrian Taylor,James Hailing Rao,Thomas Kunz

from arxiv, To be published in the Proceedings of the 5th International Conference on Machine Learning for Networking (MLN'2022)

Autonomous cyber agents may be developed by applying reinforcement and deep reinforcement learning (RL/DRL), where agents are trained in a representative environment. The training environment must simulate with high-fidelity the network Cyber Operations (CyOp) that the agent aims to explore. Given the complexity of net-work CyOps, a good simulator is difficult to achieve. This work presents a systematic solution to automatically generate a high-fidelity simulator in the Cyber Gym for Intelligent Learning (CyGIL). Through representation learning and continuous learning, CyGIL provides a unified CyOp training environment where an emulated CyGIL-E automatically generates a simulated CyGIL-S. The simulator generation is integrated with the agent training process to further reduce the required agent training time. The agent trained in CyGIL-S is transferrable directly to CyGIL-E showing full transferability to the emulated "real" network. Experimental results are presented to demonstrate the CyGIL training performance. Enabling offline RL, the CyGIL solution presents a promising direction towards sim-to-real for leveraging RL agents in real-world cyber networks.

翻译：智能安全代理可以通过应用强化学习和深度强化学习（RL/DRL）开发，其中代理在代表性环境中进行训练。训练环境必须具有高保真度来模拟代理试图探索的网络Cyber Operations （CyOp）。由于网络CyOps的复杂性，很难实现一个好的仿真器。本文提出了一种系统性的解决方案，在智能学习的Cyber Gym中自动生成高保真度的仿真器。通过表示学习和连续学习，CyGIL提供了一个统一的CyOp训练环境，在模拟的CyGIL-S的基础上自动生成了模拟的CyGIL-E。仿真器生成与代理训练过程集成，进一步降低了所需的代理训练时间。在CyGIL-S中训练的代理直接可转移到CyGIL-E，完全集成实际网络的转移性。实验结果展示了CyGIL的训练性能。通过支持脱机RL，CyGIL解决方案向实现利用RL代理来管理实际网络提供了有前途的方向。

0

相关内容

智能代理

【硬核书】深度强化学习实践手册：应用现代RL方法，包括深度Q网络、值迭代、策略梯度、TRPO、AlphaGo等，547页pdf

【硬核书】深度强化学习实践手册：应用现代RL方法，包括深度Q网络、值迭代、策略梯度、TRPO、AlphaGo等，547页pdf

专知会员服务

79+阅读 · 2022年12月11日

【干货书】Python强化学习算法:学习、理解和开发智能算法以应对人工智能挑战，356页pdf，附代码

【干货书】Python强化学习算法:学习、理解和开发智能算法以应对人工智能挑战，356页pdf，附代码

专知会员服务

58+阅读 · 2022年12月10日

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

232+阅读 · 2022年4月10日

【干货书】创建和部署深度学习应用，Programming PyTorch for Deep Learning Creating and Deploying Deep Learning Applications

【干货书】创建和部署深度学习应用，Programming PyTorch for Deep Learning Creating and Deploying Deep Learning Applications

专知会员服务

133+阅读 · 2022年3月17日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

54+阅读 · 2021年4月11日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【NeurIPS 2020 Tutorial】离线强化学习:从算法到挑战，80页ppt

【NeurIPS 2020 Tutorial】离线强化学习:从算法到挑战，80页ppt

专知

16+阅读 · 2020年12月9日

17种深度强化学习算法用Pytorch实现

17种深度强化学习算法用Pytorch实现

新智元

31+阅读 · 2019年9月16日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

干货｜深度强化学习在面向任务的对话管理中的应用

干货｜深度强化学习在面向任务的对话管理中的应用

全球人工智能

13+阅读 · 2017年9月14日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

41+阅读 · 2015年12月31日

云计算环境信任链系统安全性理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

云计算环境下移动Agent系统信任安全关键技术研究

国家自然科学基金

2+阅读 · 2014年12月31日

无线传感器网络恶劣环境下可持续性通信的研究

国家自然科学基金

3+阅读 · 2013年12月31日

褐煤O2/CO2燃烧Cr的氧化机理及模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

情景驱动的机会发现关键技术研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于交互式动态影响图的未知对手模型学习

国家自然科学基金

3+阅读 · 2012年12月31日

可信工作流管理系统的软件机理与方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于信息表示与传导机制的异质agent计算金融模型

国家自然科学基金

0+阅读 · 2011年12月31日

多体量子系统的有限时间解纠缠及其对量子信息过程的影响

国家自然科学基金

0+阅读 · 2009年12月31日

Recent Advancements in Deep Learning Applications and Methods for Autonomous Navigation: A Comprehensive Review

Arxiv

0+阅读 · 2023年5月23日

From Model-Based to Data-Driven Simulation: Challenges and Trends in Autonomous Driving

Arxiv

0+阅读 · 2023年5月23日

FEDORA: Flying Event Dataset fOr Reactive behAvior

Arxiv

0+阅读 · 2023年5月22日

Phased data augmentation for training PixelCNNs with VQ-VAE-2 and limited data

Arxiv

0+阅读 · 2023年5月22日

DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving

Arxiv

0+阅读 · 2023年5月20日

Autonomous Drone Racing: A Survey

Arxiv

27+阅读 · 2023年1月5日

Dynamic neighbourhood optimisation for task allocation using multi-agent

Arxiv

101+阅读 · 2022年5月11日

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Arxiv

62+阅读 · 2021年10月25日

Building Intelligent Autonomous Navigation Agents

Arxiv

24+阅读 · 2021年6月25日

A Survey on Distributed Machine Learning

Arxiv

45+阅读 · 2019年12月20日

VIP会员

文章信息

相关主题

相关VIP内容

【硬核书】深度强化学习实践手册：应用现代RL方法，包括深度Q网络、值迭代、策略梯度、TRPO、AlphaGo等，547页pdf

【硬核书】深度强化学习实践手册：应用现代RL方法，包括深度Q网络、值迭代、策略梯度、TRPO、AlphaGo等，547页pdf

专知会员服务

79+阅读 · 2022年12月11日

【干货书】Python强化学习算法:学习、理解和开发智能算法以应对人工智能挑战，356页pdf，附代码

【干货书】Python强化学习算法:学习、理解和开发智能算法以应对人工智能挑战，356页pdf，附代码

专知会员服务

58+阅读 · 2022年12月10日

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

232+阅读 · 2022年4月10日

【干货书】创建和部署深度学习应用，Programming PyTorch for Deep Learning Creating and Deploying Deep Learning Applications

【干货书】创建和部署深度学习应用，Programming PyTorch for Deep Learning Creating and Deploying Deep Learning Applications

专知会员服务

133+阅读 · 2022年3月17日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

54+阅读 · 2021年4月11日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

最新，DeepSeek-R1论文登上Nature封面，附83页补充材料

人工智能与未来战争

自动驾驶中的轨迹预测大型基础模型：全面综述

万字长文《对抗雷达系统的电子战综述》

相关资讯

【NeurIPS 2020 Tutorial】离线强化学习:从算法到挑战，80页ppt

【NeurIPS 2020 Tutorial】离线强化学习:从算法到挑战，80页ppt

专知

16+阅读 · 2020年12月9日

17种深度强化学习算法用Pytorch实现

17种深度强化学习算法用Pytorch实现

新智元

31+阅读 · 2019年9月16日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

干货｜深度强化学习在面向任务的对话管理中的应用

干货｜深度强化学习在面向任务的对话管理中的应用

全球人工智能

13+阅读 · 2017年9月14日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Recent Advancements in Deep Learning Applications and Methods for Autonomous Navigation: A Comprehensive Review

Arxiv

0+阅读 · 2023年5月23日

From Model-Based to Data-Driven Simulation: Challenges and Trends in Autonomous Driving

Arxiv

0+阅读 · 2023年5月23日

FEDORA: Flying Event Dataset fOr Reactive behAvior

Arxiv

0+阅读 · 2023年5月22日

Phased data augmentation for training PixelCNNs with VQ-VAE-2 and limited data

Arxiv

0+阅读 · 2023年5月22日

DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving

Arxiv

0+阅读 · 2023年5月20日

Autonomous Drone Racing: A Survey

Arxiv

27+阅读 · 2023年1月5日

Dynamic neighbourhood optimisation for task allocation using multi-agent

Arxiv

101+阅读 · 2022年5月11日

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Arxiv

62+阅读 · 2021年10月25日

Building Intelligent Autonomous Navigation Agents

Arxiv

24+阅读 · 2021年6月25日

A Survey on Distributed Machine Learning

Arxiv

45+阅读 · 2019年12月20日

相关基金

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

41+阅读 · 2015年12月31日

云计算环境信任链系统安全性理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

云计算环境下移动Agent系统信任安全关键技术研究

国家自然科学基金

2+阅读 · 2014年12月31日

无线传感器网络恶劣环境下可持续性通信的研究

国家自然科学基金

3+阅读 · 2013年12月31日

褐煤O2/CO2燃烧Cr的氧化机理及模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

情景驱动的机会发现关键技术研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于交互式动态影响图的未知对手模型学习

国家自然科学基金

3+阅读 · 2012年12月31日

可信工作流管理系统的软件机理与方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于信息表示与传导机制的异质agent计算金融模型

国家自然科学基金

0+阅读 · 2011年12月31日

多体量子系统的有限时间解纠缠及其对量子信息过程的影响

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员