网络AI健身房：自主网络安全代理的实现 (Enabling A Network AI Gym for Autonomous Cyber Agents) - 专知论文

会员服务 ·

0

网络仿真 · 脱机 · 网络安全 · 高保真 · AI ·

2023 年 4 月 3 日

Enabling A Network AI Gym for Autonomous Cyber Agents

翻译：网络AI健身房：自主网络安全代理的实现

Li Li,Jean-Pierre S. El Rami,Adrian Taylor,James Hailing Rao,Thomas Kunz

from arxiv, To appear in Proceedings of the 2022 International Conference on Computational Science and Computational Intelligence

This work aims to enable autonomous agents for network cyber operations (CyOps) by applying reinforcement and deep reinforcement learning (RL/DRL). The required RL training environment is particularly challenging, as it must balance the need for high-fidelity, best achieved through real network emulation, with the need for running large numbers of training episodes, best achieved using simulation. A unified training environment, namely the Cyber Gym for Intelligent Learning (CyGIL) is developed where an emulated CyGIL-E automatically generates a simulated CyGIL-S. From preliminary experimental results, CyGIL-S is capable to train agents in minutes compared with the days required in CyGIL-E. The agents trained in CyGIL-S are transferrable directly to CyGIL-E showing full decision proficiency in the emulated "real" network. Enabling offline RL, the CyGIL solution presents a promising direction towards sim-to-real for leveraging RL agents in real-world cyber networks.

翻译：本文旨在通过应用强化和深度强化学习（RL/DRL）为网络的CyOps操作实现自主代理。所需的RL训练环境具有特殊挑战，因为它必须平衡需要高保真度的需求，最好通过真实网络仿真实现，以及需要运行大量训练剧集的需求，最好使用模拟实现。开发了统一的训练环境，即智能学习的网络Cyber Gym（CyGIL），其中仿真的CyGIL-E自动生成了模拟的CyGIL-S。从初步的实验结果来看，CyGIL-S能够在几分钟内训练代理，而在CyGIL-E中需要数天的训练时间。在CyGIL-S中训练的代理可以直接转移到CyGIL-E中，展示在仿真的“真实”网络中的完全决策能力。通过实现脱机RL，CyGIL解决方案向利用现实世界网络中的RL代理提供了一个有前途的方向。

0

相关内容

网络仿真

148页最新《深度强化学习》教程，148页ppt

148页最新《深度强化学习》教程，148页ppt

专知会员服务

77+阅读 · 2023年4月29日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

系列教程GNN-algorithms之七：《图同构网络—GIN》

系列教程GNN-algorithms之七：《图同构网络—GIN》

专知会员服务

48+阅读 · 2020年8月9日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

【O'Reilly TensorFlow Conference 2019】不要打败市场；击败机器人：金融对抗网络（Don’t beat the market; beat the bots: Adversarial networks in finance），Manceps机器学习架构师Garrett Lander、首席执行官兼首席顾问Al Kari

【O'Reilly TensorFlow Conference 2019】不要打败市场；击败机器人：金融对抗网络（Don’t beat the market; beat the bots: Adversarial networks in finance），Manceps机器学习架构师Garrett Lander、首席执行官兼首席顾问Al Kari

专知会员服务

16+阅读 · 2019年11月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

使用 JAX 构建强化学习 agent，并借助 TensorFlow Lite 将其部署到 Android 应用中

使用 JAX 构建强化学习 agent，并借助 TensorFlow Lite 将其部署到 Android 应用中

谷歌开发者

0+阅读 · 2022年11月1日

CALDERA 一款对手自动模拟工具

CALDERA 一款对手自动模拟工具

黑白之道

20+阅读 · 2019年9月17日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

tensorflow Object Detection API使用预训练模型mask r-cnn实现对象检测

tensorflow Object Detection API使用预训练模型mask r-cnn实现对象检测

极市平台

12+阅读 · 2018年8月24日

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

专知

23+阅读 · 2018年2月23日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

基于重要性采样的并行离策略强化学习方法研究

国家自然科学基金

23+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

45+阅读 · 2015年12月31日

铸造高硼高速钢硼碳化物调控及其耐磨性研究

国家自然科学基金

0+阅读 · 2014年12月31日

非自由部署空间中无线传感器网络查询处理技术研究

国家自然科学基金

1+阅读 · 2013年12月31日

三维互联网应用中的服饰实时动画关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向ISM频段无线传感器网络的合作共存与优化技术

国家自然科学基金

0+阅读 · 2012年12月31日

基于用户偏好感知的SaaS服务选择优化研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属茂基聚合型阻燃抑烟剂的制备及其催化交联成炭机理

国家自然科学基金

0+阅读 · 2012年12月31日

社会依赖演化网调控的Agent服务协同自适应

国家自然科学基金

0+阅读 · 2012年12月31日

基于IEEE802.11n的长距离无线mesh网络理论与关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

Variable Grasp Pose and Commitment for Trajectory Optimization

Arxiv

0+阅读 · 2023年5月21日

Autonomous GIS: the next-generation AI-powered GIS

Arxiv

0+阅读 · 2023年5月20日

DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving

Arxiv

0+阅读 · 2023年5月20日

Vision-based DRL Autonomous Driving Agent with Sim2Real Transfer

Arxiv

0+阅读 · 2023年5月19日

Dive into the Power of Neuronal Heterogeneity

Arxiv

0+阅读 · 2023年5月19日

LMEye: An Interactive Perception Network for Large Language Models

Arxiv

0+阅读 · 2023年5月19日

Collective Reasoning for Safe Autonomous Systems

Arxiv

0+阅读 · 2023年5月18日

Autonomous Drone Racing: A Survey

Arxiv

27+阅读 · 2023年1月5日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

50+阅读 · 2021年1月6日

VIP会员

文章信息

相关主题

相关VIP内容

148页最新《深度强化学习》教程，148页ppt

148页最新《深度强化学习》教程，148页ppt

专知会员服务

77+阅读 · 2023年4月29日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

系列教程GNN-algorithms之七：《图同构网络—GIN》

系列教程GNN-algorithms之七：《图同构网络—GIN》

专知会员服务

48+阅读 · 2020年8月9日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

【O'Reilly TensorFlow Conference 2019】不要打败市场；击败机器人：金融对抗网络（Don’t beat the market; beat the bots: Adversarial networks in finance），Manceps机器学习架构师Garrett Lander、首席执行官兼首席顾问Al Kari

【O'Reilly TensorFlow Conference 2019】不要打败市场；击败机器人：金融对抗网络（Don’t beat the market; beat the bots: Adversarial networks in finance），Manceps机器学习架构师Garrett Lander、首席执行官兼首席顾问Al Kari

专知会员服务

16+阅读 · 2019年11月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【NTU博士论文】利用强化学习与生成模型推进可靠且可泛化的决策

美海军研发“增强侦察与态势评估系统（ARES）”应用程序以优化作战规划（附研究论文）

【NeurIPS2025】DNA-DetectLLM：基于 DNA 启发的“突变-修复”范式揭示 AI 生成文本

面向深度研究系统的强化学习基础：综述

相关资讯

使用 JAX 构建强化学习 agent，并借助 TensorFlow Lite 将其部署到 Android 应用中

使用 JAX 构建强化学习 agent，并借助 TensorFlow Lite 将其部署到 Android 应用中

谷歌开发者

0+阅读 · 2022年11月1日

CALDERA 一款对手自动模拟工具

CALDERA 一款对手自动模拟工具

黑白之道

20+阅读 · 2019年9月17日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

tensorflow Object Detection API使用预训练模型mask r-cnn实现对象检测

tensorflow Object Detection API使用预训练模型mask r-cnn实现对象检测

极市平台

12+阅读 · 2018年8月24日

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

专知

23+阅读 · 2018年2月23日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Variable Grasp Pose and Commitment for Trajectory Optimization

Arxiv

0+阅读 · 2023年5月21日

Autonomous GIS: the next-generation AI-powered GIS

Arxiv

0+阅读 · 2023年5月20日

DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving

Arxiv

0+阅读 · 2023年5月20日

Vision-based DRL Autonomous Driving Agent with Sim2Real Transfer

Arxiv

0+阅读 · 2023年5月19日

Dive into the Power of Neuronal Heterogeneity

Arxiv

0+阅读 · 2023年5月19日

LMEye: An Interactive Perception Network for Large Language Models

Arxiv

0+阅读 · 2023年5月19日

Collective Reasoning for Safe Autonomous Systems

Arxiv

0+阅读 · 2023年5月18日

Autonomous Drone Racing: A Survey

Arxiv

27+阅读 · 2023年1月5日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

50+阅读 · 2021年1月6日

相关基金

基于重要性采样的并行离策略强化学习方法研究

国家自然科学基金

23+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

45+阅读 · 2015年12月31日

铸造高硼高速钢硼碳化物调控及其耐磨性研究

国家自然科学基金

0+阅读 · 2014年12月31日

非自由部署空间中无线传感器网络查询处理技术研究

国家自然科学基金

1+阅读 · 2013年12月31日

三维互联网应用中的服饰实时动画关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向ISM频段无线传感器网络的合作共存与优化技术

国家自然科学基金

0+阅读 · 2012年12月31日

基于用户偏好感知的SaaS服务选择优化研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属茂基聚合型阻燃抑烟剂的制备及其催化交联成炭机理

国家自然科学基金

0+阅读 · 2012年12月31日

社会依赖演化网调控的Agent服务协同自适应

国家自然科学基金

0+阅读 · 2012年12月31日

基于IEEE802.11n的长距离无线mesh网络理论与关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员