自主粒子 (Autonomous particles) - 专知论文

会员服务 ·

0

INFORMS · Learning · 回合 · INTERACT · 不变 ·

2023 年 1 月 24 日

Autonomous particles

翻译：自主粒子

Nikola Andrejic,Vitaly Vanchurin

from arxiv, 15 pages, 3 figures

Consider a reinforcement learning problem where an agent has access to a very large amount of information about the environment, but it can only take very few actions to accomplish its task and to maximize its reward. Evidently, the main problem for the agent is to learn a map from a very high-dimensional space (which represents its environment) to a very low-dimensional space (which represents its actions). The high-to-low dimensional map implies that most of the information about the environment is irrelevant for the actions to be taken, and only a small fraction of information is relevant. In this paper we argue that the relevant information need not be learned by brute force (which is the standard approach), but can be identified from the intrinsic symmetries of the system. We analyze in details a reinforcement learning problem of autonomous driving, where the corresponding symmetry is the Galilean symmetry, and argue that the learning task can be accomplished with very few relevant parameters, or, more precisely, invariants. For a numerical demonstration, we show that the autonomous vehicles (which we call autonomous particles since they describe very primitive vehicles) need only four relevant invariants to learn how to drive very well without colliding with other particles. The simple model can be easily generalized to include different types of particles (e.g. for cars, for pedestrians, for buildings, for road signs, etc.) with different types of relevant invariants describing interactions between them. We also argue that there must exist a field theory description of the learning system where autonomous particles would be described by fermionic degrees of freedom and interactions mediated by the relevant invariants would be described by bosonic degrees of freedom.

翻译：高到低的地图意味着大部分环境信息对于要采取的行动无关紧要, 并且只有一小部分信息是相关的。在本文中, 我们争论说, 相关的信息不需要由粗力( 这是一种标准的方法) 来完成它的任务, 也可以从系统内在的对称中找出。我们分析的是从一个高度空间( 代表它的环境) 学习一张地图到一个非常低的维度空间( 代表它的行动 ) 。高到低的地图意味着, 有关环境的信息大多与要采取的行动无关, 并且只有一小部分信息是相关的。在本文中, 我们争论说, 相关的信息不需要通过粗力( 也就是标准的媒介) 来学习, 而是从系统内在的对称来识别。我们分析一个强化的自主性学习问题, 在那里, 对应的对 Galilean 的对称性进行对比, 并说, 学习任务可以用非常少的相关参数完成, 或者更精确的变异性。对于数字演示来说, 我们指出, 自主的车辆( 我们称之为自主粒子, 因为它们描述非常原始的飞行器), 只需要用四种相关的领域来描述, 和行进模型来解释。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

巯基保护的金纳米团簇的磁性调控的理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

CSP-GNPs对宫颈癌的靶向放疗增敏作用及其机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

Ru催化双导向基团参与C-H键活化及官能化反应的研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向成像差异的高精度强适应SAR景象匹配算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

甲醇、水在金属掺杂的TiO2(110)表面微观尺度下的光化学表征

国家自然科学基金

1+阅读 · 2013年12月31日

Hg2CuTi型全Heusler合金表面与界面的半金属特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

微纳结构表面上润湿和电润湿动力学的跨尺度研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cocycle动力学和拟周期薛定谔算子的谱

国家自然科学基金

0+阅读 · 2012年12月31日

Renin-Angiotensin System在介导机械通气所致肺微血管内皮细胞功能障碍中的作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

过渡金属及其合金团簇的稳定性和磁性研究

国家自然科学基金

0+阅读 · 2009年12月31日

An Autonomous System for Head-to-Head Race: Design, Implementation and Analysis; Team KAIST at the Indy Autonomous Challenge

Arxiv

0+阅读 · 2023年3月16日

SUAVE: An Exemplar for Self-Adaptive Underwater Vehicles

Arxiv

0+阅读 · 2023年3月16日

Learning When to Use Adaptive Adversarial Image Perturbations against Autonomous Vehicles

Arxiv

0+阅读 · 2023年3月15日

Fully neuromorphic vision and control for autonomous drone flight

Arxiv

0+阅读 · 2023年3月15日

Bayesian Learning for the Robust Verification of Autonomous Robots

Arxiv

0+阅读 · 2023年3月15日

Adaptive Planning and Control with Time-Varying Tire Models for Autonomous Racing Using Extreme Learning Machine

Arxiv

0+阅读 · 2023年3月14日

Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of Connected Autonomous Vehicles in Challenging Scenarios

Arxiv

0+阅读 · 2023年3月13日

Autonomous Drone Racing: A Survey

Arxiv

27+阅读 · 2023年1月5日

Knowledge Augmented Machine Learning with Applications in Autonomous Driving: A Survey

Arxiv

17+阅读 · 2022年5月10日

Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions

Arxiv

18+阅读 · 2021年12月21日

VIP会员

文章信息

相关主题

相关VIP内容

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

An Autonomous System for Head-to-Head Race: Design, Implementation and Analysis; Team KAIST at the Indy Autonomous Challenge

Arxiv

0+阅读 · 2023年3月16日

SUAVE: An Exemplar for Self-Adaptive Underwater Vehicles

Arxiv

0+阅读 · 2023年3月16日

Learning When to Use Adaptive Adversarial Image Perturbations against Autonomous Vehicles

Arxiv

0+阅读 · 2023年3月15日

Fully neuromorphic vision and control for autonomous drone flight

Arxiv

0+阅读 · 2023年3月15日

Bayesian Learning for the Robust Verification of Autonomous Robots

Arxiv

0+阅读 · 2023年3月15日

Adaptive Planning and Control with Time-Varying Tire Models for Autonomous Racing Using Extreme Learning Machine

Arxiv

0+阅读 · 2023年3月14日

Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of Connected Autonomous Vehicles in Challenging Scenarios

Arxiv

0+阅读 · 2023年3月13日

Autonomous Drone Racing: A Survey

Arxiv

27+阅读 · 2023年1月5日

Knowledge Augmented Machine Learning with Applications in Autonomous Driving: A Survey

Arxiv

17+阅读 · 2022年5月10日

Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions

Arxiv

18+阅读 · 2021年12月21日

相关基金

巯基保护的金纳米团簇的磁性调控的理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

CSP-GNPs对宫颈癌的靶向放疗增敏作用及其机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

Ru催化双导向基团参与C-H键活化及官能化反应的研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向成像差异的高精度强适应SAR景象匹配算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

甲醇、水在金属掺杂的TiO2(110)表面微观尺度下的光化学表征

国家自然科学基金

1+阅读 · 2013年12月31日

Hg2CuTi型全Heusler合金表面与界面的半金属特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

微纳结构表面上润湿和电润湿动力学的跨尺度研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cocycle动力学和拟周期薛定谔算子的谱

国家自然科学基金

0+阅读 · 2012年12月31日

Renin-Angiotensin System在介导机械通气所致肺微血管内皮细胞功能障碍中的作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

过渡金属及其合金团簇的稳定性和磁性研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员