控制任务中的约束推断：通过反向优化从专家演示中推断出来 (Constraint Inference in Control Tasks from Expert Demonstrations via Inverse Optimization) - 专知论文

会员服务 ·

0

推断 · 约束 · 演示 · 逆优化 · 机器人应用 ·

2023 年 4 月 6 日

Constraint Inference in Control Tasks from Expert Demonstrations via Inverse Optimization

翻译：控制任务中的约束推断：通过反向优化从专家演示中推断出来

Dimitris Papadimitriou,Jingqi Li

Inferring unknown constraints is a challenging and crucial problem in many robotics applications. When only expert demonstrations are available, it becomes essential to infer the unknown domain constraints to deploy additional agents effectively. In this work, we propose an approach to infer affine constraints in control tasks after observing expert demonstrations. We formulate the constraint inference problem as an inverse optimization problem, and we propose an alternating optimization scheme that infers the unknown constraints by minimizing a KKT residual objective. We demonstrate the effectiveness of our method in a number of simulations, and show that our method can infer less conservative constraints than a recent baseline method while maintaining comparable safety guarantees.

翻译：- 推断未知约束是许多机器人应用中的一项具有挑战性和关键的问题。当只有专家演示可用时，推断未知的领域约束变得至关重要，以有效地部署其他代理。在这项工作中，我们提出了一种方法，通过观察专家演示来推断控制任务中的仿射约束。我们将约束推断问题形式化为逆优化问题，并提出了一种交替优化方案，通过最小化KKT残差目标来推断未知约束。我们在许多模拟中展示了我们方法的有效性，并显示出我们的方法可以推断出不那么保守的约束，同时仍然保持可比较的安全保障。

0

相关内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

66+阅读 · 2023年2月15日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

控制方向未知的随机非线性系统的神经网络自适应控制

国家自然科学基金

2+阅读 · 2013年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

多智能体系统的分布式动态覆盖控制

国家自然科学基金

5+阅读 · 2011年12月31日

塔式太阳能热电系统的高效仿真与运行优化

国家自然科学基金

0+阅读 · 2011年12月31日

面向移动目标的无线传感器网络覆盖度量与优化研究

国家自然科学基金

0+阅读 · 2008年12月31日

Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis

Arxiv

0+阅读 · 2023年5月26日

Learning Safety Constraints from Demonstrations with Unknown Rewards

Arxiv

0+阅读 · 2023年5月25日

Differentially-Private Decision Trees with Probabilistic Robustness to Data Poisoning

Arxiv

0+阅读 · 2023年5月24日

Inverse Preference Learning: Preference-based RL without a Reward Function

Arxiv

0+阅读 · 2023年5月24日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

VIP会员

文章信息

相关主题

机器人应用

相关VIP内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

66+阅读 · 2023年2月15日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《基于AI的动态任务分配策略实现多智能体系统有意义人类控制》报告

《超越连接：AI驱动网络未来愿景》最新报告

人工智能赋能多域作战：能力与挑战

《战场空间决策优势：AI基础与应用研究》总结报告

相关资讯

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

相关论文

Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis

Arxiv

0+阅读 · 2023年5月26日

Learning Safety Constraints from Demonstrations with Unknown Rewards

Arxiv

0+阅读 · 2023年5月25日

Differentially-Private Decision Trees with Probabilistic Robustness to Data Poisoning

Arxiv

0+阅读 · 2023年5月24日

Inverse Preference Learning: Preference-based RL without a Reward Function

Arxiv

0+阅读 · 2023年5月24日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

相关基金

控制方向未知的随机非线性系统的神经网络自适应控制

国家自然科学基金

2+阅读 · 2013年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

多智能体系统的分布式动态覆盖控制

国家自然科学基金

5+阅读 · 2011年12月31日

塔式太阳能热电系统的高效仿真与运行优化

国家自然科学基金

0+阅读 · 2011年12月31日

面向移动目标的无线传感器网络覆盖度量与优化研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员