适应人类感知运动系统噪音特征的反向最佳控制 (Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System) - 专知论文

会员服务 ·

0

优化器 · 噪声 · 代价函数 · 控制器 · 部分可观测马尔可夫决策过程 ·

2021 年 10 月 21 日

Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System

翻译：适应人类感知运动系统噪音特征的反向最佳控制

Matthias Schultheis,Dominik Straub,Constantin A. Rothkopf

from arxiv, 24 pages, 11 figures, to be published at NeurIPS 2021

Computational level explanations based on optimal feedback control with signal-dependent noise have been able to account for a vast array of phenomena in human sensorimotor behavior. However, commonly a cost function needs to be assumed for a task and the optimality of human behavior is evaluated by comparing observed and predicted trajectories. Here, we introduce inverse optimal control with signal-dependent noise, which allows inferring the cost function from observed behavior. To do so, we formalize the problem as a partially observable Markov decision process and distinguish between the agent's and the experimenter's inference problems. Specifically, we derive a probabilistic formulation of the evolution of states and belief states and an approximation to the propagation equation in the linear-quadratic Gaussian problem with signal-dependent noise. We extend the model to the case of partial observability of state variables from the point of view of the experimenter. We show the feasibility of the approach through validation on synthetic data and application to experimental data. Our approach enables recovering the costs and benefits implicit in human sequential sensorimotor behavior, thereby reconciling normative and descriptive approaches in a computational framework.

翻译：根据以信号为根据的噪音进行的最佳反馈控制得出的计算水平解释能够说明人类感官行为中的各种现象。然而,通常需要为一项任务承担成本功能,而人类行为的最佳性则通过比较观察到的和预测的轨迹进行评估。这里,我们采用以信号为根据的噪音进行反最佳控制,从而可以从观察到的行为推断出成本功能。为了做到这一点,我们把问题正式化为一种部分可见的Markov决策程序,并区分代理人和实验者的推断问题。具体地说,我们从国家和信仰状态的演变中得出一种概率性公式,并用信号为根据的噪音对线-赤道高斯问题的传播方程式进行近似。我们将模型扩大到从实验者的角度对部分可视性国家变量的情况。我们通过验证合成数据和应用实验数据来表明这种方法的可行性。我们的方法有助于恢复人类连续感官行为中隐含的成本和惠益,从而在计算框架中协调规范性和描述性方法。

0

相关内容

优化器

NeurIPS 20201接收论文列表发布，2334篇论文都在这了！

NeurIPS 20201接收论文列表发布，2334篇论文都在这了！

专知会员服务

38+阅读 · 2021年11月4日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Low-to-Zero-Overhead IRS Reconfiguration: Decoupling Illumination and Channel Estimation

Arxiv

0+阅读 · 2021年12月17日

Morphisms and minimization of weighted automata

Arxiv

0+阅读 · 2021年12月17日

Engineering and Implementation of SimAEN

Arxiv

0+阅读 · 2021年12月16日

Consistency of the maximum likelihood estimator in hidden Markov models with trends

Arxiv

0+阅读 · 2021年12月16日

Guaranteed Contraction Control in the Presence of Imperfectly Learned Dynamics

Arxiv

0+阅读 · 2021年12月15日

Channel Parameter Estimation in the Presence of Phase Noise Based on Maximum Correntropy Criterion

Arxiv

0+阅读 · 2021年12月15日

Error estimates for a pointwise tracking optimal control problem of a semilinear elliptic equation

Arxiv

0+阅读 · 2021年12月15日

Inverse Constrained Reinforcement Learning

Arxiv

8+阅读 · 2021年5月21日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

Human Interaction with Recommendation Systems

Arxiv

6+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

部分可观测马尔可夫决策过程

相关VIP内容

NeurIPS 20201接收论文列表发布，2334篇论文都在这了！

NeurIPS 20201接收论文列表发布，2334篇论文都在这了！

专知会员服务

38+阅读 · 2021年11月4日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【IJCAI2025教程】动态开放环境下的多模态生成式人工智能，90页ppt

美陆军备战网络作战空间：军队AI教育工具、战略网络游戏

【CMU博士论文】校准不确定性量化的方法及其效用解析

科学大语言模型综述：从数据基础到智能体前沿

相关资讯

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Low-to-Zero-Overhead IRS Reconfiguration: Decoupling Illumination and Channel Estimation

Arxiv

0+阅读 · 2021年12月17日

Morphisms and minimization of weighted automata

Arxiv

0+阅读 · 2021年12月17日

Engineering and Implementation of SimAEN

Arxiv

0+阅读 · 2021年12月16日

Consistency of the maximum likelihood estimator in hidden Markov models with trends

Arxiv

0+阅读 · 2021年12月16日

Guaranteed Contraction Control in the Presence of Imperfectly Learned Dynamics

Arxiv

0+阅读 · 2021年12月15日

Channel Parameter Estimation in the Presence of Phase Noise Based on Maximum Correntropy Criterion

Arxiv

0+阅读 · 2021年12月15日

Error estimates for a pointwise tracking optimal control problem of a semilinear elliptic equation

Arxiv

0+阅读 · 2021年12月15日

Inverse Constrained Reinforcement Learning

Arxiv

8+阅读 · 2021年5月21日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

Human Interaction with Recommendation Systems

Arxiv

6+阅读 · 2018年3月28日

微信扫码咨询专知VIP会员