基于数据驱动的状态聚合方法用于动态离散选择模型 (A Data-Driven State Aggregation Approach for Dynamic Discrete Choice Models) - 专知论文

会员服务 ·

0

Q函数 · 离散 · 估计误差 · 最大似然估计 · 最大似然 ·

2023 年 4 月 20 日

A Data-Driven State Aggregation Approach for Dynamic Discrete Choice Models

翻译：基于数据驱动的状态聚合方法用于动态离散选择模型

Sinong Geng,Houssam Nassif,Carlos A. Manzanares

We study dynamic discrete choice models, where a commonly studied problem involves estimating parameters of agent reward functions (also known as "structural" parameters), using agent behavioral data. Maximum likelihood estimation for such models requires dynamic programming, which is limited by the curse of dimensionality. In this work, we present a novel algorithm that provides a data-driven method for selecting and aggregating states, which lowers the computational and sample complexity of estimation. Our method works in two stages. In the first stage, we use a flexible inverse reinforcement learning approach to estimate agent Q-functions. We use these estimated Q-functions, along with a clustering algorithm, to select a subset of states that are the most pivotal for driving changes in Q-functions. In the second stage, with these selected "aggregated" states, we conduct maximum likelihood estimation using a commonly used nested fixed-point algorithm. The proposed two-stage approach mitigates the curse of dimensionality by reducing the problem dimension. Theoretically, we derive finite-sample bounds on the associated estimation error, which also characterize the trade-off of computational complexity, estimation error, and sample complexity. We demonstrate the empirical performance of the algorithm in two classic dynamic discrete choice estimation applications.

翻译：我们研究了动态离散选择模型，在这些模型中，常见的问题是使用代理的行为数据來估计代理奖励函数的参数（也称为“结构性”参数）。最大似然估计需要动态规划，受到维度灾难的限制。在这项工作中，我们提出了一种新颖的算法，提供了一种数据驱动的方法来选择和聚合状态，从而降低了估计的计算和样本复杂性。我们的方法分两个阶段。在第一阶段，我们使用一种灵活的逆强化学习方法来估计代理Q函数。我们使用这些估计的Q函数，加上聚类算法，来选择一组最为关键的状态，这些状态对于驱动Q函数的变化至关重要。在第二阶段中，针对这些选择的“聚合”状态，我们使用常用的嵌套固定点算法进行最大似然估计。所提出的两阶段方法通过降低问题的维数来缓解了维度灾难。理论上，我们推导出了相关估计误差的有限样本界限，同时表征了计算复杂度、估计误差和样本复杂度之间的权衡。我们展示了该算法在两个经典的动态离散选择估计应用中的实证表现。

1

相关内容

Q函数

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

65+阅读 · 2023年2月15日

【SIGMOD教程】高效数据标签的众包实践:聚合、增量重标签和定价，附180页slides

【SIGMOD教程】高效数据标签的众包实践:聚合、增量重标签和定价，附180页slides

专知会员服务

11+阅读 · 2022年10月20日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【NeurIPS 2021】设置多智能体策略梯度的方差

【NeurIPS 2021】设置多智能体策略梯度的方差

专知会员服务

21+阅读 · 2021年10月24日

WWW21最新「比较学习」教程，135页PPT阐述从排名数据中学习

专知会员服务

37+阅读 · 2021年4月27日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

专知会员服务

20+阅读 · 2020年6月23日

开源新书《时间序列分析，数据/方法/应用》，6章110页pdf带你了解最新进展，附下载

开源新书《时间序列分析，数据/方法/应用》，6章110页pdf带你了解最新进展，附下载

专知会员服务

203+阅读 · 2019年11月20日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

基于图模型冲突分析反问题理论的第三方调解策略研究

国家自然科学基金

3+阅读 · 2014年12月31日

基于多Agent的分散式网络免疫方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

非参数动态混合Copula模型：估计、推断及应用

国家自然科学基金

0+阅读 · 2013年12月31日

基于数据包络分析的基金多期绩效评价与投资组合选择研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于介电弹性体驱动的MRI相容操作手系统模型及控制方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于类别非平衡时序增量数据批的多SVM动态集成企业信用评估建模

国家自然科学基金

1+阅读 · 2012年12月31日

锂电池管理系统的估计和控制算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

整数值时间序列数据的建模方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于多元统计方法的间歇过程监控与故障诊断研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于数据驱动的非线性动态过程监控理论及应用

国家自然科学基金

1+阅读 · 2009年12月31日

State Regularized Policy Optimization on Data with Dynamics Shift

Arxiv

0+阅读 · 2023年6月6日

Learning to Simulate Tree-Branch Dynamics for Manipulation

Arxiv

0+阅读 · 2023年6月6日

Probabilistic Unrolling: Scalable, Inverse-Free Maximum Likelihood Estimation for Latent Gaussian Models

Arxiv

0+阅读 · 2023年6月5日

A unified analysis of likelihood-based estimators in the Plackett--Luce model

Arxiv

0+阅读 · 2023年6月5日

Enhancing naive classifier for positive unlabeled data based on logistic regression approach

Arxiv

0+阅读 · 2023年6月5日

A general framework for circular local likelihood regression

Arxiv

0+阅读 · 2023年6月5日

Roulette-Wheel Selection-Based PSO Algorithm for Solving the Vehicle Routing Problem with Time Windows

Arxiv

0+阅读 · 2023年6月4日

A unified Bayesian inversion approach for a class of tumor growth models with different pressure laws

Arxiv

0+阅读 · 2023年6月3日

Optimal error analysis of a non-uniform IMEX-L1 finite element method for time fractional PDEs and PIDEs

Arxiv

0+阅读 · 2023年6月2日

Multilingual Sentiment Analysis: An RNN-Based Framework for Limited Data

Arxiv

12+阅读 · 2018年6月8日

VIP会员

文章信息

相关主题

最大似然估计

相关VIP内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

65+阅读 · 2023年2月15日

【SIGMOD教程】高效数据标签的众包实践:聚合、增量重标签和定价，附180页slides

【SIGMOD教程】高效数据标签的众包实践:聚合、增量重标签和定价，附180页slides

专知会员服务

11+阅读 · 2022年10月20日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【NeurIPS 2021】设置多智能体策略梯度的方差

【NeurIPS 2021】设置多智能体策略梯度的方差

专知会员服务

21+阅读 · 2021年10月24日

WWW21最新「比较学习」教程，135页PPT阐述从排名数据中学习

专知会员服务

37+阅读 · 2021年4月27日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

专知会员服务

20+阅读 · 2020年6月23日

开源新书《时间序列分析，数据/方法/应用》，6章110页pdf带你了解最新进展，附下载

开源新书《时间序列分析，数据/方法/应用》，6章110页pdf带你了解最新进展，附下载

专知会员服务

203+阅读 · 2019年11月20日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

新型数字杀伤链：理解综合战术网络对野战炮兵体系的能力与效益

《对抗环境中运用数字孪生技术优化预测性维护与后勤保障》2025最新93页

《任务式指挥十六个案例研究》232页

《幻觉还是事实：国防大型语言模型的可信度评估研究》2025最新109页

相关资讯

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

State Regularized Policy Optimization on Data with Dynamics Shift

Arxiv

0+阅读 · 2023年6月6日

Learning to Simulate Tree-Branch Dynamics for Manipulation

Arxiv

0+阅读 · 2023年6月6日

Probabilistic Unrolling: Scalable, Inverse-Free Maximum Likelihood Estimation for Latent Gaussian Models

Arxiv

0+阅读 · 2023年6月5日

A unified analysis of likelihood-based estimators in the Plackett--Luce model

Arxiv

0+阅读 · 2023年6月5日

Enhancing naive classifier for positive unlabeled data based on logistic regression approach

Arxiv

0+阅读 · 2023年6月5日

A general framework for circular local likelihood regression

Arxiv

0+阅读 · 2023年6月5日

Roulette-Wheel Selection-Based PSO Algorithm for Solving the Vehicle Routing Problem with Time Windows

Arxiv

0+阅读 · 2023年6月4日

A unified Bayesian inversion approach for a class of tumor growth models with different pressure laws

Arxiv

0+阅读 · 2023年6月3日

Optimal error analysis of a non-uniform IMEX-L1 finite element method for time fractional PDEs and PIDEs

Arxiv

0+阅读 · 2023年6月2日

Multilingual Sentiment Analysis: An RNN-Based Framework for Limited Data

Arxiv

12+阅读 · 2018年6月8日

相关基金

基于图模型冲突分析反问题理论的第三方调解策略研究

国家自然科学基金

3+阅读 · 2014年12月31日

基于多Agent的分散式网络免疫方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

非参数动态混合Copula模型：估计、推断及应用

国家自然科学基金

0+阅读 · 2013年12月31日

基于数据包络分析的基金多期绩效评价与投资组合选择研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于介电弹性体驱动的MRI相容操作手系统模型及控制方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于类别非平衡时序增量数据批的多SVM动态集成企业信用评估建模

国家自然科学基金

1+阅读 · 2012年12月31日

锂电池管理系统的估计和控制算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

整数值时间序列数据的建模方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于多元统计方法的间歇过程监控与故障诊断研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于数据驱动的非线性动态过程监控理论及应用

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员