Remote state estimation, where many sensors send their measurements of distributed dynamic plants to a remote estimator over shared wireless resources, is essential for mission-critical applications of Industry 4.0. Most of the existing works on remote state estimation assumed orthogonal multiple access and the proposed dynamic radio resource allocation algorithms can only work for very small-scale settings. In this work, we consider a remote estimation system with non-orthogonal multiple access. We formulate a novel dynamic resource allocation problem for achieving the minimum overall long-term average estimation mean-square error. Both the estimation quality state and the channel quality state are taken into account for decision making at each time. The problem has a large hybrid discrete and continuous action space for joint channel assignment and power allocation. We propose a novel action-space compression method and develop an advanced deep reinforcement learning algorithm to solve the problem. Numerical results show that our algorithm solves the resource allocation problem effectively, presents much better scalability than the literature, and provides significant performance gain compared to some benchmarks.
翻译:许多传感器将分布式动态植物的测量结果送到一个共享无线资源远程估计器的远程状态估计,对于工业4.0.0的飞行任务关键应用至关重要。 大部分现有的远程状态估计假设正方形多重访问和拟议的动态无线电资源分配算法只能用于非常小规模的设置。 在这项工作中,我们考虑的是非正方形多重访问的远程估计系统。我们为达到最低总长期平均估计平均平均值平均差错而制定了一个新的动态资源分配问题。每次决策都考虑到估算质量和频道质量状况。在频道联合分配和电力分配方面存在着一个庞大的混合离散和连续行动空间。我们提出了一个新的行动空间压缩方法,并开发了一种先进的深度强化学习算法来解决问题。数字结果显示,我们的算法有效地解决了资源分配问题,比文献的可扩展性要好得多,并提供了与某些基准相比的显著绩效收益。