This work investigates a dynamic variant of Bayesian persuasion, in which a strategic sender seeks to influence a receiver's belief over time through controlling the timing of the information disclosure, under resource constraints. We consider a binary information source (i.e., taking values 0 or 1), where the source's state evolve according to a continuous-time Markov chain (CTMC). In this setting, the receiver aims to estimate the source's state as accurately as possible. In contrast, the sender seeks to persuade the receiver to estimate the state to be 1, regardless of whether this estimate reflects the true state. This misalignment between their objectives naturally leads to a Stackelberg game formulation where the sender, acting as the leader, chooses an information-revelation policy, and the receiver, as the follower, decides whether to follow the sender's messages. As a result, the sender's objective is to maximize the long-term average time that the receiver's estimate equals 1, subject to a total sampling constraint and a constraint for the receiver to follow the sender's messages called incentive compatibility (IC) constraint. We first consider the single-source problem and show that the sender's optimal policy is to allocate a minimal sampling rate to the undesired state 0 (just enough to satisfy the IC constraint) and assign the remaining sampling rate to the desired state 1. Next, we extend the analysis to the multi-source case, where each source has a different minimal sampling rate. Our results show that the sender can leverage the timeliness of the revealed information to influence the receiver, thereby achieving a higher utility.
翻译:本研究探讨了贝叶斯说服的一个动态变体,其中战略发送方在资源约束下,通过控制信息披露的时机,试图随时间影响接收方的信念。我们考虑一个二元信息源(即取值为0或1),其状态依据连续时间马尔可夫链(CTMC)演化。在此设定下,接收方旨在尽可能准确地估计信息源的状态。相反,发送方试图说服接收方将状态估计为1,无论该估计是否反映真实状态。双方目标的不一致自然导致了斯塔克尔伯格博弈的构建:发送方作为领导者选择信息揭示策略,接收方作为跟随者决定是否遵循发送方的消息。因此,发送方的目标是在满足总采样约束以及接收方遵循发送方消息的激励相容性(IC)约束条件下,最大化接收方估计值为1的长期平均时间。我们首先考虑单源问题,证明发送方的最优策略是将最小采样率分配给不期望的状态0(仅满足IC约束所需),并将剩余采样率分配给期望的状态1。接着,我们将分析扩展到多源情形,其中每个源具有不同的最小采样率。结果表明,发送方可利用所揭示信息的时效性来影响接收方,从而实现更高的效用。