We study the aggregate welfare and individual regret guarantees of dynamic \emph{pacing algorithms} in the context of repeated auctions with budgets. Such algorithms are commonly used as bidding agents in Internet advertising platforms, adaptively learning to shade bids by a tunable linear multiplier in order to match a specified budget. We show that when agents simultaneously apply a natural form of gradient-based pacing, the liquid welfare obtained over the course of the learning dynamics is at least half the optimal expected liquid welfare obtainable by any allocation rule. Crucially, this result holds \emph{without requiring convergence of the dynamics}, allowing us to circumvent known complexity-theoretic obstacles of finding equilibria. This result is also robust to the correlation structure between agent valuations and holds for any \emph{core auction}, a broad class of auctions that includes first-price, second-price, and generalized second-price auctions as special cases. For individual guarantees, we further show such pacing algorithms enjoy \emph{dynamic regret} bounds for individual utility- and value-maximization, with respect to the sequence of budget-pacing bids, for any auction satisfying a monotone bang-for-buck property. To complement our theoretical findings, we provide semi-synthetic numerical simulations based on auction data from the Bing Advertising platform.
翻译:本研究探讨了在预算约束下的重复拍卖场景中,动态调控算法的总体福利与个体遗憾保证。此类算法通常作为互联网广告平台中的竞价代理,通过可调线性乘数自适应学习调整出价,以实现指定预算的匹配。我们证明,当多个智能体同时采用基于梯度的自然调控策略时,学习动态过程中获得的流动福利至少能达到任何分配规则所能获得的最优预期流动福利的一半。关键的是,这一结论成立无需动态过程的收敛性要求,从而能够规避寻找均衡时已知的计算复杂性障碍。该结果对智能体估值间的相关性结构具有鲁棒性,且适用于所有核心拍卖——一类包含一价拍卖、二价拍卖和广义二价拍卖作为特例的广泛拍卖类别。在个体保证方面,我们进一步证明此类调控算法在满足单调性价比特性的任意拍卖中,针对预算调控出价序列具有个体效用最大化和价值最大化的动态遗憾界。为补充理论发现,我们基于必应广告平台的拍卖数据进行了半合成数值模拟。