利用深强化学习自动数据库管理案例 (The Case for Automatic Database Administration using Deep Reinforcement Learning)

Like any large software system, a full-fledged DBMS offers an overwhelming amount of configuration knobs. These range from static initialisation parameters like buffer sizes, degree of concurrency, or level of replication to complex runtime decisions like creating a secondary index on a particular column or reorganising the physical layout of the store. To simplify the configuration, industry grade DBMSs are usually shipped with various advisory tools, that provide recommendations for given workloads and machines. However, reality shows that the actual configuration, tuning, and maintenance is usually still done by a human administrator, relying on intuition and experience. Recent work on deep reinforcement learning has shown very promising results in solving problems, that require such a sense of intuition. For instance, it has been applied very successfully in learning how to play complicated games with enormous search spaces. Motivated by these achievements, in this work we explore how deep reinforcement learning can be used to administer a DBMS. First, we will describe how deep reinforcement learning can be used to automatically tune an arbitrary software system like a DBMS by defining a problem environment. Second, we showcase our concept of NoDBA at the concrete example of index selection and evaluate how well it recommends indexes for given workloads.

翻译：与任何大型软件系统一样,一个成熟的DBMS系统提供大量的配置按钮。从静态初始化参数,如缓冲大小、调值程度或复制程度等静态初始化参数到复杂的运行时间决定,如在特定栏目上创建二级索引或重组仓库的物理布局。为了简化配置,行业级DBMS系统通常用各种咨询工具装运,为特定工作量和机器提供建议。然而,现实表明,实际配置、调控和维护通常仍由人类管理员根据直觉和经验进行。最近进行的深层强化学习工作在解决问题方面显示出非常有希望的结果,需要这种直觉感。例如,在学习如何在巨大的搜索空间上玩复杂游戏方面非常成功。受这些成就的驱动,我们探索如何利用深度强化学习来管理 DBMS。首先,我们将描述如何利用深度强化学习来自动调整像DBMS系统这样的任意软件系统,通过界定问题环境来界定问题环境。第二,我们展示了我们关于诺DBA系统的概念,在索引选择工作量的具体例子中,并评估它是如何被推荐的。

相关内容

深度强化学习

关注 156

深度强化学习 (DRL) 是一种使用深度学习技术扩展传统强化学习方法的一种机器学习方法。传统强化学习方法的主要任务是使得主体根据从环境中获得的奖赏能够学习到最大化奖赏的行为。然而，传统无模型强化学习方法需要使用函数逼近技术使得主体能够学习出值函数或者策略。在这种情况下，深度学习强大的函数逼近能力自然成为了替代人工指定特征的最好手段并为性能更好的端到端学习的实现提供了可能。

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

41+阅读 · 2020年4月11日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日