利用蒙特卡洛树搜索实现黑盒优化的学习搜索空间分割区 (Learning Search Space Partition for Black-box Optimization using Monte Carlo Tree Search)

High dimensional black-box optimization has broad applications but remains a challenging problem to solve. Given a set of samples $\{\vx_i, y_i\}$, building a global model (like Bayesian Optimization (BO)) suffers from the curse of dimensionality in the high-dimensional search space, while a greedy search may lead to sub-optimality. By recursively splitting the search space into regions with high/low function values, recent works like LaNAS shows good performance in Neural Architecture Search (NAS), reducing the sample complexity empirically. In this paper, we coin LA-MCTS that extends LaNAS to other domains. Unlike previous approaches, LA-MCTS learns the partition of the search space using a few samples and their function values in an online fashion. While LaNAS uses linear partition and performs uniform sampling in each region, our LA-MCTS adopts a nonlinear decision boundary and learns a local model to pick good candidates. If the nonlinear partition function and the local model fits well with ground-truth black-box function, then good partitions and candidates can be reached with much fewer samples. LA-MCTS serves as a \emph{meta-algorithm} by using existing black-box optimizers (e.g., BO, TuRBO) as its local models, achieving strong performance in general black-box optimization and reinforcement learning benchmarks, in particular for high-dimensional problems.

翻译：高维黑盒优化具有广泛的应用,但仍是一个难以解决的难题。在一组样本中, 以 $vx_i, y_ i_ $ 建立全球模型( 如 Bayesian Optimination (BO) ) 在高维搜索空间中受到维度的诅咒, 而贪婪的搜索可能导致次优化。通过将搜索空间分解到高/低功能值区域, LaNAS 等最近的工作显示神经结构搜索(NAS) 的绩效良好, 从而从经验上降低样本的复杂程度。在本文中, 我们用LA- MCTS 将LA- MCTS 扩展到其它域。与以往的做法不同, LA- MCTS 以在线方式使用少数样本及其函数值来学习搜索空间的分布。虽然 LaNAS 使用线性分割法和在每个区域进行统一的取样, 我们的LA- MCTS 采用非线性决定边界分割功能和本地模型来挑选好的候选人。如果非线性分隔函数和本地模型与地基黑框的强黑框功能功能功能相匹配, 然后是好的分区和候选人, 以普通的升级, 以低级的样本达到特定的学习。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

80+阅读 · 2020年2月27日

深度强化学习策略梯度教程，53页ppt

专知会员服务

176+阅读 · 2020年2月1日

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

93+阅读 · 2019年12月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日