适用于分子设计的实际大规模平行的蒙特卡洛树搜索 (Practical Massively Parallel Monte-Carlo Tree Search Applied to Molecular Design) - 专知论文

会员服务 ·

0

PARCO · 缩放 · state-of-the-art · MoDELS · SimPLe ·

2020 年 10 月 13 日

Practical Massively Parallel Monte-Carlo Tree Search Applied to Molecular Design

翻译：适用于分子设计的实际大规模平行的蒙特卡洛树搜索

Xiufeng Yang,Tanuj Kr Aasawat,Kazuki Yoshizoe

It is common practice to use large computational resources to train neural networks, as is known from many examples, such as reinforcement learning applications. However, while massively parallel computing is often used for training models, it is rarely used for searching solutions for combinatorial optimization problems. In this paper, we propose a novel massively parallel Monte-Carlo Tree Search (MP-MCTS) algorithm that works efficiently for 1,000 worker scale, and apply it to molecular design. This is the first work that applies distributed MCTS to a real-world and non-game problem. Existing work on large-scale parallel MCTS show efficient scalability in terms of the number of rollouts up to 100 workers, but suffer from the degradation in the quality of the solutions. MP-MCTS maintains the search quality at larger scale, and by running MP-MCTS on 256 CPU cores for only 10 minutes, we obtained candidate molecules having similar score to non-parallel MCTS running for 42 hours. Moreover, our results based on parallel MCTS (combined with a simple RNN model) significantly outperforms existing state-of-the-art work. Our method is generic and is expected to speed up other applications of MCTS.

翻译：使用大量计算资源来培训神经网络是常见的做法,这一点从许多例子中可以知道,例如强化学习应用等许多例子。然而,虽然大量平行计算常常用于培训模型,但很少用于寻找组合优化问题的解决办法。在本文中,我们提议采用一个全新的大规模平行的蒙特-卡洛树搜索(MP-MCTS)算法,该算法对1,000名工人有效,并适用于分子设计。这是将MCTS应用于现实世界和非游戏问题的首次工作。大规模平行MCTS的现有工作显示,在向100名工人推出的数量方面是有效的,但因解决方案质量下降而受到影响。MP-MCTS在更大程度上保持了搜索质量,通过在256 CPU核心上只运行10分钟的MP-MCTS,我们获得了与运行42小时的非平行 MCTS相近分数的候选分子。此外,我们基于平行的MCTS(与简单的RNNMTS模型相结合)的现有工作结果大大超过现有状态MTS速度。我们所期望的其他方法是通用的,其他应用速度。

0

相关内容

PARCO

PARCO：Parallel Computing。 Explanation：并行计算。 Publisher：Elsevier。 SIT:http://dblp.uni-trier.de/db/conf/parco/

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

专知会员服务

51+阅读 · 2020年8月25日

【RLChina2020公开课】Lecture-11.pdf【多智能体学习与游戏AI前沿】

【RLChina2020公开课】Lecture-11.pdf【多智能体学习与游戏AI前沿】

专知会员服务

27+阅读 · 2020年8月6日

【WWW2020-UIUC】自动主题分类法构建，Automated Topic Taxonomy Construction

【WWW2020-UIUC】自动主题分类法构建，Automated Topic Taxonomy Construction

专知会员服务

40+阅读 · 2020年3月22日

【Google 大脑】使用上千个优化任务学习超参数搜索策略，Using a thousand optimization tasks to learn hyperparameter search strategies

【Google 大脑】使用上千个优化任务学习超参数搜索策略，Using a thousand optimization tasks to learn hyperparameter search strategies

专知会员服务

18+阅读 · 2020年3月14日

自动结构变分推理，Automatic structured variational inference

自动结构变分推理，Automatic structured variational inference

专知会员服务

40+阅读 · 2020年2月10日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

基于图的word2vec负采样( GNEG:Graph-Based Negative Sampling for word2vec)

基于图的word2vec负采样( GNEG:Graph-Based Negative Sampling for word2vec)

专知会员服务

40+阅读 · 2019年11月23日

【ICCV 2019】基于元学习的自动化神经网络通道 MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

【ICCV 2019】基于元学习的自动化神经网络通道 MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

专知会员服务

17+阅读 · 2019年11月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Piecewise-Stationary Off-Policy Optimization

Arxiv

0+阅读 · 2020年12月1日

Provably Efficient Exploration for Reinforcement Learning Using Unsupervised Learning

Arxiv

0+阅读 · 2020年12月1日

Learning by Passing Tests, with Application to Neural Architecture Search

Arxiv

0+阅读 · 2020年11月30日

Optimally Supporting IoT with Cell-Free Massive MIMO

Arxiv

0+阅读 · 2020年11月30日

Language Generation via Combinatorial Constraint Satisfaction: A Tree Search Enhanced Monte-Carlo Approach

Arxiv

0+阅读 · 2020年11月30日

Distilled Thompson Sampling: Practical and Efficient Thompson Sampling via Imitation Learning

Arxiv

0+阅读 · 2020年11月29日

Scalable Deep-Learning-Accelerated Topology Optimization for Additively Manufactured Materials

Arxiv

0+阅读 · 2020年11月28日

Kernel methods through the roof: handling billions of points efficiently

Arxiv

0+阅读 · 2020年11月26日

FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge

Arxiv

0+阅读 · 2020年11月25日

Learning Discrete Structures for Graph Neural Networks

Arxiv

6+阅读 · 2019年5月17日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

专知会员服务

51+阅读 · 2020年8月25日

【RLChina2020公开课】Lecture-11.pdf【多智能体学习与游戏AI前沿】

【RLChina2020公开课】Lecture-11.pdf【多智能体学习与游戏AI前沿】

专知会员服务

27+阅读 · 2020年8月6日

【WWW2020-UIUC】自动主题分类法构建，Automated Topic Taxonomy Construction

【WWW2020-UIUC】自动主题分类法构建，Automated Topic Taxonomy Construction

专知会员服务

40+阅读 · 2020年3月22日

【Google 大脑】使用上千个优化任务学习超参数搜索策略，Using a thousand optimization tasks to learn hyperparameter search strategies

【Google 大脑】使用上千个优化任务学习超参数搜索策略，Using a thousand optimization tasks to learn hyperparameter search strategies

专知会员服务

18+阅读 · 2020年3月14日

自动结构变分推理，Automatic structured variational inference

自动结构变分推理，Automatic structured variational inference

专知会员服务

40+阅读 · 2020年2月10日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

基于图的word2vec负采样( GNEG:Graph-Based Negative Sampling for word2vec)

基于图的word2vec负采样( GNEG:Graph-Based Negative Sampling for word2vec)

专知会员服务

40+阅读 · 2019年11月23日

【ICCV 2019】基于元学习的自动化神经网络通道 MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

【ICCV 2019】基于元学习的自动化神经网络通道 MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

专知会员服务

17+阅读 · 2019年11月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

AI CITY发展研究报告：“人工智能+”时代的智慧城市发展范式创新（2025年）

风格迁移：十年综述

【ICCV2025】CL-Splats：结合局部优化的高斯泼洒持续学习方法

【HKUST博士论文】迈向可扩展且具泛化能力的时空预测

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Piecewise-Stationary Off-Policy Optimization

Arxiv

0+阅读 · 2020年12月1日

Provably Efficient Exploration for Reinforcement Learning Using Unsupervised Learning

Arxiv

0+阅读 · 2020年12月1日

Learning by Passing Tests, with Application to Neural Architecture Search

Arxiv

0+阅读 · 2020年11月30日

Optimally Supporting IoT with Cell-Free Massive MIMO

Arxiv

0+阅读 · 2020年11月30日

Language Generation via Combinatorial Constraint Satisfaction: A Tree Search Enhanced Monte-Carlo Approach

Arxiv

0+阅读 · 2020年11月30日

Distilled Thompson Sampling: Practical and Efficient Thompson Sampling via Imitation Learning

Arxiv

0+阅读 · 2020年11月29日

Scalable Deep-Learning-Accelerated Topology Optimization for Additively Manufactured Materials

Arxiv

0+阅读 · 2020年11月28日

Kernel methods through the roof: handling billions of points efficiently

Arxiv

0+阅读 · 2020年11月26日

FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge

Arxiv

0+阅读 · 2020年11月25日

Learning Discrete Structures for Graph Neural Networks

Arxiv

6+阅读 · 2019年5月17日

微信扫码咨询专知VIP会员