利用 " 深创基础模型:SARS-COV-2药物目标验证 " 加速产生深创基础模型的隐居器发现 (Accelerating Inhibitor Discovery With A Deep Generative Foundation Model: Validation for SARS-CoV-2 Drug Targets)

Vijil Chenthamarakshan,Samuel C. Hoffman,C. David Owen,Petra Lukacik,Claire Strain-Damerell,Daren Fearon,Tika R. Malla,Anthony Tumber,Christopher J. Schofield,Helen M. E. Duyvesteyn,Wanwisa Dejnirattisai,Loic Carrique,Thomas S. Walter,Gavin R. Screaton,Tetiana Matviiuk,Aleksandra Mojsilovic,Jason Crain,Martin A. Walsh,David I. Stuart,Payel Das

from arxiv, Revised title, abstract, and text; additional figures

The discovery of novel inhibitor molecules for emerging drug-target proteins is widely acknowledged as a challenging inverse design problem: Exhaustive exploration of the vast chemical search space is impractical, especially when the target structure or active molecules are unknown. Here we validate experimentally the broad utility of a deep generative framework trained at-scale on protein sequences, small molecules, and their mutual interactions -- that is unbiased toward any specific target. As demonstrators, we consider two dissimilar and relevant SARS-CoV-2 targets: the main protease and the spike protein (receptor binding domain, RBD). To perform target-aware design of novel inhibitor molecules, a protein sequence-conditioned sampling on the generative foundation model is performed. Despite using only the target sequence information, and without performing any target-specific adaptation of the generative model, micromolar-level inhibition was observed in in vitro experiments for two candidates out of only four synthesized for each target. The most potent spike RBD inhibitor also exhibited activity against several variants in live virus neutralization assays. These results therefore establish that a single, broadly deployable generative foundation model for accelerated hit discovery is effective and efficient, even in the most general case where neither target structure nor binder information is available.

翻译：发现新出现的药物目标蛋白的新抑制分子分子被公认为是一个具有挑战性的反向设计问题:对庞大的化学搜索空间进行彻底探索是不切实际的,特别是当目标结构或活跃分子未知时。在这里,我们实验地验证了在蛋白序列、小分子及其相互作用方面接受过大规模培训的深层基因化框架的广泛效用 -- -- 对任何具体目标都是不带偏见的。作为示威者,我们认为两个不同和相关的SARS-COV-2目标:主要蛋白质和尖刺蛋白(受体约束域,RBD)。为了对新的抑制分子进行有目标的设计,在基因化基础模型上进行蛋白质序列定序取样。尽管我们只使用目标序列信息,而且没有对基因化模型进行任何特定目标调整,但在体外实验中观察到微摩尔级抑制作用,每个目标只有四种合成的两名候选人。最强大的峰化RBD抑制剂还展示了与活性病毒中性变异体(受域,RBD)的活动。因此,这些结果证明一个单一的、最广泛的部署性、最广泛的基质和加速的基质的模型既不是有效的常规的,也是快速的,也是快速的基的。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

专知会员服务

40+阅读 · 2022年10月10日

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

专知会员服务

28+阅读 · 2022年2月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日