蛋白质中氨基酸模式的可解释机器学习:一种统计集合方法 (Interpretable machine learning of amino acid patterns in proteins: a statistical ensemble approach) - 专知论文

会员服务 ·

0

可解释机器学习 · 机器学习 · 机器学习模型 · 学习模型 · 蛋白质二级结构 ·

2023 年 3 月 27 日

Interpretable machine learning of amino acid patterns in proteins: a statistical ensemble approach

翻译：蛋白质中氨基酸模式的可解释机器学习:一种统计集合方法

Anna Braghetto,Enzo Orlandini,Marco Baiesi

from arxiv, 15 pages, 9 figures

Explainable and interpretable unsupervised machine learning helps understand the underlying structure of data. We introduce an ensemble analysis of machine learning models to consolidate their interpretation. Its application shows that restricted Boltzmann machines compress consistently into a few bits the information stored in a sequence of five amino acids at the start or end of $\alpha$-helices or $\beta$-sheets. The weights learned by the machines reveal unexpected properties of the amino acids and the secondary structure of proteins: (i) His and Thr have a negligible contribution to the amphiphilic pattern of $\alpha$-helices; (ii) there is a class of $\alpha$-helices particularly rich in Ala at their end; (iii) Pro occupies most often slots otherwise occupied by polar or charged amino acids, and its presence at the start of helices is relevant; (iv) Glu and especially Asp on one side, and Val, Leu, Iso, and Phe on the other, display the strongest tendency to mark amphiphilic patterns, i.e., extreme values of an "effective hydrophobicity", though they are not the most powerful (non) hydrophobic amino acids.

翻译：可解释和可解释的无监督机器学习有助于理解数据的潜在结构。我们介绍一种机器学习模型的集合分析，以 conslidate 理解。它的应用表明，受限玻尔兹曼机器经常压缩在 $\alpha$-helices or $\beta$-sheets的开始或结尾的五个氨基酸序列中存储的信息，变成了一个容量更小而精简的信息片段。机器学习模型学习到的权重揭示了氨基酸和蛋白质二级结构的意外特性: (i)His和Thr对$\alpha$-helices中的亲疏性模式的贡献微不足道; (ii)有一类 $\alpha$-helices 在其末尾富含酪氨酸; (iii) Pro最常用于占用极性或电荷氨基酸的位置,它在螺旋的开头的存在很重要; (iv)谷氨酸和尤其是天门冬氨酸在一侧,以及缬氨酸、亮氨酸、异亮氨酸和苯丙氨酸在另一侧,显示出标记亲疏性模式,即“有效疏水性”的极端值，尽管它们不是最强大的（非）疏水性氨基酸。

0

相关内容

可解释机器学习

可解释机器学习

可解释性是指一个人能够持续预测模型结果的程度。机器学习模型的可解释性越高，人们就越容易理解为什么做出某些决定或预测。

基于共进化和机器学习的蛋白质金属结合位点预测新方法

基于共进化和机器学习的蛋白质金属结合位点预测新方法

专知会员服务

7+阅读 · 2023年1月9日

Nat. Biotechnol. | 用机器学习预测多肽质谱库

Nat. Biotechnol. | 用机器学习预测多肽质谱库

专知会员服务

18+阅读 · 2022年9月12日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【MIT】时间序列GAN，Subadditivity of Probability Divergences

专知会员服务

63+阅读 · 2020年3月4日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

关于某些代数曲线K2群的研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于结构域的蛋白质-RNA相互作用预测模型构建

国家自然科学基金

0+阅读 · 2014年12月31日

基于光致电荷转移的蛋白质荧光传感器

国家自然科学基金

0+阅读 · 2014年12月31日

家蚕卵黄原蛋白受体与贮藏蛋白相互作用的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于蛋白质复合物的关键蛋白质预测

国家自然科学基金

1+阅读 · 2013年12月31日

蛋白质紫外共振拉曼光谱的QM/MM多尺度理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

靶向VEGFR-2的II型小分子抑制剂的设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

两个E3连接酶在FERONIA信号通路中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

SUMO/DeSUMO化修饰在抑制性受体膜转运中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

高等植物光系统II捕光色素蛋白复合体腔侧环区影响结构和功能的机理

国家自然科学基金

0+阅读 · 2008年12月31日

Topological Reconstruction of Particle Physics Processes using Graph Neural Networks

Arxiv

0+阅读 · 2023年5月17日

PiML Toolbox for Interpretable Machine Learning Model Development and Validation

Arxiv

0+阅读 · 2023年5月16日

Topological Interpretability for Deep-Learning

Arxiv

1+阅读 · 2023年5月15日

Explainable Reinforcement Learning via a Causal World Model

Arxiv

0+阅读 · 2023年5月15日

Learning with Differentiable Algorithms

Arxiv

11+阅读 · 2022年9月1日

A Survey of Human-in-the-loop for Machine Learning

Arxiv

35+阅读 · 2021年8月2日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

A Survey on the Explainability of Supervised Machine Learning

Arxiv

24+阅读 · 2020年11月16日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

Visual Interpretability for Deep Learning: a Survey

Arxiv

16+阅读 · 2018年2月7日

VIP会员

文章信息

相关主题

可解释机器学习

机器学习模型

蛋白质二级结构

相关VIP内容

基于共进化和机器学习的蛋白质金属结合位点预测新方法

基于共进化和机器学习的蛋白质金属结合位点预测新方法

专知会员服务

7+阅读 · 2023年1月9日

Nat. Biotechnol. | 用机器学习预测多肽质谱库

Nat. Biotechnol. | 用机器学习预测多肽质谱库

专知会员服务

18+阅读 · 2022年9月12日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【MIT】时间序列GAN，Subadditivity of Probability Divergences

专知会员服务

63+阅读 · 2020年3月4日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《步兵小单元山地严寒作战指南》美军最新条令200页

《联合作战概念的发展》最新报告

俄制无人机弹药

《复杂场景下自主着陆的模型预测控制技术》92页

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

相关论文

Topological Reconstruction of Particle Physics Processes using Graph Neural Networks

Arxiv

0+阅读 · 2023年5月17日

PiML Toolbox for Interpretable Machine Learning Model Development and Validation

Arxiv

0+阅读 · 2023年5月16日

Topological Interpretability for Deep-Learning

Arxiv

1+阅读 · 2023年5月15日

Explainable Reinforcement Learning via a Causal World Model

Arxiv

0+阅读 · 2023年5月15日

Learning with Differentiable Algorithms

Arxiv

11+阅读 · 2022年9月1日

A Survey of Human-in-the-loop for Machine Learning

Arxiv

35+阅读 · 2021年8月2日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

A Survey on the Explainability of Supervised Machine Learning

Arxiv

24+阅读 · 2020年11月16日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

Visual Interpretability for Deep Learning: a Survey

Arxiv

16+阅读 · 2018年2月7日

相关基金

关于某些代数曲线K2群的研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于结构域的蛋白质-RNA相互作用预测模型构建

国家自然科学基金

0+阅读 · 2014年12月31日

基于光致电荷转移的蛋白质荧光传感器

国家自然科学基金

0+阅读 · 2014年12月31日

家蚕卵黄原蛋白受体与贮藏蛋白相互作用的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于蛋白质复合物的关键蛋白质预测

国家自然科学基金

1+阅读 · 2013年12月31日

蛋白质紫外共振拉曼光谱的QM/MM多尺度理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

靶向VEGFR-2的II型小分子抑制剂的设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

两个E3连接酶在FERONIA信号通路中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

SUMO/DeSUMO化修饰在抑制性受体膜转运中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

高等植物光系统II捕光色素蛋白复合体腔侧环区影响结构和功能的机理

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员