跨组学联合嵌入的对比学习和自注意多组学综合应用于不完整多组学数据 (CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self Attention for multi-omics integration with incomplete multi-omics data) - 专知论文

会员服务 ·

0

组学 · 多组学数据 · 组学数据 · 联合嵌入 · 集成 ·

2023 年 4 月 12 日

CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self Attention for multi-omics integration with incomplete multi-omics data

翻译：跨组学联合嵌入的对比学习和自注意多组学综合应用于不完整多组学数据

Chen Zhao,Anqi Liu,Xiao Zhang,Xuewei Cao,Zhengming Ding,Qiuying Sha,Hui Shen,Hong-Wen Deng,Weihua Zhou

from arxiv, 21 pages; 5 figures

Integration of heterogeneous and high-dimensional multi-omics data is becoming increasingly important in understanding genetic data. Each omics technique only provides a limited view of the underlying biological process and integrating heterogeneous omics layers simultaneously would lead to a more comprehensive and detailed understanding of diseases and phenotypes. However, one obstacle faced when performing multi-omics data integration is the existence of unpaired multi-omics data due to instrument sensitivity and cost. Studies may fail if certain aspects of the subjects are missing or incomplete. In this paper, we propose a deep learning method for multi-omics integration with incomplete data by Cross-omics Linked unified embedding with Contrastive Learning and Self Attention (CLCLSA). Utilizing complete multi-omics data as supervision, the model employs cross-omics autoencoders to learn the feature representation across different types of biological data. The multi-omics contrastive learning, which is used to maximize the mutual information between different types of omics, is employed before latent feature concatenation. In addition, the feature-level self-attention and omics-level self-attention are employed to dynamically identify the most informative features for multi-omics data integration. Extensive experiments were conducted on four public multi-omics datasets. The experimental results indicated that the proposed CLCLSA outperformed the state-of-the-art approaches for multi-omics data classification using incomplete multi-omics data.

翻译：多组学数据的集成在理解遗传数据中变得越来越重要。每种组学技术仅提供潜在的生物过程的有限视图，同时集成异质性组学层将导致对疾病和表型的更全面和详细的理解。然而，在执行多组学数据集成时面临的障碍之一是存在由于仪器敏感性和成本而产生的不成对多组学数据。如果研究中缺少或不完整地涵盖了受试者的某些方面，则可能会失败。本文提出了一种用于不完整数据的多组学集成的深度学习方法：基于对比学习和自注意机制的跨组学联合嵌入（CLCLSA）。利用完整的多组学数据作为监督，在模型中运用跨组学自编码器学习跨不同类型的生物数据的特征表示。在潜在特征拼接之前应用多组学对比学习来最大化不同组学之间的互信息。此外，过特征级自注意和组学级自注意机制来动态识别多组学数据集成所需的最具信息量的特征。在四个公共多组学数据集上进行了广泛的实验，实验结果表明，所提出的CLCLSA方法在利用不完整多组学数据进行多组学数据分类时优于现有技术。

0

相关内容

UTC: 用于视觉对话的任务间对比学习的统一Transformer

UTC: 用于视觉对话的任务间对比学习的统一Transformer

专知会员服务

14+阅读 · 2022年5月4日

【CVPR 2022】长尾视觉数据识别的嵌套式协同学习方法 Nested Collaborative Learning for Long-Tailed Visual Recognition

【CVPR 2022】长尾视觉数据识别的嵌套式协同学习方法 Nested Collaborative Learning for Long-Tailed Visual Recognition

专知会员服务

13+阅读 · 2022年3月19日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

近期必读的五篇AAAI 2021【对比学习】相关论文和代码

专知会员服务

54+阅读 · 2021年1月5日

近期必读的七篇NeurIPS 2020【对比学习】相关论文和代码

近期必读的七篇NeurIPS 2020【对比学习】相关论文和代码

专知会员服务

66+阅读 · 2020年10月20日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

浅聊对比学习（Contrastive Learning）

浅聊对比学习（Contrastive Learning）

极市平台

2+阅读 · 2022年7月26日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

笔记 | Deep active learning for named entity recognition

笔记 | Deep active learning for named entity recognition

黑龙江大学自然语言处理实验室

24+阅读 · 2018年5月27日

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

AI研习社

17+阅读 · 2017年10月21日

基于蛋白质组学和代谢组学的mcr-1基因介导的多粘菌素耐药机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

蛋白质亚线粒体定位及其特征信息和预测算法的挖掘

国家自然科学基金

0+阅读 · 2014年12月31日

基于GWAS遗传信息构建生物分子互作网络鉴定中国人群骨质疏松症易感基因

国家自然科学基金

0+阅读 · 2014年12月31日

真核转译起始因子eIF4B在Abl诱导细胞癌变中的作用及其机制

国家自然科学基金

0+阅读 · 2014年12月31日

食管鳞癌异质性及耐药机制的多组学贯穿分析研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cofilin在Erucin诱导的乳腺癌细胞线粒体分裂和细胞凋亡中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

丙酮丁醇梭菌HtrA蛋白介导的丁醇耐受性机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

注意网络的影像遗传学研究

国家自然科学基金

0+阅读 · 2012年12月31日

真核基因转录后调控过程相关蛋白质及其复合物的结构生物学研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于杂种优势潜在相关基因数据库的基因调控网络的构建与比较分析

国家自然科学基金

0+阅读 · 2009年12月31日

Independent Component Alignment for Multi-Task Learning

Arxiv

0+阅读 · 2023年5月30日

Who Would be Interested in Services? An Entity Graph Learning System for User Targeting

Arxiv

0+阅读 · 2023年5月30日

Active Collaborative Localization in Heterogeneous Robot Teams

Arxiv

0+阅读 · 2023年5月29日

Understanding Breast Cancer Survival: Using Causality and Language Models on Multi-omics Data

Arxiv

0+阅读 · 2023年5月28日

Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment

Arxiv

0+阅读 · 2023年5月26日

Heterogeneous Value Evaluation for Large Language Models

Arxiv

0+阅读 · 2023年5月26日

Model-Contrastive Federated Learning

Arxiv

10+阅读 · 2021年3月30日

Efficiently Embedding Dynamic Knowledge Graphs

Efficiently Embedding Dynamic Knowledge Graphs

Arxiv

14+阅读 · 2019年10月15日

Multi-view Knowledge Graph Embedding for Entity Alignment

Arxiv

36+阅读 · 2019年6月6日

Deep Metric Learning with BIER: Boosting Independent Embeddings Robustly

Arxiv

18+阅读 · 2018年1月15日

VIP会员

文章信息

相关主题

多组学数据

相关VIP内容

UTC: 用于视觉对话的任务间对比学习的统一Transformer

UTC: 用于视觉对话的任务间对比学习的统一Transformer

专知会员服务

14+阅读 · 2022年5月4日

【CVPR 2022】长尾视觉数据识别的嵌套式协同学习方法 Nested Collaborative Learning for Long-Tailed Visual Recognition

【CVPR 2022】长尾视觉数据识别的嵌套式协同学习方法 Nested Collaborative Learning for Long-Tailed Visual Recognition

专知会员服务

13+阅读 · 2022年3月19日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

近期必读的五篇AAAI 2021【对比学习】相关论文和代码

专知会员服务

54+阅读 · 2021年1月5日

近期必读的七篇NeurIPS 2020【对比学习】相关论文和代码

近期必读的七篇NeurIPS 2020【对比学习】相关论文和代码

专知会员服务

66+阅读 · 2020年10月20日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型中的检索与结构化增强生成综述

《实现多层防御多轮交战机制的扩展型随机齐射模型》2025年最新83页

【CMU博士论文】交互驱动的人体动作估计与生成

如何避免生成式人工智能在作战中失控失效

相关资讯

浅聊对比学习（Contrastive Learning）

浅聊对比学习（Contrastive Learning）

极市平台

2+阅读 · 2022年7月26日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

笔记 | Deep active learning for named entity recognition

笔记 | Deep active learning for named entity recognition

黑龙江大学自然语言处理实验室

24+阅读 · 2018年5月27日

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

AI研习社

17+阅读 · 2017年10月21日

相关论文

Independent Component Alignment for Multi-Task Learning

Arxiv

0+阅读 · 2023年5月30日

Who Would be Interested in Services? An Entity Graph Learning System for User Targeting

Arxiv

0+阅读 · 2023年5月30日

Active Collaborative Localization in Heterogeneous Robot Teams

Arxiv

0+阅读 · 2023年5月29日

Understanding Breast Cancer Survival: Using Causality and Language Models on Multi-omics Data

Arxiv

0+阅读 · 2023年5月28日

Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment

Arxiv

0+阅读 · 2023年5月26日

Heterogeneous Value Evaluation for Large Language Models

Arxiv

0+阅读 · 2023年5月26日

Model-Contrastive Federated Learning

Arxiv

10+阅读 · 2021年3月30日

Efficiently Embedding Dynamic Knowledge Graphs

Efficiently Embedding Dynamic Knowledge Graphs

Arxiv

14+阅读 · 2019年10月15日

Multi-view Knowledge Graph Embedding for Entity Alignment

Arxiv

36+阅读 · 2019年6月6日

Deep Metric Learning with BIER: Boosting Independent Embeddings Robustly

Arxiv

18+阅读 · 2018年1月15日

相关基金

基于蛋白质组学和代谢组学的mcr-1基因介导的多粘菌素耐药机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

蛋白质亚线粒体定位及其特征信息和预测算法的挖掘

国家自然科学基金

0+阅读 · 2014年12月31日

基于GWAS遗传信息构建生物分子互作网络鉴定中国人群骨质疏松症易感基因

国家自然科学基金

0+阅读 · 2014年12月31日

真核转译起始因子eIF4B在Abl诱导细胞癌变中的作用及其机制

国家自然科学基金

0+阅读 · 2014年12月31日

食管鳞癌异质性及耐药机制的多组学贯穿分析研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cofilin在Erucin诱导的乳腺癌细胞线粒体分裂和细胞凋亡中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

丙酮丁醇梭菌HtrA蛋白介导的丁醇耐受性机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

注意网络的影像遗传学研究

国家自然科学基金

0+阅读 · 2012年12月31日

真核基因转录后调控过程相关蛋白质及其复合物的结构生物学研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于杂种优势潜在相关基因数据库的基因调控网络的构建与比较分析

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员