BioGPT: 用于生物医学文本生成与挖掘的生成预训练Transformer (BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining) - 专知论文

会员服务 ·

0

生物 · 预训练 · 文本生成 · BERT · 语言模型 ·

2023 年 4 月 3 日

BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining

翻译：BioGPT: 用于生物医学文本生成与挖掘的生成预训练Transformer

Renqian Luo,Liai Sun,Yingce Xia,Tao Qin,Sheng Zhang,Hoifung Poon,Tie-Yan Liu

from arxiv, Published at Briefings in Bioinformatics. Code is available at https://github.com/microsoft/BioGPT

Pre-trained language models have attracted increasing attention in the biomedical domain, inspired by their great success in the general natural language domain. Among the two main branches of pre-trained language models in the general language domain, i.e., BERT (and its variants) and GPT (and its variants), the first one has been extensively studied in the biomedical domain, such as BioBERT and PubMedBERT. While they have achieved great success on a variety of discriminative downstream biomedical tasks, the lack of generation ability constrains their application scope. In this paper, we propose BioGPT, a domain-specific generative Transformer language model pre-trained on large scale biomedical literature. We evaluate BioGPT on six biomedical NLP tasks and demonstrate that our model outperforms previous models on most tasks. Especially, we get 44.98%, 38.42% and 40.76% F1 score on BC5CDR, KD-DTI and DDI end-to-end relation extraction tasks respectively, and 78.2% accuracy on PubMedQA, creating a new record. Our case study on text generation further demonstrates the advantage of BioGPT on biomedical literature to generate fluent descriptions for biomedical terms. Code is available at https://github.com/microsoft/BioGPT.

翻译：预训练语言模型在自然语言领域取得了巨大的成功，这在生物医学领域引起了越来越多的关注。在自然语言领域中的预训练语言模型的两个主要分支中，即BERT和GPT，BERT（和其变体）已经在生物医学领域得到了广泛的研究，例如BioBERT和PubMedBERT。虽然它们在各种分类的下游生物医学任务上取得了巨大的成功，但缺乏生成能力限制了它们的应用范围。在本文中，我们提出了BioGPT，一种在大规模生物医学文献上预训练的领域特定生成Transformer语言模型。我们在六个生物医学NLP任务上评估BioGPT，并证明我们的模型在大多数任务上优于以前的模型。特别地，在BC5CDR，KD-DTI和DDI端到端关系提取任务中分别获得了44.98％，38.42％和40.76％的F1分数，并在PubMedQA上获得了78.2％的准确度，创造了一个新的记录。我们的文本生成案例研究进一步证明了BioGPT在生物医学文献中的优势，可以为生物医学术语生成流畅的描述。源代码可以在https://github.com/microsoft/BioGPT中找到。

0

相关内容

具有动能的生命体。

PubMed GPT ：用于生物医学文本的特定领域大型语言模型

PubMed GPT ：用于生物医学文本的特定领域大型语言模型

专知会员服务

37+阅读 · 2022年12月19日

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

26+阅读 · 2022年3月3日

UIUC韩家炜：从海量非结构化文本中挖掘结构化知识

UIUC韩家炜：从海量非结构化文本中挖掘结构化知识

专知会员服务

96+阅读 · 2021年12月30日

【KDD2021】TUTA: 通用表格预训练的树结构Transformer

专知会员服务

24+阅读 · 2021年8月22日

预训练模型如何用于文本挖掘？看这份KDD2021-UIUC《预训练文本表示:模型与应用在文本挖掘》教程，附200页Slides

专知会员服务

43+阅读 · 2021年8月18日

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

专知会员服务

97+阅读 · 2020年7月3日

【ACL2020】用于生成深度问题的语义图，Semantic Graphs for Generating Deep Questions

【ACL2020】用于生成深度问题的语义图，Semantic Graphs for Generating Deep Questions

专知会员服务

25+阅读 · 2020年5月5日

【微软亚洲研究院】CodeBERT:用于编程和自然语言的预训练模型，CodeBERT: A Pre-Trained Model for Programming and Natural Languages

【微软亚洲研究院】CodeBERT:用于编程和自然语言的预训练模型，CodeBERT: A Pre-Trained Model for Programming and Natural Languages

专知会员服务

31+阅读 · 2020年2月21日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

53+阅读 · 2020年1月30日

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

专知会员服务

48+阅读 · 2019年11月15日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

25+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

27+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

15+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

【论文推荐】最新八篇生成对抗网络相关论文—条件翻译、RGB-D动作识别、量子生成对抗网络、语义对齐、视频摘要、视觉-文本注意力

【论文推荐】最新八篇生成对抗网络相关论文—条件翻译、RGB-D动作识别、量子生成对抗网络、语义对齐、视频摘要、视觉-文本注意力

专知

15+阅读 · 2018年5月15日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

Generative Adversarial Text to Image Synthesis论文解读

Generative Adversarial Text to Image Synthesis论文解读

统计学习与视觉计算组

13+阅读 · 2017年6月9日

面向跨领域异构数据的患者相似性学习方法及应用

国家自然科学基金

22+阅读 · 2016年12月31日

基于天然产物Drimenal的新型杀菌剂分子设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

语音信号声纹信息成分的深层表达

国家自然科学基金

0+阅读 · 2012年12月31日

BRCA1蛋白出核的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

拟南芥与油菜种子油脂积累消减器(SFAR)对含油量形成的影响及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

南海内波的生成、传播及其三维性

国家自然科学基金

0+阅读 · 2012年12月31日

基于芳香亚磺酸的脱二氧化硫C-C键和C-卤键生成反应研究

国家自然科学基金

0+阅读 · 2011年12月31日

斑马鱼心脏发育

国家自然科学基金

0+阅读 · 2009年12月31日

中性Cu(I)配合物的合成及光电性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

Active Learning for Natural Language Generation

Arxiv

0+阅读 · 2023年5月24日

Enabling Large Language Models to Generate Text with Citations

Arxiv

0+阅读 · 2023年5月24日

Exploring Train and Test-Time Augmentations for Audio-Language Learning

Arxiv

0+阅读 · 2023年5月23日

Partial Annotation Learning for Biomedical Entity Recognition

Arxiv

0+阅读 · 2023年5月22日

Distilling ChatGPT for Explainable Automated Student Answer Assessment

Arxiv

2+阅读 · 2023年5月22日

STOAT: Structured Data to Analytical Text With Controls

Arxiv

0+阅读 · 2023年5月19日

A One-Class Classifier for the Detection of GAN Manipulated Multi-Spectral Satellite Images

Arxiv

0+阅读 · 2023年5月19日

Controllable Data Generation by Deep Learning: A Review

Arxiv

15+阅读 · 2022年7月19日

A Survey of Knowledge-Enhanced Text Generation

Arxiv

18+阅读 · 2020年10月9日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

VIP会员

文章信息

相关主题

相关VIP内容

PubMed GPT ：用于生物医学文本的特定领域大型语言模型

PubMed GPT ：用于生物医学文本的特定领域大型语言模型

专知会员服务

37+阅读 · 2022年12月19日

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

26+阅读 · 2022年3月3日

UIUC韩家炜：从海量非结构化文本中挖掘结构化知识

UIUC韩家炜：从海量非结构化文本中挖掘结构化知识

专知会员服务

96+阅读 · 2021年12月30日

【KDD2021】TUTA: 通用表格预训练的树结构Transformer

专知会员服务

24+阅读 · 2021年8月22日

预训练模型如何用于文本挖掘？看这份KDD2021-UIUC《预训练文本表示:模型与应用在文本挖掘》教程，附200页Slides

专知会员服务

43+阅读 · 2021年8月18日

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

专知会员服务

97+阅读 · 2020年7月3日

【ACL2020】用于生成深度问题的语义图，Semantic Graphs for Generating Deep Questions

【ACL2020】用于生成深度问题的语义图，Semantic Graphs for Generating Deep Questions

专知会员服务

25+阅读 · 2020年5月5日

【微软亚洲研究院】CodeBERT:用于编程和自然语言的预训练模型，CodeBERT: A Pre-Trained Model for Programming and Natural Languages

【微软亚洲研究院】CodeBERT:用于编程和自然语言的预训练模型，CodeBERT: A Pre-Trained Model for Programming and Natural Languages

专知会员服务

31+阅读 · 2020年2月21日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

53+阅读 · 2020年1月30日

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

专知会员服务

48+阅读 · 2019年11月15日

热门VIP内容

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

25+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

27+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

15+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

【论文推荐】最新八篇生成对抗网络相关论文—条件翻译、RGB-D动作识别、量子生成对抗网络、语义对齐、视频摘要、视觉-文本注意力

【论文推荐】最新八篇生成对抗网络相关论文—条件翻译、RGB-D动作识别、量子生成对抗网络、语义对齐、视频摘要、视觉-文本注意力

专知

15+阅读 · 2018年5月15日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

Generative Adversarial Text to Image Synthesis论文解读

Generative Adversarial Text to Image Synthesis论文解读

统计学习与视觉计算组

13+阅读 · 2017年6月9日

相关论文

Active Learning for Natural Language Generation

Arxiv

0+阅读 · 2023年5月24日

Enabling Large Language Models to Generate Text with Citations

Arxiv

0+阅读 · 2023年5月24日

Exploring Train and Test-Time Augmentations for Audio-Language Learning

Arxiv

0+阅读 · 2023年5月23日

Partial Annotation Learning for Biomedical Entity Recognition

Arxiv

0+阅读 · 2023年5月22日

Distilling ChatGPT for Explainable Automated Student Answer Assessment

Arxiv

2+阅读 · 2023年5月22日

STOAT: Structured Data to Analytical Text With Controls

Arxiv

0+阅读 · 2023年5月19日

A One-Class Classifier for the Detection of GAN Manipulated Multi-Spectral Satellite Images

Arxiv

0+阅读 · 2023年5月19日

Controllable Data Generation by Deep Learning: A Review

Arxiv

15+阅读 · 2022年7月19日

A Survey of Knowledge-Enhanced Text Generation

Arxiv

18+阅读 · 2020年10月9日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

相关基金

面向跨领域异构数据的患者相似性学习方法及应用

国家自然科学基金

22+阅读 · 2016年12月31日

基于天然产物Drimenal的新型杀菌剂分子设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

语音信号声纹信息成分的深层表达

国家自然科学基金

0+阅读 · 2012年12月31日

BRCA1蛋白出核的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

拟南芥与油菜种子油脂积累消减器(SFAR)对含油量形成的影响及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

南海内波的生成、传播及其三维性

国家自然科学基金

0+阅读 · 2012年12月31日

基于芳香亚磺酸的脱二氧化硫C-C键和C-卤键生成反应研究

国家自然科学基金

0+阅读 · 2011年12月31日

斑马鱼心脏发育

国家自然科学基金

0+阅读 · 2009年12月31日

中性Cu(I)配合物的合成及光电性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员