Automated Medication Regimen (MR) extraction from medical conversations can not only improve recall and help patients follow through with their care plan, but also reduce the documentation burden for doctors. In this paper, we focus on extracting spans for frequency, route and change, corresponding to medications discussed in the conversation. We first describe a unique dataset of annotated doctor-patient conversations and then present a weakly supervised model architecture that can perform span extraction using noisy classification data. The model utilizes an attention bottleneck inside a classification model to perform the extraction. We experiment with several variants of attention scoring and projection functions and propose a novel transformer-based attention scoring function (TAScore). The proposed combination of TAScore and Fusedmax projection achieves a 10 point increase in Longest Common Substring F1 compared to the baseline of additive scoring plus softmax projection.
翻译:从医疗谈话中提取的自动医疗制度不仅可以改善记忆,帮助病人执行护理计划,而且可以减少医生的文献负担。在本文中,我们侧重于提取频率、路线和变化的间隔,与谈话中讨论的药物相对应。我们首先描述有注释的医生-病人谈话的独特数据集,然后提出一个监督不力的模型结构,利用吵闹的分类数据进行抽取。模型利用分类模式中的注意力瓶颈来进行抽取。我们试验了几个关注评分和投影功能的变异功能,并提出了一个新的以变压器为基础的注意评分功能(TAScore ) 。TAScore和FSDAx投影的拟议组合使最常见的F1子字符比加软式投影的加积分数基准增加了10个百分点。