Reliable in silico molecular toxicity prediction is a cornerstone of modern drug discovery, offering a scalable alternative to experimental screening. However, the black-box nature of state-of-the-art models remains a significant barrier to adoption, as high-stakes safety decisions demand verifiable structural insights alongside predictive performance. To address this, we propose a novel multi-task learning (MTL) framework designed to jointly enhance accuracy and interpretability. Our architecture integrates a shared chemical language model with task-specific attention modules. By imposing an L1 sparsity penalty on these modules, the framework is constrained to focus on a minimal set of salient molecular fragments for each distinct toxicity endpoint. The resulting framework is trained end-to-end and is readily adaptable to various transformer-based backbones. Evaluated on the ClinTox, SIDER, and Tox21 benchmark datasets, our approach consistently outperforms both single-task and standard MTL baselines. Crucially, the sparse attention weights provide chemically intuitive visualizations that reveal the specific fragments influencing predictions, thereby enhancing insight into the model's decision-making process.
翻译:可靠的分子毒性计算机预测是现代药物发现的基石,为实验筛选提供了可扩展的替代方案。然而,当前最先进模型的黑箱特性仍是其应用的主要障碍,因为高风险的安全性决策不仅要求预测性能,还需要可验证的结构性见解。为此,我们提出了一种新颖的多任务学习框架,旨在同时提升准确性与可解释性。该架构将共享的化学语言模型与任务特定的注意力模块相结合。通过对这些模块施加L1稀疏性惩罚,框架被约束为每个不同的毒性终点仅关注一组最少的显著分子片段。所得框架采用端到端训练,并可轻松适配多种基于Transformer的骨干网络。在ClinTox、SIDER和Tox21基准数据集上的评估表明,我们的方法在性能上持续优于单任务及标准多任务学习基线。关键的是,稀疏注意力权重提供了化学直观的可视化结果,揭示了影响预测的具体分子片段,从而增强了对模型决策过程的理解。