Encoder-decoder architectures are prominent building blocks of state-of-the-art solutions for tasks across multiple fields where deep learning (DL) or foundation models play a key role. Although there is a growing community working on the provision of interpretation for DL models as well as considerable work in the neuro-symbolic community seeking to integrate symbolic representations and DL, many open questions remain around the need for better tools for visualization of the inner workings of DL architectures. In particular, encoder-decoder models offer an exciting opportunity for visualization and editing by humans of the knowledge implicitly represented in model weights. In this work, we explore ways to create an abstraction for segments of the network as a two-way graph-based representation. Changes to this graph structure should be reflected directly in the underlying tensor representations. Such two-way graph representation enables new neuro-symbolic systems by leveraging the pattern recognition capabilities of the encoder-decoder along with symbolic reasoning carried out on the graphs. The approach is expected to produce new ways of interacting with DL models but also to improve performance as a result of the combination of learning and reasoning capabilities.
翻译:代码解码器架构是多个领域任务最先进的最新解决方案的基石,在这些领域,深学习或基础模型发挥着关键作用。虽然在为DL模型提供解释方面,社区正在日益壮大,神经-共振界也做了大量工作,力求将象征性的表示和DL结合起来,但在需要更好的工具来直观DL结构的内部运行情况方面,许多开放的问题依然存在。特别是,编码解码器模型为人类在模型重量中暗含的知识的可视化和编辑提供了一个令人振奋的机会。在这项工作中,我们探索了如何为网络各部分创建抽象,作为双向图示代表。这种图形结构的改变应直接反映在基本的数以方表示中。这种双向图示通过利用编码解码器-解码器的图案识别能力以及图案上的象征性推理,使得新的神经-共振系统得以实现。预计该方法将产生与DL模型进行互动的新方法,同时也将提高学习和推理能力的组合。