学习与解释限制 (Learning with Explanation Constraints) - 专知论文

会员服务 ·

0

学习理论 · 改进模型 · 线性模型 · 变分 · 形式化 ·

2023 年 3 月 25 日

Learning with Explanation Constraints

翻译：学习与解释限制

Rattana Pukdee,Dylan Sam,J. Zico Kolter,Maria-Florina Balcan,Pradeep Ravikumar

While supervised learning assumes the presence of labeled data, we may have prior information about how models should behave. In this paper, we formalize this notion as learning from explanation constraints and provide a learning theoretic framework to analyze how such explanations can improve the learning of our models. For what models would explanations be helpful? Our first key contribution addresses this question via the definition of what we call EPAC models (models that satisfy these constraints in expectation over new data), and we analyze this class of models using standard learning theoretic tools. Our second key contribution is to characterize these restrictions (in terms of their Rademacher complexities) for a canonical class of explanations given by gradient information for linear models and two layer neural networks. Finally, we provide an algorithmic solution for our framework, via a variational approximation that achieves better performance and satisfies these constraints more frequently, when compared to simpler augmented Lagrangian methods to incorporate these explanations. We demonstrate the benefits of our approach over a large array of synthetic and real-world experiments.

翻译：虽然监督学习假设有标记数据的存在，但我们可能具有关于模型应该如何表现的先前信息。在本文中，我们把这种观念形式化为学习从解释限制中，并提供了一个学习理论框架来分析这种解释如何改进模型的学习。哪些模型会受到解释的帮助？我们的第一个关键贡献通过定义我们称之为EPAC模型的模型来回答这个问题（在新数据的期望上满足这些约束的模型），并使用标准的学习理论工具来分析这一类模型。我们的第二个关键贡献是为梯度信息所给出的用于线性模型和两层神经网络的标准解释这一类解释（以其Rademacher复杂性的形式）对这些限制进行表征。最后，我们通过一种变分近似的算法解决了我们的框架，并证明了与简单的加权拉格朗日方法相比，我们的方法在更频繁地满足这些限制的同时实现了更好的性能。我们在大量的合成和真实世界实验中证明了我们方法的优势。

0

相关内容

学习理论

【ACL2022】解释生成的多尺度分布深度变分自编码器, Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation

【ACL2022】解释生成的多尺度分布深度变分自编码器, Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation

专知会员服务

12+阅读 · 2022年3月24日

【教程】深度学习Keras与TensorFlow教程，Deep Learning with Keras and Tensorflow in R

【教程】深度学习Keras与TensorFlow教程，Deep Learning with Keras and Tensorflow in R

专知会员服务

32+阅读 · 2022年3月9日

【NeurIPS2021】学习用于分布外预测的因果语义表示

【NeurIPS2021】学习用于分布外预测的因果语义表示

专知会员服务

18+阅读 · 2021年11月19日

AAAI2021 | 图神经网络的异质图结构学习，Heterogeneous Graph Structure Learning for Graph Neural Networks

专知会员服务

92+阅读 · 2021年1月20日

【PKDD2020教程】可解释人工智能XAI:算法到应用，200页ppt

专知会员服务

41+阅读 · 2020年10月13日

【NeurlPS2019论文总结】一致收敛可能无法解释深度学习中的泛化现象，Uniform convergence may be unable to explain generalization in deep learning

【NeurlPS2019论文总结】一致收敛可能无法解释深度学习中的泛化现象，Uniform convergence may be unable to explain generalization in deep learning

专知会员服务

15+阅读 · 2019年12月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

浅聊对比学习（Contrastive Learning）

浅聊对比学习（Contrastive Learning）

极市平台

2+阅读 · 2022年7月26日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】(Keras)LSTM多元时序预测教程

【推荐】(Keras)LSTM多元时序预测教程

机器学习研究会

24+阅读 · 2017年8月14日

关于某些代数曲线K2群的研究

国家自然科学基金

1+阅读 · 2015年12月31日

长记忆波动率模型的结构性质、统计推断及应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

Eulerian bond-cubic 模型渗流性质的数值研究

国家自然科学基金

0+阅读 · 2012年12月31日

似然方法的有限样本研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于数据学习的高斯过程混合体的模型选择及其应用研究

国家自然科学基金

1+阅读 · 2011年12月31日

粒度支持向量机学习方法及应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

多尺度高斯过程模型及其学习曲线研究

国家自然科学基金

2+阅读 · 2009年12月31日

核子自旋、动量结构及其规范不变性研究

国家自然科学基金

0+阅读 · 2008年12月31日

海浪资料同化中背景误差的随机动力学模型及其应用

国家自然科学基金

0+阅读 · 2008年12月31日

铋基钙钛矿无铅压电陶瓷的性能调控和物理机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

Explain Any Concept: Segment Anything Meets Concept-Based Explanation

Arxiv

0+阅读 · 2023年5月17日

Online Continual Learning Without the Storage Constraint

Arxiv

0+阅读 · 2023年5月16日

Topological Interpretability for Deep-Learning

Arxiv

1+阅读 · 2023年5月15日

A survey and taxonomy of loss functions in machine learning

Arxiv

25+阅读 · 2023年1月13日

Informed Machine Learning -- A Taxonomy and Survey of Integrating Knowledge into Learning Systems

Arxiv

37+阅读 · 2021年5月28日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

Arxiv

23+阅读 · 2021年3月3日

A Survey on the Explainability of Supervised Machine Learning

Arxiv

24+阅读 · 2020年11月16日

A Wholistic View of Continual Learning with Deep Neural Networks: Forgotten Lessons and the Bridge to Active and Open World Learning

Arxiv

35+阅读 · 2020年9月3日

Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks

Arxiv

17+阅读 · 2018年6月5日

VIP会员

文章信息

相关主题

相关VIP内容

【ACL2022】解释生成的多尺度分布深度变分自编码器, Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation

【ACL2022】解释生成的多尺度分布深度变分自编码器, Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation

专知会员服务

12+阅读 · 2022年3月24日

【教程】深度学习Keras与TensorFlow教程，Deep Learning with Keras and Tensorflow in R

【教程】深度学习Keras与TensorFlow教程，Deep Learning with Keras and Tensorflow in R

专知会员服务

32+阅读 · 2022年3月9日

【NeurIPS2021】学习用于分布外预测的因果语义表示

【NeurIPS2021】学习用于分布外预测的因果语义表示

专知会员服务

18+阅读 · 2021年11月19日

AAAI2021 | 图神经网络的异质图结构学习，Heterogeneous Graph Structure Learning for Graph Neural Networks

专知会员服务

92+阅读 · 2021年1月20日

【PKDD2020教程】可解释人工智能XAI:算法到应用，200页ppt

专知会员服务

41+阅读 · 2020年10月13日

【NeurlPS2019论文总结】一致收敛可能无法解释深度学习中的泛化现象，Uniform convergence may be unable to explain generalization in deep learning

【NeurlPS2019论文总结】一致收敛可能无法解释深度学习中的泛化现象，Uniform convergence may be unable to explain generalization in deep learning

专知会员服务

15+阅读 · 2019年12月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

浅聊对比学习（Contrastive Learning）

浅聊对比学习（Contrastive Learning）

极市平台

2+阅读 · 2022年7月26日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】(Keras)LSTM多元时序预测教程

【推荐】(Keras)LSTM多元时序预测教程

机器学习研究会

24+阅读 · 2017年8月14日

相关论文

Explain Any Concept: Segment Anything Meets Concept-Based Explanation

Arxiv

0+阅读 · 2023年5月17日

Online Continual Learning Without the Storage Constraint

Arxiv

0+阅读 · 2023年5月16日

Topological Interpretability for Deep-Learning

Arxiv

1+阅读 · 2023年5月15日

A survey and taxonomy of loss functions in machine learning

Arxiv

25+阅读 · 2023年1月13日

Informed Machine Learning -- A Taxonomy and Survey of Integrating Knowledge into Learning Systems

Arxiv

37+阅读 · 2021年5月28日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

Arxiv

23+阅读 · 2021年3月3日

A Survey on the Explainability of Supervised Machine Learning

Arxiv

24+阅读 · 2020年11月16日

A Wholistic View of Continual Learning with Deep Neural Networks: Forgotten Lessons and the Bridge to Active and Open World Learning

Arxiv

35+阅读 · 2020年9月3日

Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks

Arxiv

17+阅读 · 2018年6月5日

相关基金

关于某些代数曲线K2群的研究

国家自然科学基金

1+阅读 · 2015年12月31日

长记忆波动率模型的结构性质、统计推断及应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

Eulerian bond-cubic 模型渗流性质的数值研究

国家自然科学基金

0+阅读 · 2012年12月31日

似然方法的有限样本研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于数据学习的高斯过程混合体的模型选择及其应用研究

国家自然科学基金

1+阅读 · 2011年12月31日

粒度支持向量机学习方法及应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

多尺度高斯过程模型及其学习曲线研究

国家自然科学基金

2+阅读 · 2009年12月31日

核子自旋、动量结构及其规范不变性研究

国家自然科学基金

0+阅读 · 2008年12月31日

海浪资料同化中背景误差的随机动力学模型及其应用

国家自然科学基金

0+阅读 · 2008年12月31日

铋基钙钛矿无铅压电陶瓷的性能调控和物理机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员