神经测试台:评价联合预测 (The Neural Testbed: Evaluating Joint Predictions) - 专知论文

会员服务 ·

0

Agent · 边缘化 · 值域 · Principle · 数据生成过程 ·

2022 年 11 月 2 日

The Neural Testbed: Evaluating Joint Predictions

翻译：神经测试台:评价联合预测

Ian Osband,Zheng Wen,Seyed Mohammad Asghari,Vikranth Dwaracherla,Botao Hao,Morteza Ibrahimi,Dieterich Lawson,Xiuyuan Lu,Brendan O'Donoghue,Benjamin Van Roy

Predictive distributions quantify uncertainties ignored by point estimates. This paper introduces The Neural Testbed: an open-source benchmark for controlled and principled evaluation of agents that generate such predictions. Crucially, the testbed assesses agents not only on the quality of their marginal predictions per input, but also on their joint predictions across many inputs. We evaluate a range of agents using a simple neural network data generating process. Our results indicate that some popular Bayesian deep learning agents do not fare well with joint predictions, even when they can produce accurate marginal predictions. We also show that the quality of joint predictions drives performance in downstream decision tasks. We find these results are robust across choice a wide range of generative models, and highlight the practical importance of joint predictions to the community.

翻译：本文介绍《神经测试:对产生这种预测的物剂进行有控制和有原则的评估的公开来源基准》。关键是,测试床评估物剂不仅对其每种投入的边际预测质量进行评估,而且对其在许多投入方面的联合预测进行评估。我们使用简单的神经网络数据生成过程对一系列物剂进行评估。我们的结果表明,一些受欢迎的巴耶斯深层学习物剂对联合预测并不满意,即使它们能够产生准确的边际预测。我们还表明,联合预测的质量能推动下游决策任务的业绩。我们发现,这些结果在选择广泛的基因模型方面是稳健的,并突出了联合预测对社区的实际重要性。

0

相关内容

Agent

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

谷氨酸AMPA受体GluR2/GAPDH干扰肽在颞叶癫痫中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

CNTF激活的Ast与神经元间的对话交流在癫痫发病机制中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

一种新的肿瘤转移抑制基因TMSG-1在人前列腺癌中的作用机理

国家自然科学基金

0+阅读 · 2009年12月31日

Notch 信号通路在颞叶癫痫海马硬化形成中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

Residual Tracking and Stopping for Solving Consistent Linear Inverse Problems with Finite Domains

Arxiv

0+阅读 · 2022年12月22日

Few-shot human motion prediction for heterogeneous sensors

Arxiv

0+阅读 · 2022年12月22日

Training language models for deeper understanding improves brain alignment

Arxiv

0+阅读 · 2022年12月21日

Learning and Evaluating Graph Neural Network Explanations based on Counterfactual and Factual Reasoning

Arxiv

17+阅读 · 2022年2月17日

Estimating Node Importance in Knowledge Graphs Using Graph Neural Networks

Arxiv

25+阅读 · 2019年5月21日

VIP会员

文章信息

相关主题

数据生成过程

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《北约认知战概念报告》

《预测促成大规模货运无人机的技术趋势与影响》报告

美海军放弃星座级转而采用国家安全巡逻舰设计

《北约作战弹性概念》报告

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Residual Tracking and Stopping for Solving Consistent Linear Inverse Problems with Finite Domains

Arxiv

0+阅读 · 2022年12月22日

Few-shot human motion prediction for heterogeneous sensors

Arxiv

0+阅读 · 2022年12月22日

Training language models for deeper understanding improves brain alignment

Arxiv

0+阅读 · 2022年12月21日

Learning and Evaluating Graph Neural Network Explanations based on Counterfactual and Factual Reasoning

Arxiv

17+阅读 · 2022年2月17日

Estimating Node Importance in Knowledge Graphs Using Graph Neural Networks

Arxiv

25+阅读 · 2019年5月21日

相关基金

谷氨酸AMPA受体GluR2/GAPDH干扰肽在颞叶癫痫中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

CNTF激活的Ast与神经元间的对话交流在癫痫发病机制中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

一种新的肿瘤转移抑制基因TMSG-1在人前列腺癌中的作用机理

国家自然科学基金

0+阅读 · 2009年12月31日

Notch 信号通路在颞叶癫痫海马硬化形成中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员