极其简单的激活形状化用于未知样本检测 (Extremely Simple Activation Shaping for Out-of-Distribution Detection) - 专知论文

会员服务 ·

0

样本 · 后处理 · ImageNet (数据集) · 训练数据 · 微调 ·

2023 年 5 月 1 日

Extremely Simple Activation Shaping for Out-of-Distribution Detection

翻译：极其简单的激活形状化用于未知样本检测

Andrija Djurisic,Nebojsa Bozanic,Arjun Ashok,Rosanne Liu

from arxiv, Accepted paper at ICLR 2023. 22 pages (9 main + appendix), 9 figures

The separation between training and deployment of machine learning models implies that not all scenarios encountered in deployment can be anticipated during training, and therefore relying solely on advancements in training has its limits. Out-of-distribution (OOD) detection is an important area that stress-tests a model's ability to handle unseen situations: Do models know when they don't know? Existing OOD detection methods either incur extra training steps, additional data or make nontrivial modifications to the trained network. In contrast, in this work, we propose an extremely simple, post-hoc, on-the-fly activation shaping method, ASH, where a large portion (e.g. 90%) of a sample's activation at a late layer is removed, and the rest (e.g. 10%) simplified or lightly adjusted. The shaping is applied at inference time, and does not require any statistics calculated from training data. Experiments show that such a simple treatment enhances in-distribution and out-of-distribution distinction so as to allow state-of-the-art OOD detection on ImageNet, and does not noticeably deteriorate the in-distribution accuracy. Video, animation and code can be found at: https://andrijazz.github.io/ash

翻译：分离训练和部署意味着在部署中不能预期所有场景，因此仅依赖于训练的进展存在其局限性。异域检测是一个重要领域，可以测试模型处理未见情况的能力：模型知道它们不知道吗？现有的OOD检测方法要么增加额外的训练步骤、额外的数据或对训练过的网络进行非平凡的修改。相比之下，在这项工作中，我们提出了一种极其简单的后处理、即时激活形状化方法ASH，在这种方法中，一个样本的一个后期层中大部分（例如90%）的激活被去除，剩余部分（例如10%）被简化或轻微调整。该形状化应用于推理时间，不需要从训练数据中计算任何统计数据。实验证明，这种简单的处理方法增强了内部和外部分布的区别，以允许在ImageNet上实现最先进的OOD检测，而且不会明显降低内部分布的准确性。视频、动画和代码可在以下网址中找到：https://andrijazz.github.io/ash。

0

相关内容

生成式对抗网络异常检测，GANs for Anomaly Detection

专知会员服务

34+阅读 · 2021年9月16日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

专知会员服务

43+阅读 · 2020年3月4日

【独立研究者I-Sheng Yang论文】因果机器学习损失函数（A Loss-Function for Causal Machine-Learning）

【独立研究者I-Sheng Yang论文】因果机器学习损失函数（A Loss-Function for Causal Machine-Learning）

专知会员服务

20+阅读 · 2020年1月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【泡泡一分钟】用于深度双目的非监督适应方法(ICCV-2017)

【泡泡一分钟】用于深度双目的非监督适应方法(ICCV-2017)

泡泡机器人SLAM

10+阅读 · 2018年10月7日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

环境内分泌干扰物双酚A致代谢紊乱的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

大规模RFID系统标签的自适应高效准确识别策略研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于弱监督学习的图像语义分割研究

国家自然科学基金

4+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

小样本空间制图

国家自然科学基金

0+阅读 · 2012年12月31日

低功耗高性能能量自激型电源管理集成电路

国家自然科学基金

0+阅读 · 2012年12月31日

△F508突变延迟CFTR磷酸化激活反应的分子机制探讨

国家自然科学基金

0+阅读 · 2012年12月31日

内质网应激介导的代谢综合症致脑损伤的分子机制及花青素干预的研究

国家自然科学基金

0+阅读 · 2011年12月31日

DNA甲基化的快速高通量检测研究

国家自然科学基金

0+阅读 · 2011年12月31日

Collapsed Inference for Bayesian Deep Learning

Arxiv

0+阅读 · 2023年6月16日

Self-Supervised Depth Correction of Lidar Measurements from Map Consistency Loss

Arxiv

0+阅读 · 2023年6月15日

LOVM: Language-Only Vision Model Selection

Arxiv

0+阅读 · 2023年6月15日

BED: Bi-Encoder-Based Detectors for Out-of-Distribution Detection

Arxiv

0+阅读 · 2023年6月15日

Fast and Private Inference of Deep Neural Networks by Co-designing Activation Functions

Arxiv

0+阅读 · 2023年6月14日

DistSim: A performance model of large-scale hybrid distributed DNN training

Arxiv

0+阅读 · 2023年6月14日

CELEST: Federated Learning for Globally Coordinated Threat Detection

Arxiv

17+阅读 · 2022年5月23日

Generalized Out-of-Distribution Detection: A Survey

Generalized Out-of-Distribution Detection: A Survey

Arxiv

15+阅读 · 2021年10月21日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

49+阅读 · 2021年1月6日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

ImageNet (数据集)

相关VIP内容

生成式对抗网络异常检测，GANs for Anomaly Detection

专知会员服务

34+阅读 · 2021年9月16日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

专知会员服务

43+阅读 · 2020年3月4日

【独立研究者I-Sheng Yang论文】因果机器学习损失函数（A Loss-Function for Causal Machine-Learning）

【独立研究者I-Sheng Yang论文】因果机器学习损失函数（A Loss-Function for Causal Machine-Learning）

专知会员服务

20+阅读 · 2020年1月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《扩展现实技术在美国防部维修训练中的应用》最新32页报告

《数字支柱：北约在新兴颠覆性技术时代的互操作性探索》最新报告

《扩展现实技术在军事教育中的应用：通过沉浸式体验学习疑难知识》最新30页

中文版 | 美军对扩展现实技术的军事应用探索

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【泡泡一分钟】用于深度双目的非监督适应方法(ICCV-2017)

【泡泡一分钟】用于深度双目的非监督适应方法(ICCV-2017)

泡泡机器人SLAM

10+阅读 · 2018年10月7日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

Collapsed Inference for Bayesian Deep Learning

Arxiv

0+阅读 · 2023年6月16日

Self-Supervised Depth Correction of Lidar Measurements from Map Consistency Loss

Arxiv

0+阅读 · 2023年6月15日

LOVM: Language-Only Vision Model Selection

Arxiv

0+阅读 · 2023年6月15日

BED: Bi-Encoder-Based Detectors for Out-of-Distribution Detection

Arxiv

0+阅读 · 2023年6月15日

Fast and Private Inference of Deep Neural Networks by Co-designing Activation Functions

Arxiv

0+阅读 · 2023年6月14日

DistSim: A performance model of large-scale hybrid distributed DNN training

Arxiv

0+阅读 · 2023年6月14日

CELEST: Federated Learning for Globally Coordinated Threat Detection

Arxiv

17+阅读 · 2022年5月23日

Generalized Out-of-Distribution Detection: A Survey

Generalized Out-of-Distribution Detection: A Survey

Arxiv

15+阅读 · 2021年10月21日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

49+阅读 · 2021年1月6日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

环境内分泌干扰物双酚A致代谢紊乱的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

大规模RFID系统标签的自适应高效准确识别策略研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于弱监督学习的图像语义分割研究

国家自然科学基金

4+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

小样本空间制图

国家自然科学基金

0+阅读 · 2012年12月31日

低功耗高性能能量自激型电源管理集成电路

国家自然科学基金

0+阅读 · 2012年12月31日

△F508突变延迟CFTR磷酸化激活反应的分子机制探讨

国家自然科学基金

0+阅读 · 2012年12月31日

内质网应激介导的代谢综合症致脑损伤的分子机制及花青素干预的研究

国家自然科学基金

0+阅读 · 2011年12月31日

DNA甲基化的快速高通量检测研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员