AnoShift: 一种用于无监督异常检测的分布漂移基准测试 (AnoShift: A Distribution Shift Benchmark for Unsupervised Anomaly Detection) - 专知论文

会员服务 ·

0

基准测试 · 无监督异常检测 · 基准 · 无监督 · 独立同分布 ·

2023 年 4 月 3 日

AnoShift: A Distribution Shift Benchmark for Unsupervised Anomaly Detection

翻译：AnoShift: 一种用于无监督异常检测的分布漂移基准测试

Marius Dragoi,Elena Burceanu,Emanuela Haller,Andrei Manolache,Florin Brad

Analyzing the distribution shift of data is a growing research direction in nowadays Machine Learning (ML), leading to emerging new benchmarks that focus on providing a suitable scenario for studying the generalization properties of ML models. The existing benchmarks are focused on supervised learning, and to the best of our knowledge, there is none for unsupervised learning. Therefore, we introduce an unsupervised anomaly detection benchmark with data that shifts over time, built over Kyoto-2006+, a traffic dataset for network intrusion detection. This type of data meets the premise of shifting the input distribution: it covers a large time span ($10$ years), with naturally occurring changes over time (eg users modifying their behavior patterns, and software updates). We first highlight the non-stationary nature of the data, using a basic per-feature analysis, t-SNE, and an Optimal Transport approach for measuring the overall distribution distances between years. Next, we propose AnoShift, a protocol splitting the data in IID, NEAR, and FAR testing splits. We validate the performance degradation over time with diverse models, ranging from classical approaches to deep learning. Finally, we show that by acknowledging the distribution shift problem and properly addressing it, the performance can be improved compared to the classical training which assumes independent and identically distributed data (on average, by up to $3\%$ for our approach). Dataset and code are available at https://github.com/bit-ml/AnoShift/.

翻译：数据的分布漂移分析是当今机器学习领域的一个不断发展的研究方向，这导致出现了专门针对机器学习模型通用性研究的全新基准测试。现有的基准测试都专注于监督学习，据我们所知，还没有针对无监督学习的基准测试。因此，我们引入了一种无监督异常检测基准测试，针对Kyoto-2006+网络入侵检测数据集构建在时间轴上的数据。这种数据符合输入分布漂移的前提：它涵盖了一个很长的时间跨度（10年），因此存在自然变化（例如用户修改其行为模式和软件更新）。我们使用基本的逐特征分析，t-SNE和最优输运方法来突出显示数据的非平稳性质，并定义了AnoShift协议，将数据分为IID（独立同分布，即所有数据在同一时间段），NEAR（接近数据，即不同时期但近似分布）和FAR（远离数据，即远离分布）三个测试集。我们使用不同模型对时间性能损耗进行验证，范围从传统方法到深度学习。最后，我们表明，通过认识到分布漂移问题并恰当地解决它，性能可以相对于假设独立同分布的典型训练而得到提高（在平均值上高达3%）。数据集和代码可在https://github.com/bit-ml/AnoShift/上获得。

0

相关内容

基准测试

基准测试是指通过设计科学的测试方法、测试工具和测试系统，实现对一类测试对象的某项性能指标进行定量的和可对比的测试。

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

生成式对抗网络异常检测，GANs for Anomaly Detection

专知会员服务

34+阅读 · 2021年9月16日

【ICCV2021】参数化对比学习

专知会员服务

33+阅读 · 2021年7月27日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【MIT】反偏差对比学习，Debiased Contrastive Learning

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

专知会员服务

38+阅读 · 2020年3月23日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【O’Reilly讲座】基于深度学习的异常检测方法用于检测大型数据集的质量：Anomaly detection using deep learning to measure the quality of large datasets

【O’Reilly讲座】基于深度学习的异常检测方法用于检测大型数据集的质量：Anomaly detection using deep learning to measure the quality of large datasets

专知会员服务

31+阅读 · 2020年1月11日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

【O'Reilly AI Conference 2019】使用深度学习进行异常检测以测量大型数据集的质量（Anomaly detection using deep learning to measure the quality of large datasets），BlueWhale的联合创始人兼CTO Sridhar Alla

【O'Reilly AI Conference 2019】使用深度学习进行异常检测以测量大型数据集的质量（Anomaly detection using deep learning to measure the quality of large datasets），BlueWhale的联合创始人兼CTO Sridhar Alla

专知会员服务

28+阅读 · 2019年11月5日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

异常检测（Anomaly Detection）综述

异常检测（Anomaly Detection）综述

极市平台

20+阅读 · 2020年10月24日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

动手写机器学习算法：异常检测 Anomaly Detection

动手写机器学习算法：异常检测 Anomaly Detection

七月在线实验室

11+阅读 · 2017年12月8日

基于信息熵和DCS的多基线SAR干涉理论与新方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

InSAR连接点自动稳健提取理论与方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

时间序列异常值探测的Bayes方法及其在GNSS动态数据处理中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

两样本稀疏不平衡观测的纵向数据中的检验问题

国家自然科学基金

1+阅读 · 2013年12月31日

基于剖面似然的统计推断

国家自然科学基金

0+阅读 · 2013年12月31日

基于miRNA表达异常导致Th1/Th2免疫失调的PBC发病机制及中医补虚化瘀治法研究

国家自然科学基金

0+阅读 · 2013年12月31日

不完全数据的经验似然和经验熵研究

国家自然科学基金

0+阅读 · 2011年12月31日

批次过程数据模量驱动的分布中心匹配故障诊断研究

国家自然科学基金

0+阅读 · 2011年12月31日

高频地波雷达多域协同系统建模及抗干扰方法

国家自然科学基金

1+阅读 · 2011年12月31日

非刚性变形的实时远程再现

国家自然科学基金

0+阅读 · 2011年12月31日

Beyond Individual Input for Deep Anomaly Detection on Tabular Data

Arxiv

0+阅读 · 2023年5月24日

On Context Distribution Shift in Task Representation Learning for Offline Meta RL

Arxiv

0+阅读 · 2023年5月23日

Robust Instruction Optimization for Large Language Models with Distribution Shifts

Arxiv

0+阅读 · 2023年5月23日

Is Fine-tuning Needed? Pre-trained Language Models Are Near Perfect for Out-of-Domain Detection

Is Fine-tuning Needed? Pre-trained Language Models Are Near Perfect for Out-of-Domain Detection

Arxiv

0+阅读 · 2023年5月22日

Multimodal Industrial Anomaly Detection via Hybrid Fusion

Multimodal Industrial Anomaly Detection via Hybrid Fusion

Arxiv

11+阅读 · 2023年3月1日

Towards Large-Scale Small Object Detection: Survey and Benchmarks

Arxiv

40+阅读 · 2022年7月28日

Image/Video Deep Anomaly Detection: A Survey

Arxiv

16+阅读 · 2021年3月2日

Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark

Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark

Arxiv

46+阅读 · 2019年9月22日

Transfer Adaptation Learning: A Decade Survey

Transfer Adaptation Learning: A Decade Survey

Arxiv

37+阅读 · 2019年3月12日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

VIP会员

文章信息

相关主题

无监督异常检测

独立同分布

相关VIP内容

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

生成式对抗网络异常检测，GANs for Anomaly Detection

专知会员服务

34+阅读 · 2021年9月16日

【ICCV2021】参数化对比学习

专知会员服务

33+阅读 · 2021年7月27日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【MIT】反偏差对比学习，Debiased Contrastive Learning

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

专知会员服务

38+阅读 · 2020年3月23日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【O’Reilly讲座】基于深度学习的异常检测方法用于检测大型数据集的质量：Anomaly detection using deep learning to measure the quality of large datasets

【O’Reilly讲座】基于深度学习的异常检测方法用于检测大型数据集的质量：Anomaly detection using deep learning to measure the quality of large datasets

专知会员服务

31+阅读 · 2020年1月11日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

【O'Reilly AI Conference 2019】使用深度学习进行异常检测以测量大型数据集的质量（Anomaly detection using deep learning to measure the quality of large datasets），BlueWhale的联合创始人兼CTO Sridhar Alla

【O'Reilly AI Conference 2019】使用深度学习进行异常检测以测量大型数据集的质量（Anomaly detection using deep learning to measure the quality of large datasets），BlueWhale的联合创始人兼CTO Sridhar Alla

专知会员服务

28+阅读 · 2019年11月5日

热门VIP内容

开通专知VIP会员享更多权益服务

GPT-5如何对齐？从硬性拒绝到安全完成：走向以输出为中心的安全训练

【伯克利博士论文】超越人类监督的视觉智能

【ICCV2025】SO(3) 上连续非保守动力系统的预测

2025年中国数据要素行业发展研究报告

相关资讯

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

异常检测（Anomaly Detection）综述

异常检测（Anomaly Detection）综述

极市平台

20+阅读 · 2020年10月24日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

动手写机器学习算法：异常检测 Anomaly Detection

动手写机器学习算法：异常检测 Anomaly Detection

七月在线实验室

11+阅读 · 2017年12月8日

相关论文

Beyond Individual Input for Deep Anomaly Detection on Tabular Data

Arxiv

0+阅读 · 2023年5月24日

On Context Distribution Shift in Task Representation Learning for Offline Meta RL

Arxiv

0+阅读 · 2023年5月23日

Robust Instruction Optimization for Large Language Models with Distribution Shifts

Arxiv

0+阅读 · 2023年5月23日

Is Fine-tuning Needed? Pre-trained Language Models Are Near Perfect for Out-of-Domain Detection

Is Fine-tuning Needed? Pre-trained Language Models Are Near Perfect for Out-of-Domain Detection

Arxiv

0+阅读 · 2023年5月22日

Multimodal Industrial Anomaly Detection via Hybrid Fusion

Multimodal Industrial Anomaly Detection via Hybrid Fusion

Arxiv

11+阅读 · 2023年3月1日

Towards Large-Scale Small Object Detection: Survey and Benchmarks

Arxiv

40+阅读 · 2022年7月28日

Image/Video Deep Anomaly Detection: A Survey

Arxiv

16+阅读 · 2021年3月2日

Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark

Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark

Arxiv

46+阅读 · 2019年9月22日

Transfer Adaptation Learning: A Decade Survey

Transfer Adaptation Learning: A Decade Survey

Arxiv

37+阅读 · 2019年3月12日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

相关基金

基于信息熵和DCS的多基线SAR干涉理论与新方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

InSAR连接点自动稳健提取理论与方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

时间序列异常值探测的Bayes方法及其在GNSS动态数据处理中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

两样本稀疏不平衡观测的纵向数据中的检验问题

国家自然科学基金

1+阅读 · 2013年12月31日

基于剖面似然的统计推断

国家自然科学基金

0+阅读 · 2013年12月31日

基于miRNA表达异常导致Th1/Th2免疫失调的PBC发病机制及中医补虚化瘀治法研究

国家自然科学基金

0+阅读 · 2013年12月31日

不完全数据的经验似然和经验熵研究

国家自然科学基金

0+阅读 · 2011年12月31日

批次过程数据模量驱动的分布中心匹配故障诊断研究

国家自然科学基金

0+阅读 · 2011年12月31日

高频地波雷达多域协同系统建模及抗干扰方法

国家自然科学基金

1+阅读 · 2011年12月31日

非刚性变形的实时远程再现

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员