比较微弱和不受监督的共振异常探测方法 (Comparing Weak- and Unsupervised Methods for Resonant Anomaly Detection)

Anomaly detection techniques are growing in importance at the Large Hadron Collider (LHC), motivated by the increasing need to search for new physics in a model-agnostic way. In this work, we provide a detailed comparative study between a well-studied unsupervised method called the autoencoder (AE) and a weakly-supervised approach based on the Classification Without Labels (CWoLa) technique. We examine the ability of the two methods to identify a new physics signal at different cross sections in a fully hadronic resonance search. By construction, the AE classification performance is independent of the amount of injected signal. In contrast, the CWoLa performance improves with increasing signal abundance. When integrating these approaches with a complete background estimate, we find that the two methods have complementary sensitivity. In particular, CWoLa is effective at finding diverse and moderately rare signals while the AE can provide sensitivity to very rare signals, but only with certain topologies. We therefore demonstrate that both techniques are complementary and can be used together for anomaly detection at the LHC.

翻译：在大型高原相撞器(LHC)中,反常探测技术越来越重要,原因是越来越需要以模型和不可知的方式搜索新的物理学。在这项工作中,我们提供了一种详细比较研究,一种研究周密的、未经监督的、称为自动编码器(AE)的方法,一种基于无标签分类(CEWLA)技术的微弱监督方法。我们研究了这两种方法在完全有时间共振的搜索中在不同交叉区段识别新的物理信号的能力。通过构建,AE分类性能独立于注入信号的数量。相反,CWLa性能随着信号的丰度的增加而提高。在将这些方法与完整的背景估计结合起来时,我们发现这两种方法具有互补的敏感性。特别是,CWOLA能够有效地找到多种和中度稀有的信号,而AE可以提供非常罕见的信号的敏感度,但只有某些表象。我们因此证明,这两种技术是相辅相成的,可以同时用于在LHCCCS发现异常现象。

相关内容

异常检测

关注 102

在数据挖掘中，异常检测（英语：anomaly detection）对不符合预期模式或数据集中其他项目的项目、事件或观测值的识别。通常异常项目会转变成银行欺诈、结构缺陷、医疗问题、文本错误等类型的问题。异常也被称为离群值、新奇、噪声、偏差和例外。特别是在检测滥用与网络入侵时，有趣性对象往往不是罕见对象，但却是超出预料的突发活动。这种模式不遵循通常统计定义中把异常点看作是罕见对象，于是许多异常检测方法（特别是无监督的方法）将对此类数据失效，除非进行了合适的聚集。相反，聚类分析算法可能可以检测出这些模式形成的微聚类。有三大类异常检测方法。[1] 在假设数据集中大多数实例都是正常的前提下，无监督异常检测方法能通过寻找与其他数据最不匹配的实例来检测出未标记测试数据的异常。监督式异常检测方法需要一个已经被标记“正常”与“异常”的数据集，并涉及到训练分类器（与许多其他的统计分类问题的关键区别是异常检测的内在不均衡性）。半监督式异常检测方法根据一个给定的正常训练数据集创建一个表示正常行为的模型，然后检测由学习模型生成的测试实例的可能性。

【NeurIPS2020】DeepMind最新《神经科学人工智能》报告，126页ppt

专知会员服务

70+阅读 · 2020年12月10日

图像分类半监督自监督无监督学习综述，A survey on Semi-, Self- and Unsupervised Learning for Image Classification

专知会员服务

46+阅读 · 2020年7月29日

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

专知会员服务

28+阅读 · 2020年6月13日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日