RESPAD: 实时实时实时主动反常探测,用于时间序列 (RePAD: Real-time Proactive Anomaly Detection for Time Series)

During the past decade, many anomaly detection approaches have been introduced in different fields such as network monitoring, fraud detection, and intrusion detection. However, they require understanding of data pattern and often need a long off-line period to build a model or network for the target data. Providing real-time and proactive anomaly detection for streaming time series without human intervention and domain knowledge is highly valuable since it greatly reduces human effort and enables appropriate countermeasures to be undertaken before a disastrous damage, failure, or other harmful event occurs. However, this issue has not been well studied yet. To address it, this paper proposes RePAD, which is a Real-time Proactive Anomaly Detection algorithm for streaming time series based on Long Short-Term Memory (LSTM). RePAD utilizes short-term historic data points to predict and determine whether or not the upcoming data point is a sign that an anomaly is likely to happen in the near future. By dynamically adjusting the detection threshold over time, RePAD is able to tolerate minor pattern change in time series and detect anomalies either proactively or on time. Experiments based on two time series datasets collected from the Numenta Anomaly Benchmark demonstrate that RePAD is able to proactively detect anomalies and provide early warnings in real time without human intervention and domain knowledge.

翻译：过去十年来,在网络监测、欺诈探测和入侵探测等不同领域采用了许多异常现象探测方法,但在网络监测、欺诈探测和入侵探测等不同领域采用了许多异常现象探测方法,然而,这些方法需要了解数据模式,往往需要较长的离线期才能为目标数据建立模型或网络;在没有人类干预和领域知识的情况下,为流时间序列提供实时和主动异常探测非常宝贵,因为它大大减少了人类的努力,能够在灾难性损害、故障或其他有害事件发生之前采取适当的应对措施;然而,这一问题尚未得到很好研究。为解决这一问题,本文件提议了RePAD,这是基于长期短期内存(LSTM)流时间序列的实时主动异常探测算法。 RePAD利用短期历史数据点预测和确定即将出现的数据点是否表明近期内有可能发生异常现象。通过动态调整探测阈值的临界值,可以容忍时间序列中的微小模式变化,并且可以主动或时间地探测异常现象。根据从Numenta Anoma 长期记忆中收集的两个时间序列数据集进行实验,从而在不主动性地测量人类的域中能够主动性地测量状态。

相关内容

异常检测

关注 98

在数据挖掘中，异常检测（英语：anomaly detection）对不符合预期模式或数据集中其他项目的项目、事件或观测值的识别。通常异常项目会转变成银行欺诈、结构缺陷、医疗问题、文本错误等类型的问题。异常也被称为离群值、新奇、噪声、偏差和例外。特别是在检测滥用与网络入侵时，有趣性对象往往不是罕见对象，但却是超出预料的突发活动。这种模式不遵循通常统计定义中把异常点看作是罕见对象，于是许多异常检测方法（特别是无监督的方法）将对此类数据失效，除非进行了合适的聚集。相反，聚类分析算法可能可以检测出这些模式形成的微聚类。有三大类异常检测方法。[1] 在假设数据集中大多数实例都是正常的前提下，无监督异常检测方法能通过寻找与其他数据最不匹配的实例来检测出未标记测试数据的异常。监督式异常检测方法需要一个已经被标记“正常”与“异常”的数据集，并涉及到训练分类器（与许多其他的统计分类问题的关键区别是异常检测的内在不均衡性）。半监督式异常检测方法根据一个给定的正常训练数据集创建一个表示正常行为的模型，然后检测由学习模型生成的测试实例的可能性。

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

59+阅读 · 2022年4月22日

【重磅】2021年IEEE Fellow出炉！ 282位新晋升会士！七十多位华人当选！

专知会员服务

22+阅读 · 2020年11月25日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

94+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

56+阅读 · 2019年10月17日