智能论文笔记

Denoising Architecture for Unsupervised Anomaly Detection in Time-Series

Wadie Skaf , Tomáš Horváth

分类：机器学习 | 人工智能

2022-08-30

时间序列的异常提供了各个行业的关键方案的见解，从银行和航空航天到信息技术，安全和医学。但是，由于异常的定义，经常缺乏标签以及此类数据中存在的极为复杂的时间相关性，因此识别时间序列数据中的异常尤其具有挑战性。LSTM自动编码器是基于长期短期内存网络的异常检测的编码器传统方案，该方案学会重建时间序列行为，然后使用重建错误来识别异常。我们将Denoising Architecture作为对该LSTM编码模型模型的补充，并研究其对现实世界以及人为生成的数据集的影响。我们证明了所提出的体系结构既提高了准确性和训练速度，从而使LSTM自动编码器更有效地用于无监督的异常检测任务。

translated by 谷歌翻译

Deep Learning for Time Series Anomaly Detection: A Survey

Zahra Zamanzadeh Darban , Geoffrey I. Webb , Shirui Pan , Charu C. Aggarwal , Mahsa Salehi

分类：机器学习 | 人工智能

2022-11-09

Time series anomaly detection has applications in a wide range of research fields and applications, including manufacturing and healthcare. The presence of anomalies can indicate novel or unexpected events, such as production faults, system defects, or heart fluttering, and is therefore of particular interest. The large size and complex patterns of time series have led researchers to develop specialised deep learning models for detecting anomalous patterns. This survey focuses on providing structured and comprehensive state-of-the-art time series anomaly detection models through the use of deep learning. It providing a taxonomy based on the factors that divide anomaly detection models into different categories. Aside from describing the basic anomaly detection technique for each category, the advantages and limitations are also discussed. Furthermore, this study includes examples of deep anomaly detection in time series across various application domains in recent years. It finally summarises open issues in research and challenges faced while adopting deep anomaly detection models.

translated by 谷歌翻译

Smart Meter Data Anomaly Detection using Variational Recurrent Autoencoders with Attention

Wenjing Dai , Xiufeng Liu , Alfred Heller , Per Sieverts Nielsen

分类：机器学习

2022-06-08

在能源系统的数字化中，传感器和智能电表越来越多地用于监视生产，运行和需求。基于智能电表数据的异常检测对于在早期阶段识别潜在的风险和异常事件至关重要，这可以作为及时启动适当动作和改善管理的参考。但是，来自能源系统的智能电表数据通常缺乏标签，并且包含噪声和各种模式，而没有明显的周期性。同时，在不同的能量场景中对异常的模糊定义和高度复杂的时间相关性对异常检测构成了巨大的挑战。许多传统的无监督异常检测算法（例如基于群集或基于距离的模型）对噪声不强大，也不完全利用时间序列中的时间依赖性以及在多个变量（传感器）中的其他依赖关系。本文提出了一种基于带有注意机制的变异复发自动编码器的无监督异常检测方法。凭借来自智能电表的“肮脏”数据，我们的方法预示了缺失的值和全球异常，以在训练中缩小其贡献。本文与基于VAE的基线方法和其他四种无监督的学习方法进行了定量比较，证明了其有效性和优势。本文通过一项实际案例研究进一步验证了所提出的方法，该研究方法是检测工业加热厂的供水温度异常。

translated by 谷歌翻译

Generative Anomaly Detection for Time Series Datasets

Zhuangwei Kang , Ayan Mukhopadhyay , Aniruddha Gokhale , Shijie Wen , Abhishek Dubey

分类：机器学习 | 人工智能

2022-06-28

在智能交通系统中，交通拥堵异常检测至关重要。运输机构的目标有两个方面：监视感兴趣领域的一般交通状况，并在异常拥堵状态下定位道路细分市场。建模拥塞模式可以实现这些目标，以实现全市道路的目标，相当于学习多元时间序列（MTS）的分布。但是，现有作品要么不可伸缩，要么无法同时捕获MTS中的空间信息。为此，我们提出了一个由数据驱动的生成方法组成的原则性和全面的框架，该方法可以执行可拖动的密度估计来检测流量异常。我们的方法在特征空间中的第一群段段，然后使用条件归一化流以在无监督的设置下在群集级别识别异常的时间快照。然后，我们通过在异常群集上使用内核密度估计器来识别段级别的异常。关于合成数据集的广泛实验表明，我们的方法在召回和F1得分方面显着优于几种最新的拥塞异常检测和诊断方法。我们还使用生成模型来采样标记的数据，该数据可以在有监督的环境中训练分类器，从而减轻缺乏在稀疏设置中进行异常检测的标记数据。

translated by 谷歌翻译

Lossy Compression for Robust Unsupervised Time-Series Anomaly Detection

Christopher P. Ley , Jorge F. Silva

分类：机器学习

2022-12-05

A new Lossy Causal Temporal Convolutional Neural Network Autoencoder for anomaly detection is proposed in this work. Our framework uses a rate-distortion loss and an entropy bottleneck to learn a compressed latent representation for the task. The main idea of using a rate-distortion loss is to introduce representation flexibility that ignores or becomes robust to unlikely events with distinctive patterns, such as anomalies. These anomalies manifest as unique distortion features that can be accurately detected in testing conditions. This new architecture allows us to train a fully unsupervised model that has high accuracy in detecting anomalies from a distortion score despite being trained with some portion of unlabelled anomalous data. This setting is in stark contrast to many of the state-of-the-art unsupervised methodologies that require the model to be only trained on "normal data". We argue that this partially violates the concept of unsupervised training for anomaly detection as the model uses an informed decision that selects what is normal from abnormal for training. Additionally, there is evidence to suggest it also effects the models ability at generalisation. We demonstrate that models that succeed in the paradigm where they are only trained on normal data fail to be robust when anomalous data is injected into the training. In contrast, our compression-based approach converges to a robust representation that tolerates some anomalous distortion. The robust representation achieved by a model using a rate-distortion loss can be used in a more realistic unsupervised anomaly detection scheme.

translated by 谷歌翻译

AER: Auto-Encoder with Regression for Time Series Anomaly Detection

Lawrence Wong , Dongyu Liu , Laure Berti-Equille , Sarah Alnegheimish , Kalyan Veeramachaneni

分类：机器学习 | (统计)机器学习

2022-12-27

Anomaly detection on time series data is increasingly common across various industrial domains that monitor metrics in order to prevent potential accidents and economic losses. However, a scarcity of labeled data and ambiguous definitions of anomalies can complicate these efforts. Recent unsupervised machine learning methods have made remarkable progress in tackling this problem using either single-timestamp predictions or time series reconstructions. While traditionally considered separately, these methods are not mutually exclusive and can offer complementary perspectives on anomaly detection. This paper first highlights the successes and limitations of prediction-based and reconstruction-based methods with visualized time series signals and anomaly scores. We then propose AER (Auto-encoder with Regression), a joint model that combines a vanilla auto-encoder and an LSTM regressor to incorporate the successes and address the limitations of each method. Our model can produce bi-directional predictions while simultaneously reconstructing the original time series by optimizing a joint objective function. Furthermore, we propose several ways of combining the prediction and reconstruction errors through a series of ablation studies. Finally, we compare the performance of the AER architecture against two prediction-based methods and three reconstruction-based methods on 12 well-known univariate time series datasets from NASA, Yahoo, Numenta, and UCR. The results show that AER has the highest averaged F1 score across all datasets (a 23.5% improvement compared to ARIMA) while retaining a runtime similar to its vanilla auto-encoder and regressor components. Our model is available in Orion, an open-source benchmarking tool for time series anomaly detection.

translated by 谷歌翻译

Smart Metering System Capable of Anomaly Detection by Bi-directional LSTM Autoencoder

Sangkeum Lee , Hojun Jin , Sarvar Hussain Nengroo , Yoonmee Doh , Chungho Lee , Taewook Heo , Dongsoo Har

分类：机器学习

2021-12-06

异常检测涉及广泛的应用，如故障检测，系统监控和事件检测。识别从智能计量系统获得的计量数据的异常是提高电力系统的可靠性，稳定性和效率的关键任务。本文介绍了异常检测过程，以发现在智能计量系统中观察到的异常值。在所提出的方法中，使用双向长短期存储器（BILSTM）的AutoEncoder并找到异常数据点。它通过具有非异常数据的AutoEncoder计算重建错误，并且将分类为异常的异常值通过预定义的阈值与非异常数据分离。基于Bilstm AutoEncoder的异常检测方法用来自985户家庭收集的4种能源电力/水/加热/热水的计量数据进行测试。

translated by 谷歌翻译

Time Series Anomaly Detection for Cyber-Physical Systems via Neural System Identification and Bayesian Filtering

Cheng Feng , Pengwei Tian

分类：机器学习 | (统计)机器学习

2021-06-15

Aiot技术的最新进展导致利用机器学习算法来检测网络物理系统（CPS）的操作失败的越来越受欢迎。在其基本形式中，异常检测模块从物理工厂监控传感器测量和致动器状态，并检测这些测量中的异常以识别异常操作状态。然而，由于该模型必须在存在高度复杂的系统动态和未知量的传感器噪声的情况下准确地检测异常，构建有效的异常检测模型是挑战性的。在这项工作中，我们提出了一种新的时序序列异常检测方法，称为神经系统识别和贝叶斯滤波（NSIBF），其中特制的神经网络架构被构成系统识别，即捕获动态状态空间中CP的动态模型;然后，通过跟踪系统的隐藏状态的不确定性随着时间的推移，自然地施加贝叶斯滤波算法的顶部。我们提供定性的和定量实验，并在合成和三个现实世界CPS数据集上具有所提出的方法，表明NSIBF对最先进的方法比较了对CPS中异常检测的最新方法。

translated by 谷歌翻译

RUAD: unsupervised anomaly detection in HPC systems

Martin Molan , Andrea Borghesi , Daniele Cesarini , Luca Benini , Andrea Bartolini

分类：机器学习 | 人工智能

2022-08-28

现代高性能计算（HPC）系统的复杂性日益增加，需要引入自动化和数据驱动的方法，以支持系统管理员为增加系统可用性的努力。异常检测是改善可用性不可或缺的一部分，因为它减轻了系统管理员的负担，并减少了异常和解决方案之间的时间。但是，对当前的最新检测方法进行了监督和半监督，因此它们需要具有异常的人体标签数据集 - 在生产HPC系统中收集通常是不切实际的。基于聚类的无监督异常检测方法，旨在减轻准确的异常数据的需求，到目前为止的性能差。在这项工作中，我们通过提出RUAD来克服这些局限性，RUAD是一种新型的无监督异常检测模型。 Ruad比当前的半监督和无监督的SOA方法取得了更好的结果。这是通过考虑数据中的时间依赖性以及在模型体系结构中包括长短期限内存单元的实现。提出的方法是根据tier-0系统（带有980个节点的Cineca的Marconi100的完整历史）评估的。 RUAD在半监督训练中达到曲线（AUC）下的区域（AUC）为0.763，在无监督的训练中达到了0.767的AUC，这改进了SOA方法，在半监督训练中达到0.747的AUC，无需训练的AUC和0.734的AUC在无处不在的AUC中提高了AUC。训练。它还大大优于基于聚类的当前SOA无监督的异常检测方法，其AUC为0.548。

translated by 谷歌翻译

A Comparative Study of Detecting Anomalies in Time Series Data Using LSTM and TCN Models

Saroj Gopali , Faranak Abri , Sima Siami-Namini , Akbar Siami Namin

分类：机器学习

2021-12-17

存在几种数据驱动方法，使我们的模型时间序列数据能够包括传统的基于回归的建模方法（即，Arima）。最近，在时间序列分析和预测的背景下介绍和探索了深度学习技术。询问的主要研究问题是在预测时间序列数据中的深度学习技术中的这些变化的性能。本文比较了两个突出的深度学习建模技术。比较了经常性的神经网络（RNN）长的短期记忆（LSTM）和卷积神经网络（CNN）基于基于TCN的时间卷积网络（TCN），并报告了它们的性能和训练时间。根据我们的实验结果，两个建模技术都表现了相当具有基于TCN的模型优于LSTM略微。此外，基于CNN的TCN模型比基于RNN的LSTM模型更快地构建了稳定的模型。

translated by 谷歌翻译

Adaptive Memory Networks with Self-supervised Learning for Unsupervised Anomaly Detection

Yuxin Zhang , Jindong Wang , Yiqiang Chen , Han Yu , Tao Qin

分类：机器学习 | 人工智能

2022-01-03

无监督的异常检测旨在通过在正常数据上训练来建立模型以有效地检测看不见的异常。尽管以前的基于重建的方法取得了富有成效的进展，但由于两个危急挑战，他们的泛化能力受到限制。首先，训练数据集仅包含正常模式，这限制了模型泛化能力。其次，现有模型学到的特征表示通常缺乏代表性，妨碍了保持正常模式的多样性的能力。在本文中，我们提出了一种称为自适应存储器网络的新方法，具有自我监督的学习（AMSL）来解决这些挑战，并提高无监督异常检测中的泛化能力。基于卷积的AutoEncoder结构，AMSL包含一个自我监督的学习模块，以学习一般正常模式和自适应内存融合模块来学习丰富的特征表示。四个公共多变量时间序列数据集的实验表明，与其他最先进的方法相比，AMSL显着提高了性能。具体而言，在具有9亿个样本的最大帽睡眠阶段检测数据集上，AMSL以精度和F1分数\ TextBF {4} \％+优于第二个最佳基线。除了增强的泛化能力之外，AMSL还针对输入噪声更加强大。

translated by 谷歌翻译

Forecast-based Multi-aspect Framework for Multivariate Time-series Anomaly Detection

Lan Wang , Yusan Lin , Yuhang Wu , Huiyuan Chen , Fei Wang , Hao Yang

分类：机器学习

2022-01-13

今天的网络世界难以多变量。在极端品种中收集的指标需要多变量算法以正确检测异常。然而，基于预测的算法，如被广泛证明的方法，通常在数据集中进行次优或不一致。一个关键的常见问题是他们努力成为一个尺寸适合的，但异常在自然中是独特的。我们提出了一种裁定到这种区别的方法。提出FMUAD - 一种基于预测，多方面，无监督的异常检测框架。FMUAD明确，分别捕获异常类型的签名性状 - 空间变化，时间变化和相关变化 - 与独立模块。然后，模块共同学习最佳特征表示，这是非常灵活和直观的，与类别中的大多数其他模型不同。广泛的实验表明我们的FMUAD框架始终如一地优于其他最先进的预测的异常探测器。

translated by 谷歌翻译

Unsupervised Anomaly Detection in Time-series: An Extensive Evaluation and Analysis of State-of-the-art Methods

Nesryne Mejri , Laura Lopez-Fuentes , Kankana Roy , Pavel Chernakov , Enjie Ghorbel , Djamila Aouada

分类：机器学习

2022-12-06

Unsupervised anomaly detection in time-series has been extensively investigated in the literature. Notwithstanding the relevance of this topic in numerous application fields, a complete and extensive evaluation of recent state-of-the-art techniques is still missing. Few efforts have been made to compare existing unsupervised time-series anomaly detection methods rigorously. However, only standard performance metrics, namely precision, recall, and F1-score are usually considered. Essential aspects for assessing their practical relevance are therefore neglected. This paper proposes an original and in-depth evaluation study of recent unsupervised anomaly detection techniques in time-series. Instead of relying solely on standard performance metrics, additional yet informative metrics and protocols are taken into account. In particular, (1) more elaborate performance metrics specifically tailored for time-series are used; (2) the model size and the model stability are studied; (3) an analysis of the tested approaches with respect to the anomaly type is provided; and (4) a clear and unique protocol is followed for all experiments. Overall, this extensive analysis aims to assess the maturity of state-of-the-art time-series anomaly detection, give insights regarding their applicability under real-world setups and provide to the community a more complete evaluation protocol.

translated by 谷歌翻译

Anomaly Detection with Adversarially Learned Perturbations of Latent Space

Vahid Reza Khazaie , Anthony Wong , John Taylor Jewell , Yalda Mohsenzadeh

分类：计算机视觉 | 人工智能

2022-07-03

异常检测是确定不符合正常数据分布的样品。由于异常数据的无法获得，培训监督的深神经网络是一项繁琐的任务。因此，无监督的方法是解决此任务的常见方法。深度自动编码器已被广泛用作许多无监督的异常检测方法的基础。但是，深层自动编码器的一个显着缺点是，它们通过概括重建异常值来提供不足的表示异常检测的表示。在这项工作中，我们设计了一个对抗性框架，该框架由两个竞争组件组成，一个对抗性变形者和一个自动编码器。对抗性变形器是一种卷积编码器，学会产生有效的扰动，而自动编码器是一个深层卷积神经网络，旨在重建来自扰动潜在特征空间的图像。这些网络经过相反的目标训练，在这种目标中，对抗性变形者会产生用于编码器潜在特征空间的扰动，以最大化重建误差，并且自动编码器试图中和这些扰动的效果以最大程度地减少它。当应用于异常检测时，该提出的方法会由于对特征空间的扰动应用而学习语义上的富裕表示。所提出的方法在图像和视频数据集上的异常检测中优于现有的最新方法。

translated by 谷歌翻译

RePAD: Real-time Proactive Anomaly Detection for Time Series

Ming-Chang Lee , Jia-Chun Lin , Ernst Gunnar Gran

分类：机器学习 | (统计)机器学习

2020-01-24

During the past decade, many anomaly detection approaches have been introduced in different fields such as network monitoring, fraud detection, and intrusion detection. However, they require understanding of data pattern and often need a long off-line period to build a model or network for the target data. Providing real-time and proactive anomaly detection for streaming time series without human intervention and domain knowledge is highly valuable since it greatly reduces human effort and enables appropriate countermeasures to be undertaken before a disastrous damage, failure, or other harmful event occurs. However, this issue has not been well studied yet. To address it, this paper proposes RePAD, which is a Real-time Proactive Anomaly Detection algorithm for streaming time series based on Long Short-Term Memory (LSTM). RePAD utilizes short-term historic data points to predict and determine whether or not the upcoming data point is a sign that an anomaly is likely to happen in the near future. By dynamically adjusting the detection threshold over time, RePAD is able to tolerate minor pattern change in time series and detect anomalies either proactively or on time. Experiments based on two time series datasets collected from the Numenta Anomaly Benchmark demonstrate that RePAD is able to proactively detect anomalies and provide early warnings in real time without human intervention and domain knowledge.

translated by 谷歌翻译

Calibrated One-class Classification for Unsupervised Time Series Anomaly Detection

Hongzuo Xu , Yijie Wang , Songlei Jian , Qing Liao , Yongjun Wang , Guansong Pang

分类：机器学习 | 人工智能

2022-07-25

无监督的时间序列异常检测对各种域中目标系统的潜在故障有助于。当前的最新时间序列异常检测器主要集中于设计高级神经网络结构和新的重建/预测学习目标，以尽可能准确地学习数据正常（正常模式和行为）。但是，这些单级学习方法可以被训练数据中未知异常（即异常污染）所欺骗。此外，他们的正常学习也缺乏对感兴趣异常的知识。因此，他们经常学习一个有偏见的，不准确的正态边界。本文提出了一种新型的单级学习方法，称为校准的一级分类，以解决此问题。我们的单级分类器以两种方式进行校准：（1）通过适应性地惩罚不确定的预测，这有助于消除异常污染的影响，同时强调单级模型对一级模型有信心的预测，并通过区分正常情况来确定（2）来自本机异常示例的样本，这些样本是根据原始数据基于原始数据模拟真实时间序列异常行为的。这两个校准导致耐污染的，异常的单级学习，从而产生了显着改善的正态性建模。对六个现实世界数据集进行的广泛实验表明，我们的模型大大优于12个最先进的竞争对手，并获得了6％-31％的F1分数提高。源代码可在\ url {https://github.com/xuhongzuo/couta}中获得。

translated by 谷歌翻译

DASVDD: Deep Autoencoding Support Vector Data Descriptor for Anomaly Detection

Hadi Hojjati , Narges Armanfard

分类：机器学习

2021-06-09

半监督异常检测旨在使用在正常数据上培训的模型来检测来自正常样本的异常。随着近期深度学习的进步，研究人员设计了高效的深度异常检测方法。现有作品通常使用神经网络将数据映射到更具内容性的表示中，然后应用异常检测算法。在本文中，我们提出了一种方法，DASVDD，它共同学习AutoEncoder的参数，同时最小化其潜在表示上的封闭超球的音量。我们提出了一个异常的分数，它是自动化器的重建误差和距离潜在表示中封闭边距中心的距离的组合。尽量减少这种异常的分数辅助我们在培训期间学习正常课程的潜在分布。包括异常分数中的重建错误确保DESVDD不受常见的极度崩溃问题，因为DESVDD模型不会收敛到映射到潜在表示中的恒定点的常量点。几个基准数据集上的实验评估表明，该方法优于常用的最先进的异常检测算法，同时在不同的异常类中保持鲁棒性能。

translated by 谷歌翻译

Towards a Rigorous Evaluation of Time-series Anomaly Detection

Siwon Kim , Kukjin Choi , Hyun-Soo Choi , Byunghan Lee , Sungroh Yoon

分类：机器学习 | 人工智能

2021-09-11

近年来，提出了关于时间序列异常检测（TAD）的研究报告基准TAD数据集中的高F1分数，给出了TAD的清晰改进的印象。然而，大多数研究在评分之前应用了一个名为Point调整（PA）的特殊评估协议。在本文中，我们理论上实验揭示了PA协议具有高估检测性能的可能性;也就是说，即使是随机异常的分数也可以容易地变成最先进的TAD方法。因此，应用PA协议后的TAD方法的比较可能导致误导排名。此外，我们通过表示未经训练的模型对现有方法获得了可比的检测性能，即使禁止禁止，我们会解决现有TAD方法的潜力。根据我们的调查结果，我们提出了一种新的基线和评估议定书。我们预计我们的研究将有助于对TAD严格评估，并导致未来的研究进一步改善。

translated by 谷歌翻译

Memory-free Online Change-point Detection: A Novel Neural Network Approach

Zahra Atashgahi , Decebal Constantin Mocanu , Raymond Veldhuis , Mykola Pechenizkiy

分类：机器学习 | 人工智能

2022-07-08

检测数据分布突然变化的变更点检测（CPD）被认为是时间序列分析中最重要的任务之一。尽管关于离线CPD的文献广泛，但无监督的在线CPD仍面临主要挑战，包括可扩展性，超参数调整和学习限制。为了减轻其中一些挑战，在本文中，我们提出了一种新颖的深度学习方法，用于从多维时间序列中无监督的在线CPD，名为Adaptive LSTM-AUTOENOCODER变更点检测（ALACPD）。 ALACPD利用了基于LSTM-AutoEncoder的神经网络来执行无监督的在线CPD。它连续地适应了传入的样本，而无需保留先前接收的输入，因此没有内存。我们对几个实际时间序列的CPD基准进行了广泛的评估。我们表明，在时间序列细分的质量方面，ALACPD平均在最先进的CPD算法中排名第一，并且就估计更改点的准确性而言，它与表现最好。 ALACPD的实现可在Github \ footNote {\ url {https://github.com/zahraatashgahi/alacpd}}上在线获得。

translated by 谷歌翻译

ReRe: A Lightweight Real-time Ready-to-Go Anomaly Detection Approach for Time Series

Ming-Chang Lee , Jia-Chun Lin , Ernst Gunnar Gran

分类：机器学习 | (统计)机器学习

2020-04-05

Anomaly detection is an active research topic in many different fields such as intrusion detection, network monitoring, system health monitoring, IoT healthcare, etc. However, many existing anomaly detection approaches require either human intervention or domain knowledge, and may suffer from high computation complexity, consequently hindering their applicability in real-world scenarios. Therefore, a lightweight and ready-to-go approach that is able to detect anomalies in real-time is highly sought-after. Such an approach could be easily and immediately applied to perform time series anomaly detection on any commodity machine. The approach could provide timely anomaly alerts and by that enable appropriate countermeasures to be undertaken as early as possible. With these goals in mind, this paper introduces ReRe, which is a Real-time Ready-to-go proactive Anomaly Detection algorithm for streaming time series. ReRe employs two lightweight Long Short-Term Memory (LSTM) models to predict and jointly determine whether or not an upcoming data point is anomalous based on short-term historical data points and two long-term self-adaptive thresholds. Experiments based on real-world time-series datasets demonstrate the good performance of ReRe in real-time anomaly detection without requiring human intervention or domain knowledge.

translated by 谷歌翻译