智能论文笔记

A Feature Memory Rearrangement Network for Visual Inspection of Textured Surface Defects Toward Edge Intelligent Manufacturing

Haiming Yao , Wenyong Yu , Xue Wang

分类：计算机视觉 | 人工智能

2022-06-22

在视觉检查形式中对纹理表面进行工业检查的最新进展使这种检查成为可能，以实现高效，灵活的制造系统。我们提出了一个无监督的特征内存重排网络（FMR-NET），以同时准确检测各种纹理缺陷。与主流方法一致，我们采用了背景重建的概念。但是，我们创新地利用人工合成缺陷来使模型识别异常，而传统智慧仅依赖于无缺陷的样本。首先，我们采用一个编码模块来获得纹理表面的多尺度特征。随后，提出了一个基于对比的基于学习的内存特征模块（CMFM）来获得判别性表示，并在潜在空间中构建一个正常的特征记忆库，可以用作补丁级别的缺陷和快速异常得分。接下来，提出了一个新型的全球特征重排模块（GFRM），以进一步抑制残余缺陷的重建。最后，一个解码模块利用还原的功能来重建正常的纹理背景。此外，为了提高检查性能，还利用了两阶段的训练策略进行准确的缺陷恢复改进，并且我们利用一种多模式检查方法来实现噪声刺激性缺陷定位。我们通过广泛的实验来验证我们的方法，并通过多级检测方法在协作边缘进行实用的部署 - 云云智能制造方案，表明FMR-NET具有先进的检查准确性，并显示出巨大的使用潜力在启用边缘计算的智能行业中。

translated by 谷歌翻译

Deep Learning for Unsupervised Anomaly Localization in Industrial Images: A Survey

Xian Tao , Xinyi Gong , Xin Zhang , Shaohua Yan , Chandranath Adak

分类：计算机视觉

2022-07-21

当前，借助监督学习方法，基于深度学习的视觉检查已取得了非常成功的成功。但是，在实际的工业场景中，缺陷样本的稀缺性，注释的成本以及缺乏缺陷的先验知识可能会使基于监督的方法无效。近年来，无监督的异常定位算法已在工业检查任务中广泛使用。本文旨在通过深入学习在工业图像中无视无视的异常定位中的最新成就来帮助该领域的研究人员。该调查回顾了120多个重要出版物，其中涵盖了异常定位的各个方面，主要涵盖了所审查方法的各种概念，挑战，分类法，基准数据集和定量性能比较。在审查迄今为止的成就时，本文提供了一些未来研究方向的详细预测和分析。这篇综述为对工业异常本地化感兴趣的研究人员提供了详细的技术信息，并希望将其应用于其他领域的异常本质。

translated by 谷歌翻译

Clear Memory-Augmented Auto-Encoder for Surface Defect Detection

Wei Luo , Tongzhi Niu , Lixin Tang , Wenyong Yu , Bin Li

分类：计算机视觉 | 人工智能

2022-08-08

在表面缺陷检测中，由于阳性和负样品数量的极度失衡，基于阳性样本的异常检测方法已受到越来越多的关注。具体而言，基于重建的方法是最受欢迎的方法。但是，退出的方法要么难以修复异常的前景或重建清晰的背景。因此，我们提出了一个清晰的内存调制自动编码器。首先，我们提出了一个新颖的清晰内存调节模块，该模块将编码和内存编码结合在一起，以忘记和输入的方式，从而修复异常的前景和保存透明背景。其次，提出了一般人工异常产生算法来模拟尽可能逼真和特征富含特征的异常。最后，我们提出了一种新型的多量表特征残差检测方法，用于缺陷分割，这使缺陷位置更加准确。 CMA-AE使用五个基准数据集上的11种最先进方法进行比较实验，显示F1量的平均平均改善平均为18.6％。

translated by 谷歌翻译

Generalizable Industrial Visual Anomaly Detection with Self-Induction Vision Transformer

Haiming Yao , Wenyong Yu

分类：计算机视觉

2022-11-22

Industrial vision anomaly detection plays a critical role in the advanced intelligent manufacturing process, while some limitations still need to be addressed under such a context. First, existing reconstruction-based methods struggle with the identity mapping of trivial shortcuts where the reconstruction error gap is legible between the normal and abnormal samples, leading to inferior detection capabilities. Then, the previous studies mainly concentrated on the convolutional neural network (CNN) models that capture the local semantics of objects and neglect the global context, also resulting in inferior performance. Moreover, existing studies follow the individual learning fashion where the detection models are only capable of one category of the product while the generalizable detection for multiple categories has not been explored. To tackle the above limitations, we proposed a self-induction vision Transformer(SIVT) for unsupervised generalizable multi-category industrial visual anomaly detection and localization. The proposed SIVT first extracts discriminatory features from pre-trained CNN as property descriptors. Then, the self-induction vision Transformer is proposed to reconstruct the extracted features in a self-supervisory fashion, where the auxiliary induction tokens are additionally introduced to induct the semantics of the original signal. Finally, the abnormal properties can be detected using the semantic feature residual difference. We experimented with the SIVT on existing Mvtec AD benchmarks, the results reveal that the proposed method can advance state-of-the-art detection performance with an improvement of 2.8-6.3 in AUROC, and 3.3-7.6 in AP.

translated by 谷歌翻译

A Survey on Unsupervised Visual Industrial Anomaly Detection Algorithms

Yajie Cui , Zhaoxiang Liu , Shiguo Lian

分类：计算机视觉

2022-04-24

与行业4.0的发展相一致，越来越多的关注被表面缺陷检测领域所吸引。提高效率并节省劳动力成本已稳步成为行业领域引起人们关注的问题，近年来，基于深度学习的算法比传统的视力检查方法更好。尽管现有的基于深度学习的算法偏向于监督学习，但这不仅需要大量标记的数据和大量的劳动力，而且还效率低下，并且有一定的局限性。相比之下，最近的研究表明，无监督的学习在解决视觉工业异常检测的高于缺点方面具有巨大的潜力。在这项调查中，我们总结了当前的挑战，并详细概述了最近提出的针对视觉工业异常检测的无监督算法，涵盖了五个类别，其创新点和框架详细描述了。同时，提供了包含表面图像样本的公开可用数据集的信息。通过比较不同类别的方法，总结了异常检测算法的优点和缺点。预计将协助研究社区和行业发展更广泛，更跨域的观点。

translated by 谷歌翻译

Visual Anomaly Detection Via Partition Memory Bank Module and Error Estimation

Peng Xing , Zechao Li

分类：计算机视觉

2022-09-26

基于可视异常检测的内存模块的重建方法试图缩小正常样品的重建误差，同时将其放大为异常样品。不幸的是，现有的内存模块不完全适用于异常检测任务，并且异常样品的重建误差仍然很小。为此，这项工作提出了一种新的无监督视觉异常检测方法，以共同学习有效的正常特征并消除不利的重建错误。具体而言，提出了一个新颖的分区内存库（PMB）模块，以有效地学习和存储具有正常样本语义完整性的详细特征。它开发了一种新的分区机制和一种独特的查询生成方法，以保留上下文信息，然后提高内存模块的学习能力。替代探索了拟议的PMB和跳过连接，以使异常样品的重建更糟。为了获得更精确的异常定位结果并解决了累积重建误差的问题，提出了一个新型的直方图误差估计模块，以通过差异图像的直方图自适应地消除了不利的误差。它可以改善异常本地化性能而不会增加成本。为了评估所提出的异常检测和定位方法的有效性，在三个广泛使用的异常检测数据集上进行了广泛的实验。与基于内存模块的最新方法相比，提出的方法的令人鼓舞的性能证明了其优越性。

translated by 谷歌翻译

PAEDID: Patch Autoencoder Based Deep Image Decomposition For Pixel-level Defective Region Segmentation

Shancong Mou , Meng Cao , Haoping Bai , Ping Huang , Jianjun Shi , Jiulong Shan

分类：计算机视觉 | 机器学习

2022-03-28

Unsupervised pixel-level defective region segmentation is an important task in image-based anomaly detection for various industrial applications. The state-of-the-art methods have their own advantages and limitations: matrix-decomposition-based methods are robust to noise but lack complex background image modeling capability; representation-based methods are good at defective region localization but lack accuracy in defective region shape contour extraction; reconstruction-based methods detected defective region match well with the ground truth defective region shape contour but are noisy. To combine the best of both worlds, we present an unsupervised patch autoencoder based deep image decomposition (PAEDID) method for defective region segmentation. In the training stage, we learn the common background as a deep image prior by a patch autoencoder (PAE) network. In the inference stage, we formulate anomaly detection as an image decomposition problem with the deep image prior and domain-specific regularizations. By adopting the proposed approach, the defective regions in the image can be accurately extracted in an unsupervised fashion. We demonstrate the effectiveness of the PAEDID method in simulation studies and an industrial dataset in the case study.

translated by 谷歌翻译

Self-Supervised Guided Segmentation Framework for Unsupervised Anomaly Detection

Peng Xing , Yanpeng Sun , Zechao Li

分类：计算机视觉

2022-09-26

在工业应用中，无监督的异常检测是一项艰巨的任务，因为收集足够的异常样品是不切实际的。在本文中，通过共同探索锻造异常样品的有效生成方法和正常样品特征作为分割异常检测的指导信息，提出了一种新颖的自我监督指导性分割框架（SGSF）。具体而言，为确保生成的锻造异常样品有利于模型训练，提出了显着性增强模块（SAM）。 Sam引入了显着图来产生显着性Perlin噪声图，并制定了一种自适应分割策略，以在显着区域产生不规则的掩模。然后，将口罩用于生成伪造的异常样品作为训练的负样本。不幸的是，锻造和真实异常样品之间的分布差距使得基于锻造样品训练的模型难以有效定位真实异常。为此，提出了自我监督的指导网络（SGN）。它利用自我监督的模块提取无噪声的功能，并包含正常的语义信息作为分割模块的先验知识。分割模块具有正常模式段的知识，这些片段与指导特征不同。为了评估SGSF对异常检测的有效性，在三个异常检测数据集上进行了广泛的实验。实验结果表明，SGSF达到了最新的异常检测结果。

translated by 谷歌翻译

Prototypical Residual Networks for Anomaly Detection and Localization

Hui Zhang , Zuxuan Wu , Zheng Wang , Zhineng Chen , Yu-Gang Jiang

分类：计算机视觉

2022-12-05

Anomaly detection and localization are widely used in industrial manufacturing for its efficiency and effectiveness. Anomalies are rare and hard to collect and supervised models easily over-fit to these seen anomalies with a handful of abnormal samples, producing unsatisfactory performance. On the other hand, anomalies are typically subtle, hard to discern, and of various appearance, making it difficult to detect anomalies and let alone locate anomalous regions. To address these issues, we propose a framework called Prototypical Residual Network (PRN), which learns feature residuals of varying scales and sizes between anomalous and normal patterns to accurately reconstruct the segmentation maps of anomalous regions. PRN mainly consists of two parts: multi-scale prototypes that explicitly represent the residual features of anomalies to normal patterns; a multisize self-attention mechanism that enables variable-sized anomalous feature learning. Besides, we present a variety of anomaly generation strategies that consider both seen and unseen appearance variance to enlarge and diversify anomalies. Extensive experiments on the challenging and widely used MVTec AD benchmark show that PRN outperforms current state-of-the-art unsupervised and supervised methods. We further report SOTA results on three additional datasets to demonstrate the effectiveness and generalizability of PRN.

translated by 谷歌翻译

Computer Vision on X-ray Data in Industrial Production and Security Applications: A survey

Mehdi Rafiei , Jenni Raitoharju , Alexandros Iosifidis

分类：计算机视觉

2022-11-10

X-ray imaging technology has been used for decades in clinical tasks to reveal the internal condition of different organs, and in recent years, it has become more common in other areas such as industry, security, and geography. The recent development of computer vision and machine learning techniques has also made it easier to automatically process X-ray images and several machine learning-based object (anomaly) detection, classification, and segmentation methods have been recently employed in X-ray image analysis. Due to the high potential of deep learning in related image processing applications, it has been used in most of the studies. This survey reviews the recent research on using computer vision and machine learning for X-ray analysis in industrial production and security applications and covers the applications, techniques, evaluation metrics, datasets, and performance comparison of those techniques on publicly available datasets. We also highlight some drawbacks in the published research and give recommendations for future research in computer vision-based X-ray analysis.

translated by 谷歌翻译

Deep Learning for Time Series Anomaly Detection: A Survey

Zahra Zamanzadeh Darban , Geoffrey I. Webb , Shirui Pan , Charu C. Aggarwal , Mahsa Salehi

分类：机器学习 | 人工智能

2022-11-09

Time series anomaly detection has applications in a wide range of research fields and applications, including manufacturing and healthcare. The presence of anomalies can indicate novel or unexpected events, such as production faults, system defects, or heart fluttering, and is therefore of particular interest. The large size and complex patterns of time series have led researchers to develop specialised deep learning models for detecting anomalous patterns. This survey focuses on providing structured and comprehensive state-of-the-art time series anomaly detection models through the use of deep learning. It providing a taxonomy based on the factors that divide anomaly detection models into different categories. Aside from describing the basic anomaly detection technique for each category, the advantages and limitations are also discussed. Furthermore, this study includes examples of deep anomaly detection in time series across various application domains in recent years. It finally summarises open issues in research and challenges faced while adopting deep anomaly detection models.

translated by 谷歌翻译

A Unified Survey on Anomaly, Novelty, Open-Set, and Out-of-Distribution Detection: Solutions and Future Challenges

Mohammadreza Salehi , Hossein Mirzaei , Dan Hendrycks , Yixuan Li , Mohammad Hossein Rohban , Mohammad Sabokrou

分类：计算机视觉 | 机器学习

2021-10-26

机器学习模型通常会遇到与训练分布不同的样本。无法识别分布（OOD）样本，因此将该样本分配给课堂标签会显着损害模型的可靠性。由于其对在开放世界中的安全部署模型的重要性，该问题引起了重大关注。由于对所有可能的未知分布进行建模的棘手性，检测OOD样品是具有挑战性的。迄今为止，一些研究领域解决了检测陌生样本的问题，包括异常检测，新颖性检测，一级学习，开放式识别识别和分布外检测。尽管有相似和共同的概念，但分别分布，开放式检测和异常检测已被独立研究。因此，这些研究途径尚未交叉授粉，创造了研究障碍。尽管某些调查打算概述这些方法，但它们似乎仅关注特定领域，而无需检查不同领域之间的关系。这项调查旨在在确定其共同点的同时，对各个领域的众多著名作品进行跨域和全面的审查。研究人员可以从不同领域的研究进展概述中受益，并协同发展未来的方法。此外，据我们所知，虽然进行异常检测或单级学习进行了调查，但没有关于分布外检测的全面或最新的调查，我们的调查可广泛涵盖。最后，有了统一的跨域视角，我们讨论并阐明了未来的研究线，打算将这些领域更加紧密地融为一体。

translated by 谷歌翻译

A Lightweight Reconstruction Network for Surface Defect Inspection

Chao Hu , Jian Yao , Weijie Wu , Weibin Qiu , Liqiang Zhu

分类：计算机视觉 | 机器学习

2022-12-25

Currently, most deep learning methods cannot solve the problem of scarcity of industrial product defect samples and significant differences in characteristics. This paper proposes an unsupervised defect detection algorithm based on a reconstruction network, which is realized using only a large number of easily obtained defect-free sample data. The network includes two parts: image reconstruction and surface defect area detection. The reconstruction network is designed through a fully convolutional autoencoder with a lightweight structure. Only a small number of normal samples are used for training so that the reconstruction network can be A defect-free reconstructed image is generated. A function combining structural loss and $\mathit{L}1$ loss is proposed as the loss function of the reconstruction network to solve the problem of poor detection of irregular texture surface defects. Further, the residual of the reconstructed image and the image to be tested is used as the possible region of the defect, and conventional image operations can realize the location of the fault. The unsupervised defect detection algorithm of the proposed reconstruction network is used on multiple defect image sample sets. Compared with other similar algorithms, the results show that the unsupervised defect detection algorithm of the reconstructed network has strong robustness and accuracy.

translated by 谷歌翻译

HaloAE: An HaloNet based Local Transformer Auto-Encoder for Anomaly Detection and Localization

E. Mathian , H. Liu , L. Fernandez-Cuesta , D. Samaras , M. Foll , L. Chen

分类：计算机视觉 | 人工智能

2022-08-06

无监督的异常检测和定位是至关重要的任务，因为不可能收集和标记所有可能的异常。许多研究强调了整合本地和全球信息以实现异常分割的重要性。为此，对变压器的兴趣越来越大，它允许对远程内容相互作用进行建模。但是，对于大多数图像量表而言，通过自我注意力的全球互动通常太贵了。在这项研究中，我们介绍了Haloae，这是第一个基于Halonet的局部2D版本的自动编码器。使用Haloae，我们创建了一个混合模型，该模型结合了卷积和局部2D块的自我发项层，并通过单个模型共同执行异常检测和分割。我们在MVTEC数据集上取得了竞争成果，表明结合变压器的视觉模型可以受益于自我发挥操作的本地计算，并为其他应用铺平道路。

translated by 谷歌翻译

Y-GAN: Learning Dual Data Representations for Efficient Anomaly Detection

Marija Ivanovska , Vitomir Štruc

分类：计算机视觉

2021-09-28

We propose a novel reconstruction-based model for anomaly detection, called Y-GAN. The model consists of a Y-shaped auto-encoder and represents images in two separate latent spaces. The first captures meaningful image semantics, key for representing (normal) training data, whereas the second encodes low-level residual image characteristics. To ensure the dual representations encode mutually exclusive information, a disentanglement procedure is designed around a latent (proxy) classifier. Additionally, a novel consistency loss is proposed to prevent information leakage between the latent spaces. The model is trained in a one-class learning setting using normal training data only. Due to the separation of semantically-relevant and residual information, Y-GAN is able to derive informative data representations that allow for efficient anomaly detection across a diverse set of anomaly detection tasks. The model is evaluated in comprehensive experiments with several recent anomaly detection models using four popular datasets, i.e., MNIST, FMNIST and CIFAR10, and PlantVillage.

translated by 谷歌翻译

ARCADE: Adversarially Regularized Convolutional Autoencoder for Network Anomaly Detection

Willian T. Lunardi , Martin Andreoni Lopez , Jean-Pierre Giacalone

分类：机器学习

2022-05-03

As the number of heterogenous IP-connected devices and traffic volume increase, so does the potential for security breaches. The undetected exploitation of these breaches can bring severe cybersecurity and privacy risks. Anomaly-based \acp{IDS} play an essential role in network security. In this paper, we present a practical unsupervised anomaly-based deep learning detection system called ARCADE (Adversarially Regularized Convolutional Autoencoder for unsupervised network anomaly DEtection). With a convolutional \ac{AE}, ARCADE automatically builds a profile of the normal traffic using a subset of raw bytes of a few initial packets of network flows so that potential network anomalies and intrusions can be efficiently detected before they cause more damage to the network. ARCADE is trained exclusively on normal traffic. An adversarial training strategy is proposed to regularize and decrease the \ac{AE}'s capabilities to reconstruct network flows that are out-of-the-normal distribution, thereby improving its anomaly detection capabilities. The proposed approach is more effective than state-of-the-art deep learning approaches for network anomaly detection. Even when examining only two initial packets of a network flow, ARCADE can effectively detect malware infection and network attacks. ARCADE presents 20 times fewer parameters than baselines, achieving significantly faster detection speed and reaction time.

translated by 谷歌翻译

An Overview on the Generation and Detection of Synthetic and Manipulated Satellite Images

Lydia Abady , Edoardo Daniele Cannas , Paolo Bestagini , Benedetta Tondi , Stefano Tubaro , Mauro Barni

分类：计算机视觉

2022-09-19

由于技术成本的降低和卫星发射的增加，卫星图像变得越来越流行和更容易获得。除了提供仁慈的目的外，还可以出于恶意原因（例如错误信息）使用卫星数据。事实上，可以依靠一般图像编辑工具来轻松操纵卫星图像。此外，随着深层神经网络（DNN）的激增，可以生成属于各种领域的现实合成图像，与合成生成的卫星图像的扩散有关的其他威胁正在出现。在本文中，我们回顾了关于卫星图像的产生和操纵的最新技术（SOTA）。特别是，我们既关注从头开始的合成卫星图像的产生，又要通过图像转移技术对卫星图像进行语义操纵，包括从一种类型的传感器到另一种传感器获得的图像的转换。我们还描述了迄今已研究的法医检测技术，以对合成图像伪造进行分类和检测。虽然我们主要集中在法医技术上明确定制的，该技术是针对AI生成的合成内容物的检测，但我们还审查了一些用于一般剪接检测的方法，这些方法原则上也可以用于发现AI操纵图像

translated by 谷歌翻译

A Critical Study on the Recent Deep Learning Based Semi-Supervised Video Anomaly Detection Methods

Mohammad Baradaran , Robert Bergevin

分类：计算机视觉

2021-11-02

视频异常检测是现在计算机视觉中的热门研究主题之一，因为异常事件包含大量信息。异常是监控系统中的主要检测目标之一，通常需要实时行动。关于培训的标签数据的可用性（即，没有足够的标记数据进行异常），半监督异常检测方法最近获得了利益。本文介绍了该领域的研究人员，以新的视角，并评论了最近的基于深度学习的半监督视频异常检测方法，基于他们用于异常检测的共同策略。我们的目标是帮助研究人员开发更有效的视频异常检测方法。由于选择右深神经网络的选择对于这项任务的几个部分起着重要作用，首先准备了对DNN的快速比较审查。与以前的调查不同，DNN是从时空特征提取观点审查的，用于视频异常检测。这部分审查可以帮助本领域的研究人员选择合适的网络，以获取其方法的不同部分。此外，基于其检测策略，一些最先进的异常检测方法受到严格调查。审查提供了一种新颖，深入了解现有方法，并导致陈述这些方法的缺点，这可能是未来作品的提示。

translated by 谷歌翻译

Deep Autoencoders for Anomaly Detection in Textured Images using CW-SSIM

Andrea Bionda , Luca Frittoli , Giacomo Boracchi

分类：计算机视觉 | 机器学习

2022-08-30

在图像中检测异常区域是工业监测中经常遇到的问题。一个相关的例子是对正常条件下符合特定纹理的组织和其他产品的分析，而缺陷会引入正常模式的变化。我们通过训练深层自动编码器来解决异常检测问题，我们表明，基于复杂的小波结构相似性（CW-SSIM）采用损失函数（CW-SSIM）与传统的自动编码器损失函数相比，这类图像上的检测性能出色。我们对众所周知的异常检测基准测试的实验表明，通过这种损失函数训练的简单模型可以实现可比性或优越的性能，从而利用更深入，更大，更大的计算要求的神经网络的最先进方法。

translated by 谷歌翻译

Stroke-Based Scene Text Erasing Using Synthetic Data for Training

Zhengmi Tang , Tomo Miyazaki , Yoshihiro Sugaya , Shinichiro Omachi

分类：计算机视觉

2021-04-23

场景文本擦除，它在自然图像中替换了具有合理内容的文本区域，近年来在计算机视觉社区中造成了重大关注。场景文本删除中有两个潜在的子任务：文本检测和图像修复。两个子任务都需要相当多的数据来实现更好的性能;但是，缺乏大型现实世界场景文本删除数据集不允许现有方法实现其潜力。为了弥补缺乏成对的真实世界数据，我们在额外的增强后大大使用了合成文本，随后仅在改进的合成文本引擎生成的数据集上培训了我们的模型。我们所提出的网络包含一个笔划掩模预测模块和背景染色模块，可以从裁剪文本图像中提取文本笔划作为相对较小的孔，以维持更多的背景内容以获得更好的修复结果。该模型可以用边界框部分删除场景图像中的文本实例，或者使用现有场景文本检测器进行自动场景文本擦除。 SCUT-SYN，ICDAR2013和SCUT-ENSTEXT数据集的定性和定量评估的实验结果表明，即使在现实世界数据上培训，我们的方法也显着优于现有的最先进的方法。

translated by 谷歌翻译