智能论文笔记

Quaternion-based dynamic mode decomposition for background modeling in color videos

Juan Han , Kit Ian Kou , Jifei Miao

分类：计算机视觉

2021-12-28

场景背景初始化（SBI）是计算机愿景中的具有挑战性之一。动态模式分解（DMD）是最近提出的方法，以鲁布妥地将视频序列分解为背景模型和相应的前景部分。然而，该方法需要将彩色图像转换为用于处理的灰度图像，这导致忽略彩色图像的三个通道之间的耦合信息。在本研究中，我们提出了一种基于四元数的DMD（Q-DMD），其通过四元数矩阵分析扩展了DMD，以便完全保留彩色图像的固有色彩结构和彩色视频。我们利用四元数矩阵的标准特征值来计算其光谱分解并计算相应的Q-DMD模式和特征值。公开的基准数据集上的结果证明我们的Q-DMD优于确切的DMD方法，实验结果还表明，我们的方法的性能与最先进的模式相当。

translated by 谷歌翻译

Exact Decomposition of Joint Low Rankness and Local Smoothness Plus Sparse Matrices

Jiangjun Peng , Yao Wang , Hongying Zhang , Jianjun Wang , Deyu Meng

分类：计算机视觉

2022-01-29

It is known that the decomposition in low-rank and sparse matrices (\textbf{L+S} for short) can be achieved by several Robust PCA techniques. Besides the low rankness, the local smoothness (\textbf{LSS}) is a vitally essential prior for many real-world matrix data such as hyperspectral images and surveillance videos, which makes such matrices have low-rankness and local smoothness properties at the same time. This poses an interesting question: Can we make a matrix decomposition in terms of \textbf{L\&LSS +S } form exactly? To address this issue, we propose in this paper a new RPCA model based on three-dimensional correlated total variation regularization (3DCTV-RPCA for short) by fully exploiting and encoding the prior expression underlying such joint low-rank and local smoothness matrices. Specifically, using a modification of Golfing scheme, we prove that under some mild assumptions, the proposed 3DCTV-RPCA model can decompose both components exactly, which should be the first theoretical guarantee among all such related methods combining low rankness and local smoothness. In addition, by utilizing Fast Fourier Transform (FFT), we propose an efficient ADMM algorithm with a solid convergence guarantee for solving the resulting optimization problem. Finally, a series of experiments on both simulations and real applications are carried out to demonstrate the general validity of the proposed 3DCTV-RPCA model.

translated by 谷歌翻译

Tensor Robust PCA with Nonconvex and Nonlocal Regularization

Xiaoyu Geng , Qiang Guo , Shuaixiong Hui , Caiming Zhang

分类：计算机视觉

2022-11-04

Tensor robust principal component analysis (TRPCA) is a promising way for low-rank tensor recovery, which minimizes the convex surrogate of tensor rank by shrinking each tensor singular values equally. However, for real-world visual data, large singular values represent more signifiant information than small singular values. In this paper, we propose a nonconvex TRPCA (N-TRPCA) model based on the tensor adjustable logarithmic norm. Unlike TRPCA, our N-TRPCA can adaptively shrink small singular values more and shrink large singular values less. In addition, TRPCA assumes that the whole data tensor is of low rank. This assumption is hardly satisfied in practice for natural visual data, restricting the capability of TRPCA to recover the edges and texture details from noisy images and videos. To this end, we integrate nonlocal self-similarity into N-TRPCA, and further develop a nonconvex and nonlocal TRPCA (NN-TRPCA) model. Specifically, similar nonlocal patches are grouped as a tensor and then each group tensor is recovered by our N-TRPCA. Since the patches in one group are highly correlated, all group tensors have strong low-rank property, leading to an improvement of recovery performance. Experimental results demonstrate that the proposed NN-TRPCA outperforms some existing TRPCA methods in visual data recovery. The demo code is available at https://github.com/qguo2010/NN-TRPCA.

translated by 谷歌翻译

Spatial-Temporal Map Vehicle Trajectory Detection Using Dynamic Mode Decomposition and Res-UNet+ Neural Networks

Tianya T. Zhang , Peter J. Jin

分类：计算机视觉

2022-01-13

本文提出了一种机器学习增强的纵向扫描线方法，用于从大角度交通摄像机中提取车辆轨迹。通过将空间颞映射（STMAP）分解到稀疏前景和低秩背景，应用动态模式分解（DMD）方法来提取车辆股线。通过调整两个普遍的深度学习架构，设计了一个名为Res-Unet +的深神经网络。 RES-UNET +神经网络显着提高了基于STMAP的车辆检测的性能，DMD模型提供了许多有趣的见解，了解由Stmap保留的潜在空间结构的演变。与先前的图像处理模型和主流语义分割深神经网络进行比较模型输出。经过彻底的评估后，证明该模型对许多具有挑战性的因素来说是准确和强大的。最后但并非最不重要的是，本文从根本上解决了NGSIM轨迹数据中发现了许多质量问题。清除清洁的高质量轨迹数据，以支持交通流量和微观车辆控制的未来理论和建模研究。该方法是用于基于视频的轨迹提取的可靠解决方案，并且具有广泛的适用性。

translated by 谷歌翻译

Robust Implementation of Foreground Extraction and Vessel Segmentation for X-ray Coronary Angiography Image Sequence

Zeyu Fu , Zhuang Fu , Chenzhuo Lv , Jun Yan

分类：计算机视觉

2022-09-15

从X射线冠状动脉造影（XCA）图像序列中提取对比度的血管对于直觉诊断和治疗具有重要的临床意义。在这项研究中，XCA图像序列O被认为是三维张量输入，血管层H是稀疏张量，而背景层B是低级别张量。使用张量核标准（TNN）最小化，提出了一种基于张量的强稳定主成分分析（TRPCA）的新型血管层提取方法。此外，考虑了血管的不规则运动和周围无关组织的动态干扰，引入了总变化（TV）正规化时空约束，以分离动态背景E。 - 阶段区域生长（TSRG）方法用于血管增强和分割。全局阈值分割用作获得主分支的预处理，并使用ra样特征（RLF）滤波器来增强和连接破碎的小段，最终的容器掩模是通过结合两个中间结果来构建的。我们评估了TV-TRPCA算法的前景提取的可见性以及TSRG算法在真实临床XCA图像序列和第三方数据库上的血管分割的准确性。定性和定量结果都验证了所提出的方法比现有的最新方法的优越性。

translated by 谷歌翻译

Tensor Full Feature Measure and Its Nonconvex Relaxation Applications to Tensor Recovery

Hongbing Zhang , Xinyi Liu , Hongtao Fan , Yajing Li , Yinlin Ye

分类：计算机视觉 | 机器学习

2021-09-25

张量稀疏建模是一种有希望的方法，在整个科学和工程学中，取得了巨大的成功。众所周知，实际应用中的各种数据通常由多种因素产生，因此使用张量表示包含多个因素内部结构的数据。但是，与矩阵情况不同，构建合理的稀疏度量张量是一项相对困难且非常重要的任务。因此，在本文中，我们提出了一种称为张量全功能度量（FFM）的新张量稀疏度度量。它可以同时描述张量的每个维度的特征信息以及两个维度之间的相关特征，并将塔克等级与张量管等级连接。这种测量方法可以更全面地描述张量的稀疏特征。在此基础上，我们建立了其非凸放松，并将FFM应用于低级张量完成（LRTC）和张量鲁棒的主成分分析（TRPCA）。提出了基于FFM的LRTC和TRPCA模型，并开发了两种有效的交替方向乘数法（ADMM）算法来求解所提出的模型。各种实际数值实验证实了超出最先进的方法的优势。

translated by 谷歌翻译

"Sparse + Low-Rank'' Tensor Completion Approach for Recovering Images and Videos

Chenjian Pan , Chen Ling , Hongjin He , Liqun Qi , Yanwei Xu

分类：计算机视觉

2021-10-18

从高度不足的数据中恢复颜色图像和视频是面部识别和计算机视觉中的一项基本且具有挑战性的任务。通过颜色图像和视频的多维性质，在本文中，我们提出了一种新颖的张量完成方法，该方法能够有效探索离散余弦变换（DCT）下张量数据的稀疏性。具体而言，我们介绍了两个``稀疏 +低升级''张量完成模型，以及两种可实现的算法来找到其解决方案。第一个是基于DCT的稀疏加权核标准诱导低级最小化模型。第二个是基于DCT的稀疏加上$ P $换图映射引起的低秩优化模型。此外，我们因此提出了两种可实施的增强拉格朗日算法，以解决基础优化模型。一系列数值实验在内，包括颜色图像介入和视频数据恢复表明，我们所提出的方法的性能要比许多现有的最新张量完成方法更好，尤其是对于缺少数据比率较高的情况。

translated by 谷歌翻译

Non-Convex Tensor Low-Rank Approximation for Infrared Small Target Detection

Ting Liu , Jungang Yang , Boyang Li , Chao Xiao , Yang Sun , Yingqian Wang , Wei An

分类：计算机视觉

2021-05-31

红外小目标检测是红外系统中的重要基本任务。因此，已经提出了许多红外小目标检测方法，其中低级模型已被用作强大的工具。然而，基于低级别的方法为不同的奇异值分配相同的权重，这将导致背景估计不准确。考虑到不同的奇异值具有不同的重要性，并且应判别处理，本文提出了一种用于红外小目标检测的非凸张力低秩近似（NTLA）方法。在我们的方法中，NTLA正则化将不同的权重自适应分配给不同的奇异值以进行准确背景估计。基于所提出的NTLA，我们提出了不对称的空间 - 时间总变化（ASTTV）正则化，以实现复杂场景中的更准确的背景估计。与传统的总变化方法相比，ASTTV利用不同的平滑度强度进行空间和时间正则化。我们设计了一种有效的算法来查找我们方法的最佳解决方案。与一些最先进的方法相比，所提出的方法达到各种评估指标的改进。各种复杂场景的广泛实验结果表明，我们的方法具有强大的鲁棒性和低误报率。代码可在https://github.com/liuting20a/asttv-ntla获得。

translated by 谷歌翻译

Approximation of Images via Generalized Higher Order Singular Value Decomposition over Finite-dimensional Commutative Semisimple Algebra

Liang Liao , Sen Lin , Lun Li , Xiuwei Zhang , Song Zhao , Yan Wang , Xinqiang Wang , Qi Gao , Jingyu Wang

分类：机器学习

2022-02-01

在大数据的时代，通过单数值分解的图像近似近似。但是，奇异值分解（SVD）仅用于订单两个数据，即矩阵。有必要将高阶输入变成矩阵或将其分解为一系列订单两个切片，以解决具有SVD的多光谱图像和视频等高阶数据。高阶奇异值分解（HOSVD）扩展了SVD，可以使用一些排名一的组件的总和近似高阶数据。我们考虑将HOSVD推广到有限维度的代数上的问题。该代数（称为T-Algebra）概括了复数。代数的元素（称为t-scalars）是固定大小的复数阵列。可以将矩阵和张量概括在T量标准上，然后扩展许多规范矩阵和张量算法，包括HOSVD，以获得更高的性能版本。 HOSVD的概括称为THOSVD。交替的算法可以进一步提高其近似多路数据的性能。 THOSVD还统一了广泛的主要组件分析算法。为了利用T-scalars进行近似图像利用广义算法的潜力，我们使用像素邻域策略将每个像素转换为“更深入”的T-Scalar。公开图像的实验表明，T型量表的广义算法，即ThoSVD，与其规范对应物进行了优惠。

translated by 谷歌翻译

Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions

Nathan Halko , Per-Gunnar Martinsson , Joel A. Tropp

分类：

2009-09-22

Low-rank matrix approximations, such as the truncated singular value decomposition and the rank-revealing QR decomposition, play a central role in data analysis and scientific computing. This work surveys and extends recent research which demonstrates that randomization offers a powerful tool for performing low-rank matrix approximation. These techniques exploit modern computational architectures more fully than classical methods and open the possibility of dealing with truly massive data sets.This paper presents a modular framework for constructing randomized algorithms that compute partial matrix decompositions. These methods use random sampling to identify a subspace that captures most of the action of a matrix. The input matrix is then compressed-either explicitly or implicitly-to this subspace, and the reduced matrix is manipulated deterministically to obtain the desired low-rank factorization. In many cases, this approach beats its classical competitors in terms of accuracy, speed, and robustness. These claims are supported by extensive numerical experiments and a detailed error analysis.The specific benefits of randomized techniques depend on the computational environment. Consider the model problem of finding the k dominant components of the singular value decomposition of an m × n matrix. (i) For a dense input matrix, randomized algorithms require O(mn log(k)) floating-point operations (flops) in contrast with O(mnk) for classical algorithms. (ii) For a sparse input matrix, the flop count matches classical Krylov subspace methods, but the randomized approach is more robust and can easily be reorganized to exploit multi-processor architectures. (iii) For a matrix that is too large to fit in fast memory, the randomized techniques require only a constant number of passes over the data, as opposed to O(k) passes for classical algorithms. In fact, it is sometimes possible to perform matrix approximation with a single pass over the data.

translated by 谷歌翻译

Rigorous data-driven computation of spectral properties of Koopman operators for dynamical systems

Matthew J. Colbrook , Alex Townsend

分类：机器学习

2021-11-29

Koopman运算符是无限维的运算符，可全球线性化非线性动态系统，使其光谱信息可用于理解动态。然而，Koopman运算符可以具有连续的光谱和无限维度的子空间，使得它们的光谱信息提供相当大的挑战。本文介绍了具有严格融合的数据驱动算法，用于从轨迹数据计算Koopman运算符的频谱信息。我们引入了残余动态模式分解（ResDMD），它提供了第一种用于计算普通Koopman运算符的Spectra和PseudtoStra的第一种方案，无需光谱污染。使用解析器操作员和RESDMD，我们还计算与测量保存动态系统相关的光谱度量的平滑近似。我们证明了我们的算法的显式收敛定理，即使计算连续频谱和离散频谱的密度，也可以实现高阶收敛即使是混沌系统。我们展示了在帐篷地图，高斯迭代地图，非线性摆，双摆，洛伦茨系统和11美元延长洛伦兹系统的算法。最后，我们为具有高维状态空间的动态系统提供了我们的算法的核化变体。这使我们能够计算与具有20,046维状态空间的蛋白质分子的动态相关的光谱度量，并计算出湍流流过空气的误差界限的非线性Koopman模式，其具有雷诺数为$> 10 ^ 5 $。一个295,122维的状态空间。

translated by 谷歌翻译

Digital Twin Data Modelling by Randomized Orthogonal Decomposition and Deep Learning

Diana Alina Bistrian , Omer San , Ionel Michael Navon

分类：机器学习

2022-06-17

数字双胞胎是一个代孕模型，具有反映原始过程行为的主要功能。将动力学过程与降低复杂性的数字双模型相关联具有很大的优势，可以将动力学以高精度和CPU时间和硬件的成本降低到遭受重大变化的时间表，因此很难探索。本文介绍了一个新的框架，用于创建有效的数字双流体流量流量。我们介绍了一种新型算法，该算法结合了基于Krylov的动态模式分解的优势和正确的正交分解，并优于选择最有影响力的模式。我们证明，随机正交分解算法提供了比SVD经验正交分解方法的几个优点，并减轻了对多目标优化问题的投影误差。我们涉及最先进的艺术人工智能（DL）以执行实时的实时学习（DL）数字双胞胎模型的自适应校准，富裕性的增加。该输出是流体流动动力学的高保真数字双数据数据模型，具有降低的复杂性。在三波现象的数值模拟中，随着复杂性的增加，研究了新的建模工具。我们表明，输出与原始源数据一致。我们在数值准确性和计算效率方面对新数字数据模型的性能进行彻底评估，包括时间模拟响应功能研究。

translated by 谷歌翻译

Tensor Factorization via Transformed Tensor-Tensor Product for Image Alignment

Sijia Xia , Duo Qiu , Xiongjun Zhang

分类：计算机视觉

2022-12-12

In this paper, we study the problem of a batch of linearly correlated image alignment, where the observed images are deformed by some unknown domain transformations, and corrupted by additive Gaussian noise and sparse noise simultaneously. By stacking these images as the frontal slices of a third-order tensor, we propose to utilize the tensor factorization method via transformed tensor-tensor product to explore the low-rankness of the underlying tensor, which is factorized into the product of two smaller tensors via transformed tensor-tensor product under any unitary transformation. The main advantage of transformed tensor-tensor product is that its computational complexity is lower compared with the existing literature based on transformed tensor nuclear norm. Moreover, the tensor $\ell_p$ $(0<p<1)$ norm is employed to characterize the sparsity of sparse noise and the tensor Frobenius norm is adopted to model additive Gaussian noise. A generalized Gauss-Newton algorithm is designed to solve the resulting model by linearizing the domain transformations and a proximal Gauss-Seidel algorithm is developed to solve the corresponding subproblem. Furthermore, the convergence of the proximal Gauss-Seidel algorithm is established, whose convergence rate is also analyzed based on the Kurdyka-$\L$ojasiewicz property. Extensive numerical experiments on real-world image datasets are carried out to demonstrate the superior performance of the proposed method as compared to several state-of-the-art methods in both accuracy and computational time.

translated by 谷歌翻译

Roadside Lidar Vehicle Detection and Tracking Using Range And Intensity Background Subtraction

Tianya Zhang , Peter J. Jin

分类：计算机视觉

2022-01-13

在本文中，我们使用两个无监督的学习算法的组合介绍了路边激光雷达物体检测的解决方案。 3D点云数据首先将球形坐标转换成球形坐标并使用散列函数填充到方位角网格矩阵中。之后，RAW LIDAR数据被重新排列成空间 - 时间数据结构，以存储范围，方位角和强度的信息。基于强度信道模式识别，应用动态模式分解方法将点云数据分解成低级背景和稀疏前景。三角算法根据范围信息，自动发现分割值以将移动目标与静态背景分开。在强度和范围背景减法之后，将使用基于密度的检测器检测到前景移动物体，并编码到状态空间模型中以进行跟踪。所提出的模型的输出包括车辆轨迹，可以实现许多移动性和安全应用。该方法针对商业流量数据收集平台进行了验证，并证明了对基础设施激光雷达对象检测的高效可靠的解决方案。与之前的方法相比，该方法直接处理散射和离散点云，所提出的方法可以建立3D测量数据的复杂线性关系较小，这捕获了我们经常需要的空间时间结构。

translated by 谷歌翻译

Quasi Non-Negative Quaternion Matrix Factorization with Application to Color Face Recognition

Yifen Ke , Changfeng Ma , Zhigang Jia , Yajun Xie , Riwei Liao

分类：计算机视觉

2022-11-30

To address the non-negativity dropout problem of quaternion models, a novel quasi non-negative quaternion matrix factorization (QNQMF) model is presented for color image processing. To implement QNQMF, the quaternion projected gradient algorithm and the quaternion alternating direction method of multipliers are proposed via formulating QNQMF as the non-convex constraint quaternion optimization problems. Some properties of the proposed algorithms are studied. The numerical experiments on the color image reconstruction show that these algorithms encoded on the quaternion perform better than these algorithms encoded on the red, green and blue channels. Furthermore, we apply the proposed algorithms to the color face recognition. Numerical results indicate that the accuracy rate of face recognition on the quaternion model is better than on the red, green and blue channels of color image as well as single channel of gray level images for the same data, when large facial expressions and shooting angle variations are presented.

translated by 谷歌翻译

Color Image Inpainting via Robust Pure Quaternion Matrix Completion: Error Bound and Weighted Loss

Junren Chen , Michael K. Ng

分类：计算机视觉

2022-02-04

在本文中，我们将颜色图像插入作为纯季基矩阵完成问题。在文献中，季节矩阵完成的理论保证并不确定。我们的主要目的是提出一个新的最小化问题，并将核标准和三个通道之间的二次损失相结合。为了填补理论空缺，我们获得了在干净和损坏的政权中绑定的错误，这依赖于四元素矩阵的一些新结果。在强大的完成中考虑了一般的高斯噪音，所有观察都被损坏。由于界限的动机，我们建议通过二次损失中的跨通道重量来处理不平衡或相关的噪声，这是重新平衡噪声水平或消除噪声相关性的主要目的。提供了有关合成和颜色图像数据的广泛实验结果，以确认和证明我们的理论发现。

translated by 谷歌翻译

Robust Principal Component Analysis?

Emmanuel J. Candes , Xiaodong Li , Yi Ma , John Wright

分类：

2009-12-18

This paper is about a curious phenomenon. Suppose we have a data matrix, which is the superposition of a low-rank component and a sparse component. Can we recover each component individually? We prove that under some suitable assumptions, it is possible to recover both the low-rank and the sparse components exactly by solving a very convenient convex program called Principal Component Pursuit; among all feasible decompositions, simply minimize a weighted combination of the nuclear norm and of the 1 norm. This suggests the possibility of a principled approach to robust principal component analysis since our methodology and results assert that one can recover the principal components of a data matrix even though a positive fraction of its entries are arbitrarily corrupted. This extends to the situation where a fraction of the entries are missing as well. We discuss an algorithm for solving this optimization problem, and present applications in the area of video surveillance, where our methodology allows for the detection of objects in a cluttered background, and in the area of face recognition, where it offers a principled way of removing shadows and specularities in images of faces.

translated by 谷歌翻译

Hyperspectral Mixed Noise Removal via Subspace Representation and Weighted Low-rank Tensor Regularization

Hang Zhou , Yanchi Su , Zhanshan Li

分类：计算机视觉

2021-11-13

最近，从图像中提取的不同组件的低秩属性已经考虑在MAN Hypspectral图像去噪方法中。然而，这些方法通常将3D矩阵或1D向量展开，以利用现有信息，例如非识别空间自相似性（NSS）和全局光谱相关（GSC），其破坏了高光谱图像的内在结构相关性（HSI））因此导致恢复质量差。此外，由于在HSI的原始高维空间中的矩阵和张量的矩阵和张量的参与，其中大多数受到重大计算负担问题。我们使用子空间表示和加权低级张量正则化（SWLRTR）进入模型中以消除高光谱图像中的混合噪声。具体地，为了在光谱频带中使用GSC，将噪声HSI投影到简化计算的低维子空间中。之后，引入加权的低级张量正则化术语以表征缩减图像子空间中的前导。此外，我们设计了一种基于交替最小化的算法来解决非耦合问题。模拟和实时数据集的实验表明，SWLRTR方法比定量和视觉上的其他高光谱去噪方法更好。

translated by 谷歌翻译

Multi-mode Core Tensor Factorization based Low-Rankness and Its Applications to Tensor Completion

Haijin Zeng

分类：计算机视觉

2020-12-03

低级张力完成已广泛用于计算机视觉和机器学习。本文开发了一种新型多模态核心张量分解（MCTF）方法，与张量低秩测量和该措施的更好的非凸弛豫形式（NC-MCTF）。所提出的模型编码由Tucker和T-SVD提供的一般张量的低秩见解，因此预计将在多个方向上同时模拟光谱低秩率，并准确地恢复基于几个观察到的条目的内在低秩结构的数据。此外，我们研究了MCTF和NC-MCTF正则化最小化问题，并设计了一个有效的块连续上限最小化（BSUM）算法来解决它们。该高效的求解器可以将MCTF扩展到各种任务，例如张量完成。一系列实验，包括高光谱图像（HSI），视频和MRI完成，确认了所提出的方法的卓越性能。

translated by 谷歌翻译

DTVNet+: A High-Resolution Scenic Dataset for Dynamic Time-lapse Video Generation

Jiangning Zhang , Chao Xu , Yong Liu , Yunliang Jiang

分类：计算机视觉

2020-08-11

本文介绍了一个名为DTVNet的新型端到端动态时间流逝视频生成框架，以从归一化运动向量上的单个景观图像生成多样化的延期视频。所提出的DTVNET由两个子模块组成：\ EMPH {光学流编码器}（OFE）和\ EMPH {动态视频生成器}（DVG）。 OFE将一系列光学流程图映射到编码所生成视频的运动信息的\ Emph {归一化运动向量}。 DVG包含来自运动矢量和单个景观图像的运动和内容流。此外，它包含一个编码器，用于学习共享内容特征和解码器，以构造具有相应运动的视频帧。具体地，\ EMPH {运动流}介绍多个\ EMPH {自适应实例归一化}（Adain）层，以集成用于控制对象运动的多级运动信息。在测试阶段，基于仅一个输入图像，可以产生具有相同内容但具有相同运动信息但各种运动信息的视频。此外，我们提出了一个高分辨率的景区时间流逝视频数据集，命名为快速天空时间，以评估不同的方法，可以被视为高质量景观图像和视频生成任务的新基准。我们进一步对天空延时，海滩和快速天空数据集进行实验。结果证明了我们对最先进的方法产生高质量和各种动态视频的方法的优越性。

translated by 谷歌翻译