智能论文笔记

Residual Dense Network for Image Super-Resolution

Yulun Zhang , Yapeng Tian , Yu Kong , Bineng Zhong , Yun Fu

分类：

2018-02-24

A very deep convolutional neural network (CNN) has recently achieved great success for image super-resolution (SR) and offered hierarchical features as well. However, most deep CNN based SR models do not make full use of the hierarchical features from the original low-resolution (LR) images, thereby achieving relatively-low performance. In this paper, we propose a novel residual dense network (RDN) to address this problem in image SR. We fully exploit the hierarchical features from all the convolutional layers. Specifically, we propose residual dense block (RDB) to extract abundant local features via dense connected convolutional layers. RDB further allows direct connections from the state of preceding RDB to all the layers of current RDB, leading to a contiguous memory (CM) mechanism. Local feature fusion in RDB is then used to adaptively learn more effective features from preceding and current local features and stabilizes the training of wider network. After fully obtaining dense local features, we use global feature fusion to jointly and adaptively learn global hierarchical features in a holistic way. Experiments on benchmark datasets with different degradation models show that our RDN achieves favorable performance against state-of-the-art methods.

translated by 谷歌翻译

Image Super-Resolution Using Very Deep Residual Channel Attention Networks

Yulun Zhang , Kunpeng Li , Kai Li , Lichen Wang , Bineng Zhong , Yun Fu

分类：

2018-07-08

Convolutional neural network (CNN) depth is of crucial importance for image super-resolution (SR). However, we observe that deeper networks for image SR are more difficult to train. The lowresolution inputs and features contain abundant low-frequency information, which is treated equally across channels, hence hindering the representational ability of CNNs. To solve these problems, we propose the very deep residual channel attention networks (RCAN). Specifically, we propose a residual in residual (RIR) structure to form very deep network, which consists of several residual groups with long skip connections. Each residual group contains some residual blocks with short skip connections. Meanwhile, RIR allows abundant low-frequency information to be bypassed through multiple skip connections, making the main network focus on learning high-frequency information. Furthermore, we propose a channel attention mechanism to adaptively rescale channel-wise features by considering interdependencies among channels. Extensive experiments show that our RCAN achieves better accuracy and visual improvements against state-of-the-art methods.

translated by 谷歌翻译

Iterative Network for Image Super-Resolution

Yuqing Liu , Shiqi Wang , Jian Zhang , Shanshe Wang , Siwei Ma , Wen Gao

分类：计算机视觉

2020-05-20

单像超分辨率（SISR），作为传统的不良反对问题，通过最近的卷积神经网络（CNN）的发展得到了极大的振兴。这些基于CNN的方法通常将低分辨率图像映射到其相应的高分辨率版本，具有复杂的网络结构和损耗功能，显示出令人印象深刻的性能。本文对传统的SISR算法提供了新的洞察力，并提出了一种基本上不同的方法，依赖于迭代优化。提出了一种新颖的迭代超分辨率网络（ISRN），顶部是迭代优化。我们首先分析图像SR问题的观察模型，通过以更一般和有效的方式模仿和融合每次迭代来激发可行的解决方案。考虑到批量归一化的缺点，我们提出了一种特征归一化（F-NOM，FN）方法来调节网络中的功能。此外，开发了一种具有FN的新颖块以改善作为FNB称为FNB的网络表示。剩余剩余结构被提出形成一个非常深的网络，其中FNBS与长时间跳过连接，以获得更好的信息传递和稳定训练阶段。对BICUBIC（BI）降解的测试基准的广泛实验结果表明我们的ISRN不仅可以恢复更多的结构信息，而且还可以获得竞争或更好的PSNR / SSIM结果，与其他作品相比，参数更少。除BI之外，我们除了模拟模糊（BD）和低级噪声（DN）的实际降级。 ISRN及其延伸ISRN +两者都比使用BD和DN降级模型的其他产品更好。

translated by 谷歌翻译

Second-order attention network for single image super-resolution

分类：

Recently, deep convolutional neural networks (CNNs) have been widely explored in single image super-resolution (SISR) and obtained remarkable performance. However, most of the existing CNN-based SISR methods mainly focus on wider or deeper architecture design, neglecting to explore the feature correlations of intermediate layers, hence hindering the representational power of CNNs. To address this issue, in this paper, we propose a second-order attention network (SAN) for more powerful feature expression and feature correlation learning. Specifically, a novel trainable second-order channel attention (SOCA) module is developed to adaptively rescale the channel-wise features by using second-order feature statistics for more discriminative representations. Furthermore, we present a non-locally enhanced residual group (NLRG) structure, which not only incorporates non-local operations to capture long-distance spatial contextual information, but also contains repeated local-source residual attention groups (LSRAG) to learn increasingly abstract feature representations. Experimental results demonstrate the superiority of our SAN network over state-of-the-art SISR methods in terms of both quantitative metrics and visual quality.

translated by 谷歌翻译

MemNet: A Persistent Memory Network for Image Restoration

Ying Tai , Jian Yang , Xiaoming Liu , Chunyan Xu

分类：

2017-08-07

Recently, very deep convolutional neural networks (CNNs) have been attracting considerable attention in image restoration. However, as the depth grows, the long-term dependency problem is rarely realized for these very deep models, which results in the prior states/layers having little influence on the subsequent ones. Motivated by the fact that human thoughts have persistency, we propose a very deep persistent memory network (MemNet) that introduces a memory block, consisting of a recursive unit and a gate unit, to explicitly mine persistent memory through an adaptive learning process. The recursive unit learns multi-level representations of the current state under different receptive fields. The representations and the outputs from the previous memory blocks are concatenated and sent to the gate unit, which adaptively controls how much of the previous states should be reserved, and decides how much of the current state should be stored. We apply MemNet to three image restoration tasks, i.e., image denosing, superresolution and JPEG deblocking. Comprehensive experiments demonstrate the necessity of the MemNet and its unanimous superiority on all three tasks over the state of the arts. Code is available at https://github.com/ tyshiwo/MemNet.

translated by 谷歌翻译

Efficient Single Image Super-Resolution Using Dual Path Connections with Multiple Scale Learning

Bin-Cheng Yang , Gangshan Wu

分类：计算机视觉

2021-12-31

已经证明了深度卷积神经网络近年来对SISR有效。一方面，已经广泛使用了残余连接和密集连接，以便于前向信息和向后梯度流动以提高性能。然而，当前方法以次优的方式在大多数网络层中单独使用残留连接和密集连接。另一方面，虽然各种网络和方法旨在改善计算效率，节省参数或利用彼此的多种比例因子的训练数据来提升性能，但它可以在人力资源空间中进行超级分辨率来具有高计算成本或不能在不同尺度因子的模型之间共享参数以节省参数和推理时间。为了解决这些挑战，我们提出了一种使用双路径连接的高效单图像超分辨率网络，其多种规模学习命名为EMSRDPN。通过将双路径的双路径连接引入EMSRDPN，它在大多数网络层中以综合方式使用残差连接和密集连接。双路径连接具有重用残余连接的共同特征和探索密集连接的新功能，以了解SISR的良好代表。要利用多种比例因子的特征相关性，EMSRDPN在不同缩放因子之间共享LR空间中的所有网络单元，以学习共享功能，并且仅使用单独的重建单元进行每个比例因子，这可以利用多种规模因子的培训数据来帮助各个规模因素另外提高性能，同时可以节省参数并支持共享推理，以提高效率的多种规模因素。实验显示EMSRDPN通过SOTA方法实现更好的性能和比较或更好的参数和推理效率。

translated by 谷歌翻译

Image super-resolution via deep recursive residual network

分类：

Recently, Convolutional Neural Network (CNN) based models have achieved great success in Single Image Super-Resolution (SISR). Owing to the strength of deep networks, these CNN models learn an effective nonlinear mapping from the low-resolution input image to the high-resolution target image, at the cost of requiring enormous parameters. This paper proposes a very deep CNN model (up to 52 convolutional layers) named Deep Recursive Residual Network (DRRN) that strives for deep yet concise networks. Specifically, residual learning is adopted, both in global and local manners, to mitigate the difficulty of training very deep net-works; recursive learning is used to control the model parameters while increasing the depth. Extensive benchmark evaluation shows that DRRN significantly outperforms state of the art in SISR, while utilizing far fewer parameters. Code is available at https://github.com/tyshiwo /DRRN CVPR17.

translated by 谷歌翻译

Single Image Super-Resolution via a Holistic Attention Network

Ben Niu , Weilei Wen , Wenqi Ren , Xiangde Zhang , Lianping Yang , Shuzhen Wang , Kaihao Zhang , Xiaochun Cao , Haifeng Shen

分类：

2020-08-20

Informative features play a crucial role in the single image super-resolution task. Channel attention has been demonstrated to be effective for preserving information-rich features in each layer. However, channel attention treats each convolution layer as a separate process that misses the correlation among different layers. To address this problem, we propose a new holistic attention network (HAN), which consists of a layer attention module (LAM) and a channel-spatial attention module (CSAM), to model the holistic interdependencies among layers, channels, and positions. Specifically, the proposed LAM adaptively emphasizes hierarchical features by considering correlations among layers. Meanwhile, CSAM learns the confidence at all the positions of each channel to selectively capture more informative features. Extensive experiments demonstrate that the proposed HAN performs favorably against the state-ofthe-art single image super-resolution approaches.

translated by 谷歌翻译

Cross-SRN: Structure-Preserving Super-Resolution Network with Cross Convolution

Yuqing Liu , Qi Jia , Xin Fan , Shanshe Wang , Siwei Ma , Wen Gao

分类：计算机视觉

2022-01-05

将低分辨率（LR）图像恢复到超分辨率（SR）图像具有正确和清晰的细节是挑战。现有的深度学习工作几乎忽略了图像的固有结构信息，这是对SR结果的视觉感知的重要作用。在本文中，我们将分层特征开发网络设计为探测并以多尺度特征融合方式保持结构信息。首先，我们提出了在传统边缘探测器上的交叉卷积，以定位和代表边缘特征。然后，交叉卷积块（CCBS）设计有功能归一化和渠道注意，以考虑特征的固有相关性。最后，我们利用多尺度特征融合组（MFFG）来嵌入交叉卷积块，并在层次的层次上开发不同尺度的结构特征的关系，调用名为Cross-SRN的轻量级结构保护网络。实验结果表明，交叉SRN通过准确且清晰的结构细节实现了对最先进的方法的竞争或卓越的恢复性能。此外，我们设置了一个标准，以选择具有丰富的结构纹理的图像。所提出的跨SRN优于所选择的基准测试的最先进的方法，这表明我们的网络在保存边缘具有显着的优势。

translated by 谷歌翻译

Bridging Component Learning with Degradation Modelling for Blind Image Super-Resolution

Yixuan Wu , Feng Li , Huihui Bai , Weisi Lin , Runmin Cong , Yao Zhao

分类：计算机视觉

2022-12-03

Convolutional Neural Network (CNN)-based image super-resolution (SR) has exhibited impressive success on known degraded low-resolution (LR) images. However, this type of approach is hard to hold its performance in practical scenarios when the degradation process is unknown. Despite existing blind SR methods proposed to solve this problem using blur kernel estimation, the perceptual quality and reconstruction accuracy are still unsatisfactory. In this paper, we analyze the degradation of a high-resolution (HR) image from image intrinsic components according to a degradation-based formulation model. We propose a components decomposition and co-optimization network (CDCN) for blind SR. Firstly, CDCN decomposes the input LR image into structure and detail components in feature space. Then, the mutual collaboration block (MCB) is presented to exploit the relationship between both two components. In this way, the detail component can provide informative features to enrich the structural context and the structure component can carry structural context for better detail revealing via a mutual complementary manner. After that, we present a degradation-driven learning strategy to jointly supervise the HR image detail and structure restoration process. Finally, a multi-scale fusion module followed by an upsampling layer is designed to fuse the structure and detail features and perform SR reconstruction. Empowered by such degradation-based components decomposition, collaboration, and mutual optimization, we can bridge the correlation between component learning and degradation modelling for blind SR, thereby producing SR results with more accurate textures. Extensive experiments on both synthetic SR datasets and real-world images show that the proposed method achieves the state-of-the-art performance compared to existing methods.

translated by 谷歌翻译

Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network

Namhyuk Ahn , Byungkon Kang , Kyung-Ah Sohn

分类：

2018-03-23

In recent years, deep learning methods have been successfully applied to single-image super-resolution tasks. Despite their great performances, deep learning methods cannot be easily applied to realworld applications due to the requirement of heavy computation. In this paper, we address this issue by proposing an accurate and lightweight deep network for image super-resolution. In detail, we design an architecture that implements a cascading mechanism upon a residual network. We also present variant models of the proposed cascading residual network to further improve efficiency. Our extensive experiments show that even with much fewer parameters and operations, our models achieve performance comparable to that of state-of-the-art methods.

translated by 谷歌翻译

Efficient Image Super-Resolution with Feature Interaction Weighted Hybrid Network

Wenjie Li , Juncheng Li , Guangwei Gao , Weihong Deng , Jian Yang , Guo-Jun Qi , Chia-Wen Lin

分类：计算机视觉

2022-12-29

Recently, great progress has been made in single-image super-resolution (SISR) based on deep learning technology. However, the existing methods usually require a large computational cost. Meanwhile, the activation function will cause some features of the intermediate layer to be lost. Therefore, it is a challenge to make the model lightweight while reducing the impact of intermediate feature loss on the reconstruction quality. In this paper, we propose a Feature Interaction Weighted Hybrid Network (FIWHN) to alleviate the above problem. Specifically, FIWHN consists of a series of novel Wide-residual Distillation Interaction Blocks (WDIB) as the backbone, where every third WDIBs form a Feature shuffle Weighted Group (FSWG) by mutual information mixing and fusion. In addition, to mitigate the adverse effects of intermediate feature loss on the reconstruction results, we introduced a well-designed Wide Convolutional Residual Weighting (WCRW) and Wide Identical Residual Weighting (WIRW) units in WDIB, and effectively cross-fused features of different finenesses through a Wide-residual Distillation Connection (WRDC) framework and a Self-Calibrating Fusion (SCF) unit. Finally, to complement the global features lacking in the CNN model, we introduced the Transformer into our model and explored a new way of combining the CNN and Transformer. Extensive quantitative and qualitative experiments on low-level and high-level tasks show that our proposed FIWHN can achieve a good balance between performance and efficiency, and is more conducive to downstream tasks to solve problems in low-pixel scenarios.

translated by 谷歌翻译

Multi-scale Attention Network for Image Super-Resolution

Yan Wang , Yusen Li , Gang Wang , Xiaoguang Liu

分类：计算机视觉

2022-09-28

通过利用大型内核分解和注意机制，卷积神经网络（CNN）可以在许多高级计算机视觉任务中与基于变压器的方法竞争。但是，由于远程建模的优势，具有自我注意力的变压器仍然主导着低级视野，包括超分辨率任务。在本文中，我们提出了一个基于CNN的多尺度注意网络（MAN），该网络由多尺度的大内核注意力（MLKA）和一个封闭式的空间注意单元（GSAU）组成，以提高卷积SR网络的性能。在我们的MLKA中，我们使用多尺度和栅极方案纠正LKA，以在各种粒度水平上获得丰富的注意图，从而共同汇总了全局和局部信息，并避免了潜在的阻塞伪像。在GSAU中，我们集成了栅极机制和空间注意力，以消除不必要的线性层和汇总信息丰富的空间环境。为了确认我们的设计的有效性，我们通过简单地堆叠不同数量的MLKA和GSAU来评估具有多种复杂性的人。实验结果表明，我们的人可以在最先进的绩效和计算之间实现各种权衡。代码可从https://github.com/icandle/man获得。

translated by 谷歌翻译

FreqNet: A Frequency-domain Image Super-Resolution Network with Dicrete Cosine Transform

Runyuan Cai , Yue Ding , Hongtao Lu

分类：计算机视觉

2021-11-21

单个图像超分辨率（SISR）是一个不良问题，旨在获得从低分辨率（LR）输入的高分辨率（HR）输出，在此期间应该添加额外的高频信息以改善感知质量。现有的SISR工作主要通过最小化平均平方重建误差来在空间域中运行。尽管高峰峰值信噪比（PSNR）结果，但难以确定模型是否正确地添加所需的高频细节。提出了一些基于基于残余的结构，以指导模型暗示高频率特征。然而，由于空间域度量的解释是有限的，如何验证这些人为细节的保真度仍然是一个问题。在本文中，我们提出了频率域视角来的直观管道，解决了这个问题。由现有频域的工作启发，我们将图像转换为离散余弦变换（DCT）块，然后改革它们以获取DCT功能映射，它用作我们模型的输入和目标。设计了专门的管道，我们进一步提出了符合频域任务的性质的频率损失功能。我们的SISR方法在频域中可以明确地学习高频信息，为SR图像提供保真度和良好的感知质量。我们进一步观察到我们的模型可以与其他空间超分辨率模型合并，以提高原始SR输出的质量。

translated by 谷歌翻译

Deep back-projection networks for super-resolution

分类：

The feed-forward architectures of recently proposed deep super-resolution networks learn representations of low-resolution inputs, and the non-linear mapping from those to high-resolution output. However, this approach does not fully address the mutual dependencies of low-and high-resolution images. We propose Deep Back-Projection Networks (DBPN), that exploit iterative up-and downsampling layers, providing an error feedback mechanism for projection errors at each stage. We construct mutuallyconnected up-and down-sampling stages each of which represents different types of image degradation and highresolution components. We show that extending this idea to allow concatenation of features across up-and downsampling stages (Dense DBPN) allows us to reconstruct further improve super-resolution, yielding superior results and in particular establishing new state of the art results for large scaling factors such as 8× across multiple data sets.

translated by 谷歌翻译

Image Super-resolution with An Enhanced Group Convolutional Neural Network

Chunwei Tian , Yixuan Yuan , Shichao Zhang , Chia-Wen Lin , Wangmeng Zuo , David Zhang

分类：计算机视觉

2022-05-29

具有强大学习能力的CNN被广泛选择以解决超分辨率问题。但是，CNN依靠更深的网络体系结构来提高图像超分辨率的性能，这可能会增加计算成本。在本文中，我们提出了一个增强的超分辨率组CNN（ESRGCNN），具有浅层架构，通过完全融合了深层和宽的通道特征，以在单图超级分辨率中的不同通道的相关性提取更准确的低频信息（ SISR）。同样，ESRGCNN中的信号增强操作对于继承更长途上下文信息以解决长期依赖性也很有用。将自适应上采样操作收集到CNN中，以获得具有不同大小的低分辨率图像的图像超分辨率模型。广泛的实验报告说，我们的ESRGCNN在SISR中的SISR性能，复杂性，执行速度，图像质量评估和SISR的视觉效果方面超过了最先进的实验。代码可在https://github.com/hellloxiaotian/esrgcnn上找到。

translated by 谷歌翻译

Hierarchical Similarity Learning for Aliasing Suppression Image Super-Resolution

Yuqing Liu , Qi Jia , Jian Zhang , Xin Fan , Shanshe Wang , Siwei Ma , Wen Gao

分类：计算机视觉

2022-06-07

作为一个严重的问题，近年来已经广泛研究了单图超分辨率（SISR）。 SISR的主要任务是恢复由退化程序引起的信息损失。根据Nyquist抽样理论，降解会导致混叠效应，并使低分辨率（LR）图像的正确纹理很难恢复。实际上，自然图像中相邻斑块之间存在相关性和自相似性。本文考虑了自相似性，并提出了一个分层图像超分辨率网络（HSRNET）来抑制混叠的影响。我们从优化的角度考虑SISR问题，并根据半季节分裂（HQS）方法提出了迭代解决方案模式。为了先验探索本地图像的质地，我们设计了一个分层探索块（HEB）并进行性增加了接受场。此外，设计多级空间注意力（MSA）是为了获得相邻特征的关系并增强了高频信息，这是视觉体验的关键作用。实验结果表明，与其他作品相比，HSRNET实现了更好的定量和视觉性能，并更有效地释放了别名。

translated by 谷歌翻译

Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution

Wei-Sheng Lai , Jia-Bin Huang , Narendra Ahuja , Ming-Hsuan Yang

分类：

2017-04-12

Convolutional neural networks have recently demonstrated high-quality reconstruction for single-image superresolution. In this paper, we propose the Laplacian Pyramid Super-Resolution Network (LapSRN) to progressively reconstruct the sub-band residuals of high-resolution images. At each pyramid level, our model takes coarse-resolution feature maps as input, predicts the high-frequency residuals, and uses transposed convolutions for upsampling to the finer level. Our method does not require the bicubic interpolation as the pre-processing step and thus dramatically reduces the computational complexity. We train the proposed LapSRN with deep supervision using a robust Charbonnier loss function and achieve high-quality reconstruction. Furthermore, our network generates multi-scale predictions in one feed-forward pass through the progressive reconstruction, thereby facilitates resource-aware applications. Extensive quantitative and qualitative evaluations on benchmark datasets show that the proposed algorithm performs favorably against the state-of-the-art methods in terms of speed and accuracy.

translated by 谷歌翻译

Learning Detail-Structure Alternative Optimization for Blind Super-Resolution

Feng Li , Yixuan Wu , Huihui Bai , Weisi Lin , Runmin Cong , Yao Zhao

分类：计算机视觉

2022-12-03

Existing convolutional neural networks (CNN) based image super-resolution (SR) methods have achieved impressive performance on bicubic kernel, which is not valid to handle unknown degradations in real-world applications. Recent blind SR methods suggest to reconstruct SR images relying on blur kernel estimation. However, their results still remain visible artifacts and detail distortion due to the estimation errors. To alleviate these problems, in this paper, we propose an effective and kernel-free network, namely DSSR, which enables recurrent detail-structure alternative optimization without blur kernel prior incorporation for blind SR. Specifically, in our DSSR, a detail-structure modulation module (DSMM) is built to exploit the interaction and collaboration of image details and structures. The DSMM consists of two components: a detail restoration unit (DRU) and a structure modulation unit (SMU). The former aims at regressing the intermediate HR detail reconstruction from LR structural contexts, and the latter performs structural contexts modulation conditioned on the learned detail maps at both HR and LR spaces. Besides, we use the output of DSMM as the hidden state and design our DSSR architecture from a recurrent convolutional neural network (RCNN) view. In this way, the network can alternatively optimize the image details and structural contexts, achieving co-optimization across time. Moreover, equipped with the recurrent connection, our DSSR allows low- and high-level feature representations complementary by observing previous HR details and contexts at every unrolling time. Extensive experiments on synthetic datasets and real-world images demonstrate that our method achieves the state-of-the-art against existing methods. The source code can be found at https://github.com/Arcananana/DSSR.

translated by 谷歌翻译

Image-specific Convolutional Kernel Modulation for Single Image Super-resolution

Yuanfei Huang , Jie Li , Yanting Hu , Xinbo Gao , Hua Huang

分类：计算机视觉

2021-11-16

最近，基于深度学习的超分辨率方法取得了良好的性能，但主要关注通过喂养许多样品来训练单个广义的深网络。但是直观地，每个图像都具有其表示，并且预计将获得自适应模型。对于此问题，我们通过利用图像或特征的全局上下文信息来提出一种新颖的图像特异性卷积核调制（IKM），以产生适当地调制卷积核的注意重量，这越优于Vanilla卷积和几个现有的注意机制在没有任何其他参数的情况下嵌入最先进的架构。特别是，为了优化我们在迷你批量培训中的IKM，我们引入了一种特定于图像的优化（ISO）算法，比传统的迷你批量SGD优化更有效。此外，我们调查IKM对最先进的架构的影响，并利用一个带有U风格的残差学习和沙漏密集的块学习的新骨干，术语U-HOLGLASS密集网络（U-HDN），这是一个理论上和实验，最大限度地提高IKM的效力。单图像超分辨率的广泛实验表明，该方法实现了优异的现有方法性能。代码可在github.com/yuanfeihuang/ikm获得。

translated by 谷歌翻译