智能论文笔记

Fine-tuned Generative Adversarial Network-based Model for Medical Images Super-Resolution

Alireza Aghelan , Modjtaba Rouhani

分类：计算机视觉

2022-11-01

In medical image analysis, low-resolution images negatively affect the performance of medical image interpretation and may cause misdiagnosis. Single image super-resolution (SISR) methods can improve the resolution and quality of medical images. Currently, Generative Adversarial Networks (GAN) based super-resolution models have shown very good performance. Real-Enhanced Super-Resolution Generative Adversarial Network (Real-ESRGAN) is one of the practical GAN-based models which is widely used in the field of general image super-resolution. One of the challenges in medical image super-resolution is that, unlike natural images, medical images do not have high spatial resolution. To solve this problem, we can use transfer learning technique and fine-tune the model that has been trained on external datasets (often natural datasets). In our proposed approach, the pre-trained generator and discriminator networks of the Real-ESRGAN model are fine-tuned using medical image datasets. In this paper, we worked on chest X-ray and retinal images and used the STARE dataset of retinal images and Tuberculosis Chest X-rays (Shenzhen) dataset for fine-tuning. The proposed model produces more accurate and natural textures, and its outputs have better detail and resolution compared to the original Real-ESRGAN outputs.

translated by 谷歌翻译

Underwater Images Super-Resolution Using Generative Adversarial Network-based Model

Alireza Aghelan

分类：计算机视觉

2022-11-07

Single image super-resolution (SISR) methods can enhance the resolution and quality of underwater images. Enhancing the resolution of underwater images leads to better performance of autonomous underwater vehicles. In this work, we fine-tune the Real-Enhanced Super-Resolution Generative Adversarial Network (Real-ESRGAN) model to increase the resolution of underwater images. In our proposed approach, the pre-trained generator and discriminator networks of the Real-ESRGAN model are fine-tuned using underwater image datasets. We used the USR-248 and UFO-120 datasets to fine-tune the Real-ESRGAN model. Our fine-tuned model produces images with better resolution and quality compared to the original model.

translated by 谷歌翻译

Single MR Image Super-Resolution using Generative Adversarial Network

Shawkh Ibne Rashid , Elham Shakibapour , Mehran Ebrahimi

分类：计算机视觉 | 机器学习

2022-07-16

可以使用超分辨率方法改善医学图像的空间分辨率。实际增强的超级分辨率生成对抗网络（Real-Esrgan）是最近用于产生较高分辨率图像的最新有效方法之一，给定较低分辨率的输入图像。在本文中，我们应用这种方法来增强2D MR图像的空间分辨率。在我们提出的方法中，我们稍微修改了从脑肿瘤分割挑战（BRATS）2018数据集中训练2D磁共振图像（MRI）的结构。通过计算SSIM（结构相似性指数量度），NRMSE（归一化根平方误），MAE（平均绝对误差）和VIF（视觉信息保真度）值，通过计算SSIM（结构相似性指数量度）进行定性和定量验证。

translated by 谷歌翻译

Transformer and GAN Based Super-Resolution Reconstruction Network for Medical Images

Weizhi Du , Harvery Tian

分类：计算机视觉

2022-12-26

Because of the necessity to obtain high-quality images with minimal radiation doses, such as in low-field magnetic resonance imaging, super-resolution reconstruction in medical imaging has become more popular (MRI). However, due to the complexity and high aesthetic requirements of medical imaging, image super-resolution reconstruction remains a difficult challenge. In this paper, we offer a deep learning-based strategy for reconstructing medical images from low resolutions utilizing Transformer and Generative Adversarial Networks (T-GAN). The integrated system can extract more precise texture information and focus more on important locations through global image matching after successfully inserting Transformer into the generative adversarial network for picture reconstruction. Furthermore, we weighted the combination of content loss, adversarial loss, and adversarial feature loss as the final multi-task loss function during the training of our proposed model T-GAN. In comparison to established measures like PSNR and SSIM, our suggested T-GAN achieves optimal performance and recovers more texture features in super-resolution reconstruction of MRI scanned images of the knees and belly.

translated by 谷歌翻译

A-ESRGAN: Training Real-World Blind Super-Resolution with Attention U-Net Discriminators

Zihao Wei , Yidong Huang , Yuang Chen , Chenhao Zheng , Jinnan Gao

分类：计算机视觉 | 机器学习

2021-12-19

盲目图像超分辨率（SR）是CV的长期任务，旨在恢复患有未知和复杂扭曲的低分辨率图像。最近的工作主要集中在采用更复杂的退化模型来模拟真实世界的降级。由此产生的模型在感知损失和产量感知令人信服的结果取得了突破性。然而，电流生成的对抗性网络结构所带来的限制仍然是显着的：处理像素同样地导致图像的结构特征的无知，并且导致性能缺点，例如扭曲线和背景过度锐化或模糊。在本文中，我们提出了A-ESRAN，用于盲人SR任务的GAN模型，其特色是基于U-NET的U-NET的多尺度鉴别器，可以与其他发电机无缝集成。据我们所知，这是第一项介绍U-Net结构作为GaN解决盲人问题的鉴别者的工作。本文还给出了对模型的多规模注意力突破的机制的解释。通过对现有作品的比较实验，我们的模型在非参考自然图像质量评估员度量上提出了最先进的水平性能。我们的消融研究表明，利用我们的鉴别器，基于RRDB的发电机可以利用多种尺度中图像的结构特征，因此与先前作品相比，更加感知地产生了感知的高分辨率图像。

translated by 谷歌翻译

ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks

Xintao Wang , Ke Yu , Shixiang Wu , Jinjin Gu , Yihao Liu , Chao Dong , Chen Change Loy , Yu Qiao , Xiaoou Tang

分类：

2018-09-01

The Super-Resolution Generative Adversarial Network (SR-GAN) [1] is a seminal work that is capable of generating realistic textures during single image super-resolution. However, the hallucinated details are often accompanied with unpleasant artifacts. To further enhance the visual quality, we thoroughly study three key components of SRGANnetwork architecture, adversarial loss and perceptual loss, and improve each of them to derive an Enhanced SRGAN (ESRGAN). In particular, we introduce the Residual-in-Residual Dense Block (RRDB) without batch normalization as the basic network building unit. Moreover, we borrow the idea from relativistic GAN [2] to let the discriminator predict relative realness instead of the absolute value. Finally, we improve the perceptual loss by using the features before activation, which could provide stronger supervision for brightness consistency and texture recovery. Benefiting from these improvements, the proposed ESRGAN achieves consistently better visual quality with more realistic and natural textures than SRGAN and won the first place in the PIRM2018-SR Challenge 1 [3]. The code is available at https://github.com/xinntao/ESRGAN.

translated by 谷歌翻译

A Comprehensive Review of Deep Learning-based Single Image Super-resolution

Syed Muhammad Arsalan Bashir , Yi Wang , Mahrukh Khan , Yilong Niu

分类：计算机视觉 | 机器学习

2021-02-18

图像超分辨率（SR）是重要的图像处理方法之一，可改善计算机视野领域的图像分辨率。在过去的二十年中，在超级分辨率领域取得了重大进展，尤其是通过使用深度学习方法。这项调查是为了在深度学习的角度进行详细的调查，对单像超分辨率的最新进展进行详细的调查，同时还将告知图像超分辨率的初始经典方法。该调查将图像SR方法分类为四个类别，即经典方法，基于学习的方法，无监督学习的方法和特定领域的SR方法。我们还介绍了SR的问题，以提供有关图像质量指标，可用参考数据集和SR挑战的直觉。使用参考数据集评估基于深度学习的方法。一些审查的最先进的图像SR方法包括增强的深SR网络（EDSR），周期循环gan（Cincgan），多尺度残留网络（MSRN），Meta残留密度网络（META-RDN），反复反射网络（RBPN），二阶注意网络（SAN），SR反馈网络（SRFBN）和基于小波的残留注意网络（WRAN）。最后，这项调查以研究人员将解决SR的未来方向和趋势和开放问题的未来方向和趋势。

translated by 谷歌翻译

SwiftSRGAN -- Rethinking Super-Resolution for Efficient and Real-time Inference

Koushik Sivarama Krishnan , Karthik Sivarama Krishnan

分类：计算机视觉

2021-11-29

近年来，使用基于深入学习的架构的状态，在图像超分辨率的任务中有几个进步。先前发布的许多基于超分辨率的技术，需要高端和顶部的图形处理单元（GPU）来执行图像超分辨率。随着深度学习方法的进步越来越大，神经网络已经变得越来越多地计算饥饿。我们返回了一步，并专注于创建实时有效的解决方案。我们提出了一种在其内存足迹方面更快更小的架构。所提出的架构使用深度明智的可分离卷积来提取特征，并且它与其他超分辨率的GAN（生成对抗网络）进行接受，同时保持实时推断和低存储器占用。即使在带宽条件不佳，实时超分辨率也能够流式传输高分辨率介质内容。在维持准确性和延迟之间的有效权衡之间，我们能够生产可比较的性能模型，该性能模型是超分辨率GAN的大小的一个 - 八（1/8），并且计算的速度比超分辨率的GAN快74倍。

translated by 谷歌翻译

Astronomical Image Colorization and upscaling with Generative Adversarial Networks

Shreyas Kalvankar , Hrushikesh Pandit , Pranav Parwate , Atharva Patil , Snehal Kamalapur

分类：计算机视觉 | 机器学习

2021-12-27

在没有人为干预的图像自动色彩上是在机器学习界的兴趣中的一个短暂的时间。分配颜色到图像是一个非常令人虐待的问题，因为它具有非常高的自由度的先天性;给定图像，通常没有单一的颜色组合是正确的。除了着色之外，图像重建中的另一个问题是单图像超分辨率，其旨在将低分辨率图像转换为更高的分辨率。该研究旨在通过专注于图像的非常特定的图像，即天文图像，并使用生成的对抗网络（GAN）来提供自动化方法。我们探索两种不同颜色空间，RGB和L * A *中各种型号的使用。我们使用传输学习，由于小数据集，使用预先训练的Reset-18作为骨干，即U-Net的编码器，进一步微调。该模型产生视觉上有吸引力的图像，其在原始图像中不存在的这些结果中呈现的高分辨率高分辨率，着色数据。我们通过使用所有通道的每个颜色空间中的距离度量（例如L1距离和L2距离）评估GAN来提供我们的结果，以提供比较分析。我们使用Frechet Inception距离（FID）将生成的图像的分布与实际图像的分布进行比较，以评估模型的性能。

translated by 谷歌翻译

EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis

Mehdi S. M. Sajjadi , Bernhard Schölkopf , Michael Hirsch

分类：

2016-12-23

Single image super-resolution is the task of inferring a high-resolution image from a single low-resolution input. Traditionally, the performance of algorithms for this task is measured using pixel-wise reconstruction measures such as peak signal-to-noise ratio (PSNR) which have been shown to correlate poorly with the human perception of image quality. As a result, algorithms minimizing these metrics tend to produce over-smoothed images that lack highfrequency textures and do not look natural despite yielding high PSNR values.We propose a novel application of automated texture synthesis in combination with a perceptual loss focusing on creating realistic textures rather than optimizing for a pixelaccurate reproduction of ground truth images during training. By using feed-forward fully convolutional neural networks in an adversarial training setting, we achieve a significant boost in image quality at high magnification ratios. Extensive experiments on a number of datasets show the effectiveness of our approach, yielding state-of-the-art results in both quantitative and qualitative benchmarks.

translated by 谷歌翻译

Infrared Image Super-Resolution: Systematic Review, and Future Trends

Yongsong Huang , Tomo Miyazaki , Xiaofeng Liu , Shinichiro Omachi

分类：计算机视觉 | 机器学习

2022-12-22

Image Super-Resolution (SR) is essential for a wide range of computer vision and image processing tasks. Investigating infrared (IR) image (or thermal images) super-resolution is a continuing concern within the development of deep learning. This survey aims to provide a comprehensive perspective of IR image super-resolution, including its applications, hardware imaging system dilemmas, and taxonomy of image processing methodologies. In addition, the datasets and evaluation metrics in IR image super-resolution tasks are also discussed. Furthermore, the deficiencies in current technologies and possible promising directions for the community to explore are highlighted. To cope with the rapid development in this field, we intend to regularly update the relevant excellent work at \url{https://github.com/yongsongH/Infrared_Image_SR_Survey

translated by 谷歌翻译

Exploiting Digital Surface Models for Inferring Super-Resolution for Remotely Sensed Images

Savvas Karatsiolis , Chirag Padubidri , Andreas Kamilaris

分类：计算机视觉 | 机器学习

2022-05-09

尽管应用于自然图像的大量成功的超分辨率重建（SRR）模型，但它们在遥感图像中的应用往往会产生差的结果。遥感图像通常比自然图像更复杂，并且具有较低分辨率的特殊性，它包含噪音，并且通常描绘了大质感表面。结果，将非专业的SRR模型应用于遥感图像，从而导致人工制品和不良的重建。为了解决这些问题，本文提出了一种受到先前研究工作启发的体系结构，引入了一种新的方法来迫使SRR模型输出现实的遥感图像：而不是依靠功能空间相似性作为感知损失，而是将其视为Pixel-从图像的归一化数字表面模型（NDSM）推断出的级别信息。该策略允许在训练模型期间应用更具信息的更新，该模型从任务（高程图推理）源中源，该模型与遥感密切相关。但是，在生产过程中不需要NDSM辅助信息，因此该模型除了其低分辨率对以外没有任何其他数据，因此该模型还没有任何其他数据。我们在两个远程感知的不同空间分辨率的数据集上评估了我们的模型，这些数据集也包含图像的DSM对：DFC2018数据集和包含卢森堡国家激光雷达飞行的数据集。根据视觉检查，推断的超分辨率图像表现出特别优越的质量。特别是，高分辨率DFC2018数据集的结果是现实的，几乎与地面真相图像没有区别。

translated by 谷歌翻译

Sparse-based Domain Adaptation Network for OCTA Image Super-Resolution Reconstruction

Huaying Hao , Cong Xu , Dan Zhang , Qifeng Yan , Jiong Zhang , Yue Liu , Yitian Zhao

分类：计算机视觉

2022-07-25

具有高分辨率的视网膜光学相干断层扫描术（八八）对于视网膜脉管系统的定量和分析很重要。然而，八颗图像的分辨率与相同采样频率的视野成反比，这不利于临床医生分析较大的血管区域。在本文中，我们提出了一个新型的基于稀疏的域适应超分辨率网络（SASR），以重建现实的6x6 mm2/低分辨率/低分辨率（LR）八八粒图像，以重建高分辨率（HR）表示。更具体地说，我们首先对3x3 mm2/高分辨率（HR）图像进行简单降解，以获得合成的LR图像。然后，采用一种有效的注册方法在6x6 mm2图像中以其相应的3x3 mm2图像区域注册合成LR，以获得裁切的逼真的LR图像。然后，我们提出了一个多级超分辨率模型，用于对合成数据进行全面监督的重建，从而通过生成的对流策略指导现实的LR图像重建现实的LR图像，该策略允许合成和现实的LR图像可以在特征中统一。领域。最后，新型的稀疏边缘感知损失旨在动态优化容器边缘结构。在两个八八集中进行的广泛实验表明，我们的方法的性能优于最先进的超分辨率重建方法。此外，我们还研究了重建结果对视网膜结构分割的性能，这进一步验证了我们方法的有效性。

translated by 谷歌翻译

FREDSR: Fourier Residual Efficient Diffusive GAN for Single Image Super Resolution

Kyoungwan Woo , Achyuta Rajaram

分类：计算机视觉

2022-11-30

FREDSR is a GAN variant that aims to outperform traditional GAN models in specific tasks such as Single Image Super Resolution with extreme parameter efficiency at the cost of per-dataset generalizeability. FREDSR integrates fast Fourier transformation, residual prediction, diffusive discriminators, etc to achieve strong performance in comparisons to other models on the UHDSR4K dataset for Single Image 3x Super Resolution from 360p and 720p with only 37000 parameters. The model follows the characteristics of the given dataset, resulting in lower generalizeability but higher performance on tasks such as real time up-scaling.

translated by 谷歌翻译

Real-World Single Image Super-Resolution Under Rainy Condition

Mohammad Shahab Uddin

分类：计算机视觉

2022-06-16

图像超分辨率是计算机视觉中的重要研究领域，它具有多种应用，包括监视，医学成像等。实际信号图像超分辨率由于其实时应用而变得非常流行。。在充满挑战的天气情况下，仍然有很多范围可以改善现实世界中的单像超分辨率。在本文中，我们提出了一种新算法，以在雨季中执行现实世界中的单像超分辨率。我们提出的方法可以减轻图像超分辨率期间的雨季条件的影响。我们的实验结果表明，我们提出的算法可以执行图像超分辨率，从而减少雨水的负面影响。

translated by 谷歌翻译

Enhancing Quality of Pose-varied Face Restoration with Local Weak Feature Sensing and GAN Prior

Kai Hu , Yu Liu , Renhe Liu , Wei Lu , Gang Yu , Bin Fu

分类：计算机视觉

2022-05-28

近年来，面部语义指导（包括面部地标，面部热图和面部解析图）和面部生成对抗网络（GAN）近年来已广泛用于盲面修复（BFR）。尽管现有的BFR方法在普通案例中取得了良好的性能，但这些解决方案在面对严重降解和姿势变化的图像时具有有限的弹性（例如，在现实世界情景中看起来右，左看，笑等）。在这项工作中，我们提出了一个精心设计的盲人面部修复网络，具有生成性面部先验。所提出的网络主要由非对称编解码器和stylegan2先验网络组成。在非对称编解码器中，我们采用混合的多路残留块（MMRB）来逐渐提取输入图像的弱纹理特征，从而可以更好地保留原始面部特征并避免过多的幻想。 MMRB也可以在其他网络中插入插件。此外，多亏了StyleGAN2模型的富裕和多样化的面部先验，我们采用了微调的方法来灵活地恢复自然和现实的面部细节。此外，一种新颖的自我监督训练策略是专门设计用于面部修复任务的，以使分配更接近目标并保持训练稳定性。关于合成和现实世界数据集的广泛实验表明，我们的模型在面部恢复和面部超分辨率任务方面取得了卓越的表现。

translated by 谷歌翻译

Real-World Image Super Resolution via Unsupervised Bi-directional Cycle Domain Transfer Learning based Generative Adversarial Network

Xiang Wang , Yimin Yang , Zhichang Guo , Zhili Zhou , Yu Liu , Qixiang Pang , Shan Du

分类：计算机视觉 | 机器学习

2022-11-19

Deep Convolutional Neural Networks (DCNNs) have exhibited impressive performance on image super-resolution tasks. However, these deep learning-based super-resolution methods perform poorly in real-world super-resolution tasks, where the paired high-resolution and low-resolution images are unavailable and the low-resolution images are degraded by complicated and unknown kernels. To break these limitations, we propose the Unsupervised Bi-directional Cycle Domain Transfer Learning-based Generative Adversarial Network (UBCDTL-GAN), which consists of an Unsupervised Bi-directional Cycle Domain Transfer Network (UBCDTN) and the Semantic Encoder guided Super Resolution Network (SESRN). First, the UBCDTN is able to produce an approximated real-like LR image through transferring the LR image from an artificially degraded domain to the real-world LR image domain. Second, the SESRN has the ability to super-resolve the approximated real-like LR image to a photo-realistic HR image. Extensive experiments on unpaired real-world image benchmark datasets demonstrate that the proposed method achieves superior performance compared to state-of-the-art methods.

translated by 谷歌翻译

Kernel Adversarial Learning for Real-world Image Super-resolution

Hu Wang , Congbo Ma , Jianpeng Zhang , Gustavo Carneiro

分类：计算机视觉

2021-04-19

当前的深层图像超分辨率（SR）方法试图从下采样的图像或假设简单高斯内核和添加噪声中降解来恢复高分辨率图像。但是，这种简单的图像处理技术代表了降低图像分辨率的现实世界过程的粗略近似。在本文中，我们提出了一个更现实的过程，通过引入新的内核对抗学习超分辨率（KASR）框架来处理现实世界图像SR问题，以降低图像分辨率。在提议的框架中，降解内核和噪声是自适应建模的，而不是明确指定的。此外，我们还提出了一个迭代监督过程和高频选择性目标，以进一步提高模型SR重建精度。广泛的实验验证了对现实数据集中提出的框架的有效性。

translated by 谷歌翻译

Edge-Enhanced Dual Discriminator Generative Adversarial Network for Fast MRI with Parallel Imaging Using Multi-view Information

Jiahao Huang , Weiping Ding , Jun Lv , Jingwen Yang , Hao Dong , Javier Del Ser , Jun Xia , Tiaojuan Ren , Stephen Wong , Guang Yang

分类：人工智能 | 计算机视觉 | 机器学习

2021-12-10

在临床医学中，磁共振成像（MRI）是诊断，分类，预后和治疗计划中最重要的工具之一。然而，MRI遭受了固有的慢数据采集过程，因为数据在k空间中顺序收集。近年来，大多数MRI重建方法在文献中侧重于整体图像重建而不是增强边缘信息。这项工作通过详细说明了对边缘信息的提高来阐述了这一趋势。具体地，我们通过结合多视图信息介绍一种用于快速多通道MRI重建的新型并行成像耦合双鉴别器生成的对抗网络（PIDD-GaN）。双判别设计旨在改善MRI重建中的边缘信息。一个鉴别器用于整体图像重建，而另一个鉴别器是负责增强边缘信息的负责。为发电机提出了一种具有本地和全局剩余学习的改进的U-Net。频率通道注意块（FCA块）嵌入在发电机中以结合注意力机制。引入内容损耗以培训发电机以获得更好的重建质量。我们对Calgary-Campinas公共大脑MR DataSet进行了全面的实验，并将我们的方法与最先进的MRI重建方法进行了比较。在MICCAI13数据集上进行了对剩余学习的消融研究，以验证所提出的模块。结果表明，我们的PIDD-GaN提供高质量的重建MR图像，具有良好的边缘信息。单图像重建的时间低于5ms，符合加快处理的需求。

translated by 谷歌翻译

Photo-realistic single image super-resolution using a generative adversarial network

分类：

Despite the breakthroughs in accuracy and speed of single image super-resolution using faster and deeper convolutional neural networks, one central problem remains largely unsolved: how do we recover the finer texture details when we super-resolve at large upscaling factors? The behavior of optimization-based super-resolution methods is principally driven by the choice of the objective function. Recent work has largely focused on minimizing the mean squared reconstruction error. The resulting estimates have high peak signal-to-noise ratios, but they are often lacking high-frequency details and are perceptually unsatisfying in the sense that they fail to match the fidelity expected at the higher resolution. In this paper, we present SRGAN, a generative adversarial network (GAN) for image superresolution (SR). To our knowledge, it is the first framework capable of inferring photo-realistic natural images for 4× upscaling factors. To achieve this, we propose a perceptual loss function which consists of an adversarial loss and a content loss. The adversarial loss pushes our solution to the natural image manifold using a discriminator network that is trained to differentiate between the super-resolved images and original photo-realistic images. In addition, we use a content loss motivated by perceptual similarity instead of similarity in pixel space. Our deep residual network is able to recover photo-realistic textures from heavily downsampled images on public benchmarks. An extensive mean-opinion-score (MOS) test shows hugely significant gains in perceptual quality using SRGAN. The MOS scores obtained with SRGAN are closer to those of the original high-resolution images than to those obtained with any state-of-the-art method.

translated by 谷歌翻译