智能论文笔记

Re-using Adversarial Mask Discriminators for Test-time Training under Distribution Shifts

Gabriele Valvano , Andrea Leo , Sotirios A. Tsaftaris

分类：计算机视觉

2021-08-26

由于他们学习灵活的数据驱动损失能力，生成的对抗性网络（GANS）是许多半和弱监督医学图像分割方法的一个组成部分。 Gans在一组培训数据上共同优化发电机和对抗性鉴别者。在训练完成之后，通常丢弃鉴别器，并且只使用发电机推断。但我们应该丢弃鉴别者吗？在这项工作中，我们认为训练稳定的鉴别者产生了表现性的损失函数，我们可以在推理中重复使用，以检测和\ yringit {正确}分割错误。首先，我们确定关键挑战，并提出可能的解决方案，使鉴别者在推理中重新使用。然后，我们表明我们可以将具有图像重建成本（通过解码器）的判别器结合在一起以赋予测试时间训练的因果角，并进一步改进模型。我们的方法简单，提高了预先培训的GAN的测试时间性能。此外，我们表明它与标准的后处理技术兼容，它有可能用于在线持续学习。通过我们的工作，我们开设新的研究途径，以便在推理中重新使用对抗性鉴别器。我们的代码可以在https://vios-s.github.io/addersarial-test-time -Time-Tome-Torion。

translated by 谷歌翻译

Learning Disentangled Representations in the Imaging Domain

Xiao Liu , Pedro Sanchez , Spyridon Thermos , Alison Q. O'Neil , Sotirios A. Tsaftaris

分类：计算机视觉 | 机器学习

2021-08-26

甚至在没有受限，监督的情况下，也提出了甚至在没有受限或有限的情况下学习普遍陈述的方法。使用适度数量的数据可以微调新的目标任务，或者直接在相应任务中实现显着性能的无奈域中使用的良好普遍表示。这种缓解数据和注释要求为计算机愿景和医疗保健的应用提供了诱人的前景。在本辅导纸上，我们激励了对解散的陈述，目前关键理论和详细的实际构建块和学习此类表示的标准的需求。我们讨论医学成像和计算机视觉中的应用，强调了在示例钥匙作品中进行的选择。我们通过呈现剩下的挑战和机会来结束。

translated by 谷歌翻译

Generative appearance replay for continual unsupervised domain adaptation

Boqi Chen , Kevin Thandiackal , Pushpak Pati , Orcun Goksel

分类：计算机视觉 | 人工智能

2023-01-03

Deep learning models can achieve high accuracy when trained on large amounts of labeled data. However, real-world scenarios often involve several challenges: Training data may become available in installments, may originate from multiple different domains, and may not contain labels for training. Certain settings, for instance medical applications, often involve further restrictions that prohibit retention of previously seen data due to privacy regulations. In this work, to address such challenges, we study unsupervised segmentation in continual learning scenarios that involve domain shift. To that end, we introduce GarDA (Generative Appearance Replay for continual Domain Adaptation), a generative-replay based approach that can adapt a segmentation model sequentially to new domains with unlabeled data. In contrast to single-step unsupervised domain adaptation (UDA), continual adaptation to a sequence of domains enables leveraging and consolidation of information from multiple domains. Unlike previous approaches in incremental UDA, our method does not require access to previously seen data, making it applicable in many practical scenarios. We evaluate GarDA on two datasets with different organs and modalities, where it substantially outperforms existing techniques.

translated by 谷歌翻译

Unsupervised Domain Adaptation for Semantic Image Segmentation: a Comprehensive Survey

Gabriela Csurka , Riccardo Volpi , Boris Chidlovskii

分类：计算机视觉 | 人工智能

2021-12-06

语义分割在广泛的计算机视觉应用中起着基本作用，提供了全球对图像的理解的关键信息。然而，最先进的模型依赖于大量的注释样本，其比在诸如图像分类的任务中获得更昂贵的昂贵的样本。由于未标记的数据替代地获得更便宜，因此无监督的域适应达到了语义分割社区的广泛成功并不令人惊讶。本调查致力于总结这一令人难以置信的快速增长的领域的五年，这包含了语义细分本身的重要性，以及将分段模型适应新环境的关键需求。我们提出了最重要的语义分割方法;我们对语义分割的域适应技术提供了全面的调查;我们揭示了多域学习，域泛化，测试时间适应或无源域适应等较新的趋势;我们通过描述在语义细分研究中最广泛使用的数据集和基准测试来结束本调查。我们希望本调查将在学术界和工业中提供具有全面参考指导的研究人员，并有助于他们培养现场的新研究方向。

translated by 谷歌翻译

DRG-Net: Interactive Joint Learning of Multi-lesion Segmentation and Classification for Diabetic Retinopathy Grading

Hasan Md Tusfiqur , Duy M. H. Nguyen , Mai T. N. Truong , Triet A. Nguyen , Binh T. Nguyen , Michael Barz , Hans-Juergen Profitlich , Ngoc T. T. Than , Ngan Le , Pengtao Xie

分类：计算机视觉

2022-12-30

Diabetic Retinopathy (DR) is a leading cause of vision loss in the world, and early DR detection is necessary to prevent vision loss and support an appropriate treatment. In this work, we leverage interactive machine learning and introduce a joint learning framework, termed DRG-Net, to effectively learn both disease grading and multi-lesion segmentation. Our DRG-Net consists of two modules: (i) DRG-AI-System to classify DR Grading, localize lesion areas, and provide visual explanations; (ii) DRG-Expert-Interaction to receive feedback from user-expert and improve the DRG-AI-System. To deal with sparse data, we utilize transfer learning mechanisms to extract invariant feature representations by using Wasserstein distance and adversarial learning-based entropy minimization. Besides, we propose a novel attention strategy at both low- and high-level features to automatically select the most significant lesion information and provide explainable properties. In terms of human interaction, we further develop DRG-Net as a tool that enables expert users to correct the system's predictions, which may then be used to update the system as a whole. Moreover, thanks to the attention mechanism and loss functions constraint between lesion features and classification features, our approach can be robust given a certain level of noise in the feedback of users. We have benchmarked DRG-Net on the two largest DR datasets, i.e., IDRID and FGADR, and compared it to various state-of-the-art deep learning networks. In addition to outperforming other SOTA approaches, DRG-Net is effectively updated using user feedback, even in a weakly-supervised manner.

translated by 谷歌翻译

A Survey of Unsupervised Deep Domain Adaptation

Garrett Wilson , Diane J. Cook

分类：

2018-12-06

Deep learning has produced state-of-the-art results for a variety of tasks. While such approaches for supervised learning have performed well, they assume that training and testing data are drawn from the same distribution, which may not always be the case. As a complement to this challenge, single-source unsupervised domain adaptation can handle situations where a network is trained on labeled data from a source domain and unlabeled data from a related but different target domain with the goal of performing well at test-time on the target domain. Many single-source and typically homogeneous unsupervised deep domain adaptation approaches have thus been developed, combining the powerful, hierarchical representations from deep learning with domain adaptation to reduce reliance on potentially-costly target data labels. This survey will compare these approaches by examining alternative methods, the unique and common elements, results, and theoretical insights. We follow this with a look at application areas and open research directions.

translated by 谷歌翻译

Multi-structure bone segmentation in pediatric MR images with combined regularization from shape priors and adversarial network

Arnaud Boutillon , Bhushan Borotikar , Valérie Burdin , Pierre-Henri Conze

分类：计算机视觉

2020-09-15

儿科肌肉骨骼系统的形态学和诊断评价在临床实践中至关重要。但是，大多数分段模型在稀缺的儿科成像数据上都不好。我们提出了一种新的预训练的正则化卷积编码器 - 解码器，用于分割异质儿科磁共振（MR）图像的具有挑战性的任务。在这方面，我们采用转移学习方法以及正规化策略来改善分段模型的概括。为此，我们已经构思了用于分割网络的新颖优化方案，其包括丢失函数的额外正则化术语。为了获得全局一致的预测，我们纳入了基于形状的正则化，从自动编码器学习的非线性形状表示来源。另外，通过鉴别器计算的对抗正规化是集成的，以鼓励合理的描绘。评估来自脚踝和肩部关节的两个稀缺的小儿摄像数据集的多骨分割任务的方法，包括病理和健康检查。所提出的方法与先前提出的骰子，灵敏度，特异性，最大对称表面距离，平均对称表面距离和相对绝对体积差异度量的方法更好或以前的方法进行更好或以前的方法进行比例。我们说明所提出的方法可以很容易地集成到各种骨骼分割策略中，并且可以提高在大型非医学图像数据库上预先培训的模型的预测准确性。获得的结果为小儿肌肉骨骼障碍的管理带来了新的视角。

translated by 谷歌翻译

CSG0: Continual Urban Scene Generation with Zero Forgetting

Himalaya Jain , Tuan-Hung Vu , Patrick Pérez , Matthieu Cord

分类：计算机视觉

2021-12-06

随着生成对冲网络（GANS）的快速进步，综合场景的视觉质量不断改进，包括复杂的城市场景，其中包含自动驾驶的应用。我们在这项工作中解决了一个持续的场景生成设置，其中GAN在不同的域流上培训;理想情况下，学习的模型最终应该能够在所有看到的域中生成新场景。此设置反映了现实生活场景，其中数据在不同时间的不同地方不断获取。在这种持续的设置中，我们的目标是学习零遗忘，即，由于灾难性的遗忘，在早期域内没有综合质量下降。为此，我们介绍了一种新颖的框架，不仅（i）可以在持续培训中实现无缝知识转移，而且（ii）还能以小的开销成本保证零遗忘。虽然更加内存有效，但由于继续学习，我们的模型比较每个域为一个完整模型的蛮力解决方案比较了更好的合成质量。特别是，在极端的低数据制度下，我们的方法通过大幅度大幅优于蛮力。

translated by 谷歌翻译

Single-domain Generalization in Medical Image Segmentation via Test-time Adaptation from Shape Dictionary

Quande Liu , Cheng Chen , Qi Dou , Pheng-Ann Heng

分类：计算机视觉

2022-06-29

域的概括通常需要来自多个源域的数据才能进行模型学习。但是，这种强大的假设可能并不总是在实践中成立，尤其是在数据共享高度关注，有时由于隐私问题而高度刺激的医学领域。本文研究了重要但具有挑战性的单个领域概括问题，其中在最坏情况下仅具有一个源域，可以直接概括到不同看不见的目标域。我们提出了一种在医学图像分割中解决此问题的新方法，该方法可以提取并集成了跨域不变的分割的语义形状的先验信息，即使是从单个域数据中也可以很好地捕捉，以促进分布偏移下的分割。此外，进一步设计了具有双偶然性正则化的测试时间适应策略，以促进每个看不见的域下这些形状先验的动态融合，以提高模型的通用性。对两个医学图像分割任务进行的广泛实验证明了我们在各种看不见的领域中的方法的一致改进，以及在最坏情况下，它比最先进的方法相比，它优于最先进的方法。

translated by 谷歌翻译

Dive into Self-Supervised Learning for Medical Image Analysis: Data, Models and Tasks

Chuyan Zhang , Yun Gu

分类：计算机视觉

2022-09-25

自我监督的学习（SSL）通过大量未标记的数据的先知，在各种医学成像任务上取得了出色的性能。但是，对于特定的下游任务，仍然缺乏有关如何选择合适的借口任务和实现细节的指令书。在这项工作中，我们首先回顾了医学成像分析领域中自我监督方法的最新应用。然后，我们进行了广泛的实验，以探索SSL中的四个重要问题用于医学成像，包括（1）自我监督预处理对不平衡数据集的影响，（2）网络体系结构，（3）上游任务对下游任务和下游任务和下游任务的适用性（4）SSL和常用政策用于深度学习的堆叠效果，包括数据重新采样和增强。根据实验结果，提出了潜在的指南，以在医学成像中进行自我监督预处理。最后，我们讨论未来的研究方向并提出问题，以了解新的SSL方法和范式时要注意。

translated by 谷歌翻译

Contrastive Image Synthesis and Self-supervised Feature Adaptation for Cross-Modality Biomedical Image Segmentation

Xinrong Hu , Corey Wang , Yiyu Shi

分类：计算机视觉

2022-07-27

这项工作提出了一个新颖的框架CISFA（对比图像合成和自我监督的特征适应），该框架建立在图像域翻译和无监督的特征适应性上，以进行跨模式生物医学图像分割。与现有作品不同，我们使用单方面的生成模型，并在输入图像的采样贴片和相应的合成图像之间添加加权贴片对比度损失，该图像用作形状约束。此外，我们注意到生成的图像和输入图像共享相似的结构信息，但具有不同的方式。因此，我们在生成的图像和输入图像上强制实施对比损失，以训练分割模型的编码器，以最大程度地减少学到的嵌入空间中成对图像之间的差异。与依靠对抗性学习进行特征适应的现有作品相比，这种方法使编码器能够以更明确的方式学习独立于域的功能。我们对包含腹腔和全心的CT和MRI图像的分割任务进行了广泛评估。实验结果表明，所提出的框架不仅输出了较小的器官形状变形的合成图像，而且还超过了最先进的域适应方法的较大边缘。

translated by 谷歌翻译

Medical Image Segmentation Using Deep Learning: A Survey

Risheng Wang , Tao Lei , Ruixia Cui , Bingtao Zhang , Hongying Meng , Asoke K. Nandi

分类：计算机视觉

2020-09-28

深度学习已被广泛用于医学图像分割，并且录制了录制了该领域深度学习的成功的大量论文。在本文中，我们使用深层学习技术对医学图像分割的全面主题调查。本文进行了两个原创贡献。首先，与传统调查相比，直接将深度学习的文献分成医学图像分割的文学，并为每组详细介绍了文献，我们根据从粗略到精细的多级结构分类目前流行的文献。其次，本文侧重于监督和弱监督的学习方法，而不包括无监督的方法，因为它们在许多旧调查中引入而且他们目前不受欢迎。对于监督学习方法，我们分析了三个方面的文献：骨干网络的选择，网络块的设计，以及损耗功能的改进。对于虚弱的学习方法，我们根据数据增强，转移学习和交互式分割进行调查文献。与现有调查相比，本调查将文献分类为比例不同，更方便读者了解相关理由，并将引导他们基于深度学习方法思考医学图像分割的适当改进。

translated by 谷歌翻译

CrossMoDA 2021 challenge: Benchmark of Cross-Modality Domain Adaptation techniques for Vestibular Schwnannoma and Cochlea Segmentation

Reuben Dorent , Aaron Kujawa , Marina Ivory , Spyridon Bakas , Nicola Rieke , Samuel Joutard , Ben Glocker , Jorge Cardoso , Marc Modat , Kayhan Batmanghelich

分类：计算机视觉

2022-01-08

域适应（DA）最近在医学影像社区提出了强烈的兴趣。虽然已经提出了大量DA技术进行了用于图像分割，但大多数这些技术已经在私有数据集或小公共可用数据集上验证。此外，这些数据集主要解决了单级问题。为了解决这些限制，与第24届医学图像计算和计算机辅助干预（Miccai 2021）结合第24届国际会议组织交叉模态域适应（Crossmoda）挑战。 Crossmoda是无监督跨型号DA的第一个大型和多级基准。挑战的目标是分割参与前庭施瓦新瘤（VS）的后续和治疗规划的两个关键脑结构：VS和Cochleas。目前，使用对比度增强的T1（CET1）MRI进行VS患者的诊断和监测。然而，使用诸如高分辨率T2（HRT2）MRI的非对比度序列越来越感兴趣。因此，我们创建了一个无人监督的跨模型分段基准。训练集提供注释CET1（n = 105）和未配对的非注释的HRT2（n = 105）。目的是在测试集中提供的HRT2上自动对HRT2进行单侧VS和双侧耳蜗分割（n = 137）。共有16支球队提交了评估阶段的算法。顶级履行团队达成的表现水平非常高（最佳中位数骰子 - vs：88.4％; Cochleas：85.7％）并接近完全监督（中位数骰子 - vs：92.5％;耳蜗：87.7％）。所有顶级执行方法都使用图像到图像转换方法将源域图像转换为伪目标域图像。然后使用这些生成的图像和为源图像提供的手动注释进行培训分割网络。

translated by 谷歌翻译

LE-UDA: Label-efficient unsupervised domain adaptation for medical image segmentation

Ziyuan Zhao , Fangcheng Zhou , Kaixin Xu , Zeng Zeng , Cuntai Guan , S. Kevin Zhou

分类：人工智能 | 计算机视觉

2022-12-05

While deep learning methods hitherto have achieved considerable success in medical image segmentation, they are still hampered by two limitations: (i) reliance on large-scale well-labeled datasets, which are difficult to curate due to the expert-driven and time-consuming nature of pixel-level annotations in clinical practices, and (ii) failure to generalize from one domain to another, especially when the target domain is a different modality with severe domain shifts. Recent unsupervised domain adaptation~(UDA) techniques leverage abundant labeled source data together with unlabeled target data to reduce the domain gap, but these methods degrade significantly with limited source annotations. In this study, we address this underexplored UDA problem, investigating a challenging but valuable realistic scenario, where the source domain not only exhibits domain shift~w.r.t. the target domain but also suffers from label scarcity. In this regard, we propose a novel and generic framework called ``Label-Efficient Unsupervised Domain Adaptation"~(LE-UDA). In LE-UDA, we construct self-ensembling consistency for knowledge transfer between both domains, as well as a self-ensembling adversarial learning module to achieve better feature alignment for UDA. To assess the effectiveness of our method, we conduct extensive experiments on two different tasks for cross-modality segmentation between MRI and CT images. Experimental results demonstrate that the proposed LE-UDA can efficiently leverage limited source labels to improve cross-domain segmentation performance, outperforming state-of-the-art UDA approaches in the literature. Code is available at: https://github.com/jacobzhaoziyuan/LE-UDA.

translated by 谷歌翻译

Overcoming challenges in leveraging GANs for few-shot data augmentation

Christopher Beckham , Issam Laradji , Pau Rodriguez , David Vazquez , Derek Nowrouzezahrai , Christopher Pal

分类： (统计)机器学习 | 机器学习

2022-03-30

在本文中，我们探讨了基于GAN的少量数据增强用作改善少量分类性能的方法。我们对如何对这样的任务进行微调（其中一项是以课堂开采方式）进行微调的探索，以及对这些模型如何在改善几次分类的情况下进行严格的经验研究。我们确定了与纯粹有监督的制度训练此类生成模型的困难有关的问题，几乎没有例子，以及有关现有作品的评估协议的问题。我们还发现，在这种制度中，分类精度对数据集的类别随机分配方式高度敏感。因此，我们提出了一种半监督的微调方法，作为解决这些问题的更务实的方向。

translated by 谷歌翻译

Test-time training with self-supervision for generalization under distribution shifts

分类：

In this paper, we propose Test-Time Training, a general approach for improving the performance of predictive models when training and test data come from different distributions. We turn a single unlabeled test sample into a self-supervised learning problem, on which we update the model parameters before making a prediction. This also extends naturally to data in an online stream. Our simple approach leads to improvements on diverse image classification benchmarks aimed at evaluating robustness to distribution shifts.

translated by 谷歌翻译

Which Pixel to Annotate: a Label-Efficient Nuclei Segmentation Framework

Wei Lou , Haofeng Li , Guanbin Li , Xiaoguang Han , Xiang Wan

分类：计算机视觉

2022-12-20

Recently deep neural networks, which require a large amount of annotated samples, have been widely applied in nuclei instance segmentation of H\&E stained pathology images. However, it is inefficient and unnecessary to label all pixels for a dataset of nuclei images which usually contain similar and redundant patterns. Although unsupervised and semi-supervised learning methods have been studied for nuclei segmentation, very few works have delved into the selective labeling of samples to reduce the workload of annotation. Thus, in this paper, we propose a novel full nuclei segmentation framework that chooses only a few image patches to be annotated, augments the training set from the selected samples, and achieves nuclei segmentation in a semi-supervised manner. In the proposed framework, we first develop a novel consistency-based patch selection method to determine which image patches are the most beneficial to the training. Then we introduce a conditional single-image GAN with a component-wise discriminator, to synthesize more training samples. Lastly, our proposed framework trains an existing segmentation model with the above augmented samples. The experimental results show that our proposed method could obtain the same-level performance as a fully-supervised baseline by annotating less than 5% pixels on some benchmarks.

translated by 谷歌翻译

Domain Generalization: A Survey

Kaiyang Zhou , Ziwei Liu , Yu Qiao , Tao Xiang , Chen Change Loy

分类：机器学习 | 人工智能 | 计算机视觉

2021-03-03

对分布（OOD）数据的概括是人类自然的能力，但对于机器而言挑战。这是因为大多数学习算法强烈依赖于i.i.d.〜对源/目标数据的假设，这在域转移导致的实践中通常会违反。域的概括（DG）旨在通过仅使用源数据进行模型学习来实现OOD的概括。在过去的十年中，DG的研究取得了长足的进步，导致了广泛的方法论，例如，基于域的一致性，元学习，数据增强或合奏学习的方法，仅举几例；还在各个应用领域进行了研究，包括计算机视觉，语音识别，自然语言处理，医学成像和强化学习。在本文中，首次提供了DG中的全面文献综述，以总结过去十年来的发展。具体而言，我们首先通过正式定义DG并将其与其他相关领域（如域适应和转移学习）联系起来来涵盖背景。然后，我们对现有方法和理论进行了彻底的审查。最后，我们通过有关未来研究方向的见解和讨论来总结这项调查。

translated by 谷歌翻译

Semantic Self-adaptation: Enhancing Generalization with a Single Sample

Sherwin Bahmani , Oliver Hahn , Eduard Zamfir , Nikita Araslanov , Daniel Cremers , Stefan Roth

分类：计算机视觉

2022-08-10

尽管进行了多年的研究，但跨域的概括仍然是深层网络的语义分割的关键弱点。先前的研究取决于静态模型的假设，即训练过程完成后，模型参数在测试时间保持固定。在这项工作中，我们通过一种自适应方法来挑战这一前提，用于语义分割，将推理过程调整为每个输入样本。自我适应在两个级别上运行。首先，它采用了自我监督的损失，该损失将网络中卷积层的参数定制为输入图像。其次，在批准层中，自适应近似于整个测试数据的平均值和方差，这是不可用的。它通过在训练和从单个测试样本得出的参考分布之间进行插值来实现这一目标。为了凭经验分析我们的自适应推理策略，我们制定并遵循严格的评估协议，以解决先前工作的严重局限性。我们的广泛分析得出了一个令人惊讶的结论：使用标准训练程序，自我适应大大优于强大的基准，并在多域基准测试方面设定了新的最先进的准确性。我们的研究表明，自适应推断可以补充培训时间的既定模型正规化实践，以改善深度网络的概括到异域数据。

translated by 谷歌翻译

Generalizable multi-task, multi-domain deep segmentation of sparse pediatric imaging datasets via multi-scale contrastive regularization and multi-joint anatomical priors

Arnaud Boutillon , Pierre-Henri Conze , Christelle Pons , Valérie Burdin , Bhushan Borotikar

分类：计算机视觉

2022-07-27

小儿肌肉骨骼系统的临床诊断依赖于医学成像检查的分析。在医学图像处理管道中，使用深度学习算法的语义分割使人可以自动生成患者特定的三维解剖模型，这对于形态学评估至关重要。但是，小儿成像资源的稀缺性可能导致单个深层分割模型的准确性和泛化性能降低。在这项研究中，我们建议设计一个新型的多任务多任务多域学习框架，在该框架中，单个分割网络对由解剖学的不同部分产生的多个数据集进行了优化。与以前的方法不同，我们同时考虑多个强度域和分割任务来克服小儿数据的固有稀缺性，同时利用成像数据集之间的共享特征。为了进一步提高概括能力，我们从自然图像分类中采用了转移学习方案，以及旨在在共享表示中促进域特异性群集的多尺度对比正则化，以及多连接解剖学先验来执行解剖学上一致的预测。我们评估了使用脚踝，膝盖和肩关节的三个稀缺和小儿成像数据集进行骨分割的贡献。我们的结果表明，所提出的方法在骰子指标中的表现优于个人，转移和共享分割方案，并具有统计学上足够的利润。拟议的模型为智能使用成像资源和更好地管理小儿肌肉骨骼疾病提供了新的观点。

translated by 谷歌翻译