智能论文笔记

DomainATM: Domain Adaptation Toolbox for Medical Data Analysis

Hao Guan , Mingxia Liu

分类：计算机视觉

2022-09-24

域适应性（DA）是基于现代机器学习的医学数据分析的重要技术，旨在减少不同医疗数据集之间的分布差异。适当的域适应方法可以通过从多个站点/中心获取的数据来显着增强统计能力。为此，我们开发了用于医疗数据分析（DomainAtm）的域适应工具箱 - 一个开放式软件包，旨在快速促进和轻松自定义用于医疗数据分析的域适应方法。该域名在MATLAB中实现，并具有用户友好的图形接口，它由一系列流行的数据适应算法组成，这些算法已广泛应用于医学图像分析和计算机视觉。借助域名，研究人员能够促进对医学数据分析的不同适应方法的快速特征级别和图像级适应，可视化和性能评估。更重要的是，域名使用户能够通过脚本编写，大大增强其效用和可扩展性来开发和测试自己的适应方法。三个示例实验表明了概述和域的概述特征和用法，并证明了其有效性，简单性和灵活性。该软件，源代码和手册可在线获得。

translated by 谷歌翻译

Attention-Guided Autoencoder for Automated Progression Prediction of Subjective Cognitive Decline with Structural MRI

Hao Guan , Ling Yue , Pew-Thian Yap , Andrea Bozoki , Mingxia Liu

分类：计算机视觉

2022-06-24

主观认知下降（SCD）是阿尔茨海默氏病（AD）的临床前阶段，甚至在轻度认知障碍（MCI）之前就发生。渐进式SCD将转换为MCI，并有可能进一步发展为AD。因此，通过神经成像技术（例如，结构MRI）对进行性SCD的早期鉴定对于AD的早期干预具有巨大的临床价值。但是，现有的基于MRI的机器/深度学习方法通常会遇到小样本大小的问题，这对相关的神经影像学分析构成了巨大挑战。我们旨在解决本文的主要问题是如何利用相关领域（例如AD/NC）协助SCD的进展预测。同时，我们担心哪些大脑区域与进行性SCD的识别更加紧密相关。为此，我们提出了一个注意引导自动编码器模型，以进行有效的跨域适应，以促进知识转移从AD到SCD。所提出的模型由四个关键组成部分组成：1）用于学习不同域的共享子空间表示的功能编码模块，2）用于自动定义大脑中定义的兴趣障碍区域的注意模块，3）用于重构的解码模块原始输入，4）用于鉴定脑疾病的分类模块。通过对这四个模块的联合培训，可以学习域不变功能。同时，注意机制可以强调与脑部疾病相关的区域。公开可用的ADNI数据集和私人CLAS数据集的广泛实验证明了该方法的有效性。提出的模型直接可以在CPU上仅5-10秒进行训练和测试，并且适用于具有小数据集的医疗任务。

translated by 谷歌翻译

Domain Adaptation for Time-Series Classification to Mitigate Covariate Shift

Felix Ott , David Rügamer , Lucas Heublein , Bernd Bischl , Christopher Mutschler

分类：机器学习 | 人工智能

2022-04-07

当机器学习模型将其应用于与最初训练的数据相似但不同的域中的数据时，它的性能会降低。为了减轻此域移位问题，域Adaptation（DA）技术搜索了最佳转换，该转换将（当前）输入数据从源域转换为目标域，以学习域名不变的表示，以减少域差异。本文根据两个步骤提出了一个新颖的监督DA。首先，我们从几个样本中搜索从源到目标域的最佳类依赖性转换。我们考虑了最佳的运输方法，例如地球搬运工的距离，凹痕传输和相关对准。其次，我们使用嵌入相似技术在推理时选择相应的转换。我们使用相关指标和高阶矩匹配技术。我们对具有域移动的时间序列数据集进行了广泛的评估，包括模拟和各种在线手写数据集，以演示性能。

translated by 谷歌翻译

A Survey of Unsupervised Domain Adaptation for Visual Recognition

Youshan Zhang

分类：计算机视觉

2021-12-13

虽然在许多域内生成并提供了大量的未标记数据，但对视觉数据的自动理解的需求高于以往任何时候。大多数现有机器学习模型通常依赖于大量标记的训练数据来实现高性能。不幸的是，在现实世界的应用中，不能满足这种要求。标签的数量有限，手动注释数据昂贵且耗时。通常需要将知识从现有标记域传输到新域。但是，模型性能因域之间的差异（域移位或数据集偏差）而劣化。为了克服注释的负担，域适应（DA）旨在在将知识从一个域转移到另一个类似但不同的域中时减轻域移位问题。无监督的DA（UDA）处理标记的源域和未标记的目标域。 UDA的主要目标是减少标记的源数据和未标记的目标数据之间的域差异，并在培训期间在两个域中学习域不变的表示。在本文中，我们首先定义UDA问题。其次，我们从传统方法和基于深度学习的方法中概述了不同类别的UDA的最先进的方法。最后，我们收集常用的基准数据集和UDA最先进方法的报告结果对视觉识别问题。

translated by 谷歌翻译

Applications of Unsupervised Deep Transfer Learning to Intelligent Fault Diagnosis: A Survey and Comparative Study

Zhibin Zhao , Qiyang Zhang , Xiaolei Yu , Chuang Sun , Shibin Wang , Ruqiang Yan , Xuefeng Chen

分类：机器学习

2019-12-28

最近的智能故障诊断（IFD）的进展大大依赖于深度代表学习和大量标记数据。然而，机器通常以各种工作条件操作，或者目标任务具有不同的分布，其中包含用于训练的收集数据（域移位问题）。此外，目标域中的新收集的测试数据通常是未标记的，导致基于无监督的深度转移学习（基于UDTL为基础的）IFD问题。虽然它已经实现了巨大的发展，但标准和开放的源代码框架以及基于UDTL的IFD的比较研究尚未建立。在本文中，我们根据不同的任务，构建新的分类系统并对基于UDTL的IFD进行全面审查。对一些典型方法和数据集的比较分析显示了基于UDTL的IFD中的一些开放和基本问题，这很少研究，包括特征，骨干，负转移，物理前导等的可转移性，强调UDTL的重要性和再现性 - 基于IFD，整个测试框架将发布给研究界以促进未来的研究。总之，发布的框架和比较研究可以作为扩展界面和基本结果，以便对基于UDTL的IFD进行新的研究。代码框架可用于\ url {https:/github.com/zhaozhibin/udtl}。

translated by 谷歌翻译

Cross-Subject Domain Adaptation for Classifying Working Memory Load with Multi-Frame EEG Images

Junfu Chen , Xiaoyi Jiang , Yang Chen , Bi Wang

分类：机器学习 | 计算机视觉

2021-06-12

工作记忆（WM）表示在脑海中存储的信息，是人类认知领域的一个基本研究主题。可以监测大脑的电活动的脑电图（EEG）已被广泛用于测量WM的水平。但是，关键的挑战之一是个体差异可能会导致无效的结果，尤其是当既定模型符合陌生主题时。在这项工作中，我们提出了一个具有空间注意力（CS-DASA）的跨主题深层适应模型，以概括跨科目的工作负载分类。首先，我们将EEG时间序列转换为包含空间，光谱和时间信息的多帧EEG图像。首先，CS-DASA中的主题共享模块从源和目标主题中接收多帧的EEG图像数据，并学习了共同的特征表示。然后，在特定于主题的模块中，实现了最大平均差异，以测量重现的内核希尔伯特空间中的域分布差异，这可以为域适应增加有效的罚款损失。此外，采用主题对象的空间注意机制专注于目标图像数据的判别空间特征。在包含13个受试者的公共WM EEG数据集上进行的实验表明，所提出的模型能够达到比现有最新方法更好的性能。

translated by 谷歌翻译

A Comprehensive Survey on Transfer Learning

Fuzhen Zhuang , Zhiyuan Qi , Keyu Duan , Dongbo Xi , Yongchun Zhu , Hengshu Zhu , Hui Xiong , Qing He

分类：

2019-11-07

Transfer learning aims at improving the performance of target learners on target domains by transferring the knowledge contained in different but related source domains. In this way, the dependence on a large number of target domain data can be reduced for constructing target learners. Due to the wide application prospects, transfer learning has become a popular and promising area in machine learning. Although there are already some valuable and impressive surveys on transfer learning, these surveys introduce approaches in a relatively isolated way and lack the recent advances in transfer learning. Due to the rapid expansion of the transfer learning area, it is both necessary and challenging to comprehensively review the relevant studies. This survey attempts to connect and systematize the existing transfer learning researches, as well as to summarize and interpret the mechanisms and the strategies of transfer learning in a comprehensive way, which may help readers have a better understanding of the current research status and ideas. Unlike previous surveys, this survey paper reviews more than forty representative transfer learning approaches, especially homogeneous transfer learning approaches, from the perspectives of data and model. The applications of transfer learning are also briefly introduced. In order to show the performance of different transfer learning models, over twenty representative transfer learning models are used for experiments. The models are performed on three different datasets, i.e., Amazon Reviews, Reuters-21578, and Office-31. And the experimental results demonstrate the importance of selecting appropriate transfer learning models for different applications in practice.

translated by 谷歌翻译

Discriminative Radial Domain Adaptation

Zenan Huang , Jun Wen , Siheng Chen , Linchao Zhu , Nenggan Zheng

分类：机器学习 | 计算机视觉

2023-01-01

Domain adaptation methods reduce domain shift typically by learning domain-invariant features. Most existing methods are built on distribution matching, e.g., adversarial domain adaptation, which tends to corrupt feature discriminability. In this paper, we propose Discriminative Radial Domain Adaptation (DRDR) which bridges source and target domains via a shared radial structure. It's motivated by the observation that as the model is trained to be progressively discriminative, features of different categories expand outwards in different directions, forming a radial structure. We show that transferring such an inherently discriminative structure would enable to enhance feature transferability and discriminability simultaneously. Specifically, we represent each domain with a global anchor and each category a local anchor to form a radial structure and reduce domain shift via structure matching. It consists of two parts, namely isometric transformation to align the structure globally and local refinement to match each category. To enhance the discriminability of the structure, we further encourage samples to cluster close to the corresponding local anchors based on optimal-transport assignment. Extensively experimenting on multiple benchmarks, our method is shown to consistently outperforms state-of-the-art approaches on varied tasks, including the typical unsupervised domain adaptation, multi-source domain adaptation, domain-agnostic learning, and domain generalization.

translated by 谷歌翻译

Deep Visual Domain Adaptation: A Survey

Mei Wang , Weihong Deng

分类：

2018-02-10

Deep domain adaptation has emerged as a new learning technique to address the lack of massive amounts of labeled data. Compared to conventional methods, which learn shared feature subspaces or reuse important source instances with shallow representations, deep domain adaptation methods leverage deep networks to learn more transferable representations by embedding domain adaptation in the pipeline of deep learning. There have been comprehensive surveys for shallow domain adaptation, but few timely reviews the emerging deep learning based methods. In this paper, we provide a comprehensive survey of deep domain adaptation methods for computer vision applications with four major contributions. First, we present a taxonomy of different deep domain adaptation scenarios according to the properties of data that define how two domains are diverged. Second, we summarize deep domain adaptation approaches into several categories based on training loss, and analyze and compare briefly the state-of-the-art methods under these categories. Third, we overview the computer vision applications that go beyond image classification, such as face recognition, semantic segmentation and object detection. Fourth, some potential deficiencies of current methods and several future directions are highlighted.

translated by 谷歌翻译

Deep Domain Adaptation for Pavement Crack Detection

Huijun Liu , Chunhua Yang , Ao Li , Yongxin Ge , Sheng Huang , Xin Feng , Zhimin Ruan

分类：计算机视觉

2021-11-19

基于深度学习的路面裂缝检测方法通常需要大规模标签，具有详细的裂缝位置信息来学习准确的预测。然而，在实践中，由于路面裂缝的各种视觉模式，裂缝位置很难被手动注释。在本文中，我们提出了一种基于深域适应的裂缝检测网络（DDACDN），其学会利用源域知识来预测目标域中的多类别裂缝位置信息，其中仅是图像级标签可用的。具体地，DDACDN首先通过双分支权重共享骨干网络从源和目标域中提取裂缝特征。并且在实现跨域自适应的努力中，通过从每个域的特征空间聚合三尺度特征来构建中间域，以使来自源域的裂缝特征适应目标域。最后，该网络涉及两个域的知识，并接受识别和本地化路面裂缝的培训。为了便于准确的培训和验证域适应，我们使用两个具有挑战性的路面裂缝数据集CQu-BPDD和RDD2020。此外，我们构建了一个名为CQu-BPMDD的新型大型沥青路面多标签疾病数据集，其中包含38994个高分辨率路面疾病图像，以进一步评估模型的稳健性。广泛的实验表明，DDACDN优于最先进的路面裂纹检测方法，以预测目标结构域的裂缝位置。

translated by 谷歌翻译

Multi-Representation Adaptation Network for Cross-domain Image Classification

Yongchun Zhu , Fuzhen Zhuang , Jindong Wang , Jingwu Chen , Zhiping Shi , Wenjuan Wu , Qing He

分类：计算机视觉 | 人工智能 | 机器学习

2022-01-04

在图像分类中，获得足够的标签通常昂贵且耗时。为了解决这个问题，域适应通常提供有吸引力的选择，给出了来自类似性质但不同域的大量标记数据。现有方法主要对准单个结构提取的表示的分布，并且表示可以仅包含部分信息，例如，仅包含部分饱和度，亮度和色调信息。在这一行中，我们提出了多代表性适应，这可以大大提高跨域图像分类的分类精度，并且特别旨在对准由名为Inception Adaption Adationation模块（IAM）提取的多个表示的分布。基于此，我们呈现多色自适应网络（MRAN）来通过多表示对准完成跨域图像分类任务，该任向性可以捕获来自不同方面的信息。此外，我们扩展了最大的平均差异（MMD）来计算适应损耗。我们的方法可以通过扩展具有IAM的大多数前进模型来轻松实现，并且网络可以通过反向传播有效地培训。在三个基准图像数据集上进行的实验证明了备的有效性。代码已在https://github.com/easezyc/deep-transfer -learning上获得。

translated by 谷歌翻译

Deep Unsupervised Domain Adaptation: A Review of Recent Advances and Perspectives

Xiaofeng Liu , Chaehwa Yoo , Fangxu Xing , Hyejin Oh , Georges El Fakhri , Je-Won Kang , Jonghye Woo

分类：计算机视觉 | 人工智能 | 机器学习

2022-08-15

深度学习已成为解决不同领域中现实世界中问题的首选方法，部分原因是它能够从数据中学习并在广泛的应用程序上实现令人印象深刻的性能。但是，它的成功通常取决于两个假设：（i）精确模型拟合需要大量标记的数据集，并且（ii）培训和测试数据是独立的且分布相同的。因此，不能保证它在看不见的目标域上的性能，尤其是在适应阶段遇到分布数据的数据时。目标域中数据的性能下降是部署深层神经网络的关键问题，这些网络已成功地在源域中的数据训练。通过利用标记的源域数据和未标记的目标域数据来执行目标域中的各种任务，提出了无监督的域适应（UDA）来对抗这一点。 UDA在自然图像处理，视频分析，自然语言处理，时间序列数据分析，医学图像分析等方面取得了令人鼓舞的结果。在本综述中，作为一个快速发展的主题，我们对其方法和应用程序进行了系统的比较。此外，还讨论了UDA与其紧密相关的任务的联系，例如域的概括和分布外检测。此外，突出显示了当前方法和可能有希望的方向的缺陷。

translated by 谷歌翻译

Return of Frustratingly Easy Domain Adaptation

Baochen Sun , Jiashi Feng , Kate Saenko

分类：

2015-11-17

Unlike human learning, machine learning often fails to handle changes between training (source) and test (target) input distributions. Such domain shifts, common in practical scenarios, severely damage the performance of conventional machine learning methods. Supervised domain adaptation methods have been proposed for the case when the target data have labels, including some that perform very well despite being "frustratingly easy" to implement. However, in practice, the target domain is often unlabeled, requiring unsupervised adaptation. We propose a simple, effective, and efficient method for unsupervised domain adaptation called CORrelation ALignment (CORAL). CORAL minimizes domain shift by aligning the second-order statistics of source and target distributions, without requiring any target labels. Even though it is extraordinarily simple-it can be implemented in four lines of Matlab code-CORAL performs remarkably well in extensive evaluations on standard benchmark datasets."Everything should be made as simple as possible, but not simpler."

translated by 谷歌翻译

Domain Adaptation Principal Component Analysis: base linear method for learning with out-of-distribution data

Evgeny M Mirkes , Jonathan Bac , Aziz Fouché , Sergey V. Stasenko , Andrei Zinovyev , Alexander N. Gorban

分类：机器学习

2022-08-28

域适应性是现代机器学习中的一种流行范式，旨在解决培训或验证数据集之间具有用于学习和测试分类器（源域）和潜在的大型未标记数据集的培训或验证数据集之间的分歧问题，其中利用了模型（目标域）（目标域）（目标域）。任务是找到源数据集的源和目标数据集的这种常见表示，其中源数据集提供了培训的信息，因此可以最大程度地减少来源和目标之间的差异。目前，最流行的领域适应性解决方案是基于训练神经网络，这些神经网络结合了分类和对抗性学习模块，这些模块是饥饿的，通常很难训练。我们提出了一种称为域适应性主成分分析（DAPCA）的方法，该方法发现线性减少的数据表示有助于解决域适应任务。 DAPCA基于数据点对之间引入正权重，并概括了主成分分析的监督扩展。 DAPCA代表一种迭代算法，因此在每次迭代中都解决了一个简单的二次优化问题。保证算法的收敛性，并且在实践中的迭代次数很少。我们验证了先前提出的用于解决域适应任务的基准的建议算法，还显示了在生物医学应用中对单细胞法数据集进行分析中使用DAPCA的好处。总体而言，考虑到源域和目标域之间可能的差异，DAPCA可以作为许多机器学习应用程序中有用的预处理步骤。

translated by 谷歌翻译

Semi-supervised Domain Adaptive Structure Learning

Can Qin , Lichen Wang , Qianqian Ma , Yu Yin , Huan Wang , Yun Fu

分类：计算机视觉 | 人工智能 | 机器学习

2021-12-12

半监督域适应（SSDA）是一种具有挑战性的问题，需要克服1）以朝向域的较差的数据和2）分布换档的方法。不幸的是，由于培训数据偏差朝标标样本训练，域适应（DA）和半监督学习（SSL）方法的简单组合通常无法解决这两个目的。在本文中，我们介绍了一种自适应结构学习方法，以规范SSL和DA的合作。灵感来自多视图学习，我们建议的框架由共享特征编码器网络和两个分类器网络组成，用于涉及矛盾的目的。其中，其中一个分类器被应用于组目标特征以提高级别的密度，扩大了鲁棒代表学习的分类集群的间隙。同时，其他分类器作为符号器，试图散射源功能以增强决策边界的平滑度。目标聚类和源扩展的迭代使目标特征成为相应源点的扩张边界内的封闭良好。对于跨域特征对齐和部分标记的数据学习的联合地址，我们应用最大平均差异（MMD）距离最小化和自培训（ST）将矛盾结构投影成共享视图以进行可靠的最终决定。对标准SSDA基准的实验结果包括Domainnet和Office-Home，展示了我们对最先进的方法的方法的准确性和稳健性。

translated by 谷歌翻译

Domain-Adaptive 3D Medical Image Synthesis: An Efficient Unsupervised Approach

Qingqiao Hu , Hongwei Li , Jianguo Zhang

分类：计算机视觉

2022-07-02

医疗图像合成引起了人们的关注，因为它可能会产生缺失的图像数据，改善诊断并受益于许多下游任务。但是，到目前为止，开发的合成模型并不适应显示域移位的看不见的数据分布，从而限制了其在临床常规中的适用性。这项工作着重于探索3D图像到图像合成模型的域适应性（DA）。首先，我们强调了分类，分割和合成模型之间DA的技术差异。其次，我们提出了一种基于近似3D分布的2D变异自动编码器的新型有效适应方法。第三，我们介绍了有关适应数据量和关键超参数量的影响的经验研究。我们的结果表明，所提出的方法可以显着提高3D设置中未见域的合成精度。该代码可在https://github.com/winstonhutiger/2d_vae_uda_for_3d_sythesis上公开获得。

translated by 谷歌翻译

Partial Domain Adaptation without Domain Alignment

Weikai Li , Songcan Chen

分类：计算机视觉 | 机器学习

2021-08-29

Unsupervised domain adaptation (UDA) aims to transfer knowledge from a well-labeled source domain to a different but related unlabeled target domain with identical label space. Currently, the main workhorse for solving UDA is domain alignment, which has proven successful. However, it is often difficult to find an appropriate source domain with identical label space. A more practical scenario is so-called partial domain adaptation (PDA) in which the source label set or space subsumes the target one. Unfortunately, in PDA, due to the existence of the irrelevant categories in the source domain, it is quite hard to obtain a perfect alignment, thus resulting in mode collapse and negative transfer. Although several efforts have been made by down-weighting the irrelevant source categories, the strategies used tend to be burdensome and risky since exactly which irrelevant categories are unknown. These challenges motivate us to find a relatively simpler alternative to solve PDA. To achieve this, we first provide a thorough theoretical analysis, which illustrates that the target risk is bounded by both model smoothness and between-domain discrepancy. Considering the difficulty of perfect alignment in solving PDA, we turn to focus on the model smoothness while discard the riskier domain alignment to enhance the adaptability of the model. Specifically, we instantiate the model smoothness as a quite simple intra-domain structure preserving (IDSP). To our best knowledge, this is the first naive attempt to address the PDA without domain alignment. Finally, our empirical results on multiple benchmark datasets demonstrate that IDSP is not only superior to the PDA SOTAs by a significant margin on some benchmarks (e.g., +10% on Cl->Rw and +8% on Ar->Rw ), but also complementary to domain alignment in the standard UDA

translated by 谷歌翻译

Subtype-Aware Dynamic Unsupervised Domain Adaptation

Xiaofeng Liu , Fangxu Xing , Jia You , Jun Lu , C. -C. Jay Kuo , Georges El Fakhri , Jonghye Woo

分类：计算机视觉 | 人工智能 | 机器学习

2022-08-16

无监督的域适应性（UDA）已成功地应用于没有标签的标记源域转移到目标域的知识。最近引入了可转移的原型网络（TPN），进一步解决了班级条件比对。在TPN中，虽然在潜在空间中明确执行了源和目标域之间的类中心的接近度，但尚未完全研究基础的细颗粒亚型结构和跨域紧凑性。为了解决这个问题，我们提出了一种新方法，以适应性地执行细粒度的亚型意识对准，以提高目标域的性能，而无需两个域中的子类型标签。我们方法的见解是，由于不同的条件和标签变化，同类中未标记的亚型在亚型内具有局部接近性，同时表现出不同的特征。具体而言，我们建议通过使用中间伪标签同时执行亚型的紧凑度和阶级分离。此外，我们系统地研究了有或不具有亚型数字的各种情况，并建议利用基本的亚型结构。此外，开发了一个动态队列框架，以使用替代处理方案稳步地进化亚型簇质心。与最先进的UDA方法相比，使用多视图的先天性心脏病数据和VISDA和域进行了实验结果，显示了我们的亚型意识UDA的有效性和有效性。

translated by 谷歌翻译

Aligning Domain-specific Distribution and Classifier for Cross-domain Classification from Multiple Sources

Yongchun Zhu , Fuzhen Zhuang , Deqing Wang

分类：机器学习 | 人工智能 | 计算机视觉

2022-01-04

虽然无监督的域适应（UDA）算法，即，近年来只有来自源域的标记数据，大多数算法和理论结果侧重于单源无监督域适应（SUDA）。然而，在实际情况下，标记的数据通常可以从多个不同的源收集，并且它们可能不仅不同于目标域而且彼此不同。因此，来自多个源的域适配器不应以相同的方式进行建模。最近基于深度学习的多源无监督域适应（Muda）算法专注于通过在通用特征空间中的所有源极和目标域的分布对齐来提取所有域的公共域不变表示。但是，往往很难提取Muda中所有域的相同域不变表示。此外，这些方法匹配分布而不考虑类之间的域特定的决策边界。为了解决这些问题，我们提出了一个新的框架，具有两个对准阶段的Muda，它不仅将每对源和目标域的分布对齐，而且还通过利用域特定的分类器的输出对准决策边界。广泛的实验表明，我们的方法可以对图像分类的流行基准数据集实现显着的结果。

translated by 谷歌翻译

I2F: A Unified Image-to-Feature Approach for Domain Adaptive Semantic Segmentation

Haoyu Ma , Xiangru Lin , Yizhou Yu

分类：计算机视觉

2023-01-03

Unsupervised domain adaptation (UDA) for semantic segmentation is a promising task freeing people from heavy annotation work. However, domain discrepancies in low-level image statistics and high-level contexts compromise the segmentation performance over the target domain. A key idea to tackle this problem is to perform both image-level and feature-level adaptation jointly. Unfortunately, there is a lack of such unified approaches for UDA tasks in the existing literature. This paper proposes a novel UDA pipeline for semantic segmentation that unifies image-level and feature-level adaptation. Concretely, for image-level domain shifts, we propose a global photometric alignment module and a global texture alignment module that align images in the source and target domains in terms of image-level properties. For feature-level domain shifts, we perform global manifold alignment by projecting pixel features from both domains onto the feature manifold of the source domain; and we further regularize category centers in the source domain through a category-oriented triplet loss and perform target domain consistency regularization over augmented target domain images. Experimental results demonstrate that our pipeline significantly outperforms previous methods. In the commonly tested GTA5$\rightarrow$Cityscapes task, our proposed method using Deeplab V3+ as the backbone surpasses previous SOTA by 8%, achieving 58.2% in mIoU.

translated by 谷歌翻译