智能论文笔记

Incremental Cross-Domain Adaptation for Robust Retinopathy Screening via Bayesian Deep Learning

Taimur Hassan , Bilal Hassan , Muhammad Usman Akram , Shahrukh Hashmi , Abdel Hakim Taguri , Naoufel Werghi

分类：计算机视觉

2021-10-18

视网膜疗法代表一组视网膜疾病，如果不及时治疗，可能会导致严重的视觉障碍甚至失明。许多研究人员已经开发了自治系统，通过眼底和光学相干断层扫描（OCT）图像识别视网膜病变。然而，大多数这些框架采用传统的转移学习和微调方法，需要一种非常良好的注释训练数据来产生准确的诊断性能。本文提出了一种新型增量跨域适应仪，可以通过几次拍摄培训允许任何深度分类模型逐步学习OCT和眼底图像的异常视网膜病理。此外，与其竞争对手不同，所提出的仪器通过贝叶斯多目标函数驱动，不仅执行候选分类网络，不仅在增量培训期间保留其先前学到的知识，而且还确保网络了解先前学习的结构和语义关系病理学和新增的疾病类别在推理阶段有效地识别它们。所提出的框架，在用三种不同的扫描仪获得的六个公共数据集中评估，以筛选十三个视网膜病理，优于最先进的竞争对手，通过分别实现了0.9826和0.9846的整体准确性和F1得分。

translated by 谷歌翻译

Pixel Relationships-based Regularizer for Retinal Vessel Image Segmentation

Lukman Hakim , Takio Kurita

分类：计算机视觉 | 机器学习

2022-12-28

The task of image segmentation is to classify each pixel in the image based on the appropriate label. Various deep learning approaches have been proposed for image segmentation that offers high accuracy and deep architecture. However, the deep learning technique uses a pixel-wise loss function for the training process. Using pixel-wise loss neglected the pixel neighbor relationships in the network learning process. The neighboring relationship of the pixels is essential information in the image. Utilizing neighboring pixel information provides an advantage over using only pixel-to-pixel information. This study presents regularizers to give the pixel neighbor relationship information to the learning process. The regularizers are constructed by the graph theory approach and topology approach: By graph theory approach, graph Laplacian is used to utilize the smoothness of segmented images based on output images and ground-truth images. By topology approach, Euler characteristic is used to identify and minimize the number of isolated objects on segmented images. Experiments show that our scheme successfully captures pixel neighbor relations and improves the performance of the convolutional neural network better than the baseline without a regularization term.

translated by 谷歌翻译

Single-Image Super-Resolution Reconstruction based on the Differences of Neighboring Pixels

Huipeng Zheng , Lukman Hakim , Takio Kurita , Junichi Miyao

分类：计算机视觉 | 机器学习

2022-12-28

The deep learning technique was used to increase the performance of single image super-resolution (SISR). However, most existing CNN-based SISR approaches primarily focus on establishing deeper or larger networks to extract more significant high-level features. Usually, the pixel-level loss between the target high-resolution image and the estimated image is used, but the neighbor relations between pixels in the image are seldom used. On the other hand, according to observations, a pixel's neighbor relationship contains rich information about the spatial structure, local context, and structural knowledge. Based on this fact, in this paper, we utilize pixel's neighbor relationships in a different perspective, and we propose the differences of neighboring pixels to regularize the CNN by constructing a graph from the estimated image and the ground-truth image. The proposed method outperforms the state-of-the-art methods in terms of quantitative and qualitative evaluation of the benchmark datasets. Keywords: Super-resolution, Convolutional Neural Networks, Deep Learning

translated by 谷歌翻译

Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

Andrey Ignatov , Radu Timofte , Maurizio Denna , Abdel Younes , Ganzorig Gankhuyag , Jingang Huh , Myeong Kyun Kim , Kihwan Yoon , Hyeon-Cheol Moon , Seungho Lee

分类：计算机视觉

2022-11-07

Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose the participants to design an efficient quantized image super-resolution solution that can demonstrate a real-time performance on mobile NPUs. The participants were provided with the DIV2K dataset and trained INT8 models to do a high-quality 3X image upscaling. The runtime of all models was evaluated on the Synaptics VS680 Smart Home board with a dedicated edge NPU capable of accelerating quantized neural networks. All proposed solutions are fully compatible with the above NPU, demonstrating an up to 60 FPS rate when reconstructing Full HD resolution images. A detailed description of all models developed in the challenge is provided in this paper.

translated by 谷歌翻译

NAAP-440 Dataset and Baseline for Network Architecture Accuracy Prediction

Tal Hakim

分类：计算机视觉 | 机器学习 | 神经与进化计算

2022-09-14

网络体系结构搜索（NAS）已成为开发和发现针对不同目标平台和目的的新神经体系结构的常见方法。但是，扫描搜索空间由许多候选体系结构的长期培训过程组成，这在计算资源和时间方面是昂贵的。回归算法是预测候选体系结构准确性的常见工具，可以极大地加速搜索过程。我们旨在提出一个新的基准，该基线将支持回归算法的开发，该算法可以通过其计划来预测建筑的准确性，或者仅通过对其进行训练以最少的时代进行训练。因此，我们介绍了440个神经体系结构的NAAP-440数据集，这些数据集使用固定配方在CIFAR10进行培训。我们的实验表明，通过使用现成的回归算法并运行多达10％的培训过程，不仅可以准确地预测体系结构的准确性，而且还可以预测架构预测的值，还可以保持其准确性顺序违反单调性的数量最少。这种方法可以作为加速基于NAS的研究的强大工具，从而大大提高其效率。研究中使用的数据集和代码已公开。

translated by 谷歌翻译

CustOmics: A versatile deep-learning based strategy for multi-omics integration

Hakim Benkirane , Yoann Pradat , Stefan Michiels , Paul-Henry Cournède

分类：机器学习

2022-09-12

高通量测序技术的最新进展使得可以提取多个特征，这些特征描绘了以不同和互补分子水平的患者样本。此类数据的产生导致了计算生物学方面的新挑战，这些挑战涉及捕获多个基因及其功能之间相互关系的高维和异质数据集的整合。由于它们的多功能性和学习复杂数据的合成潜在表示的能力，深度学习方法为整合多词数据提供了有希望的观点。这些方法导致了许多主要基于自动编码器模型的许多原始体系结构的概念。但是，由于任务的困难，集成策略是基本的，而不是失去全球趋势而充分利用来源的特殊性。本文提出了一种新型策略，以构建可自定义的自动编码器模型，该模型适应高维多源集成而言使用的数据集。我们将评估整合策略对潜在代表的影响，并结合提出一种新方法的最佳策略（https://github.com/hakimbenkirane/customics）。我们在这里关注来自多个OMIC来源的数据的集成，并证明了针对多个任务（例如分类和生存分析）的测试用例的拟议方法的性能。

translated by 谷歌翻译

Show Me Your Face, And I'll Tell You How You Speak

Christen Millerdurai , Lotfy Abdel Khaliq , Timon Ulrich

分类：计算机视觉

2022-06-28

当我们讲话时，可以从嘴唇的运动中推断出演讲的韵律和内容。在这项工作中，我们探讨了唇部综合的唇部任务，即，仅考虑说话者的唇部运动，我们将学习言语的唇部运动，我们专注于学习准确的唇部，以在不受限制的大型词汇环境中为多个说话者提供语音映射。我们通过其面部特征，即年龄，性别，种族和嘴唇动作来捕捉说话者的声音身份，即产生说话者身份的言语。为此，我们提出了一种新颖的方法“ lip2speech”，并采用关键设计选择，以实现无约束场景中语音合成的准确唇部。我们还使用定量，定性指标和人类评估进行了各种实验和广泛的评估。

translated by 谷歌翻译

Convolutional Layers Are Not Translation Equivariant

Nick McGreivy , Ammar Hakim

分类：计算机视觉 | 机器学习

2022-06-10

本文的目的是纠正对卷积神经网络（CNN）的误解。CNN由卷积层组成，由于重量共享，这些卷积层是偏移的。但是，与普遍的看法相反，即使忽略边界效应以及汇集和亚采样不存在，卷积层也不是翻译等效的。这是因为移位均值是一种离散的对称性，而翻译等效性是连续的对称性。这种离散的系统一般不会继承持续的等效性是对模棱两可的深度学习的基本限制。我们讨论了这一事实的两个含义。首先，尽管没有继承其模型的物理系统的翻译等效性，但CNN在图像处理方面取得了成功。其次，使用CNN求解偏微分方程（PDE）不会导致翻译模化求解器。

translated by 谷歌翻译

Metrics reloaded: Pitfalls and recommendations for image analysis validation

Lena Maier-Hein , Annika Reinke , Evangelia Christodoulou , Ben Glocker , Patrick Godau , Fabian Isensee , Jens Kleesiek , Michal Kozubek , Mauricio Reyes , Michael A. Riegler

分类：计算机视觉

2022-06-03

自动生物医学图像分析的领域至关重要地取决于算法验证的可靠和有意义的性能指标。但是，当前的度量使用通常是不明智的，并且不能反映基本的域名。在这里，我们提出了一个全面的框架，该框架指导研究人员以问题意识的方式选择绩效指标。具体而言，我们专注于生物医学图像分析问题，这些问题可以解释为图像，对象或像素级别的分类任务。该框架首先编译域兴趣 - 目标结构 - ，数据集和算法与输出问题相关的属性的属性与问题指纹相关，同时还将其映射到适当的问题类别，即图像级分类，语义分段，实例，实例细分或对象检测。然后，它指导用户选择和应用一组适当的验证指标的过程，同时使他们意识到与个人选择相关的潜在陷阱。在本文中，我们描述了指标重新加载推荐框架的当前状态，目的是从图像分析社区获得建设性的反馈。当前版本是在由60多个图像分析专家的国际联盟中开发的，将在社区驱动的优化之后公开作为用户友好的工具包提供。

translated by 谷歌翻译

Service Discovery in Social Internet of Things using Graph Neural Networks

Aymen Hamrouni , Hakim Ghazzai , Yehia Massoud

分类：机器学习

2022-05-25

The-Internet（IoT）网络智能地连接了数千个物理实体，为社区提供各种服务。它目睹了指数扩展，这使发现网络中存在的IoT设备并请求相应的服务的过程变得复杂。随着物联网环境的高度动态性质阻碍了传统的服务发现解决方案的使用，我们在本文中，通过提出一个可扩展的资源分配神经模型来解决此问题，足以适合异构的大型IoT网络。我们设计了一种图形神经网络（GNN）方法，该方法利用IoT网络中设备之间形成的社会关系来减少任何实体查找的搜索空间，并从网络中的另一个设备中获取服务。这种提出的资源分配方法超过了标准化问题，并通过GNNS的方式嵌入了社会物联网图的结构和特征，以最终的聚类分析过程。对现实世界数据集的仿真结果说明了该解决方案的性能及其在大规模IoT网络上运行的显着效率。

translated by 谷歌翻译