智能论文笔记

Segmentation of Weakly Visible Environmental Microorganism Images Using Pair-wise Deep Learning Features

Frank Kulwa , Chen Li , Marcin Grzegorzek , Md Mamunur Rahaman , Kimiaki Shirahama , Sergey Kosov

分类：计算机视觉

2022-08-31

环境微生物（EMS）的使用通过监测和分解污染物提供了高效，低成本和无害的环境污染补救措施。这取决于如何正确分段和确定EMS。为了增强透明，嘈杂且对比度较低的弱可见EM图像的分割，在本研究中提出了成对深度学习功能网络（PDLF-NET）。 PDLFS的使用使网络通过将每个图像的成对深度学习特征与基本模型Segnet的不同块相连，从而使网络更加关注前景（EMS）。利用shi和tomas描述符，我们在贴片上提取每个图像的深度特征，这些图像使用VGG-16模型以每个描述符为中心。然后，为了学习描述符之间的中间特征，基于Delaunay三角定理进行功能的配对以形成成对的深度学习特征。在该实验中，PDLF-NET可实现89.24％，63.20％，77.27％，35.15％，89.72％，91.44％和89.30％的出色分割结果，分别为IOU，DICE，DICE，VOE，灵敏度，精确性和特定性，精确性和特定性，精确性和特定性，精确性和特定性。

translated by 谷歌翻译

HTML版本

IL-MCAM: An interactive learning and multi-channel attention mechanism-based weakly supervised colorectal histopathology image classification approach

Haoyuan Chen , Chen Li , Xiaoyan Li , Md Mamunur Rahaman , Weiming Hu , Yixin Li , Wanli Liu , Changhao Sun , Hongzan Sun , Xinyu Huang

分类：计算机视觉

2022-06-07

近年来，大肠癌已成为危害人类健康最重要的疾病之一。深度学习方法对于结直肠组织病理学图像的分类越来越重要。但是，现有方法更多地集中在使用计算机而不是人类计算机交互的端到端自动分类。在本文中，我们提出了一个IL-MCAM框架。它基于注意机制和互动学习。提出的IL-MCAM框架包括两个阶段：自动学习（AL）和交互性学习（IL）。在AL阶段，使用包含三种不同注意机制通道和卷积神经网络的多通道注意机制模型用于提取多通道特征进行分类。在IL阶段，提出的IL-MCAM框架不断地将错误分类的图像添加到交互式方法中，从而提高了MCAM模型的分类能力。我们对数据集进行了比较实验，并在HE-NCT-CRC-100K数据集上进行了扩展实验，以验证拟议的IL-MCAM框架的性能，分别达到98.98％和99.77％的分类精度。此外，我们进行了消融实验和互换性实验，以验证三个通道的能力和互换性。实验结果表明，所提出的IL-MCAM框架在结直肠组织病理学图像分类任务中具有出色的性能。

translated by 谷歌翻译

An application of Pixel Interval Down-sampling (PID) for dense tiny microorganism counting on environmental microorganism images

Jiawei Zhang , Ning Xu , Chen Li , Md Mamunur Rahaman , Yu-Dong Yao , Yu-Hao Lin , Jinghua Zhang , Tao Jiang , Wenjun Qin , Marcin Grzegorzek

分类：计算机视觉 | 人工智能

2022-04-04

本文提出了一个新颖的像素间隔下采样网络（PID-NET），以较高的精度计算任务，以更高的精度计数任务。 PID-NET是具有编码器架构的端到端卷积神经网络（CNN）模型。像素间隔向下采样操作与最大功能操作相连，以结合稀疏和密集的特征。这解决了计数时茂密物体的轮廓凝结的局限性。使用经典分割指标（骰子，Jaccard和Hausdorff距离）以及计数指标进行评估。实验结果表明，所提出的PID-NET具有最佳的性能和潜力，可以实现密集的微小对象计数任务，该任务在数据集上具有2448个酵母单元图像在数据集上达到96.97 \％的计数精度。通过与最新的方法进行比较，例如注意U-NET，SWIN U-NET和TRANS U-NET，提出的PID-NET可以分割具有更清晰边界和较少不正确的碎屑的密集的微小物体，这表明PID网络在准确计数的任务中的巨大潜力。

translated by 谷歌翻译

EMDS-6: Environmental Microorganism Image Dataset Sixth Version for Image Denoising, Segmentation, Feature Extraction, Classification and Detection Methods Evaluation

Peng Zhao , Chen Li , Md Mamunur Rahaman , Hao Xu , Pingli Ma , Hechen Yang , Hongzan Sun , Tao Jiang , Ning Xu , Marcin Grzegorzek

分类：计算机视觉

2021-12-14

环境微生物（EMS）在我们周围普遍存在，对人类社会的生存和发展产生了重要影响。然而，对环境微生物（EM）数据的高标准和严格要求导致现有相关数据库的不足，更不用说具有GT图像的数据库。这个问题严重影响了相关实验的进展。因此，本研究开发了环境微生物数据集第六版（EMDS-6），其中包含21种EMS。每种类型的EM包含40个原件和40 GT图像，总共1680个EM图像。在这项研究中，为了测试EMDS-6的有效性。我们选择图像处理方法的经典算法，例如图像去噪，图像分割和目标检测。实验结果表明，EMDS-6可用于评估图像去噪，图像分割，图像特征提取，图像分类和对象检测方法的性能。

translated by 谷歌翻译

GasHisSDB: A New Gastric Histopathology Image Dataset for Computer Aided Diagnosis of Gastric Cancer

Weiming Hu , Chen Li , Xiaoyan Li , Md Mamunur Rahaman , Jiquan Ma , Yong Zhang , Haoyuan Chen , Wanli Liu , Changhao Sun , Yudong Yao

分类：计算机视觉

2021-06-04

背景和目的：胃癌已经成为全球第五次常见的癌症，早期检测胃癌对于拯救生命至关重要。胃癌的组织病理学检查是诊断胃癌的金标准。然而，计算机辅助诊断技术是挑战，以评估由于公开胃组织病理学图像数据集的稀缺而评估。方法：在本文中，公布了一种贵族公共胃组织病理学子尺寸图像数据库（GashissdB）以识别分类器的性能。具体地，包括两种类型的数据：正常和异常，总共245,196个组织案例图像。为了证明图像分类领域的不同时期的方法在GashissdB上具有差异，我们选择各种分类器进行评估。选择七种古典机器学习分类器，三个卷积神经网络分类器和新颖的基于变压器的分类器进行测试，用于测试图像分类任务。结果：本研究采用传统机器学习和深入学习方法进行了广泛的实验，以证明不同时期的方法对GashissdB具有差异。传统的机器学习实现了86.08％的最佳精度率，最低仅为41.12％。深度学习的最佳准确性达到96.47％，最低为86.21％。分类器的精度率显着变化。结论：据我们所知，它是第一个公开的胃癌组织病理学数据集，包含大量的弱监督学习的图像。我们认为Gashissdb可以吸引研究人员来探索胃癌自动诊断的新算法，这可以帮助医生和临床环境中的患者。

translated by 谷歌翻译

Is the aspect ratio of cells important in deep learning? A robust comparison of deep learning methods for multi-scale cytopathology cell image classification: from convolutional neural networks to visual transformers

Wanli Liu , Chen Li , Md Mamunur Rahamana , Tao Jiang , Hongzan Sun , Xiangchen Wu , Weiming Hu , Haoyuan Chen , Changhao Sun , Yudong Yao

分类：计算机视觉

2021-05-16

宫颈癌是女性中一种非常常见和致命的癌症类型。细胞病理学图像通常用于筛选这种癌症。鉴于在手动筛查期间可能发生许多错误，已经开发了一种基于深度学习的计算机辅助诊断系统。深度学习方法需要输入图像的固定维度，但临床医学图像的尺寸不一致。图像的纵横比在直接调整它们的同时受到影响。临床上，细胞病理学图像内的细胞的纵横比为医生诊断癌症提供重要信息。因此，很难直接调整大小。然而，许多现有研究直接调整了图像的大小，并获得了高度稳健的分类结果。为了确定合理的解释，我们进行了一系列比较实验。首先，预处理SipakMed数据集的原始数据以获得标准和缩放数据集。然后，将数据集调整为224 x 224像素。最后，22种深度学习模型用于分类标准和缩放数据集。该研究的结果表明，深度学习模型对宫颈细胞病理学图像中细胞的纵横比变化是鲁棒的。此结论也通过Herlev DataSet验证。

translated by 谷歌翻译

C2F-TCN: A Framework for Semi and Fully Supervised Temporal Action Segmentation

Dipika Singhania , Rahul Rahaman , Angela Yao

分类：计算机视觉

2022-12-20

Temporal action segmentation tags action labels for every frame in an input untrimmed video containing multiple actions in a sequence. For the task of temporal action segmentation, we propose an encoder-decoder-style architecture named C2F-TCN featuring a "coarse-to-fine" ensemble of decoder outputs. The C2F-TCN framework is enhanced with a novel model agnostic temporal feature augmentation strategy formed by the computationally inexpensive strategy of the stochastic max-pooling of segments. It produces more accurate and well-calibrated supervised results on three benchmark action segmentation datasets. We show that the architecture is flexible for both supervised and representation learning. In line with this, we present a novel unsupervised way to learn frame-wise representation from C2F-TCN. Our unsupervised learning approach hinges on the clustering capabilities of the input features and the formation of multi-resolution features from the decoder's implicit structure. Further, we provide the first semi-supervised temporal action segmentation results by merging representation learning with conventional supervised learning. Our semi-supervised learning scheme, called ``Iterative-Contrastive-Classify (ICC)'', progressively improves in performance with more labeled data. The ICC semi-supervised learning in C2F-TCN, with 40% labeled videos, performs similar to fully supervised counterparts.

translated by 谷歌翻译

A General Purpose Neural Architecture for Geospatial Systems

Nasim Rahaman , Martin Weiss , Frederik Träuble , Francesco Locatello , Alexandre Lacoste , Yoshua Bengio , Chris Pal , Li Erran Li , Bernhard Schölkopf

分类：机器学习 | 人工智能

2022-11-04

Geospatial Information Systems are used by researchers and Humanitarian Assistance and Disaster Response (HADR) practitioners to support a wide variety of important applications. However, collaboration between these actors is difficult due to the heterogeneous nature of geospatial data modalities (e.g., multi-spectral images of various resolutions, timeseries, weather data) and diversity of tasks (e.g., regression of human activity indicators or detecting forest fires). In this work, we present a roadmap towards the construction of a general-purpose neural architecture (GPNA) with a geospatial inductive bias, pre-trained on large amounts of unlabelled earth observation data in a self-supervised manner. We envision how such a model may facilitate cooperation between members of the community. We show preliminary results on the first step of the roadmap, where we instantiate an architecture that can process a wide variety of geospatial data modalities and demonstrate that it can achieve competitive performance with domain-specific architectures on tasks relating to the U.N.'s Sustainable Development Goals.

translated by 谷歌翻译

Industry-Scale Orchestrated Federated Learning for Drug Discovery

Martijn Oldenhof , Gergely Ács , Balázs Pejó , Ansgar Schuffenhauer , Nicholas Holway , Noé Sturm , Arne Dieckmann , Oliver Fortmeier , Eric Boniface , Clément Mayer

分类：机器学习 | (统计)机器学习

2022-10-17

To apply federated learning to drug discovery we developed a novel platform in the context of European Innovative Medicines Initiative (IMI) project MELLODDY (grant n{\deg}831472), which was comprised of 10 pharmaceutical companies, academic research labs, large industrial companies and startups. The MELLODDY platform was the first industry-scale platform to enable the creation of a global federated model for drug discovery without sharing the confidential data sets of the individual partners. The federated model was trained on the platform by aggregating the gradients of all contributing partners in a cryptographic, secure way following each training iteration. The platform was deployed on an Amazon Web Services (AWS) multi-account architecture running Kubernetes clusters in private subnets. Organisationally, the roles of the different partners were codified as different rights and permissions on the platform and administrated in a decentralized way. The MELLODDY platform generated new scientific discoveries which are described in a companion paper.

translated by 谷歌翻译

Discrete Key-Value Bottleneck

Frederik Träuble , Anirudh Goyal , Nasim Rahaman , Michael Mozer , Kenji Kawaguchi , Yoshua Bengio , Bernhard Schölkopf

分类：机器学习 | 人工智能

2022-07-22

深度神经网络在数据流是I.I.D的规范环境中的预测和分类任务上表现良好，标记的数据很丰富，并且类标签平衡。随着分配变化的挑战，包括非平稳或不平衡数据流。解决了这一挑战的一种强大方法是在大量未标记的数据上对大型编码器进行自我监督的预处理，然后进行特定于任务的调整。鉴于一项新任务，更新这些编码器的权重是具有挑战性的，因为需要微调大量权重，因此，他们忘记了有关先前任务的信息。在目前的工作中，我们提出了一个模型体系结构来解决此问题，以一个离散的瓶颈为基础，其中包含成对的单独和可学习的（键，价值）代码。在此设置中，我们遵循编码；通过离散瓶颈处理表示形式；和解码范式，其中输入被馈送到预处理的编码器中，编码器的输出用于选择最近的键，并将相应的值馈送到解码器以求解当前任务。该模型只能在推理过程中获取和重复使用有限数量的这些（密钥，值）对，从而启用本地化和上下文依赖的模型更新。从理论上讲，我们研究了所提出的模型最小化分布的影响的能力，并表明与（键，值）配对的这种离散瓶颈降低了假设类别的复杂性。我们经验验证了提出的方法在各种基准数据集的挑战性分配转移方案下的好处，并表明所提出的模型将共同的脆弱性降低到非i.i.d。与其他各种基线相比，非平稳培训分布。

translated by 谷歌翻译