智能论文笔记

Lung nodules segmentation from CT with DeepHealth toolkit

Hafiza Ayesha Hoor Chaudhry , Riccardo Renzulli , Daniele Perlo , Francesca Santinelli , Stefano Tibaldi , Carmen Cristiano , Marco Grosso , Attilio Fiandrotti , Maurizio Lucenteforte , Davide Cavagnino

分类：计算机视觉

2022-08-01

准确且一致的边界分割在肿瘤体积估计及其在医学图像分割领域中的处理中起着重要作用。在全球范围内，肺癌是死亡的主要原因之一，肺结节的早期发现对于早期癌症诊断和患者的存活率至关重要。这项研究的目的是证明DeepHealth Toolkit的可行性，包括PYECVL和PYEDDL库（包括精确的肺结节）。使用PYECVL和PYEDDL在UnitoChest上进行了肺结节分割的实验，以进行数据预处理以及神经网络训练。结果描述了在较宽的直径范围内对肺结节的准确分割，并且在传统检测方法上的准确性更好。本文中使用的数据集和代码可作为基线参考公开提供。

translated by 谷歌翻译

Improving Low-Resource Question Answering using Active Learning in Multiple Stages

Maximilian Schmidt , Andrea Bartezzaghi , Jasmina Bogojeska , A. Cristiano I. Malossi , Thang Vu

分类：自然语言处理 | 人工智能

2022-11-27

Neural approaches have become very popular in the domain of Question Answering, however they require a large amount of annotated data. Furthermore, they often yield very good performance but only in the domain they were trained on. In this work we propose a novel approach that combines data augmentation via question-answer generation with Active Learning to improve performance in low resource settings, where the target domains are diverse in terms of difficulty and similarity to the source domain. We also investigate Active Learning for question answering in different stages, overall reducing the annotation effort of humans. For this purpose, we consider target domains in realistic settings, with an extremely low amount of annotated samples but with many unlabeled documents, which we assume can be obtained with little effort. Additionally, we assume sufficient amount of labeled data from the source domain is available. We perform extensive experiments to find the best setup for incorporating domain experts. Our findings show that our novel approach, where humans are incorporated as early as possible in the process, boosts performance in the low-resource, domain-specific setting, allowing for low-labeling-effort question answering systems in new, specialized domains. They further demonstrate how human annotation affects the performance of QA depending on the stage it is performed.

translated by 谷歌翻译

Model-Assisted Labeling via Explainability for Visual Inspection of Civil Infrastructures

Klara Janouskova , Mattia Rigotti , Ioana Giurgiu , Cristiano Malossi

分类：计算机视觉

2022-09-22

为视觉细分标记图像是一项耗时的任务，尤其是在必须由专业专家注释者（例如土木工程）提供标签的应用领域中。在本文中，我们建议使用归因方法来利用专家注释者和在缺陷细分的情况下进行注释的专家注释之间的有价值的相互作用，以视觉检查民用基础架构。具体而言，对分类器进行了训练，可以检测缺陷，并与基于归因的方法和对抗性攀爬相结合，以生成和完善与分类输出相对应的分割掩码。这些用于在辅助标签框架中使用，在该框架中，注释者可以通过决定接受，拒绝或修改它们作为建议分割掩码与它们进行交互，并将交互记录为弱标签以进一步完善分类器。与对缺陷的手动注释相比，我们所提出的方法应用于自动视觉检查桥梁产生的现实数据集。

translated by 谷歌翻译

Survey on Deep Fuzzy Systems in regression applications: a view on interpretability

Jorge S. S. Júnior , Jérôme Mendes , Francisco Souza , Cristiano Premebida

分类：机器学习

2022-09-09

深度学习（DL）技术被回归问题所接受。最近在该领域发表的论文数量越来越多，包括调查和评论，表明，由于效率和具有高维数据的系统的良好精度，深层回归引起了社区的关注。但是，许多DL方法具有复杂的结构，这些结构对人类用户不易透明。访问这些模型的可解释性是解决敏感领域问题（例如网络安全系统，医疗，金融监视和工业过程）的重要因素。模糊逻辑系统（FLS）是可解释的模型，在文献中众所周知，能够通过具有成员资格学位的语言术语对复杂系统使用非线性表示，模仿了人类的思想。在可解释的人工智能的气氛中，有必要考虑开发智能模型的准确性和可解释性之间的权衡。本文旨在调查结合DL和FL的现有方法的最新方法，即深度模糊系统，以解决回归问题，配置当前在文献中尚不充分探索的主题，因此应进行全面调查。

translated by 谷歌翻译

LUCID: Exposing Algorithmic Bias through Inverse Design

Carmen Mazijn , Carina Prunkl , Andres Algaba , Jan Danckaert , Vincent Ginis

分类：机器学习

2022-08-26

AI系统可以在决策过程中创建，传播，支持和自动化偏见。为了减轻偏见的决策，我们俩都需要了解偏见的起源，并定义算法做出公平决定的含义。大多数群体公平概念通过计算输出上的统计指标来评估模型的结果平等。我们认为，这些输出指标会遇到内在障碍，并提出了一种互补的方法，该方法与对治疗平等的关注度的越来越多。通过通过规范逆设计（Lucid）找到不公平性，我们生成一个规范集，该集合显示了给定优选输出的模型所需的输入。该规范集揭示了模型的内部逻辑，并通过反复询问决策过程来暴露潜在的不道德偏见。我们评估了UCI成人和Compas数据集的LUCID，发现规范集检测到的一些偏见与输出指标的偏见不同。结果表明，通过将重点转移到治疗平等并研究算法的内部工作原理中，规范集是对算法公平评估工具箱的宝贵补充。

translated by 谷歌翻译

HTML版本

CoSMix: Compositional Semantic Mix for Domain Adaptation in 3D LiDAR Segmentation

Cristiano Saltori , Fabio Galasso , Giuseppe Fiameni , Nicu Sebe , Elisa Ricci , Fabio Poiesi

分类：计算机视觉 | 人工智能 | 机器学习

2022-07-20

3D激光雷达语义细分对于自动驾驶是基础。最近已经提出了几种用于点云数据的无监督域适应性（UDA）方法，以改善不同传感器和环境的模型概括。研究图像域中研究UDA问题的研究人员表明，样品混合可以减轻域的转移。我们提出了一种针对点云UDA的样品混合的新方法，即组成语义混合（Cosmix），这是基于样品混合的第一种UDA方法。 Cosmix由一个两分支对称网络组成，该网络可以同时处理标记的合成数据（源）和现实世界中未标记的点云（目标）。每个分支通过从另一个域中混合选定的数据来在一个域上运行，并使用源标签和目标伪标签的语义信息。我们在两个大规模数据集上评估Cosmix，表明它的表现要优于最先进的方法。我们的代码可在https://github.com/saltoricristiano/cosmix-uda上找到。

translated by 谷歌翻译

GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation

Cristiano Saltori , Evgeny Krivosheev , Stéphane Lathuilière , Nicu Sebe , Fabio Galasso , Giuseppe Fiameni , Elisa Ricci , Fabio Poiesi

分类：计算机视觉 | 人工智能 | 机器学习

2022-07-20

3D点云语义细分对于自动驾驶至关重要。文献中的大多数方法都忽略了一个重要方面，即在处理动态场景时如何处理域转移。这可能会极大地阻碍自动驾驶车辆的导航能力。本文推进了该研究领域的最新技术。我们的第一个贡献包括分析点云细分中的新的未开发的方案，即无源的在线无监督域改编（SF-OUDA）。我们在实验上表明，最新的方法具有相当有限的能力，可以使预训练的深网模型以在线方式看不到域。我们的第二个贡献是一种依赖于自适应自我训练和几何传播的方法，以在线调整预训练的源模型，而无需源数据或目标标签。我们的第三个贡献是在一个充满挑战的设置中研究sf-ouda，其中源数据是合成的，目标数据是现实世界中捕获的点云。我们将最近的Synlidar数据集用作合成源，并引入了两个新的合成（源）数据集，这些数据集可以刺激未来的综合自动驾驶研究。我们的实验显示了我们分割方法对数千个现实点云的有效性。代码和合成数据集可在https://github.com/saltoricristiano/gipso-sfouda上找到。

translated by 谷歌翻译

High-Order Conditional Mutual Information Maximization for dealing with High-Order Dependencies in Feature Selection

Francisco Souza , Cristiano Premebida , Rui Araújo

分类：机器学习

2022-07-18

本文提出了一种基于条件互信息（CMI）的新型特征选择方法。提出的高阶条件互信息最大化（HOCMIM）将高阶依赖性纳入特征选择过程中，并且由于其自下而上的推导而具有直接的解释。HOCMIM源自CMI的链膨胀，并表示为最大化优化问题。最大化问题是使用贪婪的搜索过程解决的，该过程加快了整个功能选择过程。实验是在一组基准数据集上运行的（总共20个）。将HOCMIM与两个有监督的学习分类器（支持向量机和K-Nearest邻居）的结果进行比较。HOCMIM在准确性方面取得了最佳效果，并且表明要比高级特征选择的速度快。

translated by 谷歌翻译

Towards biologically plausible Dreaming and Planning in recurrent spiking networks

Cristiano Capone , Pier Stanislao Paolucci

分类：机器学习

2022-05-20

Humans and animals can learn new skills after practicing for a few hours, while current reinforcement learning algorithms require a large amount of data to achieve good performances. Recent model-based approaches show promising results by reducing the number of necessary interactions with the environment to learn a desirable policy. However, these methods require biological implausible ingredients, such as the detailed storage of older experiences, and long periods of offline learning. The optimal way to learn and exploit word-models is still an open question. Taking inspiration from biology, we suggest that dreaming might be an efficient expedient to use an inner model. We propose a two-module (agent and model) spiking neural network in which "dreaming" (living new experiences in a model-based simulated environment) significantly boosts learning. We also explore "planning", an online alternative to dreaming, that shows comparable performances. Importantly, our model does not require the detailed storage of experiences, and learns online the world-model and the policy. Moreover, we stress that our network is composed of spiking neurons, further increasing the biological plausibility and implementability in neuromorphic hardware.

translated by 谷歌翻译

Explainable Deep Learning Methods in Medical Imaging Diagnosis: A Survey

Cristiano Patrício , João C. Neves , Luís F. Teixeira

分类：人工智能 | 计算机视觉 | 机器学习

2022-05-10

深度学习的显着成功引起了人们对医学成像诊断的应用的兴趣。尽管最新的深度学习模型在分类不同类型的医学数据方面已经达到了人类水平的准确性，但这些模型在临床工作流程中几乎不采用，这主要是由于缺乏解释性。深度学习模型的黑盒子性提出了制定策略来解释这些模型的决策过程的必要性，从而导致了可解释的人工智能（XAI）主题的创建。在这种情况下，我们对应用于医学成像诊断的XAI进行了详尽的调查，包括视觉，基于示例和基于概念的解释方法。此外，这项工作回顾了现有的医学成像数据集和现有的指标，以评估解释的质量。此外，我们还包括一组基于报告生成的方法的性能比较。最后，还讨论了将XAI应用于医学成像以及有关该主题的未来研究指示的主要挑战。

translated by 谷歌翻译