智能论文笔记

Object-based active inference

Ruben S. van Bergen , Pablo L. Lanillos

分类：人工智能

2022-09-02

世界由对象组成：具有独立属性和动态的不同实体。为了使代理人聪明地与世界互动，他们必须将感觉输入转化为描述每个对象的边界特征。这些基于对象的表示形成了计划行为的自然基础。主动推断（AIF）是对感知和行动的影响力的统一说明，但是现有的AIF模型并未利用这种重要的归纳偏见。为了解决这个问题，我们介绍了“基于对象的主动推理”（OBAI），将AIF与最近基于对象的神经网络结合在一起。 Obai代表具有不同变异信念的不同对象，并使用选择性注意来将输入输入到相应的对象插槽中。对象表示具有独立的基于动作的动态。动力学和生成模型是从简单环境（主动的多-DSPRITES）的经验中学到的。我们表明，奥贝（Obai）学会了从视频输入中正确分割动作扰动的对象，并将这些对象操纵到任意目标。

translated by 谷歌翻译

3-D PET Image Generation with tumour masks using TGAN

Robert V Bergen , Jean-Francois Rajotte , Fereshteh Yousefirizi , Ivan S Klyuzhin , Arman Rahmim , Raymond T. Ng

分类：计算机视觉 | 机器学习

2021-11-02

对疾病的诊断或图像分割医学图像训练计算机视觉相关算法是缺乏训练数据，标记的样品，和隐私问题的困难所致。出于这个原因，一个强大的生成方法来创建合成数据后高度寻求。然而，大多数三维图像生成器需要额外的图像输入或者是非常占用大量内存。为了解决这些问题，我们建议调整视频生成技术3-d图像生成。使用时间GAN（TGAN）架构，我们将展示我们能够产生逼真的头部和颈部PET图像。我们还表明，通过调节肿瘤口罩发电机，我们能够控制肿瘤的几何形状和位置，在生成的图像。为了测试合成影像的用途，我们使用合成的图像训练分割模型。空调真实肿瘤掩模合成图像被自动分割，和对应的真实图像也分割。我们评估使用的骰子得分的分割，并找到两个数据集（0.65合成数据，0.70的真实数据）同样的分割算法执行。然后，各种radionomic特征在分割的肿瘤体积为每个数据集来计算。真实的和合成的特征分布的比较显示，8七个特征分布有统计学不显着差异（p> 0.05）。还计算所有radionomic特征之间的相关系数，它是示出了所有在真实数据组中的强统计相关的在合成数据集被保留。

translated by 谷歌翻译

Asymptotics of representation learning in finite Bayesian neural networks

Jacob A. Zavatone-Veth , Abdulkadir Canatar , Benjamin S. Ruben , Cengiz Pehlevan

分类：机器学习 | (统计)机器学习

2021-06-01

最近的作品表明，有限的贝叶斯神经网络有时可能会越优于其无限堂兄弟，因为有限网络可以灵活地调整其内部表示。然而，我们对有限网络的学习隐藏层表示如何与无限网络的固定表示不同的理论理解仍然不完整。研究了对网络的扰动有限宽度校正，但已经研究过的网络，但学习特征的渐近学尚未完全表征。在这里，我们认为具有线性读数和高斯可能性的任何贝叶斯网络的平均特征内核的领先有限宽度校正具有很大程度上的普遍形式。我们明确地说明了三个易行网络架构：深线性完全连接和卷积网络，以及具有单个非线性隐藏层的网络。我们的结果开始阐明任务相关的学习信号如何塑造宽阔的贝叶斯神经网络的隐藏层表示。

translated by 谷歌翻译

Optimizing ship detection efficiency in SAR images

Arthur Van Meerbeeck , Jordy Van Landeghem , Ruben Cartuyvels , Marie-Francine Moens

分类：计算机视觉

2022-12-12

The detection and prevention of illegal fishing is critical to maintaining a healthy and functional ecosystem. Recent research on ship detection in satellite imagery has focused exclusively on performance improvements, disregarding detection efficiency. However, the speed and compute cost of vessel detection are essential for a timely intervention to prevent illegal fishing. Therefore, we investigated optimization methods that lower detection time and cost with minimal performance loss. We trained an object detection model based on a convolutional neural network (CNN) using a dataset of satellite images. Then, we designed two efficiency optimizations that can be applied to the base CNN or any other base model. The optimizations consist of a fast, cheap classification model and a statistical algorithm. The integration of the optimizations with the object detection model leads to a trade-off between speed and performance. We studied the trade-off using metrics that give different weight to execution time and performance. We show that by using a classification model the average precision of the detection model can be approximated to 99.5% in 44% of the time or to 92.7% in 25% of the time.

translated by 谷歌翻译

Online Real-time Learning of Dynamical Systems from Noisy Streaming Data

S. Sinha , Sai P. Nandanoori , David Barajas-Solano

分类：机器学习

2022-12-10

Recent advancements in sensing and communication facilitate obtaining high-frequency real-time data from various physical systems like power networks, climate systems, biological networks, etc. However, since the data are recorded by physical sensors, it is natural that the obtained data is corrupted by measurement noise. In this paper, we present a novel algorithm for online real-time learning of dynamical systems from noisy time-series data, which employs the Robust Koopman operator framework to mitigate the effect of measurement noise. The proposed algorithm has three main advantages: a) it allows for online real-time monitoring of a dynamical system; b) it obtains a linear representation of the underlying dynamical system, thus enabling the user to use linear systems theory for analysis and control of the system; c) it is computationally fast and less intensive than the popular Extended Dynamic Mode Decomposition (EDMD) algorithm. We illustrate the efficiency of the proposed algorithm by applying it to identify the Van der Pol oscillator, the IEEE 68 bus system, and a ring network of Van der Pol oscillators.

translated by 谷歌翻译

Vibration suppression of a state-of-the-art wafer gripper

M. El Ajjaj , M. B. Kaczmarek , M. A. C. C. van den Hurk , S. H. HosseinNia

分类：机器人

2022-12-04

In this paper the implementation of piezoelectrics to a state-of-the-art wafer gripper is investigated. The objective is to propose and validate a solution method, which includes a mechanical design and control system, to achieve at least 5% damping for two eigenmodes of a wafer gripper. This objective serves as a 'proof of concept' to show the possibilities of implementing a state-of-the-art damping method to an industrial application, which in turn can be used to dampen different thin structures. The coupling relation between the piezoelectrics and their host structure were used to design the placement of the piezoelectric patches, together with modal analysis data of the a state-of-the-art wafer gripper. This data had been measured through an experimental setup. Active damping has been succesfully implemented onto the wafer gripper where positive position feedback (PPF) is used as a control algorithm to dampen two eigenmodes.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Graph Neural Networks for Low-Energy Event Classification & Reconstruction in IceCube

R. Abbasi , M. Ackermann , J. Adams , N. Aggarwal , J. A. Aguilar , M. Ahlers , M. Ahrens , J. M. Alameddine , A. A. Alves Jr. , N. M. Amin

分类：机器学习

2022-09-07

ICECUBE是一种用于检测1 GEV和1 PEV之间大气和天体中微子的光学传感器的立方公斤阵列，该阵列已部署1.45 km至2.45 km的南极的冰盖表面以下1.45 km至2.45 km。来自ICE探测器的事件的分类和重建在ICeCube数据分析中起着核心作用。重建和分类事件是一个挑战，这是由于探测器的几何形状，不均匀的散射和冰中光的吸收，并且低于100 GEV的光，每个事件产生的信号光子数量相对较少。为了应对这一挑战，可以将ICECUBE事件表示为点云图形，并将图形神经网络（GNN）作为分类和重建方法。 GNN能够将中微子事件与宇宙射线背景区分开，对不同的中微子事件类型进行分类，并重建沉积的能量，方向和相互作用顶点。基于仿真，我们提供了1-100 GEV能量范围的比较与当前ICECUBE分析中使用的当前最新最大似然技术，包括已知系统不确定性的影响。对于中微子事件分类，与当前的IceCube方法相比，GNN以固定的假阳性速率（FPR）提高了信号效率的18％。另外，GNN在固定信号效率下将FPR的降低超过8（低于半百分比）。对于能源，方向和相互作用顶点的重建，与当前最大似然技术相比，分辨率平均提高了13％-20％。当在GPU上运行时，GNN能够以几乎是2.7 kHz的中位数ICECUBE触发速率的速率处理ICECUBE事件，这打开了在在线搜索瞬态事件中使用低能量中微子的可能性。

translated by 谷歌翻译

Negation detection in Dutch clinical texts: an evaluation of rule-based and machine learning methods

Bram van Es , Leon C. Reteig , Sander C. Tan , Marijn Schraagen , Myrthe M. Hemker , Sebastiaan R. S. Arends , Miguel A. R. Rios , Saskia Haitjema

分类：自然语言处理 | 机器学习 | (统计)机器学习

2022-09-01

由于结构化数据通常不足，因此在开发用于临床信息检索和决策支持系统模型时，需要从电子健康记录中的自由文本中提取标签。临床文本中最重要的上下文特性之一是否定，这表明没有发现。我们旨在通过比较荷兰临床注释中的三种否定检测方法来改善标签的大规模提取。我们使用Erasmus医疗中心荷兰临床语料库比较了基于ContextD的基于规则的方法，即使用MEDCAT和（Fineted）基于Roberta的模型的BilstM模型。我们发现，Bilstm和Roberta模型都在F1得分，精度和召回方面始终优于基于规则的模型。此外，我们将每个模型的分类错误系统地分类，这些错误可用于进一步改善特定应用程序的模型性能。在性能方面，将三个模型结合起来并不有益。我们得出的结论是，尤其是基于Bilstm和Roberta的模型在检测临床否定方面非常准确，但是最终，根据手头的用例，这三种方法最终都可以可行。

translated by 谷歌翻译

HTML版本

Hidden Author Bias in Book Recommendation

Savvina Daniil , Mirjam Cuper , Cynthia C. S. Liem , Jacco van Ossenbruggen , Laura Hollink

分类：人工智能

2022-09-01

协作过滤算法的优点是不需要敏感的用户或项目信息来提供建议。但是，他们仍然遭受与公平相关的问题的困扰，例如受欢迎程度偏见。在这项工作中，我们认为，当未向研究人员提供其他用户或项目信息时，受欢迎程度偏差通常会导致其他偏见。我们在书籍中使用书籍评分的常用数据集中的建议案例中检查了我们的假设。我们使用公开可用的外部资源将其丰富了作者信息。我们发现流行的书籍主要是由美国公民在数据集中撰写的，并且与用户的配置文件相比，流行的协作过滤算法往往会过分推荐这些书籍。我们得出的结论是，学者社区应进一步研究受欢迎程度偏见的社会含义。

translated by 谷歌翻译