智能论文笔记

Robust Onboard Localization in Changing Environments Exploiting Text Spotting

Nicky Zimmerman , Louis Wiesmann , Tiziano Guadagnino , Thomas Läbe , Jens Behley , Cyrill Stachniss

分类：机器人

2022-03-23

在给定地图中的强大定位是大多数自主机器人的关键组成部分。在本文中，我们解决了在室内环境中定位的问题，该问题在室内环境中发生了变化，而突出结构在不同时间点构建的地图中没有对应关系的问题。为了克服地图与由于这种变化引起的观察到的环境之间的差异，我们利用了人类可读的本地化提示来协助定位。这些提示很容易在大多数设施中获得，并且可以通过使用文本斑点来使用RGB摄像机图像来检测。我们使用在2D激光扫描和相机数据上运行的粒子过滤器将这些线索集成到蒙特卡洛本地化框架中。这样，我们为人类行走具有结构性变化和动态的环境提供了强大的本地化解决方案。我们在办公室环境中评估了有关多个挑战室内场景的本地化框架。实验表明，我们的方法对结构变化具有鲁棒性，并且可以在板载计算机上运行。我们（按照纸质接受）发布了方法的开源实现，该实现使用了现成的文本斑点，并用ROS包装器编写了C ++。

translated by 谷歌翻译

Future Sight: Dynamic Story Generation with Large Pretrained Language Models

Brian D. Zimmerman , Gaurav Sahu , Olga Vechtomova

分类：自然语言处理 | 人工智能 | 机器学习

2022-12-20

Recent advances in deep learning research, such as transformers, have bolstered the ability for automated agents to generate creative texts similar to those that a human would write. By default, transformer decoders can only generate new text with respect to previously generated text. The output distribution of candidate tokens at any position is conditioned on previously selected tokens using a self-attention mechanism to emulate the property of autoregression. This is inherently limiting for tasks such as controllable story generation where it may be necessary to condition on future plot events when writing a story. In this work, we propose Future Sight, a method for finetuning a pretrained generative transformer on the task of future conditioning. Transformer decoders are typically pretrained on the task of completing a context, one token at a time, by means of self-attention. Future Sight additionally enables a decoder to attend to an encoded future plot event. This motivates the decoder to expand on the context in a way that logically concludes with the provided future. During inference, the future plot event can be written by a human author to steer the narrative being generated in a certain direction. We evaluate the efficacy of our approach on a story generation task with human evaluators.

translated by 谷歌翻译

Automatic lesion analysis for increased efficiency in outcome prediction of traumatic brain injury

Margherita Rosnati , Eyal Soreq , Miguel Monteiro , Lucia Li , Neil S. N. Graham , Karl Zimmerman , Carlotta Rossi , Greta Carrara , Guido Bertolini , David J. Sharp

分类：计算机视觉

2022-08-08

对脑外伤（TBI）患者的准确预后很难为治疗，患者管理和长期护理提供信息至关重要。年龄，运动和学生反应性，缺氧和低血压以及计算机断层扫描（CT）的放射学发现等患者特征已被确定为TBI结果预测的重要变量。 CT是临床实践中选择的急性成像方式，因为其获取速度和广泛的可用性。但是，这种方式主要用于定性和半定量评估，例如马歇尔评分系统，该系统容易受到主观性和人为错误。这项工作探讨了使用最先进的，深度学习的TBI病变分割方法从常规获得的医院入院CT扫描中提取的成像生物标志物的预测能力。我们使用病变体积和相应的病变统计作为扩展TBI结果预测模型的输入。我们将我们提出的功能的预测能力与马歇尔分数进行比较，并与经典的TBI生物标志物配对。我们发现，在预测不利的TBI结果时，自动提取的定量CT功能的性能与Marshall分数相似或更好。利用自动地图集对齐，我们还确定额叶外病变是不良预后的重要指标。我们的工作可能有助于更好地理解TBI，并提供有关如何使用自动化神经影像分析来改善TBI后预测的新见解。

translated by 谷歌翻译

Deep learning based non-contact physiological monitoring in Neonatal Intensive Care Unit

Nicky Nirlipta Sahoo , Balamurali Murugesan , Ayantika Das , Srinivasa Karthik , Keerthi Ram , Steffen Leonhardt , Jayaraj Joseph , Mohanasankar Sivaprakasam

分类：计算机视觉

2022-07-25

新生儿重症监护病房（NICU）中的早产婴儿必须不断监测其心脏健康。常规的监测方法是基于接触的，使新生儿容易受到各种医院感染。基于视频的监视方法为非接触式测量开辟了潜在的途径。这项工作提供了一条管道，用于远程对NICU设置视频的心肺信号进行远程估算。我们提出了一个端到端深度学习（DL）模型，该模型集成了一种基于基于学习的方法来生成替代地面真理（SGT）标签以进行监督，从而避免了直接依赖对真实地面真相标签的依赖。我们进行了扩展的定性和定量分析，以检查我们提出的基于DL的管道的功效，并在估计的心率中达到了总平均平均绝对误差为4.6 BEATS（BPM）（BPM）和均方根均方根误差为6.2 bpm。

translated by 谷歌翻译

Adaptive Sampling for Discovery

Ziping Xu , Eunjae Shim , Ambuj Tewari , Paul Zimmerman

分类： (统计)机器学习 | 机器学习

2022-05-30

In this paper, we study a sequential decision-making problem, called Adaptive Sampling for Discovery (ASD). Starting with a large unlabeled dataset, algorithms for ASD adaptively label the points with the goal to maximize the sum of responses. This problem has wide applications to real-world discovery problems, for example drug discovery with the help of machine learning models. ASD algorithms face the well-known exploration-exploitation dilemma. The algorithm needs to choose points that yield information to improve model estimates but it also needs to exploit the model. We rigorously formulate the problem and propose a general information-directed sampling (IDS) algorithm. We provide theoretical guarantees for the performance of IDS in linear, graph and low-rank models. The benefits of IDS are shown in both simulation experiments and real-data experiments for discovering chemical reaction conditions.

translated by 谷歌翻译

Regular Decision Processes for Grid Worlds

Nicky Lenaers , Martijn van Otterlo

分类：人工智能

2021-11-05

马尔可夫决策过程通常用于不确定性下的顺序决策。然而，对于许多方面，从受约束或安全规范到任务和奖励结构中的各种时间（非Markovian）依赖性，需要扩展。为此，近年来，兴趣已经发展成为强化学习和时间逻辑的组合，即灵活的行为学习方法的组合，具有稳健的验证和保证。在本文中，我们描述了最近引入的常规决策过程的实验调查，该过程支持非马洛维亚奖励功能以及过渡职能。特别是，我们为常规决策过程，与在线，增量学习有关的算法扩展，对无模型和基于模型的解决方案算法的实证评估，以及以常规但非马尔维亚，网格世界的应用程序的算法扩展。

translated by 谷歌翻译

Multi Scale Identity-Preserving Image-to-Image Translation Network for Low-Resolution Face Recognition

Vahid Reza Khazaie , Nicky Bayat , Yalda Mohsenzadeh

分类：计算机视觉

2020-10-23

最新的深度神经网络模型已在受控的高分辨率面部图像上达到了几乎完美的面部识别精度。但是，当他们使用非常低分辨率的面部图像测试时，它们的性能会大大降低。这在监视系统中尤其重要，在监视系统中，低分辨率探测图像应与高分辨率图库图像匹配。超分辨率技术旨在从低分辨率对应物中产生高分辨率的面部图像。尽管它们能够重建视觉上吸引人的图像，但与身份相关的信息尚未保留。在这里，我们提出了一个具有身份的端到端图像到图像翻译的深度神经网络，该网络能够使其高分辨率的高分辨率面孔超级解决方案，同时保留与身份相关的信息。我们通过训练一个非常深的卷积编码器网络来实现这一目标，并在相应层之间具有对称收缩路径。该网络在多尺度的低分辨率条件下训练了重建和具有身份损失的结合。对我们提出的模型的广泛定量评估表明，它在自然和人工低分辨率的面部数据集甚至看不见的身份方面优于竞争超分辨率和低分辨率的面部识别方法。

translated by 谷歌翻译