智能论文笔记

ALOHA: Artificial Learning of Human Attributes for Dialogue Agents

Aaron W. Li , Veronica Jiang , Steven Y. Feng , Julia Sprague , Wei Zhou , Jesse Hoey

分类：自然语言处理 | 人工智能 | 机器学习

2019-10-18

对于谈话的AI和虚拟助手以现实的方式与人类沟通，他们必须表现出人类特征，例如情感和个性的表达。目前对构建人类对话剂的尝试呈现出显着的困难。我们提出基于Tropes的人为水平属性（HLA）作为学习对话代理的方法，这些方法可以模仿虚构人物的个性。 Tropes是由观察员的次要观察和确定的虚构个性的特征。通过将详细的HLA数据与特定字符的对话数据组合，我们提供了一个数据集，HLA-Chat，模型字符配置文件，并提供对话代理通过HLA学习角色语言样式的能力。然后，我们介绍了一个三组件系统，Aloha（代表人工学习人为学习），它结合了字符空间映射，角色社区检测和语言样式检索，以构建特定字符（或个性）特定语言模型。我们的初步实验表明Aloha的两种变化与我们提出的数据集相结合，可以在识别所选择的目标字符的正确对话响应时占据基线模型，并且无论字符的身份，节目类型如何，都是稳定的对话。

translated by 谷歌翻译

Learning Disturbances Online for Risk-Aware Control: Risk-Aware Flight with Less Than One Minute of Data

Prithvi Akella , Skylar X. Wei , Joel W. Burdick , Aaron D. Ames

分类：机器学习 | 机器人

2022-12-12

Recent advances in safety-critical risk-aware control are predicated on apriori knowledge of the disturbances a system might face. This paper proposes a method to efficiently learn these disturbances online, in a risk-aware context. First, we introduce the concept of a Surface-at-Risk, a risk measure for stochastic processes that extends Value-at-Risk -- a commonly utilized risk measure in the risk-aware controls community. Second, we model the norm of the state discrepancy between the model and the true system evolution as a scalar-valued stochastic process and determine an upper bound to its Surface-at-Risk via Gaussian Process Regression. Third, we provide theoretical results on the accuracy of our fitted surface subject to mild assumptions that are verifiable with respect to the data sets collected during system operation. Finally, we experimentally verify our procedure by augmenting a drone's controller and highlight performance increases achieved via our risk-aware approach after collecting less than a minute of operating data.

translated by 谷歌翻译

Deep filter bank regression for super-resolution of anisotropic MR brain images

Samuel W. Remedios , Shuo Han , Yuan Xue , Aaron Carass , Trac D. Tran , Dzung L. Pham , Jerry L. Prince

分类：计算机视觉

2022-09-06

在2D多板磁共振（MR）采集中，平面信号通常比面内信号较低。尽管当代超分辨率（SR）方法旨在恢复基本的高分辨率量，但估计的高频信息是通过端到端数据驱动的培训隐含的，而不是明确说明和寻求。为了解决这个问题，我们根据完美的重建过滤库重新构架SR问题声明，使我们能够识别并直接估计缺失的信息。在这项工作中，我们提出了一种两阶段的方法，以近似于与特定扫描的各向异性采集相对应的完美重建过滤库。在第1阶段，我们使用梯度下降估算缺失的过滤器，在第2阶段，我们使用深网来学习从粗系数到细节系数的映射。此外，提出的公式不依赖外部训练数据，从而规避了对域移位校正的需求。在我们的方法下，特别是在“切片差距”方案中提高了SR性能，这可能是由于框架施加的解决方案空间的限制。

translated by 谷歌翻译

Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images

Zhenzhen Wang , Carla Saoud , Sintawat Wangsiricharoen , Aaron W. James , Aleksander S. Popel , Jeremias Sulam

分类：计算机视觉 | 机器学习

2021-09-22

在病理样本的全坡度图像（WSI）中注释癌区域在临床诊断，生物医学研究和机器学习算法开发中起着至关重要的作用。但是，产生详尽而准确的注释是劳动密集型，具有挑战性和昂贵的。仅绘制粗略和近似注释是一项容易得多的任务，成本较小，并且可以减轻病理学家的工作量。在本文中，我们研究了在数字病理学中完善这些近似注释以获得更准确的问题的问题。以前的一些作品探索了从这些不准确的注释中获得机器学习模型，但是很少有人解决改进问题，在这些问题中，应该明确识别和纠正错误标签的区域，并且所有这些都需要大量的培训样本（通常很大）。我们提出了一种名为标签清洁多个实例学习（LC-MIL）标签的方法，可在不需要外部培训数据的情况下对单个WSI进行粗略注释。从WSI裁剪的带有不准确标签的贴片在多个实例学习框架内共同处理，从而减轻了它们对预测模型的影响并完善分割。我们对具有乳腺癌淋巴结转移，肝癌和结直肠癌样品的异质WSI进行的实验表明，LC-MIL显着完善了粗糙的注释，即使从单个幻灯片中学习，LC-MIL也优于最先进的替代方案。此外，我们证明了拟议方法如何有效地完善和改进病理学家绘制的真实注释。所有这些结果表明，LC-MIL是一种有前途的，轻巧的工具，可提供从粗糙注释的病理组中提供细粒的注释。

translated by 谷歌翻译

Model-Free Safety-Critical Control for Robotic Systems

Tamas G. Molnar , Ryan K. Cosner , Andrew W. Singletary , Wyatt Ubellacker , Aaron D. Ames

分类：机器人

2021-09-19

本文介绍了机器人系统的安全关键控制的框架，当配置空间中的安全区域上定义了安全区域时。为了保持安全性，我们基于控制屏障函数理论综合安全速度而不依赖于机器人的A可能复杂的高保真动态模型。然后，我们跟踪跟踪控制器的安全速度。这使得在无模型安全关键控制中。我们证明了拟议方法的理论安全保障。最后，我们证明这种方法是适用于棘手的。我们在高保真仿真中使用SEGWAY执行障碍避免任务，以及在硬件实验中的无人机和Quadruped。

translated by 谷歌翻译

Control Barrier Function Based Quadratic Programs for Safety Critical Systems

Aaron D. Ames , Xiangru Xu , Jessy W. Grizzle , Paulo Tabuada

分类：

2016-09-21

Safety critical systems involve the tight coupling between potentially conflicting control objectives and safety constraints. As a means of creating a formal framework for controlling systems of this form, and with a view toward automotive applications, this paper develops a methodology that allows safety conditions-expressed as control barrier functionsto be unified with performance objectives-expressed as control Lyapunov functions-in the context of real-time optimizationbased controllers. Safety conditions are specified in terms of forward invariance of a set, and are verified via two novel generalizations of barrier functions; in each case, the existence of a barrier function satisfying Lyapunov-like conditions implies forward invariance of the set, and the relationship between these two classes of barrier functions is characterized. In addition, each of these formulations yields a notion of control barrier function (CBF), providing inequality constraints in the control input that, when satisfied, again imply forward invariance of the set. Through these constructions, CBFs can naturally be unified with control Lyapunov functions (CLFs) in the context of a quadratic program (QP); this allows for the achievement of control objectives (represented by CLFs) subject to conditions on the admissible states of the system (represented by CBFs). The mediation of safety and performance through a QP is demonstrated on adaptive cruise control and lane keeping, two automotive control problems that present both safety and performance considerations coupled with actuator bounds.

translated by 谷歌翻译

Coarse-to-Fine Contrastive Learning on Graphs

Peiyao Zhao , Yuangang Pan , Xin Li , Xu Chen , Ivor W. Tsang , Lejian Liao

分类：机器学习

2022-12-13

Inspired by the impressive success of contrastive learning (CL), a variety of graph augmentation strategies have been employed to learn node representations in a self-supervised manner. Existing methods construct the contrastive samples by adding perturbations to the graph structure or node attributes. Although impressive results are achieved, it is rather blind to the wealth of prior information assumed: with the increase of the perturbation degree applied on the original graph, 1) the similarity between the original graph and the generated augmented graph gradually decreases; 2) the discrimination between all nodes within each augmented view gradually increases. In this paper, we argue that both such prior information can be incorporated (differently) into the contrastive learning paradigm following our general ranking framework. In particular, we first interpret CL as a special case of learning to rank (L2R), which inspires us to leverage the ranking order among positive augmented views. Meanwhile, we introduce a self-ranking paradigm to ensure that the discriminative information among different nodes can be maintained and also be less altered to the perturbations of different degrees. Experiment results on various benchmark datasets verify the effectiveness of our algorithm compared with the supervised and unsupervised models.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Evaluation of Medical Image Segmentation Models for Uncertain, Small or Empty Reference Annotations

Sophie Ostmeier , Brian Axelrod , Jeroen Bertels , Fabian Isensee , Maarten G. Lansberg , Soren Christensen , Gregory W. Albers , Li-Jia Li , Jeremy J. Heit

分类：计算机视觉 | 机器学习

2022-09-26

医学图像分割模型的性能指标用于衡量参考注释和预测之间的一致性。在开发此类模型中，使用了一组通用指标，以使结果更具可比性。但是，公共数据集中的分布与临床实践中遇到的案例之间存在不匹配。许多常见的指标无法衡量这种不匹配的影响，尤其是对于包含不确定，小或空参考注释的临床数据集。因此，可能无法通过此类指标来验证模型在临床上有意义的一致性。评估临床价值的维度包括独立于参考注释量的大小，考虑参考注释的不确定性，体积计和/或位置一致性的奖励以及对空参考注释正确分类的奖励。与普通的公共数据集不同，我们的内部数据集更具代表性。它包含不确定的，小或空的参考注释。我们研究了有关深度学习框架的预测的公开度量指标，以确定哪些设置共同指标可提供有意义的结果。我们将公共基准数据集进行比较而没有不确定，小或空参考注释。该代码将发布。

translated by 谷歌翻译

Text Simplification of College Admissions Instructions: A Professionally Simplified and Verified Corpus

Zachary W. Taylor , Maximus H. Chu , Junyi Jessy Li

分类：自然语言处理

2022-09-09

接受高等教育对于少数族裔和新兴双语学生至关重要。但是，高等教育机构用来与准学生交流的语言通常太复杂了。具体而言，美国的许多机构发布录取申请指令远远高于典型高中毕业生的平均阅读水平，通常接近13年级或14年级。这导致学生之间不必要的障碍和获得高等教育。这项工作旨在通过简化文本来应对这一挑战。我们介绍PSAT（专业简化的录取文本），这是一个数据集，其中有112条从美国的高等教育机构中随机选择的录取说明。然后，这些文本将被专业地简化，并被各个机构招生办公室的专职员工专家进行了验证和接受。此外，PSAT带有1,883个原始简化句子对的手动对齐。结果是在与现有简化资源不同的高风险流派中评估和微调文本简化系统的首个语料库。

translated by 谷歌翻译