智能论文笔记

Optimizing Data Collection in Deep Reinforcement Learning

James Gleeson , Daniel Snider , Yvonne Yang , Moshe Gabel , Eyal de Lara , Gennady Pekhimenko

分类：机器学习

2022-07-15

强化学习（RL）的工作负载需要臭名昭著的时间来训练，因为在运行时间从模拟器收集了大量样本。不幸的是，群集扩展方法仍然很昂贵，并且在GPU计算之间来回切换时，模拟器的常用CPU实现会诱导高空开销。我们探索两种优化，通过增加GPU利用率来提高RL数据收集效率：（1）GPU矢量化：在GPU上平行模拟，以增加硬件并行性，以及（2）模拟器内核融合：融合多个模拟步骤，以在单个GPU内核中运行。启动以减少全局内存带宽要求。我们发现，与常用的CPU模拟器相比，GPU矢量化最多可达到$ 1024 \ times $速度。我们介绍了不同实现的性能，并表明，对于简单的模拟器，GPU矢量化的ML编译器实现（XLA）通过将CPU从重复的Python降低到DL Backend API呼叫来优于DNN Framework（Pytorch）$ 13.4 \ times $。我们证明，带有简单模拟器的模拟器内核融合加速度为$ 11.3 \ times $，并且随着模拟器复杂性在内存带宽要求方面的增加，增加了$ 1024 \ times $。我们表明，来自模拟器内核融合的加速度是正交的，可以与GPU矢量化结合，从而导致乘法加速。

translated by 谷歌翻译

MetaDIP: Accelerating Deep Image Prior with Meta Learning

Kevin Zhang , Mingyang Xie , Maharshi Gor , Yi-Ting Chen , Yvonne Zhou , Christopher A. Metzler

分类：计算机视觉

2022-09-18

深图像先验（DIP）是一种最近提出的技术，用于通过将重建图像拟合到未经训练的卷积神经网络的输出中来解决成像反问题。与预处理的前馈神经网络不同，相同的倾角可以概括为任意逆问题，从降级到阶段检索，同时在每个任务下提供竞争性能。DIP的主要缺点是，虽然前馈神经网络可以在单个通行证中重建图像，但DIP必须以大量的计算成本逐渐更新数百到数千个迭代的权重。在这项工作中，我们使用元学习来大规模加速基于倾斜的重建。通过学习浸入权重的适当初始化，我们证明了在一系列逆成像任务中的运行时间有10倍的改善。此外，我们证明了一个经过训练以快速重建面孔的网络也将其推广以重建自然图像贴片。

translated by 谷歌翻译

The Effect of Robot Posture and Idle Motion on Spontaneous Emotional Contagion during Robot-Human Interactions

Isabel Casso , Bing Li , Tatjana Nazir , Yvonne N. Delevoye-Turrell

分类：机器人

2022-09-02

在接下来的十年中，社交机器人将在许多公共场所中实施，以向人类提供服务。我们质疑这些社交机器人的特性，以提供接受和自发的情感互动。更具体地说，在本研究中，我们报告了机器人在与人类参与者面对面互动任务中情绪传染中空闲运动频率的影响。机器人系统的伙伴被编程为采用悲伤的姿势和面部表情，同时讲述了三个悲伤的故事，并以低，中和高频向上/向下移动头部。每个参与者（n = 15）被邀请坐在好友面前，听故事。使用3D运动捕获系统（质量）记录了人类参与者姿势的无意识变化。结果表明，在高频试验中，肩膀/躯干在高频试验中的倾斜度更大。当Buddy以缓慢的频率移动时，自发运动的数量也更大。当两个人从事社交互动时，这些发现与实验心理学报道的结果相呼应。在Godspeed问卷中获得的分数进一步表明，当Buddy移动缓慢时，可能会发生情绪传染，因为机器人系统被认为是更自然和知识渊博的，例如，以速度与表达的情感相干。我们的工作探讨了机器人系统概念中身体姿势和空闲运动频率的重要性。这样的补充可以提供社交机器人，这些机器人在轻松的机器人人类协作任务中提供情感传染。

translated by 谷歌翻译

HTML版本

Safe Human-Robot Collaborative Transportation via Trust-Driven Role Adaptation

Tony Zheng , Monimoy Bujarbaruah , Yvonne R. Stürz , Francesco Borrelli

分类：机器人

2022-07-13

我们在存在障碍的情况下研究人类机器人协作运输任务。每个代理的任务是将刚性对象携带到共同的目标位置，同时避免障碍并满足其他代理的合规性和驱动约束。人类和机器人不具有对环境的当地观点。当人类政策根据机器人对环境的看法而安全，或者积极领导任务时，可以协助机器人。使用估计的人类输入，机器人通过解决有限的时间最佳控制问题来计划运输对象的轨迹。机器人上的传感器测量人类应用的输入。然后，机器人适当地应用了人类应用的加权组合及其自身计划的投入，其中根据机器人对人类投入的估计的信任价值选择权重。这允许在整个任务中对机器人进行动态领导者的角色调整。此外，在信任的价值较低的情况下，如果机器人遇到了人类可能未知的任何障碍，它会触发安全的停止政策，保持系统的安全性并发出对人类意图所需的更改。通过实验结果，我们证明了所提出的方法的功效。

translated by 谷歌翻译

A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation

David Ifeoluwa Adelani , Jesujoba Oluwadara Alabi , Angela Fan , Julia Kreutzer , Xiaoyu Shen , Machel Reid , Dana Ruiter , Dietrich Klakow , Peter Nabende , Ernie Chang

分类：自然语言处理

2022-05-04

语言模型预训练的最新进展利用大规模数据集创建多语言模型。但是，这些数据集中大多遗漏了低资源语言。这主要是因为网络上没有很好地表示口语，因此被排除在用于创建数据集的大规模爬网中。此外，这些模型的下游用户仅限于最初选择用于预训练的语言的选择。这项工作调查了如何最佳利用现有的预培训模型来为16种非洲语言创建低资源翻译系统。我们关注两个问题：1）如何将预训练的模型用于初始预培训中未包含的语言？ 2）生成的翻译模型如何有效地转移到新域？为了回答这些问题，我们创建了一个新的非洲新闻语料库，涵盖16种语言，其中8种语言不属于任何现有评估数据集的一部分。我们证明，将两种语言转移到其他语言和其他领域的最有效策略是，以少量的高质量翻译数据微调大型预训练模型。

translated by 谷歌翻译

Safety assurance of an industrial robotic control system using hardware/software co-verification

Yvonne Murray , Martin Sirevåg , Pedro Ribeiro , David A. Anisi , Morten Mossige

分类：机器人

2021-12-22

作为工业机器人的一般趋势，正在开发或重新设计的安全功能越来越多的安全功能，而不是通过安全继电器或互锁电路等物理硬件处理。这一趋势强化了补充基于传统，基于输入的测试和质量手术的重要性，这些测试和质量程序在今天广泛应用于行业，具有正式的验证和模型检查方法。为此，本文侧重于ABB工业涂料机器人中的代表性安全关键系统，即高压静电控制系统（HVC）。 HVC产生的高压的实际收敛性，对于安全操作必不可少，使用新颖的和一般共同验证框架正式验证，其中硬件和软件模型通过平台映射相关。这种方法使得具有高度多样化和专业的工具的务实组合。本文的主要贡献包括有关如何在工具之间传输硬件抽象和验证结果的详细信息，以便验证系统级安全性。值得注意的是，本文中考虑的HVC应用程序具有相当通用的反馈控制器形式。因此，这里报告的共同验证框架和经验对跟踪设定值引用的任何网络物理系统也非常相关。

translated by 谷歌翻译

Robust Data-Driven Predictive Control using Reachability Analysis

Amr Alanwar , Yvonne Stürz , Karl Henrik Johansson

分类：机器学习

2021-03-25

我们为具有有界过程和测量噪声的未知线性系统模型提供了一种强大的数据驱动控制方案。不取决于传统预测控制中的系统模型，提出了利用数据驱动的可达区域的控制器。数据驱动的可到达区域基于矩阵Zonotope递归，并且基于仅系统的轨迹的噪声输入输出数据来计算。我们假设测量和过程噪声包含在有界集中。虽然我们承担了这些界限的知识，但假设了关于噪声的统计特性的知识。在无噪声情况下，我们证明所呈现的纯粹数据驱动的控制方案导致等效的闭环行为到标称模型预测控制方案。在测量和过程噪声的情况下，我们提出的方案保证了强大的约束满足感，这在安全关键型应用中至关重要。数值实验表明了所提出的数据驱动控制器与基于模型的控制方案相比的有效性。

translated by 谷歌翻译

Backdoor Attacks Against Dataset Distillation

Yugeng Liu , Zheng Li , Michael Backes , Yun Shen , Yang Zhang

分类：机器学习

2023-01-03

Dataset distillation has emerged as a prominent technique to improve data efficiency when training machine learning models. It encapsulates the knowledge from a large dataset into a smaller synthetic dataset. A model trained on this smaller distilled dataset can attain comparable performance to a model trained on the original training dataset. However, the existing dataset distillation techniques mainly aim at achieving the best trade-off between resource usage efficiency and model utility. The security risks stemming from them have not been explored. This study performs the first backdoor attack against the models trained on the data distilled by dataset distillation models in the image domain. Concretely, we inject triggers into the synthetic data during the distillation procedure rather than during the model training stage, where all previous attacks are performed. We propose two types of backdoor attacks, namely NAIVEATTACK and DOORPING. NAIVEATTACK simply adds triggers to the raw data at the initial distillation phase, while DOORPING iteratively updates the triggers during the entire distillation procedure. We conduct extensive evaluations on multiple datasets, architectures, and dataset distillation techniques. Empirical evaluation shows that NAIVEATTACK achieves decent attack success rate (ASR) scores in some cases, while DOORPING reaches higher ASR scores (close to 1.0) in all cases. Furthermore, we conduct a comprehensive ablation study to analyze the factors that may affect the attack performance. Finally, we evaluate multiple defense mechanisms against our backdoor attacks and show that our attacks can practically circumvent these defense mechanisms.

translated by 谷歌翻译

PMT-IQA: Progressive Multi-task Learning for Blind Image Quality Assessment

Qingyi Pan , Ning Guo , Letu Qingge , Jingyi Zhang , Pei Yang

分类：计算机视觉

2023-01-03

Blind image quality assessment (BIQA) remains challenging due to the diversity of distortion and image content variation, which complicate the distortion patterns crossing different scales and aggravate the difficulty of the regression problem for BIQA. However, existing BIQA methods often fail to consider multi-scale distortion patterns and image content, and little research has been done on learning strategies to make the regression model produce better performance. In this paper, we propose a simple yet effective Progressive Multi-Task Image Quality Assessment (PMT-IQA) model, which contains a multi-scale feature extraction module (MS) and a progressive multi-task learning module (PMT), to help the model learn complex distortion patterns and better optimize the regression issue to align with the law of human learning process from easy to hard. To verify the effectiveness of the proposed PMT-IQA model, we conduct experiments on four widely used public datasets, and the experimental results indicate that the performance of PMT-IQA is superior to the comparison approaches, and both MS and PMT modules improve the model's performance.

translated by 谷歌翻译

MGTAB: A Multi-Relational Graph-Based Twitter Account Detection Benchmark

Shuhao Shi , Kai Qiao , Jian Chen , Shuai Yang , Jie Yang , Baojie Song , Linyuan Wang , Bin Yan

分类：计算机视觉

2023-01-03

The development of social media user stance detection and bot detection methods rely heavily on large-scale and high-quality benchmarks. However, in addition to low annotation quality, existing benchmarks generally have incomplete user relationships, suppressing graph-based account detection research. To address these issues, we propose a Multi-Relational Graph-Based Twitter Account Detection Benchmark (MGTAB), the first standardized graph-based benchmark for account detection. To our knowledge, MGTAB was built based on the largest original data in the field, with over 1.55 million users and 130 million tweets. MGTAB contains 10,199 expert-annotated users and 7 types of relationships, ensuring high-quality annotation and diversified relations. In MGTAB, we extracted the 20 user property features with the greatest information gain and user tweet features as the user features. In addition, we performed a thorough evaluation of MGTAB and other public datasets. Our experiments found that graph-based approaches are generally more effective than feature-based approaches and perform better when introducing multiple relations. By analyzing experiment results, we identify effective approaches for account detection and provide potential future research directions in this field. Our benchmark and standardized evaluation procedures are freely available at: https://github.com/GraphDetec/MGTAB.

translated by 谷歌翻译