智能论文笔记

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Robust Incremental Smoothing and Mapping (riSAM)

Daniel McGann , John G. Rogers III , Michael Kaess

分类：机器人

2022-09-28

本文提出了一种用于在线增量同时本地化和映射（SLAM）的强大优化方法。由于在存在感知混叠的情况下数据关联的NP硬度，可拖动（大约）数据关联方法将产生错误的测量。我们需要猛烈的后端，在达到在线效率限制的同时，在存在异常值的情况下，可以在存在异常值的情况下将其收敛到准确的解决方案。现有的强大SLAM方法要么对离群值敏感，对初始化越来越敏感，要么无法提供在线效率。我们提出了强大的增量平滑和映射（RISAM）算法，这是一种基于渐变的非跨识别性的稳健后端优化器，用于增量大满贯。我们在基准测试数据集上证明了我们的算法实现在线效率，优于现有的在线方法，并匹配或改善现有的离线方法的性能。

translated by 谷歌翻译

How Does It Feel? Self-Supervised Costmap Learning for Off-Road Vehicle Traversability

Mateo Guaman Castro , Samuel Triest , Wenshan Wang , Jason M. Gregory , Felix Sanchez , John G. Rogers III , Sebastian Scherer

分类：机器人 | 机器学习

2022-09-22

估计越野环境中的地形横穿性需要关于机器人和这些地形之间复杂相互作用动态的推理。但是，建立准确的物理模型，或创建有益的标签来以有监督的方式学习模型是有挑战性的。我们提出了一种方法，该方法通过将外部感受性的环境信息与本体感受性的地形相互作用反馈相结合，以自我监督的方式将遍历性成本映像结合在一起。此外，我们提出了一种将机器人速度纳入Costmap预测管道中的新型方法。我们在具有挑战性的越野地形上，在多个大型，自动的全地形车辆（ATV）上验证了我们的方法，并在单独的大型地面机器人上易于集成。我们的短尺寸导航结果表明，使用我们学到的Costmaps可以使整体航行更顺畅，并为机器人提供了对机器人与不同地形类型（例如草和砾石）之间相互作用的更细粒度的了解。我们的大规模导航试验表明，与基于占用率的导航基线相比，我们可以将干预措施的数量减少多达57％，这是在挑战400 m至3150 m不等的越野课程中。

translated by 谷歌翻译

NAUTS: Negotiation for Adaptation to Unstructured Terrain Surfaces

Sriram Siva , Maggie Wigness , John G. Rogers , Long Quang , Hao Zhang

分类：机器人

2022-07-27

当机器人在具有非结构化地形的现实世界越野环境中运行时，适应其导航政策的能力对于有效且安全的导航至关重要。但是，越野地形为机器人导航带来了一些挑战，包括动态障碍和地形不确定性，导致遍历或导航故障效率低下。为了应对这些挑战，我们通过谈判引入了一种新颖的适应方法，使地面机器人能够通过谈判过程来调整其导航行为。我们的方法首先学习了各种导航政策的预测模型，以充当地形感知的本地控制器和计划者。然后，通过新的谈判过程，我们的方法从与环境的各种政策互动中学习，以在线方式达成最佳政策组合，以使机器人导航适应即时的非结构性越野地形。此外，我们实施了一种新的优化算法，该算法为执行过程中实时实时提供机器人谈判提供了最佳解决方案。实验结果已经验证了我们通过谈判的适应方法优于机器人导航的先前方法，尤其是在看不见和不确定的动态地形上。

translated by 谷歌翻译

A Multi-purpose Real Haze Benchmark with Quantifiable Haze Levels and Ground Truth

Priya Narayanan , Xin Hu , Zhenyu Wu , Matthew D Thielke , John G Rogers , Andre V Harrison , John A D'Agostino , James D Brown , Long P Quang , James R Uplinger

分类：计算机视觉

2022-06-13

由于存在浓烟或阴霾，从室外视觉环境收集的图像通常会降解。在这些退化的视觉环境（DVE）中，在场景理解中进行研究的关键挑战是缺乏代表性的基准数据集。这些数据集需要评估降级设置中的最新对象识别和其他计算机视觉算法。在本文中，我们通过引入带有朦胧和无雾图像的第一个配对的真实图像基准数据集以及原位的雾化密度测量来解决其中的一些限制。该数据集是在受控的环境中生产的，其专业烟雾产生机器覆盖了整个场景，并由从无人机（UAV）（UAV）和无人接地车（UGV）的角度捕获的图像组成。我们还评估了一组代表性的最先进的飞行方法以及数据集中的对象探测器。本文介绍的完整数据集，包括地面真相对象分类框和雾密度测量值，为社区提供了以下网址评估其算法的信息：https：//a2i2-archangel.vision。该数据集的一个子集已用于在CVPR UG2 2022挑战的雾痕中进行对象检测。

translated by 谷歌翻译

WayFAST: Navigation with Predictive Traversability in the Field

Mateus Valverde Gasparino , Arun Narenthiran Sivakumar , Yixiao Liu , Andres Eduardo Baquero Velasquez , Vitor Akihiro Hisano Higuti , John Rogers , Huy Tran , Girish Chowdhary

分类：机器人 | 人工智能 | 计算机视觉 | 机器学习

2022-03-22

我们提出了一种自我监督的方法，用于预测需要良好牵引力才能导航的轮式移动机器人的可穿越路径。我们的算法称为Wayfast（无路线自动驾驶系统用于遍历性），使用RGB和深度数据以及导航经验，自主在室外非结构化环境中自主生成可遍历的路径。我们的主要灵感是，可以使用动力动力学模型估算滚动机器人的牵引力。使用在线退化的视野估计器提供的牵引力估计值，我们能够以自我监督的方式训练遍历性预测神经网络，而无需以前的方法使用的启发式方法。我们通过在各种环境中进行广泛的现场测试来证明Wayfast的有效性，从沙滩到森林檐篷和积雪覆盖的草田不等。我们的结果清楚地表明，Wayfast可以学会避免几何障碍物以及不可传输的地形，例如雪，这很难避免使用仅提供几何数据（例如LiDAR）的传感器。此外，我们表明，基于在线牵引力估计的培训管道比其他基于启发式的方法更有效率。

translated by 谷歌翻译

Self-Reflective Terrain-Aware Robot Adaptation for Consistent Off-Road Ground Navigation

Sriram Siva , Maggie Wigness , John G. Rogers , Long Quang , Hao Zhang

分类：机器人 | 机器学习

2021-11-12

地面机器人需要遍历非结构化和毫无准备的地形的关键能力，避免障碍，以完成灾害响应的现实机器人应用中的完整任务。当机器人在诸如森林等越野场环境中运行时，机器人的实际行为通常与其预期或计划的行为相匹配，这是由于地形和机器人本身的特征的变化。因此，机器人适应对于一致行为生成的能力对于非结构化的越野地形上的可动性至关重要。为了解决挑战，我们提出了一种新颖的自我反思地形感知的方法，用于造型机器人的自我反思，以产生一致的控制，以导航非结构化的越野地形，这使得机器人能够通过机器人自我进行更准确地执行预期行为。反思，同时适应不同的非结构化地形。为了评估我们的方法的性能，我们使用具有各种功能的实际机器人进行广泛的实验，这些功能在不同的非结构化越野地形上变化。综合实验结果表明，我们的自我反光地形感知的适应方法使得地机器人能够产生一致的导航行为，并且优于比较的先前和基线技术。

translated by 谷歌翻译

A Step Towards Efficient Evaluation of Complex Perception Tasks in Simulation

Jonathan Sadeghi , Blaine Rogers , James Gunn , Thomas Saunders , Sina Samangooei , Puneet Kumar Dokania , John Redford

分类：机器学习 | 人工智能 | 计算机视觉 | 机器人

2021-09-28

目前已在表征包含深层的学习模式进行部署到任何安全关键方案之前系统的错误行为越来越感兴趣。然而，表征此行为，通常需要模型，可以对复杂的现实世界的任务极其耗费计算的大规模测试。例如，任务涉及计算密集型对象检测器作为其组成部分之一。在这项工作中，我们提出了一个方法，使使用简化的低高保真模拟器高效的大规模测试和不执行昂贵的深度学习模型的计算成本。我们的方法依赖于设计测试对应的任务的计算密集型部件的高效替代模型。我们通过培训PIXOR和CenterPoint的激光雷达探测器有效的替代模型，同时证明了模拟的精度保持评估在卡拉模拟器减少计算费用的自动驾驶任务的表现证明了我们方法的有效性。

translated by 谷歌翻译

Large Language Models as Corporate Lobbyists

John J. Nay

分类：自然语言处理

2023-01-03

We demonstrate a proof-of-concept of a large language model conducting corporate lobbying related activities. We use an autoregressive large language model (OpenAI's text-davinci-003) to determine if proposed U.S. Congressional bills are relevant to specific public companies and provide explanations and confidence levels. For the bills the model deems as relevant, the model drafts a letter to the sponsor of the bill in an attempt to persuade the congressperson to make changes to the proposed legislation. We use hundreds of ground-truth labels of the relevance of a bill to a company to benchmark the performance of the model, which outperforms the baseline of predicting the most common outcome of irrelevance. However, we test the ability to determine the relevance of a bill with the previous OpenAI GPT-3 model (text-davinci-002), which was state-of-the-art on many language tasks until text-davinci-003 was released on November 28, 2022. The performance of text-davinci-002 is worse than simply always predicting that a bill is irrelevant to a company. These results suggest that, as large language models continue to improve core natural language understanding capabilities, performance on corporate lobbying related tasks will continue to improve. We then discuss why this could be problematic for societal-AI alignment.

translated by 谷歌翻译

Posterior Collapse and Latent Variable Non-identifiability

Yixin Wang , David M. Blei , John P. Cunningham

分类： (统计)机器学习 | 机器学习

2023-01-02

Variational autoencoders model high-dimensional data by positing low-dimensional latent variables that are mapped through a flexible distribution parametrized by a neural network. Unfortunately, variational autoencoders often suffer from posterior collapse: the posterior of the latent variables is equal to its prior, rendering the variational autoencoder useless as a means to produce meaningful representations. Existing approaches to posterior collapse often attribute it to the use of neural networks or optimization issues due to variational approximation. In this paper, we consider posterior collapse as a problem of latent variable non-identifiability. We prove that the posterior collapses if and only if the latent variables are non-identifiable in the generative model. This fact implies that posterior collapse is not a phenomenon specific to the use of flexible distributions or approximate inference. Rather, it can occur in classical probabilistic models even with exact inference, which we also demonstrate. Based on these results, we propose a class of latent-identifiable variational autoencoders, deep generative models which enforce identifiability without sacrificing flexibility. This model class resolves the problem of latent variable non-identifiability by leveraging bijective Brenier maps and parameterizing them with input convex neural networks, without special variational inference objectives or optimization tricks. Across synthetic and real datasets, latent-identifiable variational autoencoders outperform existing methods in mitigating posterior collapse and providing meaningful representations of the data.

translated by 谷歌翻译