智能论文笔记

SpectraNet: Learned Recognition of Artificial Satellites From High Contrast Spectroscopic Imagery

J. Zachary Gazak , Ian McQuaid , Ryan Swindle , Matthew Phelps , Justin Fletcher

分类：机器学习

2022-01-10

有效的空间交通管理需要积极识别人造卫星。从观察到的数据中提取对象识别的当前方法需要空间分辨的图像，其限制对低地球轨道中的对象的标识。然而，大多数人造卫星在地球静止轨道上运行在距离的距离中，禁止基于地面的观察者解析空间信息。本文演示了一种物体识别解决方案，利用修改的残余卷积神经网络将远程不变光谱数据映射到对象标识。我们报告了模拟64级卫星问题超过80％的分类精度 - 即使在卫星正在进行恒定，随机重新定位的情况下。由这些结果驱动的天文观察活动，九级问题的精度为72％，平均每类的100个示例，按照模拟预期执行。我们展示了通过辍学，随机重量平均（SWA）和SWA集中的分层贝叶斯推断的应用，以测量空间交通管理中的分类不确定性 - 临界部件，其中日常决策昂贵的空间资产并承担地缘政治后果。

translated by 谷歌翻译

System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games

Indranil Sur , Zachary Daniels , Abrar Rahman , Kamil Faber , Gianmarco J. Gallardo , Tyler L. Hayes , Cameron E. Taylor , Mustafa Burak Gurbuz , James Smith , Sahana Joshi

分类：机器学习 | 人工智能

2022-12-08

As Artificial and Robotic Systems are increasingly deployed and relied upon for real-world applications, it is important that they exhibit the ability to continually learn and adapt in dynamically-changing environments, becoming Lifelong Learning Machines. Continual/lifelong learning (LL) involves minimizing catastrophic forgetting of old tasks while maximizing a model's capability to learn new tasks. This paper addresses the challenging lifelong reinforcement learning (L2RL) setting. Pushing the state-of-the-art forward in L2RL and making L2RL useful for practical applications requires more than developing individual L2RL algorithms; it requires making progress at the systems-level, especially research into the non-trivial problem of how to integrate multiple L2RL algorithms into a common framework. In this paper, we introduce the Lifelong Reinforcement Learning Components Framework (L2RLCF), which standardizes L2RL systems and assimilates different continual learning components (each addressing different aspects of the lifelong learning problem) into a unified system. As an instantiation of L2RLCF, we develop a standard API allowing easy integration of novel lifelong learning components. We describe a case study that demonstrates how multiple independently-developed LL components can be integrated into a single realized system. We also introduce an evaluation environment in order to measure the effect of combining various system components. Our evaluation environment employs different LL scenarios (sequences of tasks) consisting of Starcraft-2 minigames and allows for the fair, comprehensive, and quantitative comparison of different combinations of components within a challenging common evaluation environment.

translated by 谷歌翻译

3D Neural Field Generation using Triplane Diffusion

J. Ryan Shue , Eric Ryan Chan , Ryan Po , Zachary Ankner , Jiajun Wu , Gordon Wetzstein

分类：计算机视觉 | 人工智能

2022-11-30

Diffusion models have emerged as the state-of-the-art for image generation, among other tasks. Here, we present an efficient diffusion-based model for 3D-aware generation of neural fields. Our approach pre-processes training data, such as ShapeNet meshes, by converting them to continuous occupancy fields and factoring them into a set of axis-aligned triplane feature representations. Thus, our 3D training scenes are all represented by 2D feature planes, and we can directly train existing 2D diffusion models on these representations to generate 3D neural fields with high quality and diversity, outperforming alternative approaches to 3D-aware generation. Our approach requires essential modifications to existing triplane factorization pipelines to make the resulting features easy to learn for the diffusion model. We demonstrate state-of-the-art results on 3D generation on several object classes from ShapeNet.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Optimality Guarantees for Particle Belief Approximation of POMDPs

Michael H. Lim , Tyler J. Becker , Mykel J. Kochenderfer , Claire J. Tomlin , Zachary N. Sunberg

分类：人工智能 | 机器人 | (统计)机器学习

2022-10-10

Partially observable Markov decision processes (POMDPs) provide a flexible representation for real-world decision and control problems. However, POMDPs are notoriously difficult to solve, especially when the state and observation spaces are continuous or hybrid, which is often the case for physical systems. While recent online sampling-based POMDP algorithms that plan with observation likelihood weighting have shown practical effectiveness, a general theory characterizing the approximation error of the particle filtering techniques that these algorithms use has not previously been proposed. Our main contribution is bounding the error between any POMDP and its corresponding finite sample particle belief MDP (PB-MDP) approximation. This fundamental bridge between PB-MDPs and POMDPs allows us to adapt any sampling-based MDP algorithm to a POMDP by solving the corresponding particle belief MDP, thereby extending the convergence guarantees of the MDP algorithm to the POMDP. Practically, this is implemented by using the particle filter belief transition model as the generative model for the MDP solver. While this requires access to the observation density model from the POMDP, it only increases the transition sampling complexity of the MDP solver by a factor of $\mathcal{O}(C)$, where $C$ is the number of particles. Thus, when combined with sparse sampling MDP algorithms, this approach can yield algorithms for POMDPs that have no direct theoretical dependence on the size of the state and observation spaces. In addition to our theoretical contribution, we perform five numerical experiments on benchmark POMDPs to demonstrate that a simple MDP algorithm adapted using PB-MDP approximation, Sparse-PFT, achieves performance competitive with other leading continuous observation POMDP solvers.

translated by 谷歌翻译

Learning Clinical Concepts for Predicting Risk of Progression to Severe COVID-19

Helen Zhou , Cheng Cheng , Kelly J. Shields , Gursimran Kochhar , Tariq Cheema , Zachary C. Lipton , Jeremy C. Weiss

分类：机器学习 | (统计)机器学习

2022-08-28

随着COVID-19现在普遍存在，对高危个体的识别至关重要。利用来自宾夕法尼亚州西南部主要医疗保健提供者的数据，我们开发了预测严重Covid-19进展的生存模型。在这项工作中，我们在依赖许多功能的更准确模型和依赖一些与临床医生直觉相一致的功能的模型之间面临一个权衡。使事情变得复杂，许多EHR功能往往较低，从而降低了较小模型的准确性。在这项研究中，我们开发了两组高性能风险评分：（i）由所有可用功能构建的无约束模型；（ii）在训练风险预测因子之前，在培训风险预测因子之前就学习一小部分临床概念的管道。学到的概念提高了相应特征（C-Index 0.858 vs. 0.844）的性能，并在评估样本外（随后的时间段）时证明了（i）的改进。我们的模型表现优于先前的工作（C-Index 0.844-0.872 vs. 0.598-0.810）。

translated by 谷歌翻译

The MABe22 Benchmarks for Representation Learning of Multi-Agent Behavior

Jennifer J. Sun , Andrew Ulmer , Dipam Chakraborty , Brian Geuther , Edward Hayes , Heng Jia , Vivek Kumar , Zachary Partridge , Alice Robie , Catherine E. Schretter

分类：机器学习 | 人工智能 | 计算机视觉

2022-07-21

现实世界的行为通常是由多种代理之间复杂的相互作用来塑造的。为了可靠地研究多代理行为，无监督和自我监督的学习的进步使从轨迹数据中学到了各种不同的行为表示。迄今为止，还没有一组统一的基准测试，可以在广泛的行为分析设置中进行定量和系统地比较方法。我们的目的是通过引入来自现实世界行为神经科学实验的大规模，多代理轨迹数据集来解决这一问题，该数据集涵盖了一系列行为分析任务。我们的数据集由来自通用模型生物的轨迹数据组成，其中有960万帧的小鼠数据和440万帧的飞行数据，在各种实验环境中，例如不同的菌株，相互作用的长度和光遗传学刺激。框架的子集还包括专家注销的行为标签。我们数据集的改进对应于跨多种生物的行为表示，并能够捕获常见行为分析任务的差异。

translated by 谷歌翻译

Plex: Towards Reliability using Pretrained Large Model Extensions

Dustin Tran , Jeremiah Liu , Michael W. Dusenberry , Du Phan , Mark Collier , Jie Ren , Kehang Han , Zi Wang , Zelda Mariet , Huiyi Hu

分类：机器学习 | (统计)机器学习

2022-07-15

人工智能的最新趋势是将验证的模型用于语言和视觉任务，这些模型已经实现了非凡的表现，但也令人困惑。因此，以各种方式探索这些模型的能力对该领域至关重要。在本文中，我们探讨了模型的可靠性，在其中我们将可靠的模型定义为一个不仅可以实现强大的预测性能，而且在许多涉及不确定性（例如选择性预测，开放式设置识别）的决策任务上，在许多决策任务上表现出色，而且表现良好。强大的概括（例如，准确性和适当的评分规则，例如在分布数据集中和分发数据集上的对数可能性）和适应性（例如，主动学习，几乎没有射击不确定性）。我们设计了40个数据集的10种任务类型，以评估视觉和语言域上可靠性的不同方面。为了提高可靠性，我们分别开发了VIT-PLEX和T5-PLEX，分别针对视觉和语言方式扩展了大型模型。 PLEX极大地改善了跨可靠性任务的最先进，并简化了传统协议，因为它可以改善开箱即用的性能，并且不需要设计分数或为每个任务调整模型。我们演示了高达1B参数的模型尺寸的缩放效果，并预处理数据集大小最多4B示例。我们还展示了PLEX在具有挑战性的任务上的功能，包括零射门的开放式识别，主动学习和对话语言理解中的不确定性。

translated by 谷歌翻译

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Aarohi Srivastava , Abhinav Rastogi , Abhishek Rao , Abu Awal Md Shoeb , Abubakar Abid , Adam Fisch , Adam R. Brown , Adam Santoro , Aditya Gupta , Adrià Garriga-Alonso

分类：自然语言处理 | 人工智能 | 机器学习 | (统计)机器学习

2022-06-09

语言模型既展示了定量的改进，又展示了新的定性功能，随着规模的增加。尽管它们具有潜在的变革性影响，但这些新能力的特征却很差。为了为未来的研究提供信息，为破坏性的新模型能力做准备，并改善社会有害的效果，至关重要的是，我们必须了解目前和近乎未来的能力和语言模型的局限性。为了应对这一挑战，我们介绍了超越模仿游戏基准（Big Bench）。 Big Bench目前由204个任务组成，由132家机构的442位作者贡献。任务主题是多样的，从语言学，儿童发展，数学，常识性推理，生物学，物理学，社会偏见，软件开发等等。 Big-Bench专注于被认为超出当前语言模型的功能的任务。我们评估了OpenAI的GPT型号，Google内部密集变压器体系结构和大型基础上的开关稀疏变压器的行为，跨越了数百万到数十亿个参数。此外，一个人类专家评估者团队执行了所有任务，以提供强大的基准。研究结果包括：模型性能和校准都随规模改善，但绝对的术语（以及与评估者的性能相比）；在模型类中的性能非常相似，尽管带有稀疏性。逐渐和预测的任务通常涉及大量知识或记忆成分，而在临界规模上表现出“突破性”行为的任务通常涉及多个步骤或组成部分或脆性指标；社交偏见通常会随着含糊不清的环境而随着规模而增加，但这可以通过提示来改善。

translated by 谷歌翻译

Dojo: A Differentiable Physics Engine for Robotics

Taylor A. Howell , Simon Le Cleac'h , J. Zico Kolter , Mac Schwager , Zachary Manchester

分类：机器人

2022-03-02

我们提出了Dojo，这是一种用于机器人技术的可区分物理引擎，优先考虑稳定的模拟，准确的接触物理学以及相对于状态，动作和系统参数的可不同性。Dojo在低样本速率下实现稳定的模拟，并通过使用变异积分器来节省能量和动量。非线性互补性问题，具有用于摩擦的二阶锥体，模型硬接触，并使用自定义的Primal Dual内部点法可靠地解决。使用隐式功能定理利用内点方法的特殊属性，以有效计算通过接触事件提供有用信息的光滑梯度。我们展示了Dojo独特的模拟紧密接触能力，同时提供了许多示例，包括轨迹优化，强化学习和系统识别。

translated by 谷歌翻译