智能论文笔记

Inference and dynamic decision-making for deteriorating systems with probabilistic dependencies through Bayesian networks and deep reinforcement learning

Pablo G. Morato , Charalampos P. Andriotis , Konstantinos G. Papakonstantinou , Philippe Rigo

分类：人工智能 | 机器学习 | (统计)机器学习

2022-09-02

在现代环境和社会问题的背景下，人们对能够识别土木工程系统的管理策略的方法的需求越来越大，最大程度地降低了结构性故障风险，同时最好计划检查和维护（I＆M）流程。由于与联合系统级状态描述下的全局优化方法相关的计算复杂性，大多数可用方法将I＆M决策问题简化为组件级别。在本文中，我们提出了一个有效的算法框架，用于在暴露于恶化环境的工程系统下进行推理和决策制定，从而直接在系统级别提供最佳的管理策略。在我们的方法中，决策问题被提出为部分可观察到的马尔可夫决策过程，其动态是在贝叶斯网络条件结构中编码的。该方法可以通过高斯层次结构和动态贝叶斯网络在组件之间平等或一般，不平等的恶化相关性下处理环境。在政策优化方面，我们采用了深层分散的多代理参与者 - 批评（DDMAC）强化学习方法，其中政策由批评家网络指导的参与者神经网络近似。通过在模拟环境中包括劣化依赖性，并通过在系统级别制定成本模型，DDMAC策略本质上考虑了基本系统效应。通过对疲劳恶化下的9分和钢架进行的数值实验证明了这一点。结果表明，与最先进的启发式方法相比，DDMAC政策可提供可观的好处。 DDMAC策略对系统效应的固有考虑也可以根据学习的政策来解释。

translated by 谷歌翻译

HTML版本

BERTIN: Efficient Pre-Training of a Spanish Language Model using Perplexity Sampling

Javier de la Rosa , Eduardo G. Ponferrada , Paulo Villegas , Pablo Gonzalez de Prado Salas , Manu Romero , Marıa Grandury

分类：自然语言处理 | 人工智能

2022-07-14

在计算和数据方面，大型语言模型的预培训通常需要大量资源。经常使用的Web源（例如Common Crawl）可能包含足够的噪声，以使这种预训练的亚地区。在这项工作中，我们尝试了西班牙语版本的MC4的不同采样方法，并提出了一种新颖的以数据为中心的技术，我们将其命名为$ \ textit {Perplexity sampling} $，该技术可实现大约一半的语言模型的预培训步骤并使用五分之一的数据。最终的模型与当前的最新机构相当，甚至可以为某些任务获得更好的结果。我们的工作证明了变形金刚的多功能性，并为小型团队以有限的预算培训模型铺平了道路。我们的型号可在此$ \ href {https://huggingface.co/bertin-project} {url} $中获得。

translated by 谷歌翻译

The CAMELS project: public data release

Francisco Villaescusa-Navarro , Shy Genel , Daniel Anglés-Alcázar , Lucia A. Perez , Pablo Villanueva-Domingo , Digvijay Wadekar , Helen Shao , Faizan G. Mohammad , Sultan Hassan , Emily Moser

分类：人工智能 | 机器学习

2022-01-04

制定了具有机器学习模拟（骆驼）项目的宇宙学和天体物理学，通过数千名宇宙的流体动力模拟和机器学习将宇宙学与天体物理学结合起来。骆驼包含4,233个宇宙学仿真，2,049个n-body和2,184个最先进的流体动力模拟，在参数空间中采样巨大的体积。在本文中，我们介绍了骆驼公共数据发布，描述了骆驼模拟的特性和由它们产生的各种数据产品，包括光环，次麦，银河系和空隙目录，功率谱，Bispectra，Lyman - $ \ Alpha $光谱，概率分布函数，光环径向轮廓和X射线光子列表。我们还释放了超过骆驼 - 山姆的数十亿个星系的目录：与Santa Cruz半分析模型相结合的大量N身体模拟。我们释放包含350多个Terabytes的所有数据，并包含143,922个快照，数百万光环，星系和摘要统计数据。我们提供有关如何访问，下载，读取和处理数据AT \ URL {https://camels.readthedocs.io}的进一步技术详细信息。

translated by 谷歌翻译

MedPerf: Open Benchmarking Platform for Medical Artificial Intelligence using Federated Evaluation

Alexandros Karargyris , Renato Umeton , Micah J. Sheller , Alejandro Aristizabal , Johnu George , Srini Bala , Daniel J. Beutel , Victor Bittorf , Akshay Chaudhari , Alexander Chowdhury

分类：机器学习

2021-09-29

医疗AI通过支持基于证据的医学实践，个性化患者治疗，降低成本以及改善提供者和患者体验，推进医疗保健的巨大潜力。我们认为解锁此潜力需要一种系统的方法来衡量在大规模异构数据上的医疗AI模型的性能。为了满足这种需求，我们正在建立Medperf，这是一个开放的框架，用于在医疗领域的基准测试机器学习。 Medperf将使联合评估能够将模型安全地分配给不同的评估设施，从而赋予医疗组织在高效和人类监督过程中评估和验证AI模型的性能，同时优先考虑隐私。我们描述了当前的挑战医疗保健和AI社区面临，需要开放平台，Medperf的设计理念，其目前的实施状态和我们的路线图。我们呼吁研究人员和组织加入我们创建Medperf开放基准平台。

translated by 谷歌翻译

Deep Reinforcement Learning at the Edge of the Statistical Precipice

Rishabh Agarwal , Max Schwarzer , Pablo Samuel Castro , Aaron Courville , Marc G. Bellemare

分类：机器学习 | 人工智能 | (统计)机器学习

2021-08-30

通过比较它们在大型任务套件上的相对性能来主要评估深度加强学习（RL）算法。大多数已发布的Deep RL基准的结果比较了总体性能的积分估计，如任务的平均值和中位数分数，忽略了使用有限次训练运行所暗示的统计不确定性。从街机学习环境（ALE）开始，转向计算苛刻的基准导致只评估每项任务的少量运行的实践，加剧了点估计中的统计不确定性。在本文中，我们认为，在少数运行深处的RL政权中的可靠评估不能忽视结果中的不确定性，而无需冒着现场降低进展的风险。我们使用对Atari 100k基准测试的案例研究来说明这一点，在那里我们在单独从点估计中汲取的结论之间发现了大量差异与更全面的统计分析。旨在提高现场对报告的据报道的诸如少数经营的业绩的信心，我们倡导报告总绩效的间隔估计，并提出性能概况来解释结果的可变性，以及现在更强大和高效的总数的绩效作为狭隘的平均分数，在结果中取得小的不确定性。使用此类统计工具，我们在包括ALE，Procgen和DeepMind控制套件的其他广泛使用的RL基准测试中仔细审查了现有算法的性能评估，再次在先前的比较中显示差异。我们的调查结果呼吁改变我们如何评估深度RL的性能，我们提出了更严格的评估方法，伴随着开源库的最新，以防止不可靠的结果停滞不前。

translated by 谷歌翻译

Optimal Inspection and Maintenance Planning for Deteriorating Structural Components through Dynamic Bayesian Networks and Markov Decision Processes

P. G. Morato , K. G. Papakonstantinou , C. P. Andriotis , J. S. Nielsen , P. Rigo

分类：人工智能

2020-09-09

在桥梁到海上平台和风力涡轮机的公民和海上工程系统必须有效地管理，因为它们在其运行寿命中暴露于劣化机制，例如疲劳或腐蚀。确定最佳检查和维护政策要求在不确定性下解决复杂的连续决策问题，主要目的是有效地控制与结构失败相关的风险。解决这种复杂性，基于风险的检查计划方法，通常由动态贝叶斯网络支持，评估一组预定义的启发式决策规则，以合理简化了决策问题。然而，所产生的政策可能受到决策规则定义中考虑的有限空间的损害。避免这种限制，部分观察到的马尔可夫决策过程（POMDPS）在不确定的动作结果和观察下提供了用于随机最佳控制的原则性的数学方法，其中作为整个动态更新的状态概率分布的函数规定了最佳动作。在本文中，我们将动态贝叶斯网络与POMDPS结合在联合框架中，以获得最佳检查和维护计划，我们提供了在结构可靠性背景下开发无限和有限地平线POMDP的配方。所提出的方法是对结构部件进行疲劳劣化的情况的情况下实施和测试，证明了基于最先进的POMDP求解器的能力，用于解决潜在的规划优化问题。在数值实验中，彻底比较了POMDP和基于启发式的策略，并且结果表明POMDP与对应于传统问题设置相比，POMDP达到了大幅降低的成本。

translated by 谷歌翻译

Flexible Supervised Autonomy for Exploration in Subterranean Environments

Harel Biggie , Eugene R. Rush , Danny G. Riley , Shakeeb Ahmad , Michael T. Ohradzansky , Kyle Harlow , Michael J. Miles , Daniel Torres , Steve McGuire , Eric W. Frew

分类：机器人

2023-01-02

While the capabilities of autonomous systems have been steadily improving in recent years, these systems still struggle to rapidly explore previously unknown environments without the aid of GPS-assisted navigation. The DARPA Subterranean (SubT) Challenge aimed to fast track the development of autonomous exploration systems by evaluating their performance in real-world underground search-and-rescue scenarios. Subterranean environments present a plethora of challenges for robotic systems, such as limited communications, complex topology, visually-degraded sensing, and harsh terrain. The presented solution enables long-term autonomy with minimal human supervision by combining a powerful and independent single-agent autonomy stack, with higher level mission management operating over a flexible mesh network. The autonomy suite deployed on quadruped and wheeled robots was fully independent, freeing the human supervision to loosely supervise the mission and make high-impact strategic decisions. We also discuss lessons learned from fielding our system at the SubT Final Event, relating to vehicle versatility, system adaptability, and re-configurable communications.

translated by 谷歌翻译

On the Interpretability of Attention Networks

Lakshmi Narayan Pandey , Rahul Vashisht , Harish G. Ramaswamy

分类：机器学习

2022-12-30

Attention mechanisms form a core component of several successful deep learning architectures, and are based on one key idea: ''The output depends only on a small (but unknown) segment of the input.'' In several practical applications like image captioning and language translation, this is mostly true. In trained models with an attention mechanism, the outputs of an intermediate module that encodes the segment of input responsible for the output is often used as a way to peek into the `reasoning` of the network. We make such a notion more precise for a variant of the classification problem that we term selective dependence classification (SDC) when used with attention model architectures. Under such a setting, we demonstrate various error modes where an attention model can be accurate but fail to be interpretable, and show that such models do occur as a result of training. We illustrate various situations that can accentuate and mitigate this behaviour. Finally, we use our objective definition of interpretability for SDC tasks to evaluate a few attention model learning algorithms designed to encourage sparsity and demonstrate that these algorithms help improve interpretability.

translated by 谷歌翻译

Quantum-Inspired Tensor Neural Networks for Option Pricing

Raj G. Patel , Chia-Wei Hsing , Serkan Sahin , Samuel Palmer , Saeed S. Jahromi , Shivam Sharma , Tomas Dominguez , Kris Tziritas , Christophe Michel , Vincent Porte

分类：机器学习

2022-12-28

Recent advances in deep learning have enabled us to address the curse of dimensionality (COD) by solving problems in higher dimensions. A subset of such approaches of addressing the COD has led us to solving high-dimensional PDEs. This has resulted in opening doors to solving a variety of real-world problems ranging from mathematical finance to stochastic control for industrial applications. Although feasible, these deep learning methods are still constrained by training time and memory. Tackling these shortcomings, Tensor Neural Networks (TNN) demonstrate that they can provide significant parameter savings while attaining the same accuracy as compared to the classical Dense Neural Network (DNN). In addition, we also show how TNN can be trained faster than DNN for the same accuracy. Besides TNN, we also introduce Tensor Network Initializer (TNN Init), a weight initialization scheme that leads to faster convergence with smaller variance for an equivalent parameter count as compared to a DNN. We benchmark TNN and TNN Init by applying them to solve the parabolic PDE associated with the Heston model, which is widely used in financial pricing theory.

translated by 谷歌翻译

Persistence-based operators in machine learning

Mattia G. Bergomi , Massimo Ferri , Alessandro Mella , Pietro Vertechi

分类：机器学习

2022-12-28

Artificial neural networks can learn complex, salient data features to achieve a given task. On the opposite end of the spectrum, mathematically grounded methods such as topological data analysis allow users to design analysis pipelines fully aware of data constraints and symmetries. We introduce a class of persistence-based neural network layers. Persistence-based layers allow the users to easily inject knowledge about symmetries (equivariance) respected by the data, are equipped with learnable weights, and can be composed with state-of-the-art neural architectures.

translated by 谷歌翻译