智能论文笔记

Measuring the Carbon Intensity of AI in Cloud Instances

Jesse Dodge , Taylor Prewitt , Remi Tachet Des Combes , Erika Odmark , Roy Schwartz , Emma Strubell , Alexandra Sasha Luccioni , Noah A. Smith , Nicole DeCario , Will Buchanan

分类：机器学习

2022-06-10

通过提供前所未有的计算资源访问，云计算能够在机器学习等技术中快速增长，其计算需求产生了高能源成本和相应的碳足迹。结果，最近的奖学金呼吁更好地估计AI的温室气体影响：当今的数据科学家无法轻松或可靠地访问该信息的测量，从而排除了可行策略的发展。向用户提供有关软件碳强度的信息的云提供商是一种基本的垫脚石，以最大程度地减少排放。在本文中，我们提供了一个测量软件碳强度的框架，并建议通过使用每个能量单元使用基于位置和特定时间的边际排放数据来测量运行碳排放。我们为一组自然语言处理和计算机视觉的现代模型提供了操作软件强度的测量，以及各种模型尺寸，包括预处理61亿个参数语言模型。然后，我们评估了一套用于减少Microsoft Azure Cloud Compute平台排放的方法套件：使用不同地理区域中的云实例，在一天中的不同时间使用云实例，并在边际碳强度高于某个阈值时动态暂停云实例。我们证实了先前的结果，即数据中心的地理区域在给定云实例的碳强度中起着重要作用，并发现选择合适的区域可能具有最大的运营排放减少影响。我们还表明，一天中的时间对操作软件碳强度有显着影响。最后，我们最终提出了有关机器学习从业人员如何使用软件碳强度信息来减少环境影响的建议。

translated by 谷歌翻译

Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch

Shangtong Zhang , Remi Tachet , Romain Laroche

分类：机器学习

2021-11-04

在本文中，我们在表格设置中建立了违法演员批评算法的全球最优性和收敛速度，而不使用密度比来校正行为政策的状态分布与目标政策之间的差异。我们的工作超出了现有的工作原理，最佳的策略梯度方法中的现有工作中使用确切的策略渐变来更新策略参数时，我们使用近似和随机更新步骤。我们的更新步骤不是渐变更新，因为我们不使用密度比以纠正状态分布，这与从业者做得好。我们的更新是近似的，因为我们使用学习的评论家而不是真正的价值函数。我们的更新是随机的，因为在每个步骤中，更新仅为当前状态操作对完成。此外，我们在分析中删除了现有作品的几个限制性假设。我们的工作中的核心是基于其均匀收缩性能的时源性Markov链中的通用随机近似算法的有限样本分析。

translated by 谷歌翻译

GraphCast: Learning skillful medium-range global weather forecasting

Remi Lam , Alvaro Sanchez-Gonzalez , Matthew Willson , Peter Wirnsberger , Meire Fortunato , Alexander Pritzel , Suman Ravuri , Timo Ewalds , Ferran Alet , Zach Eaton-Rosen

分类：机器学习

2022-12-24

We introduce a machine-learning (ML)-based weather simulator--called "GraphCast"--which outperforms the most accurate deterministic operational medium-range weather forecasting system in the world, as well as all previous ML baselines. GraphCast is an autoregressive model, based on graph neural networks and a novel high-resolution multi-scale mesh representation, which we trained on historical weather data from the European Centre for Medium-Range Weather Forecasts (ECMWF)'s ERA5 reanalysis archive. It can make 10-day forecasts, at 6-hour time intervals, of five surface variables and six atmospheric variables, each at 37 vertical pressure levels, on a 0.25-degree latitude-longitude grid, which corresponds to roughly 25 x 25 kilometer resolution at the equator. Our results show GraphCast is more accurate than ECMWF's deterministic operational forecasting system, HRES, on 90.0% of the 2760 variable and lead time combinations we evaluated. GraphCast also outperforms the most accurate previous ML-based weather forecasting model on 99.2% of the 252 targets it reported. GraphCast can generate a 10-day forecast (35 gigabytes of data) in under 60 seconds on Cloud TPU v4 hardware. Unlike traditional forecasting methods, ML-based forecasting scales well with data: by training on bigger, higher quality, and more recent data, the skill of the forecasts can improve. Together these results represent a key step forward in complementing and improving weather modeling with ML, open new opportunities for fast, accurate forecasting, and help realize the promise of ML-based simulation in the physical sciences.

translated by 谷歌翻译

PIZZA: A new benchmark for complex end-to-end task-oriented parsing

Konstantine Arkoudas , Nicolas Guenon des Mesnards , Melanie Rubino , Sandesh Swamy , Saarthak Khanna , Weiqi Sun , Khan Haidar

分类：自然语言处理 | 机器学习

2022-12-01

Much recent work in task-oriented parsing has focused on finding a middle ground between flat slots and intents, which are inexpressive but easy to annotate, and powerful representations such as the lambda calculus, which are expressive but costly to annotate. This paper continues the exploration of task-oriented parsing by introducing a new dataset for parsing pizza and drink orders, whose semantics cannot be captured by flat slots and intents. We perform an extensive evaluation of deep-learning techniques for task-oriented parsing on this dataset, including different flavors of seq2seq systems and RNNGs. The dataset comes in two main versions, one in a recently introduced utterance-level hierarchical notation that we call TOP, and one whose targets are executable representations (EXR). We demonstrate empirically that training the parser to directly generate EXR notation not only solves the problem of entity resolution in one fell swoop and overcomes a number of expressive limitations of TOP notation, but also results in significantly greater parsing accuracy.

translated by 谷歌翻译

Découvrir de nouvelles classes dans des données tabulaires

Colin Troisemaine , Joachim Flocon-Cholet , Stéphane Gosselin , Sandrine Vaton , Alexandre Reiffers-Masson , Vincent Lemaire

分类：机器学习

2022-11-28

In Novel Class Discovery (NCD), the goal is to find new classes in an unlabeled set given a labeled set of known but different classes. While NCD has recently gained attention from the community, no framework has yet been proposed for heterogeneous tabular data, despite being a very common representation of data. In this paper, we propose TabularNCD, a new method for discovering novel classes in tabular data. We show a way to extract knowledge from already known classes to guide the discovery process of novel classes in the context of tabular data which contains heterogeneous variables. A part of this process is done by a new method for defining pseudo labels, and we follow recent findings in Multi-Task Learning to optimize a joint objective function. Our method demonstrates that NCD is not only applicable to images but also to heterogeneous tabular data.

translated by 谷歌翻译

Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees

Daniil Tiapkin , Denis Belomestny , Daniele Calandriello , Eric Moulines , Remi Munos , Alexey Naumov , Mark Rowland , Michal Valko , Pierre Menard

分类： (统计)机器学习 | 机器学习

2022-09-28

我们考虑在以$ s $状态的地平线$ h $和$ a $ ACTIVE的偶发性，有限的，依赖于阶段的马尔可夫决策过程的环境中进行强化学习。代理商的性能是在与环境互动以$ t $插件互动后的遗憾来衡量的。我们提出了一种乐观的后验抽样算法（OPSRL），这是一种简单的后验抽样变体，仅需要许多后样品对数，$ h $，$ s $，$ a $和$ t $ a $ h $ s $ s $ a $ a $和$ t $一对。对于OPSRL，我们保证最多可容纳订单的高概率遗憾，$ \ wideTilde {\ mathcal {o}}}（\ sqrt {h^3sat}）$忽略$ \ text {poly} \ log（hsat）$项。新型的新型技术成分是线性形式的新型抗浓缩不等式，可能具有独立感兴趣。具体而言，我们将Alfers and Dinges [1984]的Beta分布的基于正常近似的下限扩展到Dirichlet分布。我们的界限匹配订单$ \ omega（\ sqrt {h^3sat}）$的下限，从而回答了Agrawal和Jia [2017b]在情节环境中提出的空旷问题。

translated by 谷歌翻译

ScoreCAM GNN: une explication optimale des réseaux profonds sur graphes

Adrien Raison , Pascal Bourdon , David Helbert

分类：机器学习

2022-07-26

深层网络的解释性正在成为深度学习社区中的一个核心问题。在图形上学习是相同的，这是许多现实世界中存在的数据结构。在本文中，我们提出了一种比最新方法更优化，更轻，一致和更好利用评估图的拓扑的方法。

translated by 谷歌翻译

CheckINN: Wide Range Neural Network Verification in Imandra

Remi Desmartin , Grant Passmore , Ekaterina Komendantskaya , Matthew Daggitt

分类：人工智能

2022-07-21

神经网络越来越依赖于复杂安全系统（例如自动驾驶汽车）的组成部分。对在更大的验证周期中嵌入神经网络验证的工具和方法的需求很高。但是，由于关注的广泛验证属性，很难进行神经网络验证，通常每个验证属性仅适用于专用求解器中的验证。在本文中，我们展示了最初设计用于验证，验证和仿真金融基础架构的功能编程语言的Imandra如何为神经网络验证提供整体基础架构。我们开发了一个新颖的图书馆Checkinn，该图书馆在Imandra的神经网络上形式化，并涵盖了神经网络验证的不同重要方面。

translated by 谷歌翻译

PBRE: A Rule Extraction Method from Trained Neural Networks Designed for Smart Home Services

Mingming Qiu , Elie Najm , Remi Sharrock , Bruno Traverson

分类：人工智能

2022-07-18

当同时部署大量传感器和执行器的多个服务时，设计智能家庭服务是一项复杂的任务。它可能依赖于基于知识或数据驱动的方法。前者可以使用基于规则的方法静态设计服务，后者可以使用学习方法动态地发现居民的偏好。但是，这些方法都不完全令人满意，因为规则不能涵盖所有可能改变的可能情况，而学习方法可能会做出有时对居民无法理解的决定。在本文中，提出了PBRE（基于教学的规则提取器），以从学习方法中提取规则，以实现智能家庭系统的动态规则生成。预期的优势是采用了基于规则的方法的解释性和学习方法的动态性。我们将PBRE与现有规则提取方法进行比较，结果显示PBRE的性能更好。我们还应用PBRE从NRL（基于神经网络的强化学习）代表的智能家庭服务中提取规则。结果表明，PBRE可以帮助NRL模拟的服务向居民提出可理解的建议。

translated by 谷歌翻译

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

Julien Perolat , Bart de Vylder , Daniel Hennes , Eugene Tarassov , Florian Strub , Vincent de Boer , Paul Muller , Jerome T. Connor , Neil Burch , Thomas Anthony

分类：人工智能

2022-06-30

我们介绍了DeepNash，这是一种能够学习从头开始播放不完美的信息游戏策略的自主代理，直到人类的专家级别。 Stratego是人工智能（AI）尚未掌握的少数标志性棋盘游戏之一。这个受欢迎的游戏具有$ 10^{535} $节点的巨大游戏树，即，$ 10^{175} $倍的$倍于GO。它具有在不完美的信息下需要决策的其他复杂性，类似于德克萨斯州Hold'em扑克，该扑克的游戏树较小（以$ 10^{164} $节点为单位）。 Stratego中的决策是在许多离散的动作上做出的，而动作与结果之间没有明显的联系。情节很长，在球员获胜之前经常有数百次动作，而Stratego中的情况则不能像扑克中那样轻松地分解成管理大小的子问题。由于这些原因，Stratego几十年来一直是AI领域的巨大挑战，现有的AI方法几乎没有达到业余比赛水平。 Deepnash使用游戏理论，无模型的深钢筋学习方法，而无需搜索，该方法学会通过自我播放来掌握Stratego。 DeepNash的关键组成部分的正则化NASH Dynamics（R-NAD）算法通过直接修改基础多项式学习动力学来收敛到近似NASH平衡，而不是围绕它“循环”。 Deepnash在Stratego中击败了现有的最先进的AI方法，并在Gravon Games平台上获得了年度（2022年）和历史前3名，并与人类专家竞争。

translated by 谷歌翻译