智能论文笔记

Greenhouse gases emissions: estimating corporate non-reported emissions using interpretable machine learning

Jeremi Assael , Thibaut Heurtebize , Laurent Carlier , François Soupé

分类：机器学习 | 人工智能

2022-12-21

As of 2022, greenhouse gases (GHG) emissions reporting and auditing are not yet compulsory for all companies and methodologies of measurement and estimation are not unified. We propose a machine learning-based model to estimate scope 1 and scope 2 GHG emissions of companies not reporting them yet. Our model, specifically designed to be transparent and completely adapted to this use case, is able to estimate emissions for a large universe of companies. It shows good out-of-sample global performances as well as good out-of-sample granular performances when evaluating it by sectors, by countries or by revenues buckets. We also compare our results to those of other providers and find our estimates to be more accurate. Thanks to the proposed explainability tools using Shapley values, our model is fully interpretable, the user being able to understand which factors split explain the GHG emissions for each particular company.

translated by 谷歌翻译

FIESTA: FIber gEneration and bundle Segmentation in Tractography using Autoencoders

Félix Dumais , Jon Haitz Legarreta , Carl Lemaire , Philippe Poulin , François Rheault , Laurent Petit , Maxime Descoteaux , Pierre-Marc Jodoin

分类：计算机视觉 | 机器学习

2022-11-30

White matter bundle segmentation is a cornerstone of modern tractography to study the brain's structural connectivity in domains such as neurological disorders, neurosurgery, and aging. In this study, we present FIESTA (FIber gEneration and bundle Segmentation in Tractography using Autoencoders), a reliable and robust, fully automated, and easily semi-automatically calibrated pipeline based on deep autoencoders that can dissect and fully populate WM bundles. Our framework allows the transition from one anatomical bundle definition to another with marginal calibrating time. This pipeline is built upon FINTA, CINTA, and GESTA methods that demonstrated how autoencoders can be used successfully for streamline filtering, bundling, and streamline generation in tractography. Our proposed method improves bundling coverage by recovering hard-to-track bundles with generative sampling through the latent space seeding of the subject bundle and the atlas bundle. A latent space of streamlines is learned using autoencoder-based modeling combined with contrastive learning. Using an atlas of bundles in standard space (MNI), our proposed method segments new tractograms using the autoencoder latent distance between each tractogram streamline and its closest neighbor bundle in the atlas of bundles. Intra-subject bundle reliability is improved by recovering hard-to-track streamlines, using the autoencoder to generate new streamlines that increase each bundle's spatial coverage while remaining anatomically meaningful. Results show that our method is more reliable than state-of-the-art automated virtual dissection methods such as RecoBundles, RecoBundlesX, TractSeg, White Matter Analysis and XTRACT. Overall, these results show that our framework improves the practicality and usability of current state-of-the-art bundling framework

translated by 谷歌翻译

Variational inference of fractional Brownian motion with linear computational complexity

Hippolyte Verdier , François Laurent , Christian Vestergaard , Jean-Baptiste Masson , Alhassan Cassé

分类：机器学习

2022-03-15

我们引入了基于仿真的摊销贝叶斯推理方案，以推断随机步行的参数。我们的方法通过无可能的方法了解了步行参数的后验分布。在第一步中，对图形神经网络进行了模拟数据培训，以学习随机步行的优化低维摘要统计数据。在第二步中，可逆神经网络使用变分推断从学习的汇总统计数据中产生参数的后验分布。我们应用我们的方法来从单轨迹推断布朗尼运动模型的参数。摊销推理过程的计算复杂性与轨迹长度线性缩放，其精度比例与cram {\'e} r-rao相似，在较大的长度上结合。该方法对位置噪声是强大的，并且比训练期间看到的轨迹更长的轨迹更长。最后，我们适应了该方案，以表明环境中的有限去相关时间可以从单个轨迹中推断出来。

translated by 谷歌翻译

A Novel Occupancy Mapping Framework for Risk-Aware Path Planning in Unstructured Environments

Johann Laconte , Abderrahim Kasmi , François Pomerleau , Roland Chapuis , Laurent Malaterre , Christophe Debain , Romuald Aufrère

分类：机器人

2020-11-16

在自治机器人的背景下，最重要的任务之一是在导航期间防止对机器人的潜在损坏。为此目的，通常认为必须处理已知的概率障碍，然后计算与每个障碍物碰撞的概率。然而，在复杂的场景或非结构化环境中，可能难以检测这种障碍。在这些情况下，使用公制地图，其中每个位置存储占用的信息。最常见的公制地图类型是贝叶斯占用地图。然而，由于其离散性，这种类型的地图对计算持续路径的风险评估不太适合。因此，我们介绍了一种名为Lambda领域的新型地图，专门用于风险评估。我们首先提出了一种方法来计算这样的地图和对路径的通用风险的期望。然后，我们用一个用例展示了我们的通用配方的益处，该用例将风险定义为路径上的预期碰撞力。使用这种风险定义和Lambda领域，我们表明我们的框架能够在具有物理度量的同时进行经典路径规划。此外，Lambda领域为处理非结构化环境（如高草）提供了一种自然的方式。如果标准环境表示始终会产生围绕此类障碍的轨迹，我们的框架允许机器人在意识到所采取的风险时穿过草地。

translated by 谷歌翻译

Explaining Machine Learning Models using Entropic Variable Projection

François Bachoc , Fabrice Gamboa , Max Halford , Jean-Michel Loubes , Laurent Risser

分类： (统计)机器学习 | 机器学习

2018-10-18

在本文中，我们提出了一种新的可解释性形式主义，旨在阐明测试集的每个输入变量如何影响机器学习模型的预测。因此，我们根据训练有素的机器学习决策规则提出了一个群体的解释性形式，它们是根据其对输入变量分布的可变性的反应。为了强调每个输入变量的影响，这种形式主义使用信息理论框架，该框架量化了基于熵投影的所有输入输出观测值的影响。因此，这是第一个统一和模型不可知的形式主义，使数据科学家能够解释输入变量之间的依赖性，它们对预测错误的影响以及它们对输出预测的影响。在大型样本案例中提供了熵投影的收敛速率。最重要的是，我们证明，计算框架中的解释具有低算法的复杂性，使其可扩展到现实生活中的大数据集。我们通过解释通过在各种数据集上使用XGBoost，随机森林或深层神经网络分类器（例如成人收入，MNIST，CELEBA，波士顿住房，IRIS以及合成的）上使用的复杂决策规则来说明我们的策略。最终，我们明确了基于单个观察结果的解释性策略石灰和摇摆的差异。可以通过使用自由分布的Python工具箱https://gems-ai.aniti.fr/来复制结果。

translated by 谷歌翻译

Long-Term Online Multi-Session Graph-Based SPLAM with Memory Management

Mathieu Labbé , François Michaud

分类：机器人

2022-12-30

For long-term simultaneous planning, localization and mapping (SPLAM), a robot should be able to continuously update its map according to the dynamic changes of the environment and the new areas explored. With limited onboard computation capabilities, a robot should also be able to limit the size of the map used for online localization and mapping. This paper addresses these challenges using a memory management mechanism, which identifies locations that should remain in a Working Memory (WM) for online processing from locations that should be transferred to a Long-Term Memory (LTM). When revisiting previously mapped areas that are in LTM, the mechanism can retrieve these locations and place them back in WM for online SPLAM. The approach is tested on a robot equipped with a short-range laser rangefinder and a RGB-D camera, patrolling autonomously 10.5 km in an indoor environment over 11 sessions while having encountered 139 people.

translated by 谷歌翻译

A Generalization of ViT/MLP-Mixer to Graphs

Xiaoxin He , Bryan Hooi , Thomas Laurent , Adam Perold , Yann LeCun , Xavier Bresson

分类：计算机视觉

2022-12-27

Graph Neural Networks (GNNs) have shown great potential in the field of graph representation learning. Standard GNNs define a local message-passing mechanism which propagates information over the whole graph domain by stacking multiple layers. This paradigm suffers from two major limitations, over-squashing and poor long-range dependencies, that can be solved using global attention but significantly increases the computational cost to quadratic complexity. In this work, we propose an alternative approach to overcome these structural limitations by leveraging the ViT/MLP-Mixer architectures introduced in computer vision. We introduce a new class of GNNs, called Graph MLP-Mixer, that holds three key properties. First, they capture long-range dependency and mitigate the issue of over-squashing as demonstrated on the Long Range Graph Benchmark (LRGB) and the TreeNeighbourMatch datasets. Second, they offer better speed and memory efficiency with a complexity linear to the number of nodes and edges, surpassing the related Graph Transformer and expressive GNN models. Third, they show high expressivity in terms of graph isomorphism as they can distinguish at least 3-WL non-isomorphic graphs. We test our architecture on 4 simulated datasets and 7 real-world benchmarks, and show highly competitive results on all of them.

translated by 谷歌翻译

Neural Networks beyond explainability: Selective inference for sequence motifs

Antoine Villié , Philippe Veber , Yohann de Castro , Laurent Jacob

分类：机器学习 | (统计)机器学习

2022-12-23

Over the past decade, neural networks have been successful at making predictions from biological sequences, especially in the context of regulatory genomics. As in other fields of deep learning, tools have been devised to extract features such as sequence motifs that can explain the predictions made by a trained network. Here we intend to go beyond explainable machine learning and introduce SEISM, a selective inference procedure to test the association between these extracted features and the predicted phenotype. In particular, we discuss how training a one-layer convolutional network is formally equivalent to selecting motifs maximizing some association score. We adapt existing sampling-based selective inference procedures by quantizing this selection over an infinite set to a large but finite grid. Finally, we show that sampling under a specific choice of parameters is sufficient to characterize the composite null hypothesis typically used for selective inference-a result that goes well beyond our particular framework. We illustrate the behavior of our method in terms of calibration, power and speed and discuss its power/speed trade-off with a simpler data-split strategy. SEISM paves the way to an easier analysis of neural networks used in regulatory genomics, and to more powerful methods for genome wide association studies (GWAS).

translated by 谷歌翻译

Contrastive Distillation Is a Sample-Efficient Self-Supervised Loss Policy for Transfer Learning

Chris Lengerich , Gabriel Synnaeve , Amy Zhang , Hugh Leather , Kurt Shuster , François Charton , Charysse Redwood

分类：自然语言处理 | 机器学习

2022-12-21

Traditional approaches to RL have focused on learning decision policies directly from episodic decisions, while slowly and implicitly learning the semantics of compositional representations needed for generalization. While some approaches have been adopted to refine representations via auxiliary self-supervised losses while simultaneously learning decision policies, learning compositional representations from hand-designed and context-independent self-supervised losses (multi-view) still adapts relatively slowly to the real world, which contains many non-IID subspaces requiring rapid distribution shift in both time and spatial attention patterns at varying levels of abstraction. In contrast, supervised language model cascades have shown the flexibility to adapt to many diverse manifolds, and hints of self-learning needed for autonomous task transfer. However, to date, transfer methods for language models like few-shot learning and fine-tuning still require human supervision and transfer learning using self-learning methods has been underexplored. We propose a self-supervised loss policy called contrastive distillation which manifests latent variables with high mutual information with both source and target tasks from weights to tokens. We show how this outperforms common methods of transfer learning and suggests a useful design axis of trading off compute for generalizability for online transfer. Contrastive distillation is improved through sampling from memory and suggests a simple algorithm for more efficiently sampling negative examples for contrastive losses than random sampling.

translated by 谷歌翻译

Temporal Disaggregation of the Cumulative Grass Growth

Thomas Guyet , Laurent Spillemaecker , Simon Malinowski , Anne-Isabelle Graux

分类：机器学习 | 人工智能

2022-12-21

Information on the grass growth over a year is essential for some models simulating the use of this resource to feed animals on pasture or at barn with hay or grass silage. Unfortunately, this information is rarely available. The challenge is to reconstruct grass growth from two sources of information: usual daily climate data (rainfall, radiation, etc.) and cumulative growth over the year. We have to be able to capture the effect of seasonal climatic events which are known to distort the growth curve within the year. In this paper, we formulate this challenge as a problem of disaggregating the cumulative growth into a time series. To address this problem, our method applies time series forecasting using climate information and grass growth from previous time steps. Several alternatives of the method are proposed and compared experimentally using a database generated from a grassland process-based model. The results show that our method can accurately reconstruct the time series, independently of the use of the cumulative growth information.

translated by 谷歌翻译