智能论文笔记

Project Achoo: A Practical Model and Application for COVID-19 Detection from Recordings of Breath, Voice, and Cough

Alexander Ponomarchuk , Ilya Burenko , Elian Malkin , Ivan Nazarov , Vladimir Kokh , Manvel Avetisian , Leonid Zhukov

分类：机器学习

2021-07-12

Covid-19大流行为感染检测和监测解决方案产生了重大的兴趣和需求。在本文中，我们提出了一种机器学习方法，可以使用在消费者设备上进行的录音来快速分离Covid-19。该方法将信号处理方法与微调深层学习网络相结合，提供了信号去噪，咳嗽检测和分类的方法。我们还开发并部署了一个移动应用程序，使用症状检查器与语音，呼吸和咳嗽信号一起使用，以检测Covid-19感染。该应用程序对两个开放的数据集和最终用户在测试版测试期间收集的嘈杂数据显示了鲁棒性能。

translated by 谷歌翻译

Posterior samples of source galaxies in strong gravitational lenses with score-based priors

Alexandre Adam , Adam Coogan , Nikolay Malkin , Ronan Legin , Laurence Perreault-Levasseur , Yashar Hezaveh , Yoshua Bengio

分类：计算机视觉 | 机器学习

2022-11-07

Inferring accurate posteriors for high-dimensional representations of the brightness of gravitationally-lensed sources is a major challenge, in part due to the difficulties of accurately quantifying the priors. Here, we report the use of a score-based model to encode the prior for the inference of undistorted images of background galaxies. This model is trained on a set of high-resolution images of undistorted galaxies. By adding the likelihood score to the prior score and using a reverse-time stochastic differential equation solver, we obtain samples from the posterior. Our method produces independent posterior samples and models the data almost down to the noise level. We show how the balance between the likelihood and the prior meet our expectations in an experiment with out-of-distribution data.

translated by 谷歌翻译

GFlowOut: Dropout with Generative Flow Networks

Dianbo Liu , Moksh Jain , Bonaventure Dossou , Qianli Shen , Salem Lahlou , Anirudh Goyal , Nikolay Malkin , Chris Emezue , Dinghuai Zhang , Nadhir Hassen

分类：机器学习 | 人工智能

2022-10-24

Bayesian Inference offers principled tools to tackle many critical problems with modern neural networks such as poor calibration and generalization, and data inefficiency. However, scaling Bayesian inference to large architectures is challenging and requires restrictive approximations. Monte Carlo Dropout has been widely used as a relatively cheap way for approximate Inference and to estimate uncertainty with deep neural networks. Traditionally, the dropout mask is sampled independently from a fixed distribution. Recent works show that the dropout mask can be viewed as a latent variable, which can be inferred with variational inference. These methods face two important challenges: (a) the posterior distribution over masks can be highly multi-modal which can be difficult to approximate with standard variational inference and (b) it is not trivial to fully utilize sample-dependent information and correlation among dropout masks to improve posterior estimation. In this work, we propose GFlowOut to address these issues. GFlowOut leverages the recently proposed probabilistic framework of Generative Flow Networks (GFlowNets) to learn the posterior distribution over dropout masks. We empirically demonstrate that GFlowOut results in predictive distributions that generalize better to out-of-distribution data, and provide uncertainty estimates which lead to better performance in downstream tasks.

translated by 谷歌翻译

Learning GFlowNets from partial episodes for improved convergence and stability

Kanika Madan , Jarrid Rector-Brooks , Maksym Korablyov , Emmanuel Bengio , Moksh Jain , Andrei Nica , Tom Bosc , Yoshua Bengio , Nikolay Malkin

分类：机器学习 | (统计)机器学习

2022-09-26

生成流动网络（GFLOWNETS）是一种算法家族，用于训练在非均衡目标密度下离散对象的顺序采样器，并已成功用于各种概率建模任务。现有的Gflownets培训目标是国家本地的，或者是过渡的本地，或者在整个采样轨迹上传播奖励信号。我们认为，这些替代方案代表了梯度偏见变化权衡的相反目的，并提出了一种利用这种权衡以减轻其有害影响的方法。受到强化学习的TD（$ \ lambda $）算法的启发，我们介绍了一个subtrajectory Balance或subtb（$ \ lambda $），这是一个GFLOWNET培训目标，可以从不同长度的部分动作子序列中学习。我们表明，SubTB（$ \ lambda $）会在先前研究和新环境中加速采样器的收敛，并在具有更长的动作序列和比以前的可能性更长的环境中培训Gflownets。我们还对随机梯度动力学进行了比较分析，阐明了GFLOWNET训练中的偏差变化权衡以及亚条件平衡的优势。

translated by 谷歌翻译

Unifying Generative Models with GFlowNets

Dinghuai Zhang , Ricky T. Q. Chen , Nikolay Malkin , Yoshua Bengio

分类：机器学习 | 人工智能 | (统计)机器学习

2022-09-06

有许多用于深层生成建模的框架，每个框架通常都有自己的特定培训算法和推理方法。我们提供了有关现有深层生成模型与GFLOWNET框架之间的连接的简短说明，阐明了它们的重叠特征，并通过Markovian轨迹通过学习镜头来提供统一的观点。这为统一培训和推理算法提供了一种手段，并提供了构建生成模型团聚的途径。

translated by 谷歌翻译

Diffusion models as plug-and-play priors

Alexandros Graikos , Nikolay Malkin , Nebojsa Jojic , Dimitris Samaras

分类：机器学习 | 计算机视觉

2022-06-17

我们考虑在模型中推断高维数据$ \ mathbf {x} $的问题，该模型由先前的$ p（\ mathbf {x}）$和辅助约束$ c（\ mathbf {x}，\ mathbf）{y}）$。在本文中，先验是一个独立训练的denoising扩散生成模型。辅助约束预计将具有可区分的形式，但可能来自不同的来源。这种推理的可能性将扩散模型转换为插件模块，从而允许在适应新域和任务（例如条件生成或图像分割）中进行一系列潜在应用。扩散模型的结构使我们能够通过通过固定的denoising网络迭代分化来执行近似推断，每个步骤在每个步骤中都有不同量的噪声。考虑到评估其健身的许多噪声版本的$ \ mathbf {x} $是一种新颖的搜索机制，可能导致新算法用于解决组合优化问题。

translated by 谷歌翻译

Resolving label uncertainty with implicit posterior models

Esther Rolf , Nikolay Malkin , Alexandros Graikos , Ana Jojic , Caleb Robinson , Nebojsa Jojic

分类：机器学习 | (统计)机器学习

2022-02-28

我们提出了一种在数据样本集合中共同推断标签的方法，其中每个样本都包含一个观察和对标签的先验信念。通过隐式假设存在一种生成模型，可区分预测因子是后部，我们得出了一个训练目标，该目标允许在弱信念下学习。该配方统一了各种机器学习设置；弱信念可以以嘈杂或不完整的标签形式出现，由辅助输入的不同预测机制给出的可能性，或反映出有关手头问题结构的知识的常识性先验。我们证明了有关各种问题的建议算法：通过负面培训示例进行分类，从排名中学习，弱和自我监督的空中成像细分，视频框架的共段以及粗糙的监督文本分类。

translated by 谷歌翻译

Generative Flow Networks for Discrete Probabilistic Modeling

Dinghuai Zhang , Nikolay Malkin , Zhen Liu , Alexandra Volokhova , Aaron Courville , Yoshua Bengio

分类：机器学习 | (统计)机器学习

2022-02-03

我们提出了基于能量的生成流网络（EB-GFN），这是一种用于高维离散数据的新型概率建模算法。基于生成流网络（GFLOWNETS）的理论，我们通过随机数据构建政策对生成过程进行建模，从而将昂贵的MCMC探索摊销为从Gflownet采样的固定动作中。我们展示了Gflownets如何在模式之间进行大致进行大型Gibbs采样以混合。我们提出了一个框架，以共同训练具有能量功能的Gflownet，以便Gflownet学会从能量分布中进行采样，而能量则以近似MLE目标学习，并从GFLOWNET中使用负样本。我们证明了EB-GFN对各种概率建模任务的有效性。代码可在https://github.com/zdhnarsil/eb_gfn上公开获取。

translated by 谷歌翻译

Mining self-similarity: Label super-resolution with epitomic representations

Nikolay Malkin , Anthony Ortiz , Caleb Robinson , Nebojsa Jojic

分类：计算机视觉

2020-04-24

我们表明，基于补丁的模型，例如展示，可以对使用深卷积神经网络的语义分割和标签超分辨率的最新状态具有卓越的性能。我们推导出一种新的培训算法，其允许从非常大的数据集中学习并从拓扑表征中推导出标签超分辨率算法作为统计推理算法。我们说明了我们在陆地覆盖映射和医学图像分析任务的方法。

translated by 谷歌翻译